Skip to content
Definition

Inference

When an AI model generates responses based on its training.

Full Definition

Inference is the process of using a trained AI model to generate outputs (responses) from inputs (prompts). During inference, the model applies what it learned during training to new queries. Every time you ask ChatGPT a question, you're running inference on the model. Understanding inference helps explain AI behavior—the model isn't 'thinking' or searching in real-time but rather generating responses based on patterns learned during training, which explains why it might have outdated information or inconsistent knowledge.

Related Terms

Tools & Resources

Monitor Your AI Visibility

See how ChatGPT, Claude, and Perplexity mention your brand.

Free AI Visibility Check