Skip to content
Definition

Training Data

The information used to teach AI models how to respond to queries.

Full Definition

Training data is the massive collection of text, documents, and other content used to train large language models. The quality, recency, and content of training data directly influences what AI systems know about brands and how they respond to related queries. Training data typically comes from web pages, books, articles, and other text sources. For GEO, understanding training data helps explain AI behavior—if your brand isn't well-represented in authoritative sources that likely appear in training data, AI systems may not mention you.

Related Terms

Tools & Resources

Monitor Your AI Visibility

See how ChatGPT, Claude, and Perplexity mention your brand.

Free AI Visibility Check