Definition
Training Data
The information used to teach AI models how to respond to queries.
Full Definition
Training data is the massive collection of text, documents, and other content used to train large language models. The quality, recency, and content of training data directly influences what AI systems know about brands and how they respond to related queries. Training data typically comes from web pages, books, articles, and other text sources. For GEO, understanding training data helps explain AI behavior—if your brand isn't well-represented in authoritative sources that likely appear in training data, AI systems may not mention you.
Related Terms
Tools & Resources
Monitor Your AI Visibility
See how ChatGPT, Claude, and Perplexity mention your brand.
Free AI Visibility Check