What is Retrieval-Augmented Generation?

NexChron

Retrieval-Augmented Generation

Definition A technique that combines a language model with a live retrieval system, fetching relevant documents from an external knowledge base before generating a response. RAG grounds LLM outputs in up-to-date, verifiable facts rather than relying solely on trained parameters.

In Depth

In a RAG pipeline, a query is first converted to an embedding, then used to search a vector database for semantically similar passages. Those passages are injected into the model prompt as context. RAG reduces hallucination, enables citations, and allows knowledge to be updated without retraining the model.

Browse more terms

AI Agent AI Alignment AI Audit AI Bill of Rights AI Compute AI Governance AI Orchestration AI Readiness AI Risk Management AI Watermarking AI-as-a-Service Activation Function Active Learning Adversarial Attack Agentic AI Agentic Workflow Algorithmic Fairness Arctic Artificial General Intelligence Artificial Superintelligence