NexChron

Perplexity

Definition A statistical measure of how well a language model predicts a sample of text — lower perplexity means the model assigns higher probability to the actual next tokens and is therefore a better fit. Perplexity is a standard intrinsic evaluation metric for language models.

In Depth

Perplexity is computed as the exponent of the average negative log-likelihood per token. It is used to compare models trained on the same data distribution and to track training progress. While perplexity correlates loosely with downstream task performance, it does not capture instruction-following ability, factual accuracy, or safety — which is why benchmark suites and human evaluation remain essential complements.

Browse more terms

AI Agent AI Alignment AI Audit AI Bill of Rights AI Compute AI Governance AI Orchestration AI Readiness AI Risk Management AI Watermarking AI-as-a-Service Activation Function Active Learning Adversarial Attack Agentic AI Agentic Workflow Algorithmic Fairness Arctic Artificial General Intelligence Artificial Superintelligence