NexChron

MMLU

Definition Massive Multitask Language Understanding — a benchmark covering 57 academic subjects from elementary mathematics to professional law and medicine, used to measure the breadth of world knowledge in language models. MMLU has become one of the most widely cited evaluation suites for LLM capability.

In Depth

MMLU consists of multiple-choice questions drawn from standardized tests and academic exams. A model that scores well demonstrates it has absorbed broad factual and reasoning knowledge across STEM, humanities, and professional domains. Frontier models now exceed average human performance on MMLU, prompting researchers to develop harder successors like MMLU-Pro and GPQA to maintain discriminative power.

Browse more terms

AI Agent AI Alignment AI Audit AI Bill of Rights AI Compute AI Governance AI Orchestration AI Readiness AI Risk Management AI Watermarking AI-as-a-Service Activation Function Active Learning Adversarial Attack Agentic AI Agentic Workflow Algorithmic Fairness Arctic Artificial General Intelligence Artificial Superintelligence