What is Self-Supervised Learning?

Self-Supervised Learning — AI Glossary

Self-Supervised Learning

Definition A training approach where models learn from unlabeled data by creating their own training signals, such as predicting hidden parts of the input.

In Depth

Self-supervised learning (SSL) is a paradigm where models generate their own supervisory signals from unlabeled data, eliminating the need for expensive manual annotation. The model creates a pretext task from the data itself, such as predicting masked words in a sentence (BERT), predicting the next token (GPT), or reconstructing augmented versions of images.

SSL has become the dominant pre-training strategy for foundation models because it can leverage virtually unlimited unlabeled data from the internet. This makes it far more scalable than supervised learning, which requires human-labeled examples. The representations learned through self-supervised pre-training typically transfer well to many downstream tasks with minimal fine-tuning.

The success of self-supervised learning has fundamentally changed AI development. Instead of collecting task-specific labeled datasets, practitioners now pre-train on massive unlabeled corpora and adapt to specific tasks with small amounts of labeled data. This pre-train-then-fine-tune paradigm underlies virtually all modern foundation models and is a key reason why AI capabilities have advanced so rapidly.

Self-Supervised Learning

In Depth

Browse more terms