In Depth

CLIP understands what images depict in natural language terms. It powers image search, content moderation, and is a key component in image generation systems like DALL-E and Stable Diffusion. CLIP bridges the gap between visual and textual AI.