In Depth
Edge AI is enabled by model compression techniques (quantization, pruning, knowledge distillation) and dedicated neural processing units (NPUs) in chips from Apple, Qualcomm, and Google. Applications include real-time translation on-device, autonomous vehicle perception, industrial quality inspection, and smart home processing. The trade-off is reduced model capability compared to large cloud-hosted models.