NexChron

Edge AI

Definition Running AI model inference directly on local devices — smartphones, cameras, sensors, vehicles — rather than sending data to cloud servers. Edge AI reduces latency, preserves privacy, and enables AI in offline or bandwidth-constrained environments.

In Depth

Edge AI is enabled by model compression techniques (quantization, pruning, knowledge distillation) and dedicated neural processing units (NPUs) in chips from Apple, Qualcomm, and Google. Applications include real-time translation on-device, autonomous vehicle perception, industrial quality inspection, and smart home processing. The trade-off is reduced model capability compared to large cloud-hosted models.

Browse more terms

AI Agent AI Alignment AI Audit AI Bill of Rights AI Compute AI Governance AI Orchestration AI Readiness AI Risk Management AI Watermarking AI-as-a-Service Activation Function Active Learning Adversarial Attack Agentic AI Agentic Workflow Algorithmic Fairness Arctic Artificial General Intelligence Artificial Superintelligence