Book a meeting
VentureBeatLarge Language Models

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

On-device AI models have stayed small because the entire weight set has to live in DRAM, capping practical parameter counts well below what server-side deployments use. Enterprise architects evaluating agentic workloads have had to choose between capable cloud-dependent models and limited on-device ones. Apple's third-generation foundation models, announced at WWDC26, break that constraint by moving the weight set off DRAM entirely.The AFM 3 family was developed in collaboration with Google

Implement AI in your business?

Stekz helps businesses implement AI and automation. From strategy to working code.

Book an appointment