TL;DRAI

Perplexity AI unveiled a hybrid orchestrator that autonomously routes workloads between local and cloud models, keeping sensitive data on-device. For IT managers, this reduces cloud costs, improves latency, and undermines investment urgency in domestic AI infrastructure.

Perplexity AI, the fast-growing search startup now valued at $20 billion, unveiled what it calls the first hybrid local-server inference orchestrator at Computex 2026 on Monday night, demonstrating software that autonomously decides — in real time and mid-task — which AI workloads stay on a user's device and which get routed to frontier models in the cloud.

CEO Aravind Srinivas demonstrated the system onstage alongside Intel CEO Lip-Bu Tan during Intel's keynote address, using Perplexity's "Personal Computer" agent to process confidential deal materials. In the demonstration, local models running on Intel Core Ultra Series 3 determined which information should remain on the device and which information could be sent to cloud-based models. Srinivas said the approach balances intelligence, accuracy, privacy, and cost.

The key claim is not that a model can run locally — dozens of tools already do that. It is that Perplexity's system makes the routing decision itself, task by task, without requiring the user to choose in advance. Sensitive data like financial records or health information stays on the local machine; the heavier reasoning tasks that require frontier-scale models get sent to the cloud. One task, multiple execution locations, automatic orchestration.

venturebeat.com

Perplexity AI unveils hybrid local-cloud inference system at Computex 2026

Perplexity AI unveiled a hybrid local-cloud inference system at Computex 2026 that automatically routes AI tasks between a user’s device and the cloud, signaling a major shift in enterprise AI, privacy, and on-device computing.

martedì 2 giugno 2026 New tab

TL;DRAI

2,116 words~10 min read

Perplexity AI unveils hybrid local-cloud inference system at Computex 2026

Perplexity AI unveils hybrid local-cloud inference system at Computex 2026

Other newsrooms on this story

Related reading

Perplexity AI Introduces Hybrid Local-Server Inference Orchestrator for…

Perplexity Wants Your Laptop to Do Part of the AI Work—So It Doesn't Have To -…

Perplexity splits AI inference between PCs and cloud to cut costs

Perplexity announces hybrid AI system that decides what runs locally or in the…

Watch Perplexity Is 'Chip Agnostic,' Says CEO - Bloomberg

Perplexity CEO outlines multi-model AI vision in Taiwan event - The Economic…

Other newsrooms on this story

Related reading

Perplexity AI Introduces Hybrid Local-Server Inference Orchestrator for…

Perplexity Wants Your Laptop to Do Part of the AI Work—So It Doesn't Have To -…

Perplexity splits AI inference between PCs and cloud to cut costs

Perplexity announces hybrid AI system that decides what runs locally or in the…

Watch Perplexity Is 'Chip Agnostic,' Says CEO - Bloomberg

Perplexity CEO outlines multi-model AI vision in Taiwan event - The Economic…