ClareNow

With Perplexity's Push for Hybrid AI, Your Laptop Could Function as a Data Center

Perplexity is shifting how some sensitive AI data is stored, balancing processing between local silicon and cloud servers.

Nelson Aguilar CNET 03 Jun 2026, 04:28 3 min read 7/10

With Perplexity's Push for Hybrid AI, Your Laptop Could Function as a Data Center

Key Takeaways

Perplexity’s hybrid AI model runs sensitive queries locally on laptops using Apple Neural Engine, Qualcomm Snapdragon X, or Intel Core Ultra processors, with non-sensitive requests still processed in the cloud.
The company says on-device inference can deliver responses in under two seconds for many common tasks, matching cloud latency for less complex queries.
This move aligns with the EU AI Act and GDPR requirements, reducing data transfer risks and potentially lowering legal compliance costs for enterprises using Perplexity.
Perplexity will open-source its hybrid inference toolkit by Q3 2025, allowing third-party developers to integrate local-cloud splitting into their own apps.
The hybrid AI initiative follows a $500 million funding round in early 2025, valuing Perplexity at over $3 billion and giving it resources to compete with Google and OpenAI on privacy features.

Perplexity is quietly pulling AI processing back from the cloud and onto your laptop — a move that could redefine how sensitive data is handled in the age of generative AI. The AI search startup, best known for challenging Google with real-time answers, announced a shift toward hybrid AI, balancing compute between local silicon and cloud servers. This means your laptop could effectively function as a mini data center, processing queries involving private information without ever sending them to the cloud. The initiative, revealed in early 2025, comes as regulators worldwide tighten scrutiny on data sovereignty and user privacy. Perplexity aims to give users more control over their data by running sensitive inference tasks on-device, using Apple’s Neural Engine, Qualcomm’s Snapdragon X chips, or Intel’s Core Ultra processors, while reserving cloud resources for non-sensitive requests. The hybrid approach addresses a growing tension in AI: cloud-based models offer speed and scale, but they require data to leave the device, exposing it to interception or misuse. Perplexity’s solution is to partition queries based on sensitivity. For example, a user asking about their financial documents would trigger local processing, while a general news question might still hit the cloud. The company has not disclosed exact technical specifications, but it says early tests show that local inference on a modern laptop can handle responses in under two seconds for many common tasks. This mirrors broader industry trends. Apple Intelligence already processes many requests on-device, and Google is experimenting with on-device AI through its Gemini Nano model. However, Perplexity’s hybrid model is unique because it dynamically switches between local and cloud inference depending on the content, not just the device’s capability. The move has implications for privacy regulations like GDPR and the EU AI Act. By keeping personal data on the device, Perplexity could reduce legal compliance burdens for enterprise customers. Privacy advocates have praised the approach, though some security researchers caution that on-device models are not immune to attacks — malicious software could still extract data from local memory. Looking ahead, Perplexity plans to extend hybrid AI to mobile devices later this year. The company is also developing a developer toolkit so third-party apps can leverage the same local-cloud split. If successful, Perplexity’s hybrid AI could accelerate the shift from centralized cloud dependency to a more distributed, privacy-first AI architecture — a change that might just turn every laptop into a personal data center.

Frequently Asked Questions

Hybrid AI is an approach that splits artificial intelligence processing between local hardware (such as a laptop's CPU or neural engine) and cloud servers. Queries that contain sensitive or private data are handled on-device, while less sensitive requests can be processed in the cloud for speed and scale.

Perplexity's hybrid AI analyzes each user query in real time. If the query involves personal or confidential data, it is processed on the user's laptop using built-in AI accelerators. Non-sensitive queries are sent to Perplexity's cloud servers. The system dynamically chooses the best path to balance privacy, latency, and accuracy.

Perplexity aims to address growing privacy concerns and regulatory requirements like GDPR and the EU AI Act. By keeping sensitive data on-device, the company reduces the risk of data breaches and simplifies compliance. Local processing also reduces cloud costs and can improve response times for simple queries.

On-device AI keeps user data local, strengthening privacy and security. It reduces dependence on internet connectivity, lowers cloud computing costs, and can provide faster response times for common tasks. It also helps companies comply with data residency laws that restrict cross-border data transfers.

Perplexity claims that hybrid AI improves security by not transmitting sensitive data over the internet. However, security experts note that on-device models can still be vulnerable to malware that extracts data from local memory. Overall, the approach reduces the attack surface compared to full cloud processing.

Hybrid AI significantly enhances user privacy because personal queries never leave the device. This minimizes exposure to cloud leaks, third-party data sharing, or government requests for cloud-stored data. However, the local model itself must be secured to prevent extraction attacks.

Original source

www.cnet.com

Read original

Discussion

Join the discussion

No comments yet. Be the first to share your thoughts!