Nvidia is developing the RTX Spark platform and new chips to turn Windows laptops into AI-agent powerhouses capable of running large language models locally [1, 2].

This shift represents a move away from cloud-dependency, allowing users to operate personal AI assistants offline. By processing data on the device, Nvidia aims to increase privacy and reduce the latency associated with remote server requests [3, 2].

CEO Jensen Huang said these developments during a September 2024 episode of The Vergecast [3, 4]. The strategy involves a combination of software optimization through RTX Spark and the introduction of specialized hardware. Specifically, Nvidia is working on N2X and N3X chips designed to handle the heavy computational loads required by local AI [2].

Performance targets for these new chips are significant. The N2X chip is intended to provide approximately double the performance of current RTX 40-series GPUs [2]. The N3X chip is designed for even greater capacity, targeting approximately triple the performance of the current RTX 40-series [2].

While Nvidia pushes for local processing, other industry players are adopting different strategies. Perplexity is exploring a hybrid model that balances AI workloads between local silicon and cloud servers [3]. This contrast highlights a broader industry debate over whether the future of AI is centralized in the cloud or distributed across personal hardware [3, 1].

Local AI testing has already shown some viability on older hardware. Reports indicate that certain local AI models can work effectively on laptops as old as six years, even those lacking a dedicated GPU [5]. However, the RTX Spark ecosystem is intended to move beyond basic functionality to create a high-performance environment for complex AI agents [1].

Nvidia is developing the RTX Spark platform and new chips to turn Windows laptops into AI-agent powerhouses.

The transition toward local AI execution marks a strategic attempt to decouple the AI experience from internet connectivity and subscription-based cloud models. If Nvidia successfully implements the N2X and N3X architectures, the laptop will evolve from a terminal that accesses cloud intelligence into a self-sufficient compute node. This could shift the competitive landscape of the PC market, making NPU and GPU performance the primary driver of hardware sales over traditional CPU clock speeds.