Volvo and Google have integrated the Gemini AI model into the cameras of the new EX60 electric SUV [1, 2].

This partnership marks a shift in vehicle interfaces by allowing a car to perceive and discuss the physical world in real time. By merging visual data with large language models, the vehicle moves beyond simple voice commands toward contextual awareness.

The EX60's built-in cameras allow the AI to analyze buildings, street signs, and other surroundings [1, 2]. This integration enables the vehicle to converse with occupants about what it sees through its lenses [1, 2]. The technology is designed to enhance the user experience and differentiate the SUV within a competitive electric vehicle market [3, 2].

Unveiled in Sweden, the EX60 aims to combine high-end artificial intelligence with long-distance utility [2]. The vehicle features a range capability of 400 miles [4].

Volvo has positioned the EX60 as a tech-forward offering that leverages Google's ecosystem to provide a more intuitive driving experience [3]. The use of Gemini allows the car to process complex visual information and translate it into natural conversation, a feature not previously seen in this capacity in consumer vehicles [2].

The EX60’s built‑in cameras are integrated with Google’s Gemini AI model.

The integration of multimodal AI like Gemini into automotive hardware signals a transition from 'smart' cars to 'perceptive' cars. By utilizing external cameras as the AI's eyes, Volvo is shifting the role of the in-car assistant from a navigation tool to a real-time tour guide and environmental analyzer, potentially setting a new industry standard for passenger interaction.