Thousands of Indian workers are filming everyday activities to provide training data for AI-powered robots [1].
This process is critical because AI models require massive amounts of physical-world data to understand how to interact with real objects. By recording human movement, tech firms can teach machines to perform household and industrial work without relying solely on simulated environments.
Workers are recording routine tasks such as chopping mangoes, cleaning, and folding towels [1]. These videos build large-scale datasets that allow AI systems to learn the nuances of object manipulation [2]. This data collection is part of a broader global effort to speed up the development of humanoid robots intended for home and industrial use [1].
While data collection is happening in India, other robot learning centers are operating in China where people teach machines directly [3]. The demand for this human-centric data has fueled the growth of the robotics sector. One startup focused on physical intelligence is already valued at $1 billion [2].
Industry projections suggest the scale of this integration will be vast. Billions of autonomous AI-powered robots are projected to work alongside people in the coming decades [4]. To reach that scale, companies need a diverse array of human behaviors captured in high detail, often using head-mounted cameras to mimic a human's perspective.
This workforce provides the essential bridge between digital intelligence and physical action. Without these thousands of recorded examples, robots would struggle to perform a simple task like grasping a piece of clothing or navigating a kitchen [1].
“Thousands of Indian workers are filming everyday activities to provide training data for AI-powered robots.”
The shift toward 'physical intelligence' marks a transition from AI that processes text and images to AI that masters the physical world. By outsourcing data collection to workers in India and China, global tech firms are creating a global labor pipeline for robotic training, effectively treating human movement as a raw commodity to be harvested for machine learning.

