It’s a race to capture real-world AI training data
CHENNAI: Home services startup Pronto’s admission recently that it was piloting in-home video recordings to train physical AI systems shines light on a fast-growing and loosely regulated industry of AI data capture and labelling for the global robotics supply chain.
Pronto is not alone. Startups such as Human Archive, Humyn Labs, Egolab AI and Neocambrian are collecting what is called egocentric data or first-person video captured through wearables or head-mounted cameras. They partner with cloud kitchens, hotels, home services platforms, small textile and garment factories, and warehouse operators to record everyday tasks from cooking meals and washing dishes to stitching garments, assembling components and sorting inventory. In some cases, startups have built dedicated ‘data factories’ with motion-tracking rigs.
“Typical clients are robotics, vision-language-action model and world model companies,” said Abhinav Kukreja, founder of Neocambrian AI, which raised funds from angels, including Dalmia Family Office Trust. “There is no equivalent repository of physical behaviour on the internet. Robots need to learn from messy homes, crowded factories, small shops and repair stations, which India offers. When done right, it can become an additional source of paid work for many workers and households, and we compensate both environment owners and data collectors,” he said.
This data trains world models and physical AI systems, teaching robots to navigate and act in messy, unstructured environments and smart glasses for object recognition. One industry insider said there is significant demand from the defence industry, particularly for autonomous drone applications. The practice also raises questions about privacy, legality and compensation as in some cases videos are recorded without pay and consent from the workers. TOI learnt that some factories have paused such pilots after the recent backlash.
Manish Agarwal, co-founder of Humyn Labs, which works with leading frontier labs, said demand is growing from robotics OEMs, software makers and enterprises. “We collect and convert this into episodic strings for robot memory, which helps build low to mid-level agentic capabilities including physical action, voice, sight and mobility,” he said. “We are using verified networks of workers across 16 countries as robots cannot be trained only in Indian environments. For European domestic robotics to navigate better, we need training data similar to that environment.”
Startups argue that this is India’s entry into the global AI value chain, and that working with frontier labs could help the country train competitive models of its own. But sceptics see a cost-arbitrage play. Madhukar Yarra, CEO of Bengaluru-based NextWealth, which annotates these videos, called it a flash in the pan. Much of the data is collected through unorganised gig work, he said.
Ready to Make a Smarter Property Decision? Build Your Legacy with TOI Homes.
Burgeoning data
“Typical clients are robotics, vision-language-action model and world model companies,” said Abhinav Kukreja, founder of Neocambrian AI, which raised funds from angels, including Dalmia Family Office Trust. “There is no equivalent repository of physical behaviour on the internet. Robots need to learn from messy homes, crowded factories, small shops and repair stations, which India offers. When done right, it can become an additional source of paid work for many workers and households, and we compensate both environment owners and data collectors,” he said.
This data trains world models and physical AI systems, teaching robots to navigate and act in messy, unstructured environments and smart glasses for object recognition. One industry insider said there is significant demand from the defence industry, particularly for autonomous drone applications. The practice also raises questions about privacy, legality and compensation as in some cases videos are recorded without pay and consent from the workers. TOI learnt that some factories have paused such pilots after the recent backlash.
Manish Agarwal, co-founder of Humyn Labs, which works with leading frontier labs, said demand is growing from robotics OEMs, software makers and enterprises. “We collect and convert this into episodic strings for robot memory, which helps build low to mid-level agentic capabilities including physical action, voice, sight and mobility,” he said. “We are using verified networks of workers across 16 countries as robots cannot be trained only in Indian environments. For European domestic robotics to navigate better, we need training data similar to that environment.”
Startups argue that this is India’s entry into the global AI value chain, and that working with frontier labs could help the country train competitive models of its own. But sceptics see a cost-arbitrage play. Madhukar Yarra, CEO of Bengaluru-based NextWealth, which annotates these videos, called it a flash in the pan. Much of the data is collected through unorganised gig work, he said.
Ready to Make a Smarter Property Decision? Build Your Legacy with TOI Homes.
Comments
Be the first to share a thought and become theFirst Voiceof this News Article
end of article
Trending Stories
- FIFA World Cup 2026 Opening Ceremony Highlights: Shakira, Burna Boy bring azteca to life as World Cup opening ceremony concludes
- US-Israel-Iran War News Live Updates: 'We'll negotiate with bombs' - US on Iran strikes as Middle East enters day 3 of renewed fire
- FIFA World Cup 2026: How to watch match replays and highlights in India, the US and across global streaming platforms
- Phil Mickelson faces misconduct allegations while golf fans question sudden end of San Diego club membership
- CM Vijay urges Centre to exempt Tamil Nadu from NEET, release pending funds to state and declare Thirukkural as a ‘national literature’
- March to dominance: How victories in states is consolidating BJP in Parliament
- SpaceX IPO tomorrow: Could Elon Musk become world's first trillionaire? Decoding the math
Featured in Business
- Countries protecting domestic industries, India no exception: Piyush Goyal on global trade
- Seeds of change: How India’s agriculture went high-output in 12 years
- US stocks today: Wall Street rebounds as AI stocks recover despite Iran war worries
- India retains growth edge with 6.6% growth, World Bank forecasts; cuts global outlook
- SpaceX IPO tomorrow: Could Elon Musk become world's first trillionaire? Decoding the math
- Stock market: Which are top 10 gainers and losers on NSE & BSE today? Check list
Photostories
- Dipika Kakar shares health update on her Immunotherapy treatment; Opens up about being away from son Ruhaan
- Is your child carrying emotional baggage? 6 subtle signs and what parents can do to help
- Inland Taipan changes colour with the seasons: Inside the strange and dramatic seasonal color change explained
- Fresh Nipah case in India: What the symptoms look like, how it spreads, and what you should actually do
- “This will help prevent…and improve..” Why did Amit Shah praise this rice variety
- 7 key things to know about Siri AI that Apple announced at WWDC 2026
- Vintage '90s-inspired blouse designs approved by Bengali actresses that deserve a strong comeback
- Shoaib Malik to Glenn Maxwell: 7 international cricketers who married Indian women
- 5 venomous snakes you’ll only find in Africa and where to spot them
- No crowds, no chaos: This remote Himalayan valley in Uttarkhand feels frozen in time
Hot Picks
Top Trends
Up Next
Follow Us On Social Media