Explained: What is India’s Sarvam AI model that Google CEO Sundar Pichai is quite impressed with
Google CEO Sundar Pichai said that he is impressed with the work done by Sarvam AI. Speaking at the ongoing India AI Impact Summit 2026, Pichai said “The developer energy I find in India every time I travel, it’s bar none, second to none,” adding that the entrepreneurship ecosystem in the country is “thriving”. Pichai specifically highlighted Sarvam AI for developing local AI models tailored to Indian languages and contexts saying "The work Sarvam has done developing local AI models ....I just don't see any impediments to that, and I think it is very, very well positioned". The AI startup has recently taken the internet by storm with the company claiming that its AI model has outperformed some of the biggest names in ai, including Google’s Gemini and OpenAI’s ChatGPT.
“Sarvam Vision achieves state-of-the-art accuracy of 84.3% on the olmOCR-Bench (English only subset) outperforming frontier models like Gemini 3 Pro and recent OCR models like DeepSeek OCR 2,” wrote Pratyush Kumar, CEO, Sarvam AI.
Sarvam was founded by Vivek Raghavan and Pratyush Kumar in August 2023. In a blog post, the company explained that its Sarvam AI model is capable of a range of visual understanding tasks, including image captioning, scene text recognition, chart interpretation, and complex table parsing. One of the company aims is to unlock India's knowledge that remains embedded in physical documents, scanned archives, and historical collections.
Another key problem that the company is working on is to bring AI functionality to Indian users. “Most global models treat Indian languages as secondary, often resulting in lower accuracy for regional scripts. Along with pushing the frontiers of accuracy, our VLM is an inference-efficient 3B state-space model,” the company said.
Sarvam AI model, the company says, is trained on high-quality datasets covering 22 official Indian languages, including varied financial documents, literature, newspapers, historic texts, and more.
Sarvam AI’s speech recognition model supports 10 Indian languages within a single 74-million-parameter model that occupies approximately 294MB on a device. It can automatically identify the language being spoken, without requiring the user to select it. The model can process speech at about 8.5x real-time and provides a time-to-first-token of less than 300 milliseconds on a Qualcomm Snapdragon 8 Gen 3 chipset.
Its speech synthesis model has a device footprint of about 60 MB and 24 million parameters. The model achieves a mean character error rate of 0.0173 on a standard benchmark, indicating that synthesised speech closely matches the intended text across languages. Custom voice cloning is also supported on it which means a new voice can be added using about one hour of audio data and deployed within the same 60MB model file.
The translation model, on the other hand, has 150 million parameters and an on-device footprint of around 334MB. It handles bidirectional translation across 110 language pairs, including 10 Indian languages and English, without routing through an intermediate language.
One of the key differentiators between India’s Sarvam AI, and Gemini and ChatGPT is the former’s focus on Indian languages prioritising English and treating the rest secondary. Since it is trained in 22 Indian languages, it can give higher accuracy for regional scripts.
While other models are only capable enough to extract text from documents or images, the SarvamAI can also interpret visual elements for better understanding and additional knowledge. This ensures better performance on a variety of complex documents in the level of understanding with a large-scale Indic OCR benchmark for Indian languages.
The Document Intelligence API is free for February 2026, allowing users to explore and build with Sarvam Vision at scale, with getting started today for completely free.
Here’s a brief summary of major features of India’s Sarvam AI model are:
“Sarvam Vision achieves state-of-the-art accuracy of 84.3% on the olmOCR-Bench (English only subset) outperforming frontier models like Gemini 3 Pro and recent OCR models like DeepSeek OCR 2,” wrote Pratyush Kumar, CEO, Sarvam AI.
What is India’s Sarvam AI that Sundar Pichai praised
Sarvam was founded by Vivek Raghavan and Pratyush Kumar in August 2023. In a blog post, the company explained that its Sarvam AI model is capable of a range of visual understanding tasks, including image captioning, scene text recognition, chart interpretation, and complex table parsing. One of the company aims is to unlock India's knowledge that remains embedded in physical documents, scanned archives, and historical collections.
Another key problem that the company is working on is to bring AI functionality to Indian users. “Most global models treat Indian languages as secondary, often resulting in lower accuracy for regional scripts. Along with pushing the frontiers of accuracy, our VLM is an inference-efficient 3B state-space model,” the company said.
Sarvam AI model, the company says, is trained on high-quality datasets covering 22 official Indian languages, including varied financial documents, literature, newspapers, historic texts, and more.
Sarvam AI’s speech recognition model supports 10 Indian languages within a single 74-million-parameter model that occupies approximately 294MB on a device. It can automatically identify the language being spoken, without requiring the user to select it. The model can process speech at about 8.5x real-time and provides a time-to-first-token of less than 300 milliseconds on a Qualcomm Snapdragon 8 Gen 3 chipset.
Its speech synthesis model has a device footprint of about 60 MB and 24 million parameters. The model achieves a mean character error rate of 0.0173 on a standard benchmark, indicating that synthesised speech closely matches the intended text across languages. Custom voice cloning is also supported on it which means a new voice can be added using about one hour of audio data and deployed within the same 60MB model file.
The translation model, on the other hand, has 150 million parameters and an on-device footprint of around 334MB. It handles bidirectional translation across 110 language pairs, including 10 Indian languages and English, without routing through an intermediate language.
How Sarvam AI differs from Gemini and ChatGPT
While other models are only capable enough to extract text from documents or images, the SarvamAI can also interpret visual elements for better understanding and additional knowledge. This ensures better performance on a variety of complex documents in the level of understanding with a large-scale Indic OCR benchmark for Indian languages.
Sarvam AI model availability
The Document Intelligence API is free for February 2026, allowing users to explore and build with Sarvam Vision at scale, with getting started today for completely free.
India’s Sarvam AI: Key features
Here’s a brief summary of major features of India’s Sarvam AI model are:
- Multimodal vision-language: This helps in ensuring to understand the images and texts together for enabling the image captioning, chart, or table interpretation more easily.
- Document understanding (Indian languages focused): It has high-accuracy OCR and knowledge extraction for 22 Indian languages, including historic texts and scanned documents.
- Charts and data interpretation: Sarvam AI is capable of understanding more than texts. The charts, data, illustrations, and visual analysis of the documents.
- Multilingual visual: The AI model understands and interprets visual elements across multiple languages in the same document.
- Leading performance: Sarvam AI excels in global English benchmarks and introduces the Sarvam Indic OCR Bench for Indian languages.
- Accessible API: Its document intelligence APIs are production-ready and free to use for experimentation in February 2026.
Top Comment
P
Prasanna Kumar
23 hours ago
This Govt has the habit of using people from other countries to speak about Indian products/services, perhaps to create an impression that it is very popular!! Why is it that the leading Indian IT companies are not able to convince both Indians and foreign countries?!! The same happens at Governance level also. If you observe Macaron's speech, it sounded very much like a speech written by the PM's speech writer, glorifying Indian's digital footprint and AI capabilities!!!Read allPost comment
Popular from Technology
- Are Elon Musk and Sam Altman not ‘human’? Internet erupts after podcaster’s claim goes viral as Donald Trump teases UFO files
- Infosys CEO Salil Parekh on AI tools replacing engineers: It is not that overnight everything is going to be replaced as in large companies ...
- Who is Ranvir Sachdeva, 8-year-old who became the youngest speaker at India AI Impact Summit 2026; He met with Google CEO Sundar Pichai, OpenAI’s Sam Altman
- OpenAI CEO Sam Altman calls out tech companies for mass layoffs; says: Can't blame everything on...
- AI knows how caste works in India. Here’s why that’s a worry
end of article
Trending Stories
- Montreal Canadiens Could Break Rivalry Taboo on Trade Deadline Deal With Toronto Maple Leafs To Acquire Some Depth Pieces
- Travis Kelce’s luxurious mansions revealed: Inside his 6-bedroom, multi-floor $6 million property
- Rashee Rice net worth in 2026: Breaking down contract, salary, and career earnings
- US Supreme Court Ruling Trump Tariffs Live Updates: Top court's decision impacts some, but not all of Trump's levies
- Ronda Rousey vs Gina Carano: What makes the MMA showdown so special
- AUS vs OMAN, T20 WC: Australia beat Oman by nine wickets
- Alysa Liu family: Inside the story of Olympic figure skater's father Arthur Liu, surrogacy journey, and close bond with her siblings
Featured in technology
- Sung Jinwoo unveils new ‘Monarch of the Heaven-Annihilating Dragons’ form in Solo Leveling: Arise
- Asha Sharma named new CEO of Microsoft Gaming: Read Satya Nadella’s message to staff
- Microsoft responds to the report that US ICE uses company's tech for mass spying of civilians; says: Microsoft policies and terms of service do not …
- Project Silica: Microsoft finds affordable way to write data on glass and preserve it for 10,000 years
- After layoffs and shutting down studios, Meta is making more changes to the 'team' Mark Zuckerberg changed company's name for
- Infosys CEO Salil Parekh on AI tools replacing engineers: It is not that overnight everything is going to be replaced as in large companies ...
Photostories
- 6 iconic Butter Chicken dishes from around the world
- Spices you should carry for good fortune; based on your birth number
- How does Shark Tank India judge Aman Gupta’s home look from inside: A sneak peak into his aesthetic Gurgaon apartment
- Baby names inspired by mountains and peaks
- 8 Indian breakfasts with more protein than eggs
- 10 easy herbs and plants to grow in a compact vertical garden
- Which Lakshmi is associated with your birth number?
- Just one month to go for ‘Dhurandhar 2’ vs ‘Toxic’: Here’s what the big box-office clash promises
- How to make classic Gobhi Matar Pulao for lunch
- From being bullied for making rotis to watching his mother clean gutters; When MasterChef India judge Vikas Khanna spoke about his early struggles
Up Next