India's Public AI Stack: BharatGPT, Sarvam, Krutrim
India's "public AI stack" in 2026 is no longer a slogan. There is now a real budget line, a real compute pool, and a real catalog of foundation models that were trained from scratch on Indian languages. The shape it has taken — public-private, mission-funded, ecosystem-led — is meaningfully different from how the US, EU, or China have approached the same problem.
The IndiaAI Mission, in one paragraph
Approved in 2024 and operational from 2025 onward, the IndiaAI Mission is a ₹10,372 crore (roughly US$1.25B) multi-year program with seven pillars: compute capacity, foundation models, datasets, applications, talent, startup financing, and safe/trusted AI (Indian government program summary, 2026). The thing builders care about most is the foundation-model pillar and the GPU pool that backs it.
The Ministry of Electronics and Information Technology (MeitY) selected an initial cohort of startups to build indigenous foundation models with subsidized compute access. Sarvam AI, SoketAI, Gan AI, and Gnani AI were the founding selections in early 2025, with Sarvam announced first in April 2025 (Wikipedia, "Sarvam AI"). Additional cohorts have been added since.
Sarvam AI
Sarvam, headquartered in Bengaluru, is the highest-profile of the IndiaAI cohort. In February 2026 it announced two foundational models trained from scratch with Indic-first datasets:
Sarvam also ships smaller production-friendly models (the Sarvam-1 / Sarvam-2 family) plus speech models for Indian-language STT and TTS. Their distinguishing bet is treating Indic languages as first-class citizens during pre-training, rather than as a fine-tune layer on top of a primarily-English base. [Inference: this is consistent with their public statements but the exact training-data ratios are not all public.]
Krutrim
Krutrim, founded by Bhavish Aggarwal (also of Ola), launched in 2023 and operates separately from the IndiaAI Mission cohort. The flagship Krutrim model handles 22 Indian languages and was trained on 2 trillion+ tokens (Rest of World, 2026). Krutrim has emphasized building the underlying infrastructure stack — compute, cloud, model serving — alongside the model itself, and has positioned itself as a vertically integrated alternative.
BharatGPT and BharatGen
BharatGPT is the conversational-AI product line developed primarily by CoRover (the AI assistant vendor behind IRCTC's "AskDisha"). By the IRCTC deployment, BharatGPT is already handling millions of queries per month in Hindi and 11+ other Indian languages, which is one of the largest production Indic NLP workloads in the country (Organiser, 2026). [Unverified at the exact volume — public claims; not externally audited.]
BharatGen is the parallel academic-led foundation-model initiative anchored at IIT Bombay and TIH-Foundation for IoT and IoE, focused on building large multimodal models for Indian languages with public datasets and open-research norms.
Bhashini
Bhashini is the Government of India's National Language Translation Mission — a public dataset, model, and API platform for translation, ASR, and TTS across Indian languages. It is the layer most non-AI software companies actually integrate, because the API is free at modest tiers and the datasets are publicly available. Bhashini sits below the foundation-model layer and above the application layer, and a lot of the indigenous AI activity in 2026 is some combination of "fine-tune a foundation model on Bhashini-derived data and ship a vertical product."
Compute: the IndiaAI Compute Portal
The compute pillar has been the most operationally important piece of the mission. The IndiaAI Compute Portal aggregates GPU capacity from empanelled cloud providers, and selected startups, researchers, and government projects can apply for subsidized hours. The targeted scale is in the 10,000+ GPU range as of 2026, dominated by H100 and H200-class hardware. [Unverified — the portal's published capacity has been moving up through 2026; check the official IndiaAI announcements for current numbers.]
What the stack actually does well
What it has not yet matched
Two honest gaps:
What this means for builders
If you are building for an Indian-market user base in 2026:
The shorter version: India's public AI stack in 2026 is not theoretical anymore. It is a real set of models, a real compute pool, and a real set of customers. The honest thing to say is that the global frontier is still elsewhere, but the gap on Indic-language production workloads has closed materially, and the funding ramp suggests it will keep closing.


