Overview
- India Today’s OSINT evaluation found Sarvam Vision produced more accurate, ready-to-use outputs than ChatGPT, Google Gemini, xAI Grok and DeepSeek on Sanskrit translation, a Gujarati rural-governance application, handwritten OCR and table extraction.
- Sarvam reports 93%+ on OmniDocBench v1.5 and 84.3% on the English subset of olmOCR-Bench, claiming outperformance of models such as Gemini 3 Pro and DeepSeek OCR 2 on document and OCR benchmarks.
- The company introduced a new brand identity ahead of the India AI Impact Summit and is positioning its offerings as India-first tools focused on Indic scripts, multilingual support and local use cases.
- Sarvam Vision targets document understanding across 22 languages, preserving complex tables, charts and graphs, while Bulbul V3 delivers text-to-speech with 35+ voices across 11 Indian languages with support for code-mixing and regional accents.
- India Today noted its tests did not cover reasoning depth, coding or long-context performance, underscoring that Sarvam’s advantages appear task-specific rather than a broad lead over frontier models.