The Economic Times

The Economic Times
A groundbreaking speech recognition benchmark called *Voice of India*, developed by Josh Talks and AI4Bharat at IIT Madras, has exposed significant performance failures in global AI systems when processing Indian languages and accents. The evaluation, covering 15 languages and approximately 35,000 speakers, reveals that leading international models from OpenAI and Microsoft struggle dramatically with how Indians actually speak.
India-focused Sarvam Audio consistently outperforms global competitors, particularly against OpenAI's models which trail by over 50 percentage points in accuracy. The benchmark highlights critical disparities: all models perform better on Indo-Aryan languages like Hindi and Bengali (5-6% word error rate) compared to Dravidian languages like Tamil and Telugu (15-20% WER). Regional Hindi dialects such as Bhojpuri and Chhattisgarhi, spoken by tens of millions, see error rates jump to 20-30%.
The evaluation uniquely incorporates code-switched speech, background noise, and geographic dialect variations that reflect real-world Indian conversations. With voice becoming the primary digital interface for banking, healthcare, and government services, these high error rates have serious implications for digital inclusion across India's diverse linguistic landscape.
MP Jugal Kishore Sharma Addresses Lok Sabha in Dogri, Demands Railway Expansion for Jammu-Katra Route
Kannada Sahitya Sammelana calls for literature to preserve human values amid technological advancement
VoM News Launches Kashmiri Language Website to Preserve Regional Journalism
AIIMS Jammu Conducts Hindi Workshop to Promote Official Language Use
Karnataka Excludes Third Language Marks From SSLC Total Score, Students To Receive Only Grades
Bihar to Appoint 2,000 Urdu Translators to Strengthen Second Official Language Status
