An independent mechanistic and adversarial audit of Sarvam-30B and Sarvam-105B across 14 Indian languages. The models are 6x more likely to comply with harmful requests in Indian languages than in English.
Releasing the First Open 1B+ Language Model with Hyperconnections