Picture for Anoop Kunchukuttan

Anoop Kunchukuttan

Nilekani Centre at AI4Bharat, Indian Institute of Technology Madras, India, Microsoft, India

The Reasoning Lingua Franca: A Double-Edged Sword for Multilingual AI

Add code
Oct 23, 2025
Viaarxiv icon

RomanLens: Latent Romanization and its role in Multilinguality in LLMs

Add code
Feb 11, 2025
Viaarxiv icon

Pralekha: An Indic Document Alignment Evaluation Benchmark

Add code
Nov 28, 2024
Figure 1 for Pralekha: An Indic Document Alignment Evaluation Benchmark
Figure 2 for Pralekha: An Indic Document Alignment Evaluation Benchmark
Figure 3 for Pralekha: An Indic Document Alignment Evaluation Benchmark
Figure 4 for Pralekha: An Indic Document Alignment Evaluation Benchmark
Viaarxiv icon

BhasaAnuvaad: A Speech Translation Dataset for 14 Indian Languages

Add code
Nov 07, 2024
Figure 1 for BhasaAnuvaad: A Speech Translation Dataset for 14 Indian Languages
Figure 2 for BhasaAnuvaad: A Speech Translation Dataset for 14 Indian Languages
Figure 3 for BhasaAnuvaad: A Speech Translation Dataset for 14 Indian Languages
Figure 4 for BhasaAnuvaad: A Speech Translation Dataset for 14 Indian Languages
Viaarxiv icon

Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs

Add code
Oct 17, 2024
Figure 1 for Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
Figure 2 for Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
Figure 3 for Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
Figure 4 for Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
Viaarxiv icon

An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models

Add code
Jul 08, 2024
Figure 1 for An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Figure 2 for An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Figure 3 for An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Figure 4 for An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Viaarxiv icon

How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?

Add code
Jun 06, 2024
Viaarxiv icon

Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation

Add code
Mar 25, 2024
Viaarxiv icon

IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

Add code
Mar 11, 2024
Viaarxiv icon

Airavata: Introducing Hindi Instruction-tuned LLM

Add code
Jan 26, 2024
Viaarxiv icon