Picture for Amit Agarwal

Amit Agarwal

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

Add code
Jan 25, 2026
Viaarxiv icon

LLM-Guided Lifecycle-Aware Clustering of Multi-Turn Customer Support Conversations

Add code
Jan 07, 2026
Viaarxiv icon

FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models

Add code
Oct 02, 2025
Figure 1 for FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models
Figure 2 for FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models
Figure 3 for FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models
Figure 4 for FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models
Viaarxiv icon

Hard Negative Mining for Domain-Specific Retrieval in Enterprise Systems

Add code
May 23, 2025
Viaarxiv icon

SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use

Add code
May 22, 2025
Viaarxiv icon

FS-DAG: Few Shot Domain Adapting Graph Networks for Visually Rich Document Understanding

Add code
May 22, 2025
Viaarxiv icon

Tokenization Matters: Improving Zero-Shot NER for Indic Languages

Add code
Apr 23, 2025
Viaarxiv icon

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Add code
Mar 10, 2025
Figure 1 for Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Figure 2 for Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Figure 3 for Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Figure 4 for Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Viaarxiv icon

Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization

Add code
Feb 18, 2025
Figure 1 for Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization
Figure 2 for Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization
Figure 3 for Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization
Figure 4 for Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization
Viaarxiv icon

MVTamperBench: Evaluating Robustness of Vision-Language Models

Add code
Dec 27, 2024
Viaarxiv icon