Picture for Roberto Navigli

Roberto Navigli

Word Sense Linking: Disambiguating Outside the Sandbox

Add code
Dec 12, 2024
Viaarxiv icon

Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis

Add code
Dec 02, 2024
Viaarxiv icon

Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond

Add code
Oct 08, 2024
Viaarxiv icon

Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics

Add code
Oct 07, 2024
Figure 1 for Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Figure 2 for Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Figure 3 for Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Figure 4 for Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Viaarxiv icon

ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering

Add code
Oct 07, 2024
Viaarxiv icon

Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!

Add code
Aug 25, 2024
Figure 1 for Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
Figure 2 for Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
Figure 3 for Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
Figure 4 for Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
Viaarxiv icon

AutoML-guided Fusion of Entity and LLM-based representations

Add code
Aug 19, 2024
Figure 1 for AutoML-guided Fusion of Entity and LLM-based representations
Figure 2 for AutoML-guided Fusion of Entity and LLM-based representations
Figure 3 for AutoML-guided Fusion of Entity and LLM-based representations
Figure 4 for AutoML-guided Fusion of Entity and LLM-based representations
Viaarxiv icon

Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends

Add code
Jul 31, 2024
Viaarxiv icon

ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget

Add code
Jul 31, 2024
Viaarxiv icon

ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming

Add code
Apr 06, 2024
Figure 1 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 2 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 3 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Figure 4 for ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Viaarxiv icon