Picture for Thibault Sellam

Thibault Sellam

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation

Add code
May 22, 2023
Figure 1 for SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Figure 2 for SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Figure 3 for SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Figure 4 for SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation
Viaarxiv icon

Reward Gaming in Conditional Text Generation

Add code
Nov 16, 2022
Figure 1 for Reward Gaming in Conditional Text Generation
Figure 2 for Reward Gaming in Conditional Text Generation
Figure 3 for Reward Gaming in Conditional Text Generation
Figure 4 for Reward Gaming in Conditional Text Generation
Viaarxiv icon

Dialect-robust Evaluation of Generated Text

Add code
Nov 02, 2022
Figure 1 for Dialect-robust Evaluation of Generated Text
Figure 2 for Dialect-robust Evaluation of Generated Text
Figure 3 for Dialect-robust Evaluation of Generated Text
Figure 4 for Dialect-robust Evaluation of Generated Text
Viaarxiv icon

SQuId: Measuring Speech Naturalness in Many Languages

Add code
Oct 12, 2022
Figure 1 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 2 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 3 for SQuId: Measuring Speech Naturalness in Many Languages
Figure 4 for SQuId: Measuring Speech Naturalness in Many Languages
Viaarxiv icon

Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text

Add code
Feb 14, 2022
Viaarxiv icon

Learning Compact Metrics for MT

Add code
Oct 12, 2021
Figure 1 for Learning Compact Metrics for MT
Figure 2 for Learning Compact Metrics for MT
Figure 3 for Learning Compact Metrics for MT
Figure 4 for Learning Compact Metrics for MT
Viaarxiv icon

The MultiBERTs: BERT Reproductions for Robustness Analysis

Add code
Jun 30, 2021
Figure 1 for The MultiBERTs: BERT Reproductions for Robustness Analysis
Figure 2 for The MultiBERTs: BERT Reproductions for Robustness Analysis
Figure 3 for The MultiBERTs: BERT Reproductions for Robustness Analysis
Figure 4 for The MultiBERTs: BERT Reproductions for Robustness Analysis
Viaarxiv icon

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

Add code
Feb 03, 2021
Figure 1 for The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Figure 2 for The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Figure 3 for The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Figure 4 for The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Viaarxiv icon