Picture for James Thorne

James Thorne

Evaluating the Consistency of LLM Evaluators

Add code
Nov 30, 2024
Viaarxiv icon

The Automated Verification of Textual Claims (AVeriTeC) Shared Task

Add code
Oct 31, 2024
Figure 1 for The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Figure 2 for The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Figure 3 for The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Figure 4 for The Automated Verification of Textual Claims (AVeriTeC) Shared Task
Viaarxiv icon

Cross-lingual Transfer of Reward Models in Multilingual Alignment

Add code
Oct 23, 2024
Figure 1 for Cross-lingual Transfer of Reward Models in Multilingual Alignment
Figure 2 for Cross-lingual Transfer of Reward Models in Multilingual Alignment
Figure 3 for Cross-lingual Transfer of Reward Models in Multilingual Alignment
Figure 4 for Cross-lingual Transfer of Reward Models in Multilingual Alignment
Viaarxiv icon

Stable Language Model Pre-training by Reducing Embedding Variability

Add code
Sep 12, 2024
Viaarxiv icon

Margin-aware Preference Optimization for Aligning Diffusion Models without Reference

Add code
Jun 10, 2024
Viaarxiv icon

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Add code
Jun 04, 2024
Figure 1 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 2 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 3 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Figure 4 for Block Transformer: Global-to-Local Language Modeling for Fast Inference
Viaarxiv icon

Epistemology of Language Models: Do Language Models Have Holistic Knowledge?

Add code
Mar 19, 2024
Viaarxiv icon

BEnQA: A Question Answering and Reasoning Benchmark for Bengali and English

Add code
Mar 16, 2024
Viaarxiv icon

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

Add code
Mar 15, 2024
Viaarxiv icon

ORPO: Monolithic Preference Optimization without Reference Model

Add code
Mar 14, 2024
Viaarxiv icon