Picture for Dongkeun Yoon

Dongkeun Yoon

A 2-step Framework for Automated Literary Translation Evaluation: Its Promises and Pitfalls

Add code
Dec 02, 2024
Viaarxiv icon

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Add code
Oct 23, 2024
Figure 1 for MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Figure 2 for MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Figure 3 for MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Figure 4 for MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models
Viaarxiv icon

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Add code
Jun 09, 2024
Figure 1 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 2 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 3 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Figure 4 for The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Viaarxiv icon

LangBridge: Multilingual Reasoning Without Multilingual Supervision

Add code
Jan 19, 2024
Viaarxiv icon

Gradient Ascent Post-training Enhances Language Model Generalization

Add code
Jun 12, 2023
Viaarxiv icon

Knowledge Unlearning for Mitigating Privacy Risks in Language Models

Add code
Oct 04, 2022
Figure 1 for Knowledge Unlearning for Mitigating Privacy Risks in Language Models
Figure 2 for Knowledge Unlearning for Mitigating Privacy Risks in Language Models
Figure 3 for Knowledge Unlearning for Mitigating Privacy Risks in Language Models
Figure 4 for Knowledge Unlearning for Mitigating Privacy Risks in Language Models
Viaarxiv icon