Picture for Se-Young Yun

Se-Young Yun

When Debate Fails: Bias Reinforcement in Large Language Models

Add code
Mar 21, 2025
Viaarxiv icon

MAVFlow: Preserving Paralinguistic Elements with Conditional Flow Matching for Zero-Shot AV2AV Multilingual Translation

Add code
Mar 14, 2025
Viaarxiv icon

Probability-Flow ODE in Infinite-Dimensional Function Spaces

Add code
Mar 13, 2025
Viaarxiv icon

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Add code
Mar 10, 2025
Viaarxiv icon

Self-Training Elicits Concise Reasoning in Large Language Models

Add code
Feb 28, 2025
Viaarxiv icon

What is the Alignment Objective of GRPO?

Add code
Feb 25, 2025
Viaarxiv icon

Conditional Synthesis of 3D Molecules with Time Correction Sampler

Add code
Nov 01, 2024
Viaarxiv icon

FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL

Add code
Oct 21, 2024
Figure 1 for FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Figure 2 for FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Figure 3 for FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Figure 4 for FlickerFusion: Intra-trajectory Domain Generalizing Multi-Agent RL
Viaarxiv icon

Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models

Add code
Oct 14, 2024
Figure 1 for Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models
Figure 2 for Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models
Figure 3 for Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models
Figure 4 for Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models
Viaarxiv icon

Stable Language Model Pre-training by Reducing Embedding Variability

Add code
Sep 12, 2024
Viaarxiv icon