Picture for Minghao Wu

Minghao Wu

New Trends for Modern Machine Translation with Large Reasoning Models

Add code
Mar 13, 2025
Viaarxiv icon

Towards Widening The Distillation Bottleneck for Reasoning Models

Add code
Mar 03, 2025
Viaarxiv icon

Demystifying Multilingual Chain-of-Thought in Process Reward Modeling

Add code
Feb 18, 2025
Viaarxiv icon

Findings of the WMT 2024 Shared Task on Discourse-Level Literary Translation

Add code
Dec 16, 2024
Viaarxiv icon

Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention

Add code
Oct 16, 2024
Figure 1 for Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
Figure 2 for Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
Figure 3 for Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
Figure 4 for Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
Viaarxiv icon

The Best of Both Worlds: Bridging Quality and Diversity in Data Selection with Bipartite Graph

Add code
Oct 16, 2024
Viaarxiv icon

Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue

Add code
Jun 20, 2024
Figure 1 for Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Figure 2 for Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Figure 3 for Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Figure 4 for Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue
Viaarxiv icon

Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models

Add code
Jun 13, 2024
Figure 1 for Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
Figure 2 for Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
Figure 3 for Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
Figure 4 for Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
Viaarxiv icon

(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts

Add code
May 20, 2024
Viaarxiv icon

EmbSum: Leveraging the Summarization Capabilities of Large Language Models for Content-Based Recommendations

Add code
May 19, 2024
Viaarxiv icon