Picture for Rongxiang Weng

Rongxiang Weng

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

Length Desensitization in Directed Preference Optimization

Add code
Sep 10, 2024
Figure 1 for Length Desensitization in Directed Preference Optimization
Figure 2 for Length Desensitization in Directed Preference Optimization
Figure 3 for Length Desensitization in Directed Preference Optimization
Figure 4 for Length Desensitization in Directed Preference Optimization
Viaarxiv icon

What's Wrong with Your Code Generated by Large Language Models? An Extensive Study

Add code
Jul 08, 2024
Figure 1 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 2 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 3 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Figure 4 for What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
Viaarxiv icon

The Rise and Potential of Large Language Model Based Agents: A Survey

Add code
Sep 19, 2023
Figure 1 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 2 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 3 for The Rise and Potential of Large Language Model Based Agents: A Survey
Figure 4 for The Rise and Potential of Large Language Model Based Agents: A Survey
Viaarxiv icon

Secrets of RLHF in Large Language Models Part I: PPO

Add code
Jul 18, 2023
Viaarxiv icon

Towards Reliable Neural Machine Translation with Consistency-Aware Meta-Learning

Add code
Mar 20, 2023
Viaarxiv icon

Learning Decoupled Retrieval Representation for Nearest Neighbour Neural Machine Translation

Add code
Sep 20, 2022
Figure 1 for Learning Decoupled Retrieval Representation for Nearest Neighbour Neural Machine Translation
Figure 2 for Learning Decoupled Retrieval Representation for Nearest Neighbour Neural Machine Translation
Figure 3 for Learning Decoupled Retrieval Representation for Nearest Neighbour Neural Machine Translation
Figure 4 for Learning Decoupled Retrieval Representation for Nearest Neighbour Neural Machine Translation
Viaarxiv icon

Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation

Add code
Apr 14, 2022
Figure 1 for Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
Figure 2 for Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
Figure 3 for Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
Figure 4 for Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Translation
Viaarxiv icon

Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

Add code
Oct 09, 2020
Figure 1 for Uncertainty-Aware Semantic Augmentation for Neural Machine Translation
Figure 2 for Uncertainty-Aware Semantic Augmentation for Neural Machine Translation
Figure 3 for Uncertainty-Aware Semantic Augmentation for Neural Machine Translation
Figure 4 for Uncertainty-Aware Semantic Augmentation for Neural Machine Translation
Viaarxiv icon