Picture for Pengzhi Gao

Pengzhi Gao

STEP: Success-Rate-Aware Trajectory-Efficient Policy Optimization

Add code
Nov 17, 2025
Viaarxiv icon

Revisiting Entropy in Reinforcement Learning for Large Reasoning Models

Add code
Nov 08, 2025
Viaarxiv icon

BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism

Add code
May 27, 2025
Figure 1 for BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
Figure 2 for BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
Figure 3 for BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
Figure 4 for BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism
Viaarxiv icon

Enhance Mobile Agents Thinking Process Via Iterative Preference Learning

Add code
May 18, 2025
Figure 1 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 2 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 3 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Figure 4 for Enhance Mobile Agents Thinking Process Via Iterative Preference Learning
Viaarxiv icon

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Add code
Feb 04, 2025
Viaarxiv icon

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

Add code
Jun 25, 2024
Figure 1 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 2 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 3 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Figure 4 for LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Viaarxiv icon

Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models

Add code
Jan 11, 2024
Viaarxiv icon

An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation

Add code
Aug 28, 2023
Figure 1 for An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation
Figure 2 for An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation
Figure 3 for An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation
Figure 4 for An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation
Viaarxiv icon

Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization

Add code
Jun 12, 2023
Figure 1 for Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
Figure 2 for Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
Figure 3 for Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
Figure 4 for Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization
Viaarxiv icon

Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization

Add code
May 12, 2023
Viaarxiv icon