Picture for Wentian Zhao

Wentian Zhao

HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation

Add code
Oct 09, 2025
Viaarxiv icon

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

Add code
Apr 13, 2025
Viaarxiv icon

Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach

Add code
Nov 26, 2024
Figure 1 for Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
Figure 2 for Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
Figure 3 for Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
Figure 4 for Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
Viaarxiv icon

Quadratic Is Not What You Need For Multimodal Large Language Models

Add code
Oct 08, 2024
Figure 1 for Quadratic Is Not What You Need For Multimodal Large Language Models
Figure 2 for Quadratic Is Not What You Need For Multimodal Large Language Models
Figure 3 for Quadratic Is Not What You Need For Multimodal Large Language Models
Figure 4 for Quadratic Is Not What You Need For Multimodal Large Language Models
Viaarxiv icon

DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision

Add code
Dec 29, 2023
Figure 1 for DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision
Figure 2 for DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision
Figure 3 for DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision
Figure 4 for DL3DV-10K: A Large-Scale Scene Dataset for Deep Learning-based 3D Vision
Viaarxiv icon

Text2Layer: Layered Image Generation using Latent Diffusion Model

Add code
Jul 19, 2023
Viaarxiv icon

Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph

Add code
Jul 26, 2021
Figure 1 for Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Figure 2 for Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Figure 3 for Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Figure 4 for Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph
Viaarxiv icon

Video Question Answering on Screencast Tutorials

Add code
Aug 02, 2020
Figure 1 for Video Question Answering on Screencast Tutorials
Figure 2 for Video Question Answering on Screencast Tutorials
Figure 3 for Video Question Answering on Screencast Tutorials
Figure 4 for Video Question Answering on Screencast Tutorials
Viaarxiv icon