Picture for Zhong Zhang

Zhong Zhang

2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization

Add code
Apr 10, 2025
Viaarxiv icon

Learning to Generate Structured Output with Schema Reinforcement Learning

Add code
Feb 26, 2025
Viaarxiv icon

AgentRM: Enhancing Agent Generalization with Reward Modeling

Add code
Feb 25, 2025
Viaarxiv icon

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models

Add code
Nov 08, 2024
Viaarxiv icon

Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs

Add code
Oct 18, 2024
Figure 1 for Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
Figure 2 for Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
Figure 3 for Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
Figure 4 for Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs
Viaarxiv icon

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

Add code
Oct 16, 2024
Figure 1 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 2 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 3 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Figure 4 for Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance
Viaarxiv icon

Learning Evolving Tools for Large Language Models

Add code
Oct 09, 2024
Figure 1 for Learning Evolving Tools for Large Language Models
Figure 2 for Learning Evolving Tools for Large Language Models
Figure 3 for Learning Evolving Tools for Large Language Models
Figure 4 for Learning Evolving Tools for Large Language Models
Viaarxiv icon

Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language Models

Add code
Aug 04, 2024
Viaarxiv icon

Mix-Domain Contrastive Learning for Unpaired H&E-to-IHC Stain Translation

Add code
Jun 17, 2024
Viaarxiv icon

MGCP: A Multi-Grained Correlation based Prediction Network for Multivariate Time Series

Add code
May 30, 2024
Figure 1 for MGCP: A Multi-Grained Correlation based Prediction Network for Multivariate Time Series
Figure 2 for MGCP: A Multi-Grained Correlation based Prediction Network for Multivariate Time Series
Figure 3 for MGCP: A Multi-Grained Correlation based Prediction Network for Multivariate Time Series
Figure 4 for MGCP: A Multi-Grained Correlation based Prediction Network for Multivariate Time Series
Viaarxiv icon