Picture for Xiao Liu

Xiao Liu

School of Computer Science and Technology, Anhui University

Research advances on fish feeding behavior recognition and intensity quantification methods in aquaculture

Add code
Feb 21, 2025
Viaarxiv icon

GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks

Add code
Feb 20, 2025
Viaarxiv icon

Redistribute Ensemble Training for Mitigating Memorization in Diffusion Models

Add code
Feb 13, 2025
Viaarxiv icon

Optimizing Large Language Model Training Using FP4 Quantization

Add code
Jan 28, 2025
Figure 1 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 2 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 3 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 4 for Optimizing Large Language Model Training Using FP4 Quantization
Viaarxiv icon

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Figure 1 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 2 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 3 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 4 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Viaarxiv icon

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Add code
Jan 08, 2025
Viaarxiv icon

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Add code
Dec 30, 2024
Viaarxiv icon

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Add code
Dec 20, 2024
Figure 1 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 2 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 3 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 4 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Viaarxiv icon

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Add code
Dec 16, 2024
Viaarxiv icon

Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Add code
Dec 08, 2024
Figure 1 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 2 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 3 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 4 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Viaarxiv icon