Picture for Xiao Liu

Xiao Liu

School of Computer Science and Technology, Anhui University

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Viaarxiv icon

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Add code
Jan 08, 2025
Viaarxiv icon

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Add code
Dec 30, 2024
Viaarxiv icon

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Add code
Dec 20, 2024
Viaarxiv icon

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Add code
Dec 16, 2024
Viaarxiv icon

Does RLHF Scale? Exploring the Impacts From Data, Model, and Method

Add code
Dec 08, 2024
Figure 1 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 2 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 3 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Figure 4 for Does RLHF Scale? Exploring the Impacts From Data, Model, and Method
Viaarxiv icon

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

Add code
Nov 21, 2024
Figure 1 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 2 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 3 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Figure 4 for Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
Viaarxiv icon

CLMIA: Membership Inference Attacks via Unsupervised Contrastive Learning

Add code
Nov 17, 2024
Figure 1 for CLMIA: Membership Inference Attacks via Unsupervised Contrastive Learning
Figure 2 for CLMIA: Membership Inference Attacks via Unsupervised Contrastive Learning
Figure 3 for CLMIA: Membership Inference Attacks via Unsupervised Contrastive Learning
Figure 4 for CLMIA: Membership Inference Attacks via Unsupervised Contrastive Learning
Viaarxiv icon

Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models

Add code
Nov 08, 2024
Figure 1 for Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
Figure 2 for Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
Figure 3 for Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
Figure 4 for Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
Viaarxiv icon

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Add code
Nov 04, 2024
Figure 1 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 2 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 3 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Figure 4 for WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Viaarxiv icon