Picture for Qingyang Li

Qingyang Li

Facet: highly efficient E(3)-equivariant networks for interatomic potentials

Add code
Sep 10, 2025
Viaarxiv icon

Towards Reward Fairness in RLHF: From a Resource Allocation Perspective

Add code
May 29, 2025
Viaarxiv icon

MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling

Add code
Apr 14, 2025
Viaarxiv icon

SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin

Add code
Feb 19, 2025
Figure 1 for SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin
Figure 2 for SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin
Figure 3 for SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin
Figure 4 for SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin
Viaarxiv icon

Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models

Add code
Nov 25, 2024
Figure 1 for Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
Figure 2 for Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
Figure 3 for Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
Figure 4 for Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models
Viaarxiv icon

Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training

Add code
Oct 01, 2024
Figure 1 for Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training
Figure 2 for Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training
Figure 3 for Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training
Figure 4 for Investigating the Synergistic Effects of Dropout and Residual Connections on Language Model Training
Viaarxiv icon

TSO: Self-Training with Scaled Preference Optimization

Add code
Aug 31, 2024
Figure 1 for TSO: Self-Training with Scaled Preference Optimization
Figure 2 for TSO: Self-Training with Scaled Preference Optimization
Figure 3 for TSO: Self-Training with Scaled Preference Optimization
Figure 4 for TSO: Self-Training with Scaled Preference Optimization
Viaarxiv icon

Towards Comprehensive Preference Data Collection for Reward Modeling

Add code
Jun 24, 2024
Viaarxiv icon

DoLLM: How Large Language Models Understanding Network Flow Data to Detect Carpet Bombing DDoS

Add code
May 13, 2024
Figure 1 for DoLLM: How Large Language Models Understanding Network Flow Data to Detect Carpet Bombing DDoS
Figure 2 for DoLLM: How Large Language Models Understanding Network Flow Data to Detect Carpet Bombing DDoS
Figure 3 for DoLLM: How Large Language Models Understanding Network Flow Data to Detect Carpet Bombing DDoS
Figure 4 for DoLLM: How Large Language Models Understanding Network Flow Data to Detect Carpet Bombing DDoS
Viaarxiv icon

Synthetic Dialogue Dataset Generation using LLM Agents

Add code
Jan 30, 2024
Viaarxiv icon