Picture for Yixiao Li

Yixiao Li

Adaptive Preference Scaling for Reinforcement Learning with Human Feedback

Add code
Jun 04, 2024
Viaarxiv icon

DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking

Add code
Mar 25, 2024
Viaarxiv icon

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Add code
Oct 23, 2023
Viaarxiv icon

Deep Reinforcement Learning from Hierarchical Weak Preference Feedback

Add code
Sep 06, 2023
Viaarxiv icon

LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation

Add code
Jun 26, 2023
Viaarxiv icon

A Review of Changepoint Detection Models

Add code
Aug 20, 2019
Viaarxiv icon