Picture for Yimin Hu

Yimin Hu

Teaching Language Models to Self-Improve by Learning from Language Feedback

Add code
Jun 11, 2024
Viaarxiv icon

Prior Constraints-based Reward Model Training for Aligning Large Language Models

Add code
Apr 01, 2024
Viaarxiv icon

BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision

Add code
Feb 07, 2024
Viaarxiv icon

Anomaly Detection of Particle Orbit in Accelerator using LSTM Deep Learning Technology

Add code
Jan 28, 2024
Viaarxiv icon

ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation

Add code
Aug 04, 2023
Figure 1 for ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation
Figure 2 for ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation
Figure 3 for ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation
Figure 4 for ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation
Viaarxiv icon

Improved Knowledge Distillation for Pre-trained Language Models via Knowledge Selection

Add code
Feb 01, 2023
Viaarxiv icon