Picture for Zhichao Wang

Zhichao Wang

James

Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models

Add code
Nov 03, 2024
Viaarxiv icon

UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function

Add code
Oct 28, 2024
Viaarxiv icon

FedMABA: Towards Fair Federated Learning through Multi-Armed Bandits Allocation

Add code
Oct 26, 2024
Viaarxiv icon

UNA: Unifying Alignments of RLHF/PPO, DPO and KTO by a Generalized Implicit Reward Function

Add code
Aug 27, 2024
Viaarxiv icon

StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion

Add code
Aug 05, 2024
Figure 1 for StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion
Figure 2 for StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion
Figure 3 for StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion
Viaarxiv icon

Universality of kernel random matrices and kernel regression in the quadratic regime

Add code
Aug 02, 2024
Viaarxiv icon

McGAN: Generating Manufacturable Designs by Embedding Manufacturing Rules into Conditional Generative Adversarial Network

Add code
Jul 24, 2024
Viaarxiv icon

A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More

Add code
Jul 23, 2024
Viaarxiv icon

PAFT: A Parallel Training Paradigm for Effective LLM Fine-Tuning

Add code
Jun 25, 2024
Viaarxiv icon

Save It All: Enabling Full Parameter Tuning for Federated Large Language Models via Cycle Black Gradient Descent

Add code
Jun 17, 2024
Viaarxiv icon