Picture for Xiaoying Zhang

Xiaoying Zhang

Conversational Dueling Bandits in Generalized Linear Models

Add code
Jul 26, 2024
Viaarxiv icon

User-Creator Feature Dynamics in Recommender Systems with Dual Influence

Add code
Jul 19, 2024
Viaarxiv icon

Toward Optimal LLM Alignments Using Two-Player Games

Add code
Jun 16, 2024
Viaarxiv icon

Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching

Add code
Jun 11, 2024
Viaarxiv icon

GI-Free Pilot-Aided Channel Estimation for Affine Frequency Division Multiplexing Systems

Add code
Apr 01, 2024
Figure 1 for GI-Free Pilot-Aided Channel Estimation for Affine Frequency Division Multiplexing Systems
Figure 2 for GI-Free Pilot-Aided Channel Estimation for Affine Frequency Division Multiplexing Systems
Figure 3 for GI-Free Pilot-Aided Channel Estimation for Affine Frequency Division Multiplexing Systems
Figure 4 for GI-Free Pilot-Aided Channel Estimation for Affine Frequency Division Multiplexing Systems
Viaarxiv icon

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

Add code
Mar 14, 2024
Viaarxiv icon

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

Add code
Mar 08, 2024
Viaarxiv icon

Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation

Add code
Feb 14, 2024
Viaarxiv icon

Human-Instruction-Free LLM Self-Alignment with Limited Samples

Add code
Jan 06, 2024
Viaarxiv icon

Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?

Add code
Aug 29, 2023
Viaarxiv icon