Picture for Yufeng Zhang

Yufeng Zhang

School of Artificial Intelligence, Sun Yat-sen University, Zhuhai 519082, Guangdong Key Laboratory of Big Data Analysis and Processing, 510006, China

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Add code
Oct 10, 2024
Figure 1 for Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Figure 2 for Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Figure 3 for Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Figure 4 for Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Viaarxiv icon

BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data

Add code
Oct 01, 2024
Viaarxiv icon

Robust Beamforming Design for Near-Field DMA-NOMA mmWave Communications With Imperfect Position Information

Add code
Sep 24, 2024
Viaarxiv icon

Pattern-Aware Chain-of-Thought Prompting in Large Language Models

Add code
Apr 23, 2024
Viaarxiv icon

A Mean-Field Analysis of Neural Gradient Descent-Ascent: Applications to Functional Conditional Moment Equations

Add code
Apr 18, 2024
Viaarxiv icon

Super-resolution of biomedical volumes with 2D supervision

Add code
Apr 15, 2024
Viaarxiv icon

$\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model

Add code
Mar 11, 2024
Figure 1 for $\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 2 for $\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 3 for $\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Figure 4 for $\mathbf{}$-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Viaarxiv icon

Can Large Language Models Play Games? A Case Study of A Self-Play Approach

Add code
Mar 08, 2024
Figure 1 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 2 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 3 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 4 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Viaarxiv icon

CPT: Competence-progressive Training Strategy for Few-shot Node Classification

Add code
Feb 01, 2024
Viaarxiv icon

Answering Subjective Induction Questions on Products by Summarizing Multi-sources Multi-viewpoints Knowledge

Add code
Sep 12, 2023
Viaarxiv icon