Picture for Yuzi Yan

Yuzi Yan

Boosting Deductive Reasoning with Step Signals In RLHF

Add code
Oct 12, 2024
Viaarxiv icon

Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown

Add code
Oct 01, 2024
Viaarxiv icon

3D-Properties: Identifying Challenges in DPO and Charting a Path Forward

Add code
Jun 11, 2024
Figure 1 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 2 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 3 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 4 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Viaarxiv icon

Exploring the LLM Journey from Cognition to Expression with Linear Representations

Add code
May 27, 2024
Figure 1 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 2 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 3 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 4 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Viaarxiv icon

Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range

Add code
Mar 05, 2024
Figure 1 for Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Figure 2 for Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Figure 3 for Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Figure 4 for Distributed Policy Gradient for Linear Quadratic Networked Control with Limited Communication Range
Viaarxiv icon

Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech

Add code
Mar 31, 2022
Figure 1 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 2 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 3 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Figure 4 for Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
Viaarxiv icon

Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

Add code
Nov 14, 2021
Figure 1 for Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning
Figure 2 for Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning
Figure 3 for Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning
Figure 4 for Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning
Viaarxiv icon

Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

Add code
Aug 27, 2021
Figure 1 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 2 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 3 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Figure 4 for Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement
Viaarxiv icon

AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style

Add code
Jul 06, 2021
Figure 1 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 2 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 3 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Figure 4 for AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style
Viaarxiv icon

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Add code
Apr 20, 2021
Figure 1 for AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Figure 2 for AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Figure 3 for AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Figure 4 for AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data
Viaarxiv icon