Picture for Jialian Li

Jialian Li

Boosting Deductive Reasoning with Step Signals In RLHF

Add code
Oct 12, 2024
Viaarxiv icon

3D-Properties: Identifying Challenges in DPO and Charting a Path Forward

Add code
Jun 11, 2024
Figure 1 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 2 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 3 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 4 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Viaarxiv icon

Exploring the LLM Journey from Cognition to Expression with Linear Representations

Add code
May 27, 2024
Figure 1 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 2 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 3 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 4 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Viaarxiv icon

Reward Informed Dreamer for Task Generalization in Reinforcement Learning

Add code
Mar 09, 2023
Figure 1 for Reward Informed Dreamer for Task Generalization in Reinforcement Learning
Figure 2 for Reward Informed Dreamer for Task Generalization in Reinforcement Learning
Figure 3 for Reward Informed Dreamer for Task Generalization in Reinforcement Learning
Figure 4 for Reward Informed Dreamer for Task Generalization in Reinforcement Learning
Viaarxiv icon

LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds

Add code
Mar 28, 2022
Figure 1 for LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds
Figure 2 for LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds
Figure 3 for LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds
Figure 4 for LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds
Viaarxiv icon

Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model

Add code
Mar 15, 2022
Figure 1 for Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model
Figure 2 for Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model
Figure 3 for Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model
Figure 4 for Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model
Viaarxiv icon

Nearly Horizon-Free Offline Reinforcement Learning

Add code
Mar 25, 2021
Figure 1 for Nearly Horizon-Free Offline Reinforcement Learning
Figure 2 for Nearly Horizon-Free Offline Reinforcement Learning
Viaarxiv icon

Fast Regularity-Constrained Plane Reconstruction

Add code
May 20, 2019
Figure 1 for Fast Regularity-Constrained Plane Reconstruction
Figure 2 for Fast Regularity-Constrained Plane Reconstruction
Figure 3 for Fast Regularity-Constrained Plane Reconstruction
Figure 4 for Fast Regularity-Constrained Plane Reconstruction
Viaarxiv icon

Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information

Add code
Oct 10, 2018
Figure 1 for Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information
Figure 2 for Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information
Viaarxiv icon

The YouTube-8M Kaggle Competition: Challenges and Methods

Add code
Jul 13, 2017
Figure 1 for The YouTube-8M Kaggle Competition: Challenges and Methods
Figure 2 for The YouTube-8M Kaggle Competition: Challenges and Methods
Figure 3 for The YouTube-8M Kaggle Competition: Challenges and Methods
Figure 4 for The YouTube-8M Kaggle Competition: Challenges and Methods
Viaarxiv icon