Picture for Jialian Li

Jialian Li

Boosting Deductive Reasoning with Step Signals In RLHF

Add code
Oct 12, 2024
Viaarxiv icon

3D-Properties: Identifying Challenges in DPO and Charting a Path Forward

Add code
Jun 11, 2024
Figure 1 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 2 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 3 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Figure 4 for 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward
Viaarxiv icon

Exploring the LLM Journey from Cognition to Expression with Linear Representations

Add code
May 27, 2024
Figure 1 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 2 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 3 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Figure 4 for Exploring the LLM Journey from Cognition to Expression with Linear Representations
Viaarxiv icon

Reward Informed Dreamer for Task Generalization in Reinforcement Learning

Add code
Mar 09, 2023
Viaarxiv icon

LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds

Add code
Mar 28, 2022
Figure 1 for LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds
Figure 2 for LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds
Figure 3 for LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds
Figure 4 for LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds
Viaarxiv icon

Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model

Add code
Mar 15, 2022
Figure 1 for Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model
Figure 2 for Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model
Figure 3 for Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model
Figure 4 for Policy Learning for Robust Markov Decision Process with a Mismatched Generative Model
Viaarxiv icon

Nearly Horizon-Free Offline Reinforcement Learning

Add code
Mar 25, 2021
Figure 1 for Nearly Horizon-Free Offline Reinforcement Learning
Figure 2 for Nearly Horizon-Free Offline Reinforcement Learning
Viaarxiv icon

Fast Regularity-Constrained Plane Reconstruction

Add code
May 20, 2019
Figure 1 for Fast Regularity-Constrained Plane Reconstruction
Figure 2 for Fast Regularity-Constrained Plane Reconstruction
Figure 3 for Fast Regularity-Constrained Plane Reconstruction
Figure 4 for Fast Regularity-Constrained Plane Reconstruction
Viaarxiv icon

Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information

Add code
Oct 10, 2018
Figure 1 for Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information
Figure 2 for Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information
Viaarxiv icon

The YouTube-8M Kaggle Competition: Challenges and Methods

Add code
Jul 13, 2017
Figure 1 for The YouTube-8M Kaggle Competition: Challenges and Methods
Figure 2 for The YouTube-8M Kaggle Competition: Challenges and Methods
Figure 3 for The YouTube-8M Kaggle Competition: Challenges and Methods
Figure 4 for The YouTube-8M Kaggle Competition: Challenges and Methods
Viaarxiv icon