Picture for Yaodong Yang

Yaodong Yang

Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction

Add code
Jan 09, 2025
Viaarxiv icon

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

Add code
Dec 24, 2024
Viaarxiv icon

Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback

Add code
Dec 20, 2024
Viaarxiv icon

Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation

Add code
Dec 15, 2024
Viaarxiv icon

RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors

Add code
Dec 14, 2024
Viaarxiv icon

Random Feature Models with Learnable Activation Functions

Add code
Nov 29, 2024
Viaarxiv icon

Object-Centric Dexterous Manipulation from Human Motion Data

Add code
Nov 06, 2024
Figure 1 for Object-Centric Dexterous Manipulation from Human Motion Data
Figure 2 for Object-Centric Dexterous Manipulation from Human Motion Data
Figure 3 for Object-Centric Dexterous Manipulation from Human Motion Data
Figure 4 for Object-Centric Dexterous Manipulation from Human Motion Data
Viaarxiv icon

Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping

Add code
Oct 30, 2024
Figure 1 for Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping
Figure 2 for Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping
Figure 3 for Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping
Figure 4 for Neural Attention Field: Emerging Point Relevance in 3D Scenes for One-Shot Dexterous Grasping
Viaarxiv icon

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment

Add code
Oct 22, 2024
Figure 1 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 2 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 3 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Figure 4 for Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment
Viaarxiv icon

Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games

Add code
Oct 02, 2024
Viaarxiv icon