Picture for Yaodong Yang

Yaodong Yang

ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs

Add code
Mar 17, 2025
Viaarxiv icon

SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning

Add code
Mar 05, 2025
Viaarxiv icon

Differentiable Information Enhanced Model-Based Reinforcement Learning

Add code
Mar 03, 2025
Viaarxiv icon

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Add code
Feb 28, 2025
Viaarxiv icon

Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs

Add code
Feb 26, 2025
Viaarxiv icon

Retrieval Dexterity: Efficient Object Retrieval in Clutters with Dexterous Hand

Add code
Feb 26, 2025
Viaarxiv icon

SAE-V: Interpreting Multimodal Models for Enhanced Alignment

Add code
Feb 22, 2025
Viaarxiv icon

Model Evolution Framework with Genetic Algorithm for Multi-Task Reinforcement Learning

Add code
Feb 19, 2025
Viaarxiv icon

Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer

Add code
Feb 04, 2025
Viaarxiv icon

RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?

Add code
Jan 20, 2025
Viaarxiv icon