Picture for Xiaodong He

Xiaodong He

Department of R and D, UnionString Technology Co. Ltd

PAC: Pronunciation-Aware Contextualized Large Language Model-based Automatic Speech Recognition

Add code
Sep 16, 2025
Viaarxiv icon

ChartMaster: Advancing Chart-to-Code Generation with Real-World Charts and Chart Similarity Reinforcement Learning

Add code
Aug 25, 2025
Viaarxiv icon

Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs

Add code
Jul 24, 2025
Viaarxiv icon

Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation

Add code
May 21, 2025
Viaarxiv icon

HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation

Add code
Mar 31, 2025
Viaarxiv icon

Scaling Down Text Encoders of Text-to-Image Diffusion Models

Add code
Mar 25, 2025
Viaarxiv icon

Dexterous Hand Manipulation via Efficient Imitation-Bootstrapped Online Reinforcement Learning

Add code
Mar 06, 2025
Viaarxiv icon

An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation

Add code
Jan 25, 2025
Figure 1 for An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Figure 2 for An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Figure 3 for An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Figure 4 for An Atomic Skill Library Construction Method for Data-Efficient Embodied Manipulation
Viaarxiv icon

UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition

Add code
Dec 23, 2024
Viaarxiv icon

LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation

Add code
Dec 13, 2024
Figure 1 for LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Figure 2 for LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Figure 3 for LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Figure 4 for LMAgent: A Large-scale Multimodal Agents Society for Multi-user Simulation
Viaarxiv icon