Picture for Yichen Zhu

Yichen Zhu

RAGraph: A General Retrieval-Augmented Graph Learning Framework

Add code
Oct 31, 2024
Viaarxiv icon

EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching

Add code
Oct 31, 2024
Viaarxiv icon

Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation

Add code
Sep 27, 2024
Viaarxiv icon

Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation

Add code
Sep 22, 2024
Viaarxiv icon

An Adaptive Second-order Method for a Class of Nonconvex Nonsmooth Composite Optimization

Add code
Jul 24, 2024
Viaarxiv icon

MMRo: Are Multimodal LLMs Eligible as the Brain for In-Home Robotics?

Add code
Jun 28, 2024
Viaarxiv icon

Non-confusing Generation of Customized Concepts in Diffusion Models

Add code
May 11, 2024
Viaarxiv icon

Retrieval-Augmented Embodied Agents

Add code
Apr 17, 2024
Viaarxiv icon

Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models

Add code
Mar 15, 2024
Viaarxiv icon

Safety of Multimodal Large Language Models on Images and Text

Add code
Feb 01, 2024
Viaarxiv icon