Picture for Zhengyuan Li

Zhengyuan Li

Purdue University, West Lafayette, IN, USA

Toward Cognitive Supersensing in Multimodal Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon

PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation

Add code
Jan 11, 2026
Viaarxiv icon

MDD: A Dataset for Text-and-Music Conditioned Duet Dance Generation

Add code
Aug 23, 2025
Viaarxiv icon

EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering

Add code
Oct 26, 2024
Figure 1 for EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Figure 2 for EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Figure 3 for EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Figure 4 for EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering
Viaarxiv icon

Practical Region-level Attack against Segment Anything Models

Add code
Apr 12, 2024
Figure 1 for Practical Region-level Attack against Segment Anything Models
Figure 2 for Practical Region-level Attack against Segment Anything Models
Figure 3 for Practical Region-level Attack against Segment Anything Models
Figure 4 for Practical Region-level Attack against Segment Anything Models
Viaarxiv icon

InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion

Add code
Aug 31, 2023
Viaarxiv icon