Picture for Yanwei Fu

Yanwei Fu

DecoFuse: Decomposing and Fusing the "What", "Where", and "How" for Brain-Inspired fMRI-to-Video Decoding

Add code
Apr 01, 2025
Viaarxiv icon

ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning

Add code
Mar 30, 2025
Viaarxiv icon

EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation

Add code
Mar 20, 2025
Viaarxiv icon

Sequential Multi-Object Grasping with One Dexterous Hand

Add code
Mar 12, 2025
Viaarxiv icon

CineBrain: A Large-Scale Multi-Modal Brain Dataset During Naturalistic Audiovisual Narrative Processing

Add code
Mar 10, 2025
Viaarxiv icon

Online Dense Point Tracking with Streaming Memory

Add code
Mar 09, 2025
Viaarxiv icon

HOP: Heterogeneous Topology-based Multimodal Entanglement for Co-Speech Gesture Generation

Add code
Mar 03, 2025
Viaarxiv icon

Revisiting Large Language Model Pruning using Neuron Semantic Attribution

Add code
Mar 03, 2025
Viaarxiv icon

Human2Robot: Learning Robot Actions from Paired Human-Robot Videos

Add code
Feb 23, 2025
Viaarxiv icon

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors

Add code
Feb 20, 2025
Figure 1 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 2 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 3 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 4 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Viaarxiv icon