Picture for Hao Li

Hao Li

Jack

From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction

Add code
Mar 20, 2025
Viaarxiv icon

DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis

Add code
Mar 19, 2025
Viaarxiv icon

Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization

Add code
Mar 17, 2025
Viaarxiv icon

Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation

Add code
Mar 13, 2025
Viaarxiv icon

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Add code
Mar 13, 2025
Viaarxiv icon

Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption

Add code
Mar 12, 2025
Viaarxiv icon

Astrea: A MOE-based Visual Understanding Model with Progressive Alignment

Add code
Mar 12, 2025
Viaarxiv icon

Silent Hazards of Token Reduction in Vision-Language Models: The Hidden Impact on Consistency

Add code
Mar 11, 2025
Viaarxiv icon

Materials Map Integrating Experimental and Computational Data through Graph-Based Machine Learning for Enhanced Materials Discovery

Add code
Mar 11, 2025
Viaarxiv icon

SARA: Structural and Adversarial Representation Alignment for Training-efficient Diffusion Models

Add code
Mar 11, 2025
Viaarxiv icon