Picture for Hanhui Li

Hanhui Li

FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model

Add code
Mar 25, 2025
Viaarxiv icon

Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?

Add code
Mar 08, 2025
Viaarxiv icon

AtomThink: A Slow Thinking Framework for Multimodal Mathematical Reasoning

Add code
Nov 18, 2024
Viaarxiv icon

Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars

Add code
Oct 11, 2024
Figure 1 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 2 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 3 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 4 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Viaarxiv icon

GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

Add code
Aug 23, 2024
Figure 1 for GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Figure 2 for GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Figure 3 for GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Figure 4 for GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
Viaarxiv icon

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Add code
Jun 03, 2024
Viaarxiv icon

TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation

Add code
Apr 29, 2024
Figure 1 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 2 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 3 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Figure 4 for TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Viaarxiv icon

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

Add code
Apr 25, 2024
Figure 1 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 2 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 3 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Figure 4 for ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Viaarxiv icon

3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands

Add code
Jan 02, 2024
Figure 1 for 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Figure 2 for 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Figure 3 for 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Figure 4 for 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Viaarxiv icon

Monocular 3D Hand Mesh Recovery via Dual Noise Estimation

Add code
Dec 26, 2023
Figure 1 for Monocular 3D Hand Mesh Recovery via Dual Noise Estimation
Figure 2 for Monocular 3D Hand Mesh Recovery via Dual Noise Estimation
Figure 3 for Monocular 3D Hand Mesh Recovery via Dual Noise Estimation
Figure 4 for Monocular 3D Hand Mesh Recovery via Dual Noise Estimation
Viaarxiv icon