Picture for Huchuan Lu

Huchuan Lu

Towards Real-Time Open-Vocabulary Video Instance Segmentation

Add code
Dec 05, 2024
Viaarxiv icon

MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models

Add code
Dec 02, 2024
Viaarxiv icon

Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different Scenes

Add code
Dec 02, 2024
Viaarxiv icon

Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding

Add code
Nov 29, 2024
Viaarxiv icon

DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting

Add code
Nov 26, 2024
Viaarxiv icon

OASIS: Open Agent Social Interaction Simulations with One Million Agents

Add code
Nov 26, 2024
Figure 1 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 2 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 3 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Figure 4 for OASIS: Open Agent Social Interaction Simulations with One Million Agents
Viaarxiv icon

OASIS: Open Agents Social Interaction Simulations on One Million Agents

Add code
Nov 21, 2024
Figure 1 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 2 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 3 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Figure 4 for OASIS: Open Agents Social Interaction Simulations on One Million Agents
Viaarxiv icon

GLDesigner: Leveraging Multi-Modal LLMs as Designer for Enhanced Aesthetic Text Glyph Layouts

Add code
Nov 18, 2024
Viaarxiv icon

LLMs Can Evolve Continually on Modality for X-Modal Reasoning

Add code
Oct 26, 2024
Viaarxiv icon

GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning

Add code
Oct 20, 2024
Viaarxiv icon