Picture for Haifeng Huang

Haifeng Huang

Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine

Add code
Dec 12, 2024
Viaarxiv icon

Improving Retrieval Augmented Language Model with Self-Reasoning

Add code
Jul 29, 2024
Viaarxiv icon

A Refer-and-Ground Multimodal Large Language Model for Biomedicine

Add code
Jun 26, 2024
Viaarxiv icon

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

Add code
Jun 13, 2024
Viaarxiv icon

Grounded 3D-LLM with Referent Tokens

Add code
May 16, 2024
Viaarxiv icon

FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion

Add code
May 10, 2024
Viaarxiv icon

Molecule-Space: Free Lunch in Unified Multimodal Space via Knowledge Fusion

Add code
May 08, 2024
Viaarxiv icon

Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding

Add code
Dec 21, 2023
Viaarxiv icon

Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers

Add code
Dec 15, 2023
Viaarxiv icon

Extending Multi-modal Contrastive Representations

Add code
Oct 13, 2023
Viaarxiv icon