Picture for Hao Fei

Hao Fei

Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology

Add code
Mar 19, 2025
Viaarxiv icon

Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene

Add code
Mar 19, 2025
Viaarxiv icon

Universal Scene Graph Generation

Add code
Mar 19, 2025
Viaarxiv icon

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Add code
Mar 16, 2025
Viaarxiv icon

Multi-Granular Multimodal Clue Fusion for Meme Understanding

Add code
Mar 16, 2025
Viaarxiv icon

TAIL: Text-Audio Incremental Learning

Add code
Mar 06, 2025
Viaarxiv icon

Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models

Add code
Mar 03, 2025
Viaarxiv icon

Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems

Add code
Feb 21, 2025
Viaarxiv icon

Semantic Role Labeling: A Systematical Survey

Add code
Feb 09, 2025
Viaarxiv icon

CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Add code
Jan 28, 2025
Viaarxiv icon