Picture for Feilong Tang

Feilong Tang

MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation

Add code
Feb 17, 2025
Viaarxiv icon

Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis

Add code
Feb 17, 2025
Viaarxiv icon

Beyond Words: AuralLLM and SignMST-C for Precise Sign Language Production and Bidirectional Accessibility

Add code
Jan 01, 2025
Viaarxiv icon

Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations

Add code
Jan 01, 2025
Figure 1 for Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations
Figure 2 for Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations
Figure 3 for Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations
Figure 4 for Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations
Viaarxiv icon

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP

Add code
Dec 27, 2024
Figure 1 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 2 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 3 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 4 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Viaarxiv icon

Neighbor Does Matter: Density-Aware Contrastive Learning for Medical Semi-supervised Segmentation

Add code
Dec 27, 2024
Viaarxiv icon

Meta Curvature-Aware Minimization for Domain Generalization

Add code
Dec 16, 2024
Viaarxiv icon

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation

Add code
Dec 03, 2024
Viaarxiv icon

OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining

Add code
Nov 23, 2024
Figure 1 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 2 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 3 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Figure 4 for OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
Viaarxiv icon

SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation

Add code
Aug 16, 2024
Viaarxiv icon