Picture for Xiaomao Fan

Xiaomao Fan

SAM-Swin: SAM-Driven Dual-Swin Transformers with Adaptive Lesion Enhancement for Laryngo-Pharyngeal Tumor Detection

Add code
Oct 29, 2024
Viaarxiv icon

Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations

Add code
Sep 08, 2024
Figure 1 for Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations
Figure 2 for Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations
Figure 3 for Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations
Figure 4 for Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations
Viaarxiv icon

3D-LSPTM: An Automatic Framework with 3D-Large-Scale Pretrained Model for Laryngeal Cancer Detection Using Laryngoscopic Videos

Add code
Sep 02, 2024
Viaarxiv icon

SAM-FNet: SAM-Guided Fusion Network for Laryngo-Pharyngeal Tumor Detection

Add code
Aug 15, 2024
Viaarxiv icon

Domain-invariant Representation Learning via Segment Anything Model for Blood Cell Classification

Add code
Aug 14, 2024
Viaarxiv icon

Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Segment Anything Model

Add code
Aug 13, 2024
Viaarxiv icon

Core Knowledge Learning Framework for Graph Adaptation and Scalability Learning

Add code
Jul 02, 2024
Viaarxiv icon

Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework

Add code
Jul 08, 2022
Figure 1 for Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework
Figure 2 for Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework
Figure 3 for Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework
Figure 4 for Video-based Smoky Vehicle Detection with A Coarse-to-Fine Framework
Viaarxiv icon