Picture for Caifeng Shan

Caifeng Shan

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Add code
Jan 03, 2025
Viaarxiv icon

T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs

Add code
Dec 02, 2024
Figure 1 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 2 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 3 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Figure 4 for T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
Viaarxiv icon

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Add code
Nov 22, 2024
Figure 1 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 2 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 3 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Figure 4 for MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs
Viaarxiv icon

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Add code
Aug 09, 2024
Figure 1 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 2 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 3 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 4 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Viaarxiv icon

GAPNet: Granularity Attention Network with Anatomy-Prior-Constraint for Carotid Artery Segmentation

Add code
Jun 27, 2024
Figure 1 for GAPNet: Granularity Attention Network with Anatomy-Prior-Constraint for Carotid Artery Segmentation
Figure 2 for GAPNet: Granularity Attention Network with Anatomy-Prior-Constraint for Carotid Artery Segmentation
Figure 3 for GAPNet: Granularity Attention Network with Anatomy-Prior-Constraint for Carotid Artery Segmentation
Figure 4 for GAPNet: Granularity Attention Network with Anatomy-Prior-Constraint for Carotid Artery Segmentation
Viaarxiv icon

DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation

Add code
Jun 01, 2024
Figure 1 for DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation
Figure 2 for DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation
Figure 3 for DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation
Figure 4 for DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation
Viaarxiv icon

HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis

Add code
May 08, 2024
Figure 1 for HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis
Figure 2 for HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis
Figure 3 for HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis
Figure 4 for HAGAN: Hybrid Augmented Generative Adversarial Network for Medical Image Synthesis
Viaarxiv icon

Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning

Add code
Jan 05, 2022
Figure 1 for Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning
Figure 2 for Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning
Figure 3 for Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning
Figure 4 for Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning
Viaarxiv icon

Medical Instrument Segmentation in 3D US by Hybrid Constrained Semi-Supervised Learning

Add code
Jul 30, 2021
Figure 1 for Medical Instrument Segmentation in 3D US by Hybrid Constrained Semi-Supervised Learning
Figure 2 for Medical Instrument Segmentation in 3D US by Hybrid Constrained Semi-Supervised Learning
Figure 3 for Medical Instrument Segmentation in 3D US by Hybrid Constrained Semi-Supervised Learning
Figure 4 for Medical Instrument Segmentation in 3D US by Hybrid Constrained Semi-Supervised Learning
Viaarxiv icon

Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition

Add code
Jun 29, 2021
Figure 1 for Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition
Figure 2 for Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition
Figure 3 for Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition
Figure 4 for Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition
Viaarxiv icon