Picture for Zongyuan Ge

Zongyuan Ge

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP

Add code
Dec 27, 2024
Figure 1 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 2 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 3 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 4 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Viaarxiv icon

Neighbor Does Matter: Density-Aware Contrastive Learning for Medical Semi-supervised Segmentation

Add code
Dec 27, 2024
Viaarxiv icon

A General-Purpose Multimodal Foundation Model for Dermatology

Add code
Oct 19, 2024
Figure 1 for A General-Purpose Multimodal Foundation Model for Dermatology
Figure 2 for A General-Purpose Multimodal Foundation Model for Dermatology
Figure 3 for A General-Purpose Multimodal Foundation Model for Dermatology
Figure 4 for A General-Purpose Multimodal Foundation Model for Dermatology
Viaarxiv icon

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Add code
Oct 13, 2024
Figure 1 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 2 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 3 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 4 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Viaarxiv icon

MONICA: Benchmarking on Long-tailed Medical Image Classification

Add code
Oct 02, 2024
Viaarxiv icon

Adaptive Transformer Modelling of Density Function for Nonparametric Survival Analysis

Add code
Sep 10, 2024
Viaarxiv icon

Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance

Add code
Aug 27, 2024
Figure 1 for Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance
Figure 2 for Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance
Figure 3 for Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance
Figure 4 for Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance
Viaarxiv icon

TP-DRSeg: Improving Diabetic Retinopathy Lesion Segmentation with Explicit Text-Prompts Assisted SAM

Add code
Jun 22, 2024
Viaarxiv icon

OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding

Add code
Jun 12, 2024
Figure 1 for OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Figure 2 for OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Figure 3 for OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Figure 4 for OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
Viaarxiv icon

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Add code
Jun 10, 2024
Figure 1 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 2 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 3 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Figure 4 for CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
Viaarxiv icon