Picture for Tao Zhou

Tao Zhou

MM-Retinal V2: Transfer an Elite Knowledge Spark into Fundus Vision-Language Pretraining

Add code
Jan 27, 2025
Figure 1 for MM-Retinal V2: Transfer an Elite Knowledge Spark into Fundus Vision-Language Pretraining
Figure 2 for MM-Retinal V2: Transfer an Elite Knowledge Spark into Fundus Vision-Language Pretraining
Figure 3 for MM-Retinal V2: Transfer an Elite Knowledge Spark into Fundus Vision-Language Pretraining
Figure 4 for MM-Retinal V2: Transfer an Elite Knowledge Spark into Fundus Vision-Language Pretraining
Viaarxiv icon

Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein

Add code
Jan 07, 2025
Viaarxiv icon

Self-Calibrated Dual Contrasting for Annotation-Efficient Bacteria Raman Spectroscopy Clustering and Classification

Add code
Dec 28, 2024
Viaarxiv icon

Enhancing Large-scale UAV Route Planing with Global and Local Features via Reinforcement Graph Fusion

Add code
Dec 20, 2024
Viaarxiv icon

Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation

Add code
Dec 18, 2024
Figure 1 for Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation
Figure 2 for Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation
Figure 3 for Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation
Figure 4 for Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation
Viaarxiv icon

Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language Models

Add code
Dec 18, 2024
Figure 1 for Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language Models
Figure 2 for Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language Models
Figure 3 for Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language Models
Figure 4 for Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language Models
Viaarxiv icon

Bringing Multimodality to Amazon Visual Search System

Add code
Dec 17, 2024
Figure 1 for Bringing Multimodality to Amazon Visual Search System
Figure 2 for Bringing Multimodality to Amazon Visual Search System
Figure 3 for Bringing Multimodality to Amazon Visual Search System
Figure 4 for Bringing Multimodality to Amazon Visual Search System
Viaarxiv icon

CALA: A Class-Aware Logit Adapter for Few-Shot Class-Incremental Learning

Add code
Dec 17, 2024
Viaarxiv icon

DiffRaman: A Conditional Latent Denoising Diffusion Probabilistic Model for Bacterial Raman Spectroscopy Identification Under Limited Data Conditions

Add code
Dec 11, 2024
Viaarxiv icon

Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey

Add code
Nov 05, 2024
Figure 1 for Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Figure 2 for Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Figure 3 for Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Figure 4 for Exploring the Interplay Between Video Generation and World Models in Autonomous Driving: A Survey
Viaarxiv icon