Picture for Youshan Zhang

Youshan Zhang

UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache

Add code
May 20, 2025
Viaarxiv icon

AgentPolyp: Accurate Polyp Segmentation via Image Enhancement Agent

Add code
Apr 15, 2025
Figure 1 for AgentPolyp: Accurate Polyp Segmentation via Image Enhancement Agent
Figure 2 for AgentPolyp: Accurate Polyp Segmentation via Image Enhancement Agent
Figure 3 for AgentPolyp: Accurate Polyp Segmentation via Image Enhancement Agent
Figure 4 for AgentPolyp: Accurate Polyp Segmentation via Image Enhancement Agent
Viaarxiv icon

Automatic Teaching Platform on Vision Language Retrieval Augmented Generation

Add code
Mar 07, 2025
Viaarxiv icon

SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing

Add code
Jan 13, 2025
Viaarxiv icon

Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection

Add code
Jan 13, 2025
Figure 1 for Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Figure 2 for Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Figure 3 for Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Figure 4 for Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Viaarxiv icon

Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation

Add code
Nov 22, 2024
Figure 1 for Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Figure 2 for Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Figure 3 for Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Figure 4 for Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Viaarxiv icon

SparrowVQE: Visual Question Explanation for Course Content Understanding

Add code
Nov 12, 2024
Figure 1 for SparrowVQE: Visual Question Explanation for Course Content Understanding
Figure 2 for SparrowVQE: Visual Question Explanation for Course Content Understanding
Figure 3 for SparrowVQE: Visual Question Explanation for Course Content Understanding
Figure 4 for SparrowVQE: Visual Question Explanation for Course Content Understanding
Viaarxiv icon

KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication

Add code
Oct 21, 2024
Figure 1 for KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Figure 2 for KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Figure 3 for KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Figure 4 for KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Viaarxiv icon

Vision Transformer Segmentation for Visual Bird Sound Denoising

Add code
Jun 13, 2024
Figure 1 for Vision Transformer Segmentation for Visual Bird Sound Denoising
Figure 2 for Vision Transformer Segmentation for Visual Bird Sound Denoising
Figure 3 for Vision Transformer Segmentation for Visual Bird Sound Denoising
Viaarxiv icon

Complex Image-Generative Diffusion Transformer for Audio Denoising

Add code
Jun 13, 2024
Figure 1 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 2 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 3 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 4 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Viaarxiv icon