Picture for Youshan Zhang

Youshan Zhang

Automatic Teaching Platform on Vision Language Retrieval Augmented Generation

Add code
Mar 07, 2025
Viaarxiv icon

SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing

Add code
Jan 13, 2025
Viaarxiv icon

Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection

Add code
Jan 13, 2025
Figure 1 for Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Figure 2 for Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Figure 3 for Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Figure 4 for Confident Pseudo-labeled Diffusion Augmentation for Canine Cardiomegaly Detection
Viaarxiv icon

Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation

Add code
Nov 22, 2024
Figure 1 for Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Figure 2 for Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Figure 3 for Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Figure 4 for Leapfrog Latent Consistency Model (LLCM) for Medical Images Generation
Viaarxiv icon

SparrowVQE: Visual Question Explanation for Course Content Understanding

Add code
Nov 12, 2024
Figure 1 for SparrowVQE: Visual Question Explanation for Course Content Understanding
Figure 2 for SparrowVQE: Visual Question Explanation for Course Content Understanding
Figure 3 for SparrowVQE: Visual Question Explanation for Course Content Understanding
Figure 4 for SparrowVQE: Visual Question Explanation for Course Content Understanding
Viaarxiv icon

KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication

Add code
Oct 21, 2024
Figure 1 for KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Figure 2 for KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Figure 3 for KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Figure 4 for KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Viaarxiv icon

Vision Transformer Segmentation for Visual Bird Sound Denoising

Add code
Jun 13, 2024
Viaarxiv icon

Complex Image-Generative Diffusion Transformer for Audio Denoising

Add code
Jun 13, 2024
Figure 1 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 2 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 3 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Figure 4 for Complex Image-Generative Diffusion Transformer for Audio Denoising
Viaarxiv icon

Diffusion Gaussian Mixture Audio Denoise

Add code
Jun 13, 2024
Viaarxiv icon

DCHT: Deep Complex Hybrid Transformer for Speech Enhancement

Add code
Oct 30, 2023
Viaarxiv icon