Picture for Ying Hu

Ying Hu

Med-2E3: A 2D-Enhanced 3D Medical Multimodal Large Language Model

Add code
Nov 19, 2024
Viaarxiv icon

DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Add code
Oct 19, 2024
Viaarxiv icon

Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function

Add code
Sep 30, 2024
Figure 1 for Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Figure 2 for Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Figure 3 for Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Figure 4 for Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function
Viaarxiv icon

Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE

Add code
Sep 26, 2024
Viaarxiv icon

SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images

Add code
Jul 03, 2024
Figure 1 for SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images
Figure 2 for SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images
Figure 3 for SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images
Figure 4 for SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images
Viaarxiv icon

Ultrasound Report Generation with Cross-Modality Feature Alignment via Unsupervised Guidance

Add code
Jun 02, 2024
Viaarxiv icon

TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models

Add code
May 20, 2024
Figure 1 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Figure 2 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Figure 3 for TinyLLaVA Factory: A Modularized Codebase for Small-scale Large Multimodal Models
Viaarxiv icon

Personalized Forgetting Mechanism with Concept-Driven Knowledge Tracing

Add code
Apr 18, 2024
Viaarxiv icon

TinyLLaVA: A Framework of Small-scale Large Multimodal Models

Add code
Feb 22, 2024
Viaarxiv icon

Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations

Add code
Jul 07, 2023
Viaarxiv icon