Picture for Xiaobo Zhang

Xiaobo Zhang

CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart

Add code
Oct 28, 2024
Viaarxiv icon

A Medical Multimodal Large Language Model for Pediatric Pneumonia

Add code
Sep 04, 2024
Figure 1 for A Medical Multimodal Large Language Model for Pediatric Pneumonia
Figure 2 for A Medical Multimodal Large Language Model for Pediatric Pneumonia
Figure 3 for A Medical Multimodal Large Language Model for Pediatric Pneumonia
Figure 4 for A Medical Multimodal Large Language Model for Pediatric Pneumonia
Viaarxiv icon

Anatomical Structure-Guided Medical Vision-Language Pre-training

Add code
Mar 14, 2024
Viaarxiv icon

Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

Add code
Dec 15, 2023
Viaarxiv icon

Large Language Models are Complex Table Parsers

Add code
Dec 13, 2023
Figure 1 for Large Language Models are Complex Table Parsers
Figure 2 for Large Language Models are Complex Table Parsers
Figure 3 for Large Language Models are Complex Table Parsers
Figure 4 for Large Language Models are Complex Table Parsers
Viaarxiv icon

Enhanced Knowledge Injection for Radiology Report Generation

Add code
Nov 01, 2023
Viaarxiv icon

Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval

Add code
Sep 20, 2023
Viaarxiv icon

Video Infringement Detection via Feature Disentanglement and Mutual Information Maximization

Add code
Sep 13, 2023
Viaarxiv icon

Web Photo Source Identification based on Neural Enhanced Camera Fingerprint

Add code
Feb 18, 2023
Viaarxiv icon

Boosting COVID-19 Severity Detection with Infection-aware Contrastive Mixup Classification

Add code
Dec 01, 2022
Viaarxiv icon