Picture for Bing Han

Bing Han

Towards Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training

Add code
Jan 06, 2026
Viaarxiv icon

Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology

Add code
Dec 23, 2025
Viaarxiv icon

MedBench v4: A Robust and Scalable Benchmark for Evaluating Chinese Medical Language Models, Multimodal Models, and Intelligent Agents

Add code
Nov 19, 2025
Viaarxiv icon

Can Large Language Models Function as Qualified Pediatricians? A Systematic Evaluation in Real-World Clinical Contexts

Add code
Nov 17, 2025
Viaarxiv icon

TCM-5CEval: Extended Deep Evaluation Benchmark for LLM's Comprehensive Clinical Research Competence in Traditional Chinese Medicine

Add code
Nov 17, 2025
Viaarxiv icon

Towards Responsible Evaluation for Text-to-Speech

Add code
Oct 08, 2025
Figure 1 for Towards Responsible Evaluation for Text-to-Speech
Figure 2 for Towards Responsible Evaluation for Text-to-Speech
Figure 3 for Towards Responsible Evaluation for Text-to-Speech
Figure 4 for Towards Responsible Evaluation for Text-to-Speech
Viaarxiv icon

FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data

Add code
Sep 08, 2025
Figure 1 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 2 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 3 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Figure 4 for FoMo4Wheat: Toward reliable crop vision foundation models with globally curated data
Viaarxiv icon

HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation

Add code
Aug 20, 2025
Viaarxiv icon

Exploring Self-Supervised Audio Models for Generalized Anomalous Sound Detection

Add code
Aug 17, 2025
Viaarxiv icon

FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation

Add code
Jul 22, 2025
Viaarxiv icon