Picture for Bin Fu

Bin Fu

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI

Add code
Nov 21, 2024
Viaarxiv icon

Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification

Add code
Nov 21, 2024
Viaarxiv icon

Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline

Add code
Nov 19, 2024
Viaarxiv icon

Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production

Add code
Nov 19, 2024
Viaarxiv icon

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Add code
Nov 04, 2024
Figure 1 for MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Figure 2 for MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Figure 3 for MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Figure 4 for MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Viaarxiv icon

Responsible Multilingual Large Language Models: A Survey of Development, Applications, and Societal Impact

Add code
Oct 23, 2024
Viaarxiv icon

Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models

Add code
Sep 03, 2024
Figure 1 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 2 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 3 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 4 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Viaarxiv icon

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Add code
Aug 06, 2024
Viaarxiv icon

AppAgent v2: Advanced Agent for Flexible Mobile Interactions

Add code
Aug 05, 2024
Viaarxiv icon

LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement

Add code
Jul 26, 2024
Viaarxiv icon