Picture for Bin Fu

Bin Fu

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Add code
Feb 10, 2025
Viaarxiv icon

GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI

Add code
Nov 21, 2024
Viaarxiv icon

Intent-Aware Dialogue Generation and Multi-Task Contrastive Learning for Multi-Turn Intent Classification

Add code
Nov 21, 2024
Viaarxiv icon

Balancing Accuracy and Efficiency in Multi-Turn Intent Classification for LLM-Powered Dialog Systems in Production

Add code
Nov 19, 2024
Viaarxiv icon

Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline

Add code
Nov 19, 2024
Viaarxiv icon

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Add code
Nov 04, 2024
Figure 1 for MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Figure 2 for MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Figure 3 for MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Figure 4 for MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Viaarxiv icon

Responsible Multilingual Large Language Models: A Survey of Development, Applications, and Societal Impact

Add code
Oct 23, 2024
Viaarxiv icon

Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models

Add code
Sep 03, 2024
Figure 1 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 2 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 3 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 4 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Viaarxiv icon

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Add code
Aug 06, 2024
Viaarxiv icon

AppAgent v2: Advanced Agent for Flexible Mobile Interactions

Add code
Aug 05, 2024
Figure 1 for AppAgent v2: Advanced Agent for Flexible Mobile Interactions
Figure 2 for AppAgent v2: Advanced Agent for Flexible Mobile Interactions
Figure 3 for AppAgent v2: Advanced Agent for Flexible Mobile Interactions
Figure 4 for AppAgent v2: Advanced Agent for Flexible Mobile Interactions
Viaarxiv icon