Picture for Bin Fu

Bin Fu

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Add code
Nov 04, 2024
Viaarxiv icon

Responsible Multilingual Large Language Models: A Survey of Development, Applications, and Societal Impact

Add code
Oct 23, 2024
Viaarxiv icon

Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models

Add code
Sep 03, 2024
Figure 1 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 2 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 3 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 4 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Viaarxiv icon

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Add code
Aug 06, 2024
Viaarxiv icon

AppAgent v2: Advanced Agent for Flexible Mobile Interactions

Add code
Aug 05, 2024
Viaarxiv icon

LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement

Add code
Jul 26, 2024
Viaarxiv icon

EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts

Add code
Jun 13, 2024
Viaarxiv icon

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models

Add code
May 31, 2024
Viaarxiv icon

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Add code
Mar 08, 2024
Viaarxiv icon

Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models

Add code
Dec 22, 2023
Viaarxiv icon