Picture for Bin Fu

Bin Fu

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Add code
Aug 28, 2025
Viaarxiv icon

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Add code
Aug 25, 2025
Viaarxiv icon

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Add code
Jul 23, 2025
Viaarxiv icon

Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation

Add code
Jul 17, 2025
Viaarxiv icon

Adapting a Segmentation Foundation Model for Medical Image Classification

Add code
May 09, 2025
Viaarxiv icon

Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation

Add code
May 09, 2025
Figure 1 for Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation
Figure 2 for Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation
Figure 3 for Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation
Figure 4 for Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation
Viaarxiv icon

White Light Specular Reflection Data Augmentation for Deep Learning Polyp Detection

Add code
May 08, 2025
Viaarxiv icon

GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical Reasoning

Add code
Apr 02, 2025
Viaarxiv icon

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Add code
Mar 27, 2025
Viaarxiv icon

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Add code
Mar 27, 2025
Viaarxiv icon