Picture for Bohan Zhuang

Bohan Zhuang

ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality

Add code
Dec 05, 2024
Figure 1 for ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality
Figure 2 for ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality
Figure 3 for ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality
Figure 4 for ZipAR: Accelerating Autoregressive Image Generation through Spatial Locality
Viaarxiv icon

Enhancing Perception Capabilities of Multimodal LLMs with Training-free Fusion

Add code
Dec 02, 2024
Viaarxiv icon

Evaluating and Advancing Multimodal Large Language Models in Ability Lens

Add code
Nov 22, 2024
Viaarxiv icon

MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views

Add code
Nov 07, 2024
Figure 1 for MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
Figure 2 for MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
Figure 3 for MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
Figure 4 for MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views
Viaarxiv icon

ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression

Add code
Oct 11, 2024
Viaarxiv icon

McCaD: Multi-Contrast MRI Conditioned, Adaptive Adversarial Diffusion Model for High-Fidelity MRI Synthesis

Add code
Sep 01, 2024
Figure 1 for McCaD: Multi-Contrast MRI Conditioned, Adaptive Adversarial Diffusion Model for High-Fidelity MRI Synthesis
Figure 2 for McCaD: Multi-Contrast MRI Conditioned, Adaptive Adversarial Diffusion Model for High-Fidelity MRI Synthesis
Figure 3 for McCaD: Multi-Contrast MRI Conditioned, Adaptive Adversarial Diffusion Model for High-Fidelity MRI Synthesis
Figure 4 for McCaD: Multi-Contrast MRI Conditioned, Adaptive Adversarial Diffusion Model for High-Fidelity MRI Synthesis
Viaarxiv icon

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI

Add code
Aug 06, 2024
Viaarxiv icon

InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation

Add code
Jul 14, 2024
Viaarxiv icon

SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation

Add code
Jul 06, 2024
Figure 1 for SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation
Figure 2 for SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation
Figure 3 for SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation
Figure 4 for SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation
Viaarxiv icon

ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models

Add code
Jun 13, 2024
Viaarxiv icon