Picture for Alan Yuille

Alan Yuille

Johns Hopkins University

PulseCheck457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models

Add code
Feb 13, 2025
Viaarxiv icon

PulseCheck457: A Diagnostic Benchmark for Comprehensive Spatial Reasoning of Large Multimodal Models

Add code
Feb 12, 2025
Viaarxiv icon

EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and Inference

Add code
Feb 07, 2025
Viaarxiv icon

Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

Add code
Feb 06, 2025
Viaarxiv icon

How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?

Add code
Jan 20, 2025
Viaarxiv icon

VideoAuteur: Towards Long Narrative Video Generation

Add code
Jan 10, 2025
Viaarxiv icon

RadGPT: Constructing 3D Image-Text Tumor Datasets

Add code
Jan 08, 2025
Viaarxiv icon

Text-Driven Tumor Synthesis

Add code
Dec 24, 2024
Figure 1 for Text-Driven Tumor Synthesis
Figure 2 for Text-Driven Tumor Synthesis
Figure 3 for Text-Driven Tumor Synthesis
Figure 4 for Text-Driven Tumor Synthesis
Viaarxiv icon

Flowing from Words to Pixels: A Framework for Cross-Modality Evolution

Add code
Dec 19, 2024
Figure 1 for Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Figure 2 for Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Figure 3 for Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Figure 4 for Flowing from Words to Pixels: A Framework for Cross-Modality Evolution
Viaarxiv icon

FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching

Add code
Dec 19, 2024
Viaarxiv icon