Picture for Jieneng Chen

Jieneng Chen

PulseCheck457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models

Add code
Feb 13, 2025
Viaarxiv icon

PulseCheck457: A Diagnostic Benchmark for Comprehensive Spatial Reasoning of Large Multimodal Models

Add code
Feb 12, 2025
Viaarxiv icon

GenEx: Generating an Explorable World

Add code
Dec 12, 2024
Viaarxiv icon

3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark

Add code
Dec 10, 2024
Viaarxiv icon

Generative World Explorer

Add code
Nov 19, 2024
Figure 1 for Generative World Explorer
Figure 2 for Generative World Explorer
Figure 3 for Generative World Explorer
Figure 4 for Generative World Explorer
Viaarxiv icon

Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?

Add code
Nov 06, 2024
Figure 1 for Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
Figure 2 for Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
Figure 3 for Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
Figure 4 for Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
Viaarxiv icon

LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression

Add code
Jun 28, 2024
Figure 1 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 2 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 3 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 4 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Viaarxiv icon

ViTamin: Designing Scalable Vision Models in the Vision-Language Era

Add code
Apr 03, 2024
Figure 1 for ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Figure 2 for ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Figure 3 for ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Figure 4 for ViTamin: Designing Scalable Vision Models in the Vision-Language Era
Viaarxiv icon

3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge

Add code
Mar 23, 2024
Figure 1 for 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Figure 2 for 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Figure 3 for 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Figure 4 for 3D-TransUNet for Brain Metastases Segmentation in the BraTS2023 Challenge
Viaarxiv icon

Prompt-Based Exemplar Super-Compression and Regeneration for Class-Incremental Learning

Add code
Nov 30, 2023
Viaarxiv icon