Picture for Dimitris N. Metaxas

Dimitris N. Metaxas

Rutgers University

Show and Segment: Universal Medical Image Segmentation via In-Context Learning

Add code
Mar 25, 2025
Viaarxiv icon

LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation

Add code
Mar 18, 2025
Viaarxiv icon

Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars

Add code
Mar 15, 2025
Viaarxiv icon

Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge

Add code
Mar 05, 2025
Viaarxiv icon

LUCAS: Layered Universal Codec Avatars

Add code
Feb 27, 2025
Viaarxiv icon

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering

Add code
Feb 05, 2025
Figure 1 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 2 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 3 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Figure 4 for The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering
Viaarxiv icon

LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation

Add code
Feb 04, 2025
Viaarxiv icon

RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models

Add code
Feb 04, 2025
Viaarxiv icon

MLLM-as-a-Judge for Image Safety without Human Labeling

Add code
Dec 31, 2024
Figure 1 for MLLM-as-a-Judge for Image Safety without Human Labeling
Figure 2 for MLLM-as-a-Judge for Image Safety without Human Labeling
Figure 3 for MLLM-as-a-Judge for Image Safety without Human Labeling
Figure 4 for MLLM-as-a-Judge for Image Safety without Human Labeling
Viaarxiv icon

Accelerating Multimodel Large Language Models by Searching Optimal Vision Token Reduction

Add code
Nov 30, 2024
Figure 1 for Accelerating Multimodel Large Language Models by Searching Optimal Vision Token Reduction
Figure 2 for Accelerating Multimodel Large Language Models by Searching Optimal Vision Token Reduction
Figure 3 for Accelerating Multimodel Large Language Models by Searching Optimal Vision Token Reduction
Figure 4 for Accelerating Multimodel Large Language Models by Searching Optimal Vision Token Reduction
Viaarxiv icon