Picture for Miguel P. Eckstein

Miguel P. Eckstein

IRIS: Intent Resolution via Inference-time Saccades for Open-Ended VQA in Large Vision-Language Models

Add code
Feb 18, 2026
Viaarxiv icon

Predicting Reaction Time to Comprehend Scenes with Foveated Scene Understanding Maps

Add code
May 19, 2025
Figure 1 for Predicting Reaction Time to Comprehend Scenes with Foveated Scene Understanding Maps
Figure 2 for Predicting Reaction Time to Comprehend Scenes with Foveated Scene Understanding Maps
Figure 3 for Predicting Reaction Time to Comprehend Scenes with Foveated Scene Understanding Maps
Figure 4 for Predicting Reaction Time to Comprehend Scenes with Foveated Scene Understanding Maps
Viaarxiv icon

Convolutional Neural Network Model Observers Discount Signal-like Anatomical Structures During Search in Virtual Digital Breast Tomosynthesis Phantoms

Add code
May 23, 2024
Viaarxiv icon

A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise

Add code
Jan 28, 2022
Figure 1 for A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise
Figure 2 for A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise
Figure 3 for A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise
Viaarxiv icon

FoveaTer: Foveated Transformer for Image Classification

Add code
May 29, 2021
Figure 1 for FoveaTer: Foveated Transformer for Image Classification
Figure 2 for FoveaTer: Foveated Transformer for Image Classification
Figure 3 for FoveaTer: Foveated Transformer for Image Classification
Figure 4 for FoveaTer: Foveated Transformer for Image Classification
Viaarxiv icon

Comparing Visual Reasoning in Humans and AI

Add code
Apr 29, 2021
Figure 1 for Comparing Visual Reasoning in Humans and AI
Figure 2 for Comparing Visual Reasoning in Humans and AI
Figure 3 for Comparing Visual Reasoning in Humans and AI
Figure 4 for Comparing Visual Reasoning in Humans and AI
Viaarxiv icon

Gaze Perception in Humans and CNN-Based Model

Add code
Apr 17, 2021
Figure 1 for Gaze Perception in Humans and CNN-Based Model
Figure 2 for Gaze Perception in Humans and CNN-Based Model
Figure 3 for Gaze Perception in Humans and CNN-Based Model
Figure 4 for Gaze Perception in Humans and CNN-Based Model
Viaarxiv icon

Language-based Video Editing via Multi-Modal Multi-Level Transformer

Add code
Apr 02, 2021
Figure 1 for Language-based Video Editing via Multi-Modal Multi-Level Transformer
Figure 2 for Language-based Video Editing via Multi-Modal Multi-Level Transformer
Figure 3 for Language-based Video Editing via Multi-Modal Multi-Level Transformer
Figure 4 for Language-based Video Editing via Multi-Modal Multi-Level Transformer
Viaarxiv icon

Medical Image Quality Metrics for Foveated Model Observers

Add code
Feb 09, 2021
Figure 1 for Medical Image Quality Metrics for Foveated Model Observers
Figure 2 for Medical Image Quality Metrics for Foveated Model Observers
Figure 3 for Medical Image Quality Metrics for Foveated Model Observers
Figure 4 for Medical Image Quality Metrics for Foveated Model Observers
Viaarxiv icon

Assessment of Faster R-CNN in Man-Machine collaborative search

Add code
Apr 04, 2019
Figure 1 for Assessment of Faster R-CNN in Man-Machine collaborative search
Figure 2 for Assessment of Faster R-CNN in Man-Machine collaborative search
Figure 3 for Assessment of Faster R-CNN in Man-Machine collaborative search
Figure 4 for Assessment of Faster R-CNN in Man-Machine collaborative search
Viaarxiv icon