Picture for Jonathan Roberts

Jonathan Roberts

Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games

Add code
Dec 18, 2024
Viaarxiv icon

Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?

Add code
Nov 07, 2024
Figure 1 for Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?
Figure 2 for Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?
Figure 3 for Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?
Figure 4 for Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?
Viaarxiv icon

GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models

Add code
Aug 21, 2024
Figure 1 for GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Figure 2 for GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Figure 3 for GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Figure 4 for GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Viaarxiv icon

SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation

Add code
May 14, 2024
Figure 1 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 2 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 3 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 4 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Viaarxiv icon

Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs

Add code
Nov 30, 2023
Viaarxiv icon

GPT4GEO: How a Language Model Sees the World's Geography

Add code
May 30, 2023
Viaarxiv icon

SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models

Add code
Apr 23, 2023
Viaarxiv icon

3D Semantic Mapping from Arthroscopy using Out-of-distribution Pose and Depth and In-distribution Segmentation Training

Add code
Jun 10, 2021
Figure 1 for 3D Semantic Mapping from Arthroscopy using Out-of-distribution Pose and Depth and In-distribution Segmentation Training
Figure 2 for 3D Semantic Mapping from Arthroscopy using Out-of-distribution Pose and Depth and In-distribution Segmentation Training
Figure 3 for 3D Semantic Mapping from Arthroscopy using Out-of-distribution Pose and Depth and In-distribution Segmentation Training
Viaarxiv icon

Arthroscopic Multi-Spectral Scene Segmentation Using Deep Learning

Add code
Mar 03, 2021
Figure 1 for Arthroscopic Multi-Spectral Scene Segmentation Using Deep Learning
Viaarxiv icon

Real-time Joint Motion Analysis and Instrument Tracking for Robot-Assisted Orthopaedic Surgery

Add code
Sep 06, 2019
Figure 1 for Real-time Joint Motion Analysis and Instrument Tracking for Robot-Assisted Orthopaedic Surgery
Figure 2 for Real-time Joint Motion Analysis and Instrument Tracking for Robot-Assisted Orthopaedic Surgery
Figure 3 for Real-time Joint Motion Analysis and Instrument Tracking for Robot-Assisted Orthopaedic Surgery
Figure 4 for Real-time Joint Motion Analysis and Instrument Tracking for Robot-Assisted Orthopaedic Surgery
Viaarxiv icon