Picture for Oliver Lemon

Oliver Lemon

Heriot-Watt University

Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling

Add code
Sep 09, 2024
Figure 1 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 2 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 3 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 4 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Viaarxiv icon

AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

Add code
Jun 19, 2024
Viaarxiv icon

Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers

Add code
Apr 21, 2024
Viaarxiv icon

Socially Pertinent Robots in Gerontological Healthcare

Add code
Apr 11, 2024
Viaarxiv icon

NLP Verification: Towards a General Methodology for Certifying Robustness

Add code
Mar 15, 2024
Viaarxiv icon

Visually Grounded Language Learning: a review of language games, datasets, tasks, and models

Add code
Dec 05, 2023
Viaarxiv icon

Multitask Multimodal Prompted Training for Interactive Embodied Task Completion

Add code
Nov 07, 2023
Viaarxiv icon

Detecting Agreement in Multi-party Conversational AI

Add code
Nov 06, 2023
Viaarxiv icon

Detecting agreement in multi-party dialogue: evaluating speaker diarisation versus a procedural baseline to enhance user engagement

Add code
Nov 06, 2023
Viaarxiv icon

FurChat: An Embodied Conversational Agent using LLMs, Combining Open and Closed-Domain Dialogue with Facial Expressions

Add code
Aug 30, 2023
Viaarxiv icon