Picture for Oliver Lemon

Oliver Lemon

Heriot-Watt University

Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling

Add code
Sep 09, 2024
Figure 1 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 2 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 3 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 4 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Viaarxiv icon

AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

Add code
Jun 19, 2024
Viaarxiv icon

Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers

Add code
Apr 21, 2024
Viaarxiv icon

Socially Pertinent Robots in Gerontological Healthcare

Add code
Apr 11, 2024
Figure 1 for Socially Pertinent Robots in Gerontological Healthcare
Figure 2 for Socially Pertinent Robots in Gerontological Healthcare
Figure 3 for Socially Pertinent Robots in Gerontological Healthcare
Figure 4 for Socially Pertinent Robots in Gerontological Healthcare
Viaarxiv icon

NLP Verification: Towards a General Methodology for Certifying Robustness

Add code
Mar 15, 2024
Viaarxiv icon

Visually Grounded Language Learning: a review of language games, datasets, tasks, and models

Add code
Dec 05, 2023
Viaarxiv icon

Multitask Multimodal Prompted Training for Interactive Embodied Task Completion

Add code
Nov 07, 2023
Figure 1 for Multitask Multimodal Prompted Training for Interactive Embodied Task Completion
Figure 2 for Multitask Multimodal Prompted Training for Interactive Embodied Task Completion
Figure 3 for Multitask Multimodal Prompted Training for Interactive Embodied Task Completion
Figure 4 for Multitask Multimodal Prompted Training for Interactive Embodied Task Completion
Viaarxiv icon

Detecting Agreement in Multi-party Conversational AI

Add code
Nov 06, 2023
Viaarxiv icon

Detecting agreement in multi-party dialogue: evaluating speaker diarisation versus a procedural baseline to enhance user engagement

Add code
Nov 06, 2023
Viaarxiv icon

FurChat: An Embodied Conversational Agent using LLMs, Combining Open and Closed-Domain Dialogue with Facial Expressions

Add code
Aug 30, 2023
Viaarxiv icon