Picture for David Semedo

David Semedo

Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding

Add code
Mar 26, 2025
Viaarxiv icon

Zero-Shot Action Recognition in Surveillance Videos

Add code
Oct 28, 2024
Viaarxiv icon

Multi-trait User Simulation with Adaptive Decoding for Conversational Task Assistants

Add code
Oct 16, 2024
Viaarxiv icon

Show and Guide: Instructional-Plan Grounded Vision and Language Model

Add code
Sep 27, 2024
Figure 1 for Show and Guide: Instructional-Plan Grounded Vision and Language Model
Figure 2 for Show and Guide: Instructional-Plan Grounded Vision and Language Model
Figure 3 for Show and Guide: Instructional-Plan Grounded Vision and Language Model
Figure 4 for Show and Guide: Instructional-Plan Grounded Vision and Language Model
Viaarxiv icon

GlórIA -- A Generative and Open Large Language Model for Portuguese

Add code
Feb 20, 2024
Viaarxiv icon

Plan-Grounded Large Language Models for Dual Goal Conversational Settings

Add code
Feb 01, 2024
Viaarxiv icon

TWIZ: The Wizard of Multimodal Conversational-Stimulus

Add code
Oct 03, 2023
Viaarxiv icon

Grounded Complex Task Segmentation for Conversational Assistants

Add code
Sep 20, 2023
Viaarxiv icon

The Wizard of Curiosities: Enriching Dialogues with Fun Facts

Add code
Sep 20, 2023
Viaarxiv icon

Rating Prediction in Conversational Task Assistants with Behavioral and Conversational-Flow Features

Add code
Sep 20, 2023
Figure 1 for Rating Prediction in Conversational Task Assistants with Behavioral and Conversational-Flow Features
Figure 2 for Rating Prediction in Conversational Task Assistants with Behavioral and Conversational-Flow Features
Figure 3 for Rating Prediction in Conversational Task Assistants with Behavioral and Conversational-Flow Features
Figure 4 for Rating Prediction in Conversational Task Assistants with Behavioral and Conversational-Flow Features
Viaarxiv icon