Picture for Richard Luo

Richard Luo

Shotluck Holmes: A Family of Efficient Small-Scale Large Language Vision Models For Video Captioning and Summarization

Add code
May 31, 2024
Viaarxiv icon

Joint Moment Retrieval and Highlight Detection Via Natural Language Queries

Add code
May 08, 2023
Viaarxiv icon