Picture for Angela Yao

Angela Yao

National University of Singapore

On the Consistency of Video Large Language Models in Temporal Comprehension

Add code
Nov 20, 2024
Viaarxiv icon

OnlineTAS: An Online Baseline for Temporal Action Segmentation

Add code
Nov 02, 2024
Viaarxiv icon

Scene-Text Grounding for Text-Based Video Question Answering

Add code
Sep 22, 2024
Figure 1 for Scene-Text Grounding for Text-Based Video Question Answering
Figure 2 for Scene-Text Grounding for Text-Based Video Question Answering
Figure 3 for Scene-Text Grounding for Text-Based Video Question Answering
Figure 4 for Scene-Text Grounding for Text-Based Video Question Answering
Viaarxiv icon

Question-Answering Dense Video Events

Add code
Sep 10, 2024
Viaarxiv icon

Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit Adjustment

Add code
Aug 19, 2024
Viaarxiv icon

VideoQA in the Era of LLMs: An Empirical Study

Add code
Aug 08, 2024
Viaarxiv icon

RealViformer: Investigating Attention for Real-World Video Super-Resolution

Add code
Jul 19, 2024
Viaarxiv icon

NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model

Add code
Jul 17, 2024
Viaarxiv icon

Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution

Add code
Jul 10, 2024
Viaarxiv icon

OfCaM: Global Human Mesh Recovery via Optimization-free Camera Motion Scale Calibration

Add code
Jun 30, 2024
Viaarxiv icon