Picture for Ali Koksal

Ali Koksal

VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting

Add code
Dec 16, 2024
Figure 1 for VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
Figure 2 for VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
Figure 3 for VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
Figure 4 for VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
Viaarxiv icon

Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos

Add code
Sep 14, 2023
Viaarxiv icon