Picture for Apoorva Beedu

Apoorva Beedu

Exploring Efficient Foundational Multi-modal Models for Video Summarization

Add code
Oct 09, 2024
Viaarxiv icon

Mamba Fusion: Learning Actions Through Questioning

Add code
Sep 17, 2024
Viaarxiv icon

Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition -- And Ways to Overcome Them

Add code
Aug 21, 2024
Viaarxiv icon

On the Efficacy of Text-Based Input Modalities for Action Anticipation

Add code
Jan 23, 2024
Viaarxiv icon

Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition

Add code
Sep 03, 2023
Viaarxiv icon

Multi-Stage Based Feature Fusion of Multi-Modal Data for Human Activity Recognition

Add code
Nov 08, 2022
Viaarxiv icon

Video based Object 6D Pose Estimation using Transformers

Add code
Nov 07, 2022
Viaarxiv icon

End-to-End Multimodal Representation Learning for Video Dialog

Add code
Oct 26, 2022
Viaarxiv icon

VideoPose: Estimating 6D object pose from videos

Add code
Nov 20, 2021
Figure 1 for VideoPose: Estimating 6D object pose from videos
Figure 2 for VideoPose: Estimating 6D object pose from videos
Figure 3 for VideoPose: Estimating 6D object pose from videos
Figure 4 for VideoPose: Estimating 6D object pose from videos
Viaarxiv icon