Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Nov 17, 2022

Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang(+11 more)

Figure 1 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Figure 2 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Figure 3 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Figure 4 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Share this with someone who'll enjoy it:

Abstract:In this report, we present our champion solutions to five tracks at Ego4D challenge. We leverage our developed InternVideo, a video foundation model, for five Ego4D tasks, including Moment Queries, Natural Language Queries, Future Hand Prediction, State Change Object Detection, and Short-term Object Interaction Anticipation. InternVideo-Ego4D is an effective paradigm to adapt the strong foundation model to the downstream ego-centric video understanding tasks with simple head designs. In these five tasks, the performance of InternVideo-Ego4D comprehensively surpasses the baseline methods and the champions of CVPR2022, demonstrating the powerful representation ability of InternVideo as a video foundation model. Our code will be released at https://github.com/OpenGVLab/ego4d-eccv2022-solutions

* Technical report in 2nd International Ego4D Workshop@ECCV 2022. Code will be released at https://github.com/OpenGVLab/ego4d-eccv2022-solutions

View paper on

Share this with someone who'll enjoy it:

Title:InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Paper and Code