Picture for Tobias Weyand

Tobias Weyand

Neptune: The Long Orbit to Benchmarking Long Video Understanding

Add code
Dec 12, 2024
Viaarxiv icon

Extending Video Masked Autoencoders to 128 frames

Add code
Nov 20, 2024
Viaarxiv icon

VideoPrism: A Foundational Visual Encoder for Video Understanding

Add code
Feb 20, 2024
Figure 1 for VideoPrism: A Foundational Visual Encoder for Video Understanding
Figure 2 for VideoPrism: A Foundational Visual Encoder for Video Understanding
Figure 3 for VideoPrism: A Foundational Visual Encoder for Video Understanding
Figure 4 for VideoPrism: A Foundational Visual Encoder for Video Understanding
Viaarxiv icon

VideoGLUE: Video General Understanding Evaluation of Foundation Models

Add code
Jul 06, 2023
Viaarxiv icon

Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information

Add code
Jun 02, 2022
Figure 1 for Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information
Figure 2 for Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information
Figure 3 for Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information
Figure 4 for Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information
Viaarxiv icon

Towards A Fairer Landmark Recognition Dataset

Add code
Aug 19, 2021
Figure 1 for Towards A Fairer Landmark Recognition Dataset
Figure 2 for Towards A Fairer Landmark Recognition Dataset
Figure 3 for Towards A Fairer Landmark Recognition Dataset
Figure 4 for Towards A Fairer Landmark Recognition Dataset
Viaarxiv icon

Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food

Add code
Mar 04, 2021
Figure 1 for Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
Figure 2 for Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
Figure 3 for Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
Figure 4 for Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
Viaarxiv icon

Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval

Add code
Apr 03, 2020
Figure 1 for Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
Figure 2 for Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
Figure 3 for Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
Figure 4 for Google Landmarks Dataset v2 -- A Large-Scale Benchmark for Instance-Level Recognition and Retrieval
Viaarxiv icon

CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps

Add code
Aug 06, 2018
Figure 1 for CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps
Figure 2 for CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps
Figure 3 for CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps
Figure 4 for CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps
Viaarxiv icon

Large-Scale Image Retrieval with Attentive Deep Local Features

Add code
Feb 03, 2018
Figure 1 for Large-Scale Image Retrieval with Attentive Deep Local Features
Figure 2 for Large-Scale Image Retrieval with Attentive Deep Local Features
Figure 3 for Large-Scale Image Retrieval with Attentive Deep Local Features
Figure 4 for Large-Scale Image Retrieval with Attentive Deep Local Features
Viaarxiv icon