Picture for Samyak Datta

Samyak Datta

Sid

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

DISGO: Automatic End-to-End Evaluation for Scene Text OCR

Add code
Aug 25, 2023
Figure 1 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 2 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 3 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 4 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Viaarxiv icon

Episodic Memory Question Answering

Add code
May 03, 2022
Figure 1 for Episodic Memory Question Answering
Figure 2 for Episodic Memory Question Answering
Figure 3 for Episodic Memory Question Answering
Figure 4 for Episodic Memory Question Answering
Viaarxiv icon

Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents

Add code
Sep 07, 2020
Figure 1 for Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents
Figure 2 for Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents
Figure 3 for Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents
Figure 4 for Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents
Viaarxiv icon

Embodied Question Answering in Photorealistic Environments with Point Cloud Perception

Add code
Apr 06, 2019
Figure 1 for Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
Figure 2 for Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
Figure 3 for Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
Figure 4 for Embodied Question Answering in Photorealistic Environments with Point Cloud Perception
Viaarxiv icon

Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment

Add code
Mar 27, 2019
Figure 1 for Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
Figure 2 for Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
Figure 3 for Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
Figure 4 for Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
Viaarxiv icon

Unsupervised Learning of Face Representations

Add code
Mar 03, 2018
Figure 1 for Unsupervised Learning of Face Representations
Figure 2 for Unsupervised Learning of Face Representations
Figure 3 for Unsupervised Learning of Face Representations
Viaarxiv icon

Embodied Question Answering

Add code
Dec 01, 2017
Figure 1 for Embodied Question Answering
Figure 2 for Embodied Question Answering
Figure 3 for Embodied Question Answering
Figure 4 for Embodied Question Answering
Viaarxiv icon