Picture for Sai Rajeswar

Sai Rajeswar

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Add code
Dec 05, 2024
Viaarxiv icon

Representing Positional Information in Generative World Models for Object Manipulation

Add code
Sep 19, 2024
Viaarxiv icon

Multimodal foundation world models for generalist embodied agents

Add code
Jun 26, 2024
Viaarxiv icon

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Add code
Jun 17, 2024
Viaarxiv icon

VCR: Visual Caption Restoration

Add code
Jun 10, 2024
Viaarxiv icon

Capture the Flag: Uncovering Data Insights with Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon

Equivariant Adaptation of Large Pre-Trained Models

Add code
Oct 02, 2023
Viaarxiv icon

Efficient Dynamics Modeling in Interactive Environments with Koopman Theory

Add code
Jul 12, 2023
Viaarxiv icon

Choreographer: Learning and Adapting Skills in Imagination

Add code
Nov 23, 2022
Viaarxiv icon

Unsupervised Model-based Pre-training for Data-efficient Control from Pixels

Add code
Sep 24, 2022
Figure 1 for Unsupervised Model-based Pre-training for Data-efficient Control from Pixels
Figure 2 for Unsupervised Model-based Pre-training for Data-efficient Control from Pixels
Figure 3 for Unsupervised Model-based Pre-training for Data-efficient Control from Pixels
Figure 4 for Unsupervised Model-based Pre-training for Data-efficient Control from Pixels
Viaarxiv icon