Picture for Michael Ryoo

Michael Ryoo

Instance-Aware Generalized Referring Expression Segmentation

Add code
Nov 22, 2024
Viaarxiv icon

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Add code
Aug 22, 2024
Figure 1 for xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Figure 2 for xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Figure 3 for xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Figure 4 for xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Viaarxiv icon

SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention

Add code
Dec 04, 2023
Viaarxiv icon

Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders

Add code
Oct 31, 2023
Viaarxiv icon

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Add code
Jul 28, 2023
Viaarxiv icon

Language-based Action Concept Spaces Improve Video Self-Supervised Learning

Add code
Jul 20, 2023
Viaarxiv icon

RT-1: Robotics Transformer for Real-World Control at Scale

Add code
Dec 13, 2022
Viaarxiv icon

Neural Neural Textures Make Sim2Real Consistent

Add code
Jun 27, 2022
Figure 1 for Neural Neural Textures Make Sim2Real Consistent
Figure 2 for Neural Neural Textures Make Sim2Real Consistent
Figure 3 for Neural Neural Textures Make Sim2Real Consistent
Figure 4 for Neural Neural Textures Make Sim2Real Consistent
Viaarxiv icon

Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language

Add code
Apr 01, 2022
Figure 1 for Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Figure 2 for Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Figure 3 for Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Figure 4 for Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language
Viaarxiv icon

Self-supervised Video Transformer

Add code
Dec 02, 2021
Figure 1 for Self-supervised Video Transformer
Figure 2 for Self-supervised Video Transformer
Figure 3 for Self-supervised Video Transformer
Figure 4 for Self-supervised Video Transformer
Viaarxiv icon