Picture for Rowan Zellers

Rowan Zellers

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

MAUVE Scores for Generative Models: Theory and Practice

Add code
Dec 30, 2022
Viaarxiv icon

Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest

Add code
Sep 13, 2022
Figure 1 for Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest
Figure 2 for Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest
Figure 3 for Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest
Figure 4 for Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest
Viaarxiv icon

Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks

Add code
Jun 17, 2022
Figure 1 for Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Figure 2 for Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Figure 3 for Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Figure 4 for Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Viaarxiv icon

Multimodal Knowledge Alignment with Reinforcement Learning

Add code
May 25, 2022
Figure 1 for Multimodal Knowledge Alignment with Reinforcement Learning
Figure 2 for Multimodal Knowledge Alignment with Reinforcement Learning
Figure 3 for Multimodal Knowledge Alignment with Reinforcement Learning
Figure 4 for Multimodal Knowledge Alignment with Reinforcement Learning
Viaarxiv icon

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning

Add code
Feb 10, 2022
Figure 1 for The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
Figure 2 for The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
Figure 3 for The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
Figure 4 for The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
Viaarxiv icon

MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound

Add code
Jan 07, 2022
Figure 1 for MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Figure 2 for MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Figure 3 for MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Figure 4 for MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound
Viaarxiv icon

Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer

Add code
Dec 16, 2021
Figure 1 for Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
Figure 2 for Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
Figure 3 for Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
Figure 4 for Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
Viaarxiv icon

NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics

Add code
Dec 16, 2021
Figure 1 for NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics
Figure 2 for NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics
Figure 3 for NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics
Figure 4 for NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics
Viaarxiv icon

MERLOT: Multimodal Neural Script Knowledge Models

Add code
Jun 10, 2021
Figure 1 for MERLOT: Multimodal Neural Script Knowledge Models
Figure 2 for MERLOT: Multimodal Neural Script Knowledge Models
Figure 3 for MERLOT: Multimodal Neural Script Knowledge Models
Figure 4 for MERLOT: Multimodal Neural Script Knowledge Models
Viaarxiv icon