Picture for Mandy Guo

Mandy Guo

Imagen 3

Add code
Aug 13, 2024
Viaarxiv icon

Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models

Add code
May 27, 2024
Figure 1 for Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
Figure 2 for Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
Figure 3 for Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
Figure 4 for Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences

Add code
May 18, 2023
Figure 1 for mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences
Figure 2 for mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences
Figure 3 for mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences
Figure 4 for mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences
Viaarxiv icon

WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset

Add code
May 09, 2023
Viaarxiv icon

A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding

Add code
May 05, 2023
Viaarxiv icon

CoBIT: A Contrastive Bi-directional Image-Text Generation Model

Add code
Mar 23, 2023
Figure 1 for CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Figure 2 for CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Figure 3 for CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Figure 4 for CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Viaarxiv icon

CoLT5: Faster Long-Range Transformers with Conditional Computation

Add code
Mar 17, 2023
Viaarxiv icon

LongT5: Efficient Text-To-Text Transformer for Long Sequences

Add code
Dec 15, 2021
Figure 1 for LongT5: Efficient Text-To-Text Transformer for Long Sequences
Figure 2 for LongT5: Efficient Text-To-Text Transformer for Long Sequences
Figure 3 for LongT5: Efficient Text-To-Text Transformer for Long Sequences
Figure 4 for LongT5: Efficient Text-To-Text Transformer for Long Sequences
Viaarxiv icon

MURAL: Multimodal, Multitask Retrieval Across Languages

Add code
Sep 10, 2021
Figure 1 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 2 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 3 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 4 for MURAL: Multimodal, Multitask Retrieval Across Languages
Viaarxiv icon