Picture for Jiahui Yu

Jiahui Yu

Tony

GPT-4o System Card

Add code
Oct 25, 2024
Viaarxiv icon

ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling

Add code
Aug 07, 2024
Figure 1 for ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
Figure 2 for ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
Figure 3 for ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
Figure 4 for ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
Viaarxiv icon

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Add code
Jan 11, 2024
Figure 1 for Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Figure 2 for Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Figure 3 for Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Figure 4 for Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Towards an Automatic AI Agent for Reaction Condition Recommendation in Chemical Synthesis

Add code
Nov 28, 2023
Viaarxiv icon

IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers

Add code
Nov 27, 2023
Figure 1 for IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers
Figure 2 for IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers
Figure 3 for IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers
Figure 4 for IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers
Viaarxiv icon

De-Diffusion Makes Text a Strong Cross-Modal Interface

Add code
Nov 01, 2023
Viaarxiv icon

Module-wise Adaptive Distillation for Multimodality Foundation Models

Add code
Oct 06, 2023
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon