Picture for Hexiang Hu

Hexiang Hu

OmnixR: Evaluating Omni-modality Language Models on Reasoning across Modalities

Add code
Oct 16, 2024
Viaarxiv icon

KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities

Add code
Oct 15, 2024
Figure 1 for KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities
Figure 2 for KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities
Figure 3 for KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities
Figure 4 for KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities
Viaarxiv icon

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Add code
Oct 14, 2024
Viaarxiv icon

Imagen 3

Add code
Aug 13, 2024
Viaarxiv icon

Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?

Add code
Jun 19, 2024
Figure 1 for Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?
Figure 2 for Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?
Figure 3 for Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?
Figure 4 for Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?
Viaarxiv icon

MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions

Add code
Mar 28, 2024
Figure 1 for MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Figure 2 for MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Figure 3 for MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Figure 4 for MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Viaarxiv icon

Instruct-Imagen: Image Generation with Multi-modal Instruction

Add code
Jan 03, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

Add code
Nov 28, 2023
Viaarxiv icon

From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces

Add code
May 31, 2023
Figure 1 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 2 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 3 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Figure 4 for From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces
Viaarxiv icon