Picture for Timothy Ossowski

Timothy Ossowski

COMMA: A Communicative Multimodal Multi-Agent Benchmark

Add code
Oct 10, 2024
Figure 1 for COMMA: A Communicative Multimodal Multi-Agent Benchmark
Figure 2 for COMMA: A Communicative Multimodal Multi-Agent Benchmark
Figure 3 for COMMA: A Communicative Multimodal Multi-Agent Benchmark
Figure 4 for COMMA: A Communicative Multimodal Multi-Agent Benchmark
Viaarxiv icon

OLIVE: Object Level In-Context Visual Embeddings

Add code
Jun 02, 2024
Viaarxiv icon

How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes

Add code
Apr 04, 2024
Viaarxiv icon

Prompting Large Vision-Language Models for Compositional Reasoning

Add code
Jan 20, 2024
Viaarxiv icon

Multimodal Prompt Retrieval for Generative Visual Question Answering

Add code
Jun 30, 2023
Viaarxiv icon

Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment

Add code
May 23, 2022
Figure 1 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 2 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 3 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 4 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Viaarxiv icon