Picture for Xindi Wu

Xindi Wu

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Add code
Oct 04, 2024
Figure 1 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 2 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 3 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 4 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Viaarxiv icon

ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty

Add code
Aug 26, 2024
Viaarxiv icon

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Add code
Jun 26, 2024
Figure 1 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 2 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 3 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Figure 4 for CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Viaarxiv icon

Language Models as Science Tutors

Add code
Feb 16, 2024
Viaarxiv icon

Multimodal Dataset Distillation for Image-Text Retrieval

Add code
Aug 15, 2023
Viaarxiv icon

Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images

Add code
Jan 10, 2023
Viaarxiv icon

Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation

Add code
Jun 04, 2022
Figure 1 for Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation
Figure 2 for Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation
Figure 3 for Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation
Figure 4 for Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Add code
Oct 13, 2021
Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon

Squared $\ell_2$ Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations

Add code
Nov 25, 2020
Figure 1 for Squared $\ell_2$ Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations
Figure 2 for Squared $\ell_2$ Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations
Figure 3 for Squared $\ell_2$ Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations
Figure 4 for Squared $\ell_2$ Norm as Consistency Loss for Leveraging Augmented Data to Learn Robust and Invariant Representations
Viaarxiv icon

High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks

Add code
May 28, 2019
Figure 1 for High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks
Figure 2 for High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks
Figure 3 for High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks
Figure 4 for High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks
Viaarxiv icon