Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Cultural and Linguistic Diversity Improves Visual Representations

Oct 22, 2023

Andre Ye, Sebastin Santy, Jena D. Hwang, Amy X. Zhang, Ranjay Krishna

Figure 1 for Cultural and Linguistic Diversity Improves Visual Representations

Figure 2 for Cultural and Linguistic Diversity Improves Visual Representations

Figure 3 for Cultural and Linguistic Diversity Improves Visual Representations

Figure 4 for Cultural and Linguistic Diversity Improves Visual Representations

Share this with someone who'll enjoy it:

Abstract:Computer vision often treats perception as objective, and this assumption gets reflected in the way that datasets are collected and models are trained. For instance, image descriptions in different languages are typically assumed to be translations of the same semantic content. However, work in cross-cultural psychology and linguistics has shown that individuals differ in their visual perception depending on their cultural background and the language they speak. In this paper, we demonstrate significant differences in semantic content across languages in both dataset and model-produced captions. When data is multilingual as opposed to monolingual, captions have higher semantic coverage on average, as measured by scene graph, embedding, and linguistic complexity. For example, multilingual captions have on average 21.8% more objects, 24.5% more relations, and 27.1% more attributes than a set of monolingual captions. Moreover, models trained on content from different languages perform best against test data from those languages, while those trained on multilingual content perform consistently well across all evaluation data compositions. Our research provides implications for how diverse modes of perception can improve image understanding.

View paper on

Share this with someone who'll enjoy it:

Title:Cultural and Linguistic Diversity Improves Visual Representations

Paper and Code