Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

Mar 30, 2023

Hyeonggon Ryu, Arda Senocak, In So Kweon, Joon Son Chung

Figure 1 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

Figure 2 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

Figure 3 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

Figure 4 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

Share this with someone who'll enjoy it:

Abstract:The objective of this work is to explore the learning of visually grounded speech models (VGS) from multilingual perspective. Bilingual VGS models are generally trained with an equal number of spoken captions from both languages. However, in reality, there can be an imbalance among the languages for the available spoken captions. Our key contribution in this work is to leverage the power of a high-resource language in a bilingual visually grounded speech model to improve the performance of a low-resource language. We introduce two methods to distill the knowledge of high-resource language into low-resource languages: (1) incorporating a strong pre-trained high-resource language encoder and (2) using semantically similar spoken captions. Our experiments show that combining these two approaches effectively enables the low-resource language to surpass the performances of monolingual and bilingual counterparts for cross-modal retrieval tasks.

* ICASSP 2023

View paper on

Share this with someone who'll enjoy it:

Title:Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

Paper and Code