Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shyma Alhuwaider

Revisiting Test Time Adaptation under Online Evaluation

Apr 10, 2023

Motasem Alfarra, Hani Itani, Alejandro Pardo, Shyma Alhuwaider, Merey Ramazanova, Juan C. Pérez, Zhipeng Cai, Matthias Müller, Bernard Ghanem

Figure 1 for Revisiting Test Time Adaptation under Online Evaluation

Figure 2 for Revisiting Test Time Adaptation under Online Evaluation

Figure 3 for Revisiting Test Time Adaptation under Online Evaluation

Figure 4 for Revisiting Test Time Adaptation under Online Evaluation

Abstract:This paper proposes a novel online evaluation protocol for Test Time Adaptation (TTA) methods, which penalizes slower methods by providing them with fewer samples for adaptation. TTA methods leverage unlabeled data at test time to adapt to distribution shifts. Though many effective methods have been proposed, their impressive performance usually comes at the cost of significantly increased computation budgets. Current evaluation protocols overlook the effect of this extra computation cost, affecting their real-world applicability. To address this issue, we propose a more realistic evaluation protocol for TTA methods, where data is received in an online fashion from a constant-speed data stream, thereby accounting for the method's adaptation speed. We apply our proposed protocol to benchmark several TTA methods on multiple datasets and scenarios. Extensive experiments shows that, when accounting for inference speed, simple and fast approaches can outperform more sophisticated but slower methods. For example, SHOT from 2020 outperforms the state-of-the-art method SAR from 2023 under our online setting. Our online evaluation protocol emphasizes the need for developing TTA methods that are efficient and applicable in realistic settings.

* 14 pages, 8 figures, 7 tables

Via

Access Paper or Ask Questions

ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture

Nov 19, 2022

Youssef Mohamed, Mohamed Abdelfattah, Shyma Alhuwaider, Feifan Li, Xiangliang Zhang, Kenneth Ward Church, Mohamed Elhoseiny

Figure 1 for ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture

Figure 2 for ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture

Figure 3 for ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture

Figure 4 for ArtELingo: A Million Emotion Annotations of WikiArt with Emphasis on Diversity over Language and Culture

Abstract:This paper introduces ArtELingo, a new benchmark and dataset, designed to encourage work on diversity across languages and cultures. Following ArtEmis, a collection of 80k artworks from WikiArt with 0.45M emotion labels and English-only captions, ArtELingo adds another 0.79M annotations in Arabic and Chinese, plus 4.8K in Spanish to evaluate "cultural-transfer" performance. More than 51K artworks have 5 annotations or more in 3 languages. This diversity makes it possible to study similarities and differences across languages and cultures. Further, we investigate captioning tasks, and find diversity improves the performance of baseline models. ArtELingo is publicly available at https://www.artelingo.org/ with standard splits and baseline models. We hope our work will help ease future research on multilinguality and culturally-aware AI.

* 9 pages, Accepted at EMNLP 22, for more details see https://www.artelingo.org/

Via

Access Paper or Ask Questions