Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sameer Jain

Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages

Feb 19, 2024

Sameer Jain, Sedrick Scott Keh, Shova Chettri, Karun Dewan, Pablo Izquierdo, Johanna Prussman, Pooja Shreshtha, Cesar Suarez, Zheyuan Ryan Shi, Lei Li(+1 more)

Figure 1 for Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages

Figure 2 for Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages

Figure 3 for Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages

Figure 4 for Where It Really Matters: Few-Shot Environmental Conservation Media Monitoring for Low-Resource Languages

Abstract:Environmental conservation organizations routinely monitor news content on conservation in protected areas to maintain situational awareness of developments that can have an environmental impact. Existing automated media monitoring systems require large amounts of data labeled by domain experts, which is only feasible at scale for high-resource languages like English. However, such tools are most needed in the global south where news of interest is mainly in local low-resource languages, and far fewer experts are available to annotate datasets sustainably. In this paper, we propose NewsSerow, a method to automatically recognize environmental conservation content in low-resource languages. NewsSerow is a pipeline of summarization, in-context few-shot classification, and self-reflection using large language models (LLMs). Using at most 10 demonstration example news articles in Nepali, NewsSerow significantly outperforms other few-shot methods and achieves comparable performance with models fully fine-tuned using thousands of examples. The World Wide Fund for Nature (WWF) has deployed NewsSerow for media monitoring in Nepal, significantly reducing their operational burden, and ensuring that AI tools for conservation actually reach the communities that need them the most. NewsSerow has also been deployed for countries with other languages like Colombia.

* AAAI 2024: AI for Social Impact Track

Via

Access Paper or Ask Questions

Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Jun 01, 2023

Sameer Jain, Vaishakh Keshava, Swarnashree Mysore Sathyendra, Patrick Fernandes, Pengfei Liu, Graham Neubig, Chunting Zhou

Figure 1 for Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Figure 2 for Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Figure 3 for Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Figure 4 for Multi-Dimensional Evaluation of Text Summarization with In-Context Learning

Abstract:Evaluation of natural language generation (NLG) is complex and multi-dimensional. Generated text can be evaluated for fluency, coherence, factuality, or any other dimensions of interest. Most frameworks that perform such multi-dimensional evaluation require training on large manually or synthetically generated datasets. In this paper, we study the efficacy of large language models as multi-dimensional evaluators using in-context learning, obviating the need for large training datasets. Our experiments show that in-context learning-based evaluators are competitive with learned evaluation frameworks for the task of text summarization, establishing state-of-the-art on dimensions such as relevance and factual consistency. We then analyze the effects of factors such as the selection and number of in-context examples on performance. Finally, we study the efficacy of in-context learning based evaluators in evaluating zero-shot summaries written by large language models such as GPT-3.

* ACL Findings '23

Via

Access Paper or Ask Questions