Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

Oct 20, 2024

Mohamed Bayan Kmainasi, Ali Ezzat Shahroor, Maram Hasanain, Sahinur Rahman Laskar, Naeemul Hassan, Firoj Alam

Figure 1 for LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

Figure 2 for LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

Figure 3 for LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

Figure 4 for LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

Share this with someone who'll enjoy it:

Abstract:Large Language Models (LLMs) have demonstrated remarkable success as general-purpose task solvers across various fields, including NLP, healthcare, finance, and law. However, their capabilities remain limited when addressing domain-specific problems, particularly in downstream NLP tasks. Research has shown that models fine-tuned on instruction-based downstream NLP datasets outperform those that are not fine-tuned. While most efforts in this area have primarily focused on resource-rich languages like English and broad domains, little attention has been given to multilingual settings and specific domains. To address this gap, this study focuses on developing a specialized LLM, LlamaLens, for analyzing news and social media content in a multilingual context. To the best of our knowledge, this is the first attempt to tackle both domain specificity and multilinguality, with a particular focus on news and social media. Our experimental setup includes 19 tasks, represented by 52 datasets covering Arabic, English, and Hindi. We demonstrate that LlamaLens outperforms the current state-of-the-art (SOTA) on 16 testing sets, and achieves comparable performance on 10 sets. We make the models and resources publicly available for the research community.(https://huggingface.co/QCRI)

* LLMs, Multilingual, Language Diversity, Large Language Models, Social Media, News Media, Specialized LLMs, Fact-checking, Media Analysis

View paper on

Share this with someone who'll enjoy it:

Title:LlamaLens: Specialized Multilingual LLM for Analyzing News and Social Media Content

Paper and Code