Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Saeedeh Momtazi

MELAC: Massive Evaluation of Large Language Models with Alignment of Culture in Persian Language

Aug 01, 2025

Farhan Farsi, Farnaz Aghababaloo, Shahriar Shariati Motlagh, Parsa Ghofrani, MohammadAli SadraeiJavaheri, Shayan Bali, Amirhossein Shabani, Farbod Bijary, Ghazal Zamaninejad, AmirMohammad Salehoof(+1 more)

Abstract:As large language models (LLMs) become increasingly embedded in our daily lives, evaluating their quality and reliability across diverse contexts has become essential. While comprehensive benchmarks exist for assessing LLM performance in English, there remains a significant gap in evaluation resources for other languages. Moreover, because most LLMs are trained primarily on data rooted in European and American cultures, they often lack familiarity with non-Western cultural contexts. To address this limitation, our study focuses on the Persian language and Iranian culture. We introduce 19 new evaluation datasets specifically designed to assess LLMs on topics such as Iranian law, Persian grammar, Persian idioms, and university entrance exams. Using these datasets, we benchmarked 41 prominent LLMs, aiming to bridge the existing cultural and linguistic evaluation gap in the field.

* Preprint. Under review

Via

Access Paper or Ask Questions

E2TP: Element to Tuple Prompting Improves Aspect Sentiment Tuple Prediction

May 10, 2024

Mohammad Ghiasvand Mohammadkhani, Niloofar Ranjbar, Saeedeh Momtazi

Figure 1 for E2TP: Element to Tuple Prompting Improves Aspect Sentiment Tuple Prediction

Figure 2 for E2TP: Element to Tuple Prompting Improves Aspect Sentiment Tuple Prediction

Figure 3 for E2TP: Element to Tuple Prompting Improves Aspect Sentiment Tuple Prediction

Figure 4 for E2TP: Element to Tuple Prompting Improves Aspect Sentiment Tuple Prediction

Abstract:Generative approaches have significantly influenced Aspect-Based Sentiment Analysis (ABSA), garnering considerable attention. However, existing studies often predict target text components monolithically, neglecting the benefits of utilizing single elements for tuple prediction. In this paper, we introduce Element to Tuple Prompting (E2TP), employing a two-step architecture. The former step focuses on predicting single elements, while the latter step completes the process by mapping these predicted elements to their corresponding tuples. E2TP is inspired by human problem-solving, breaking down tasks into manageable parts, using the first step's output as a guide in the second step. Within this strategy, three types of paradigms, namely E2TP($diet$), E2TP($f_1$), and E2TP($f_2$), are designed to facilitate the training process. Beyond in-domain task-specific experiments, our paper addresses cross-domain scenarios, demonstrating the effectiveness and generalizability of the approach. By conducting a comprehensive analysis on various benchmarks, we show that E2TP achieves new state-of-the-art results in nearly all cases.

Via

Access Paper or Ask Questions

Multi Sentence Description of Complex Manipulation Action Videos

Nov 13, 2023

Fatemeh Ziaeetabar, Reza Safabakhsh, Saeedeh Momtazi, Minija Tamosiunaite, Florentin Wörgötter

Figure 1 for Multi Sentence Description of Complex Manipulation Action Videos

Figure 2 for Multi Sentence Description of Complex Manipulation Action Videos

Figure 3 for Multi Sentence Description of Complex Manipulation Action Videos

Figure 4 for Multi Sentence Description of Complex Manipulation Action Videos

Abstract:Automatic video description requires the generation of natural language statements about the actions, events, and objects in the video. An important human trait, when we describe a video, is that we are able to do this with variable levels of detail. Different from this, existing approaches for automatic video descriptions are mostly focused on single sentence generation at a fixed level of detail. Instead, here we address video description of manipulation actions where different levels of detail are required for being able to convey information about the hierarchical structure of these actions relevant also for modern approaches of robot learning. We propose one hybrid statistical and one end-to-end framework to address this problem. The hybrid method needs much less data for training, because it models statistically uncertainties within the video clips, while in the end-to-end method, which is more data-heavy, we are directly connecting the visual encoder to the language decoder without any intermediate (statistical) processing step. Both frameworks use LSTM stacks to allow for different levels of description granularity and videos can be described by simple single-sentences or complex multiple-sentence descriptions. In addition, quantitative results demonstrate that these methods produce more realistic descriptions than other competing approaches.

Via

Access Paper or Ask Questions

A Hierarchical Graph-based Approach for Recognition and Description Generation of Bimanual Actions in Videos

Oct 01, 2023

Fatemeh Ziaeetabar, Reza Safabakhsh, Saeedeh Momtazi, Minija Tamosiunaite, Florentin Wörgötter

Figure 1 for A Hierarchical Graph-based Approach for Recognition and Description Generation of Bimanual Actions in Videos

Figure 2 for A Hierarchical Graph-based Approach for Recognition and Description Generation of Bimanual Actions in Videos

Figure 3 for A Hierarchical Graph-based Approach for Recognition and Description Generation of Bimanual Actions in Videos

Figure 4 for A Hierarchical Graph-based Approach for Recognition and Description Generation of Bimanual Actions in Videos

Abstract:Nuanced understanding and the generation of detailed descriptive content for (bimanual) manipulation actions in videos is important for disciplines such as robotics, human-computer interaction, and video content analysis. This study describes a novel method, integrating graph based modeling with layered hierarchical attention mechanisms, resulting in higher precision and better comprehensiveness of video descriptions. To achieve this, we encode, first, the spatio-temporal inter dependencies between objects and actions with scene graphs and we combine this, in a second step, with a novel 3-level architecture creating a hierarchical attention mechanism using Graph Attention Networks (GATs). The 3-level GAT architecture allows recognizing local, but also global contextual elements. This way several descriptions with different semantic complexity can be generated in parallel for the same video clip, enhancing the discriminative accuracy of action recognition and action description. The performance of our approach is empirically tested using several 2D and 3D datasets. By comparing our method to the state of the art we consistently obtain better performance concerning accuracy, precision, and contextual relevance when evaluating action recognition as well as description generation. In a large set of ablation experiments we also assess the role of the different components of our model. With our multi-level approach the system obtains different semantic description depths, often observed in descriptions made by different people, too. Furthermore, better insight into bimanual hand-object interactions as achieved by our model may portend advancements in the field of robotics, enabling the emulation of intricate human actions with heightened precision.

Via

Access Paper or Ask Questions

X-CapsNet For Fake News Detection

Jul 23, 2023

Mohammad Hadi Goldani, Reza Safabakhsh, Saeedeh Momtazi

Figure 1 for X-CapsNet For Fake News Detection

Figure 2 for X-CapsNet For Fake News Detection

Figure 3 for X-CapsNet For Fake News Detection

Figure 4 for X-CapsNet For Fake News Detection

Abstract:News consumption has significantly increased with the growing popularity and use of web-based forums and social media. This sets the stage for misinforming and confusing people. To help reduce the impact of misinformation on users' potential health-related decisions and other intents, it is desired to have machine learning models to detect and combat fake news automatically. This paper proposes a novel transformer-based model using Capsule neural Networks(CapsNet) called X-CapsNet. This model includes a CapsNet with dynamic routing algorithm paralyzed with a size-based classifier for detecting short and long fake news statements. We use two size-based classifiers, a Deep Convolutional Neural Network (DCNN) for detecting long fake news statements and a Multi-Layer Perceptron (MLP) for detecting short news statements. To resolve the problem of representing short news statements, we use indirect features of news created by concatenating the vector of news speaker profiles and a vector of polarity, sentiment, and counting words of news statements. For evaluating the proposed architecture, we use the Covid-19 and the Liar datasets. The results in terms of the F1-score for the Covid-19 dataset and accuracy for the Liar dataset show that models perform better than the state-of-the-art baselines.

Via

Access Paper or Ask Questions

Explaining Recommendation System Using Counterfactual Textual Explanations

Mar 14, 2023

Niloofar Ranjbar, Saeedeh Momtazi, MohammadMehdi Homayounpour

Abstract:Currently, there is a significant amount of research being conducted in the field of artificial intelligence to improve the explainability and interpretability of deep learning models. It is found that if end-users understand the reason for the production of some output, it is easier to trust the system. Recommender systems are one example of systems that great efforts have been conducted to make their output more explainable. One method for producing a more explainable output is using counterfactual reasoning, which involves altering minimal features to generate a counterfactual item that results in changing the output of the system. This process allows the identification of input features that have a significant impact on the desired output, leading to effective explanations. In this paper, we present a method for generating counterfactual explanations for both tabular and textual features. We evaluated the performance of our proposed method on three real-world datasets and demonstrated a +5\% improvement on finding effective features (based on model-based measures) compared to the baseline method.

Via

Access Paper or Ask Questions

A Semantic Modular Framework for Events Topic Modeling in Social Media

Jan 21, 2023

Arya Hadizadeh Moghaddam, Saeedeh Momtazi

Figure 1 for A Semantic Modular Framework for Events Topic Modeling in Social Media

Figure 2 for A Semantic Modular Framework for Events Topic Modeling in Social Media

Figure 3 for A Semantic Modular Framework for Events Topic Modeling in Social Media

Figure 4 for A Semantic Modular Framework for Events Topic Modeling in Social Media

Abstract:The advancement of social media contributes to the growing amount of content they share frequently. This framework provides a sophisticated place for people to report various real-life events. Detecting these events with the help of natural language processing has received researchers' attention, and various algorithms have been developed for this goal. In this paper, we propose a Semantic Modular Model (SMM) consisting of 5 different modules, namely Distributional Denoising Autoencoder, Incremental Clustering, Semantic Denoising, Defragmentation, and Ranking and Processing. The proposed model aims to (1) cluster various documents and ignore the documents that might not contribute to the identification of events, (2) identify more important and descriptive keywords. Compared to the state-of-the-art methods, the results show that the proposed model has a higher performance in identifying events with lower ranks and extracting keywords for more important events in three English Twitter datasets: FACup, SuperTuesday, and USElection. The proposed method outperformed the best reported results in the mean keyword-precision metric by 7.9\%.

* 32 pages, 2 figures

Via

Access Paper or Ask Questions

PQuAD: A Persian Question Answering Dataset

Feb 13, 2022

Kasra Darvishi, Newsha Shahbodagh, Zahra Abbasiantaeb, Saeedeh Momtazi

Figure 1 for PQuAD: A Persian Question Answering Dataset

Figure 2 for PQuAD: A Persian Question Answering Dataset

Figure 3 for PQuAD: A Persian Question Answering Dataset

Figure 4 for PQuAD: A Persian Question Answering Dataset

Abstract:We present Persian Question Answering Dataset (PQuAD), a crowdsourced reading comprehension dataset on Persian Wikipedia articles. It includes 80,000 questions along with their answers, with 25% of the questions being adversarially unanswerable. We examine various properties of the dataset to show the diversity and the level of its difficulty as an MRC benchmark. By releasing this dataset, we aim to ease research on Persian reading comprehension and development of Persian question answering systems. Our experiments on different state-of-the-art pre-trained contextualized language models show 74.8% Exact Match (EM) and 87.6% F1-score that can be used as the baseline results for further research on Persian QA.

Via

Access Paper or Ask Questions

Neural Relation Prediction for Simple Question Answering over Knowledge Graph

Feb 18, 2020

Amin Abolghasemi, Saeedeh Momtazi

Figure 1 for Neural Relation Prediction for Simple Question Answering over Knowledge Graph

Figure 2 for Neural Relation Prediction for Simple Question Answering over Knowledge Graph

Figure 3 for Neural Relation Prediction for Simple Question Answering over Knowledge Graph

Figure 4 for Neural Relation Prediction for Simple Question Answering over Knowledge Graph

Abstract:Relation extraction from simple questions aims to capture the relation of a factoid question with one underlying relation from a set of predefined ones ina knowledge base. Most recent methods take advantage of neural networks for matching a question with all relations in order to find the best relation that is expressed by that question. In this paper, we propose an instance-based method to find similar questions of a new question, in the sense of their relations, to predict its mentioned relation. The motivation roots in the fact that a relation can be expressed with different forms of question and these forms mostly share similar terms or concepts. Our experiments on the SimpleQuestions dataset show that the proposed model achieved better accuracy compared to the state-of-the-art relation extraction models.

Via

Access Paper or Ask Questions

Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey

Feb 16, 2020

Zahra Abbasiyantaeb, Saeedeh Momtazi

Figure 1 for Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey

Figure 2 for Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey

Figure 3 for Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey

Figure 4 for Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey

Abstract:Text-based Question Answering (QA) is a challenging task which aims at finding short concrete answers for users' questions. This line of research has been widely studied with information retrieval techniques and has received increasing attention in recent years by considering deep neural network approaches. Deep learning approaches, which are the main focus of this paper, provide a powerful technique to learn multiple layers of representations and interaction between questions and texts. In this paper, we provide a comprehensive overview of different models proposed for the QA task, including both traditional information retrieval perspective, and more recent deep neural network perspective. We also introduce well-known datasets for the task and present available results from the literature to have a comparison between different techniques.

Via

Access Paper or Ask Questions