Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ashish Anand

Indian Institute of Technology Guwahati

AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models

Dec 29, 2024

Mansi, Pranshu Pandya, Mahek Bhavesh Vora, Soumya Bharadwaj, Ashish Anand

Figure 1 for AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models

Figure 2 for AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models

Figure 3 for AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models

Figure 4 for AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models

Abstract:Existing datasets for relation classification and extraction often exhibit limitations such as restricted relation types and domain-specific biases. This work presents a generic framework to generate well-structured sentences from given tuples with the help of Large Language Models (LLMs). This study has focused on the following major questions: (i) how to generate sentences from relation tuples, (ii) how to compare and rank them, (iii) can we combine strengths of individual methods and amalgamate them to generate an even bette quality of sentences, and (iv) how to evaluate the final dataset? For the first question, we employ a multifaceted 5-stage pipeline approach, leveraging LLMs in conjunction with template-guided generation. We introduce Sentence Evaluation Index(SEI) that prioritizes factors like grammatical correctness, fluency, human-aligned sentiment, accuracy, and complexity to answer the first part of the second question. To answer the second part of the second question, this work introduces a SEI-Ranker module that leverages SEI to select top candidate generations. The top sentences are then strategically amalgamated to produce the final, high-quality sentence. Finally, we evaluate our dataset on LLM-based and SOTA baselines for relation classification. The proposed dataset features 255 relation types, with 15K sentences in the test set and around 150k in the train set organized in, significantly enhancing relational diversity and complexity. This work not only presents a new comprehensive benchmark dataset for RE/RC task, but also compare different LLMs for generation of quality sentences from relational tuples.

* 18 pages, 5 Figures

Via

Access Paper or Ask Questions

Enhancing Event Extraction from Short Stories through Contextualized Prompts

Dec 14, 2024

Chaitanya Kirti, Ayon Chattopadhyay, Ashish Anand, Prithwijit Guha

Figure 1 for Enhancing Event Extraction from Short Stories through Contextualized Prompts

Figure 2 for Enhancing Event Extraction from Short Stories through Contextualized Prompts

Figure 3 for Enhancing Event Extraction from Short Stories through Contextualized Prompts

Figure 4 for Enhancing Event Extraction from Short Stories through Contextualized Prompts

Abstract:Event extraction is an important natural language processing (NLP) task of identifying events in an unstructured text. Although a plethora of works deal with event extraction from new articles, clinical text etc., only a few works focus on event extraction from literary content. Detecting events in short stories presents several challenges to current systems, encompassing a different distribution of events as compared to other domains and the portrayal of diverse emotional conditions. This paper presents \texttt{Vrittanta-EN}, a collection of 1000 English short stories annotated for real events. Exploring this field could result in the creation of techniques and resources that support literary scholars in improving their effectiveness. This could simultaneously influence the field of Natural Language Processing. Our objective is to clarify the intricate idea of events in the context of short stories. Towards the objective, we collected 1,000 short stories written mostly for children in the Indian context. Further, we present fresh guidelines for annotating event mentions and their categories, organized into \textit{seven distinct classes}. The classes are {\tt{COGNITIVE-MENTAL-STATE(CMS), COMMUNICATION(COM), CONFLICT(CON), GENERAL-ACTIVITY(GA), LIFE-EVENT(LE), MOVEMENT(MOV), and OTHERS(OTH)}}. Subsequently, we apply these guidelines to annotate the short story dataset. Later, we apply the baseline methods for automatically detecting and categorizing events. We also propose a prompt-based method for event detection and classification. The proposed method outperforms the baselines, while having significant improvement of more than 4\% for the class \texttt{CONFLICT} in event classification task.

* 47 pages, 8 figures, Planning to submit in Elsevier (Computer Speech and Language Journal)

Via

Access Paper or Ask Questions

End-to-End Argument Mining as Augmented Natural Language Generation

Jun 12, 2024

Nilmadhab Das, Vishal Choudhary, V. Vijaya Saradhi, Ashish Anand

Abstract:Argument Mining (AM) is a crucial aspect of computational argumentation, which deals with the identification and extraction of Argumentative Components (ACs) and their corresponding Argumentative Relations (ARs). Most prior works have solved these problems by dividing them into multiple subtasks. And the available end-to-end setups are mostly based on the dependency parsing approach. This work proposes a unified end-to-end framework based on a generative paradigm, in which the argumentative structures are framed into label-augmented text, called Augmented Natural Language (ANL). Additionally, we explore the role of different types of markers in solving AM tasks. Through different marker-based fine-tuning strategies, we present an extensive study by integrating marker knowledge into our generative model. The proposed framework achieves competitive results to the state-of-the-art (SoTA) model and outperforms several baselines.

Via

Access Paper or Ask Questions

Noise in Relation Classification Dataset TACRED: Characterization and Reduction

Nov 21, 2023

Akshay Parekh, Ashish Anand, Amit Awekar

Abstract:The overarching objective of this paper is two-fold. First, to explore model-based approaches to characterize the primary cause of the noise. in the RE dataset TACRED Second, to identify the potentially noisy instances. Towards the first objective, we analyze predictions and performance of state-of-the-art (SOTA) models to identify the root cause of noise in the dataset. Our analysis of TACRED shows that the majority of the noise in the dataset originates from the instances labeled as no-relation which are negative examples. For the second objective, we explore two nearest-neighbor-based strategies to automatically identify potentially noisy examples for elimination and reannotation. Our first strategy, referred to as Intrinsic Strategy (IS), is based on the assumption that positive examples are clean. Thus, we have used false-negative predictions to identify noisy negative examples. Whereas, our second approach, referred to as Extrinsic Strategy, is based on using a clean subset of the dataset to identify potentially noisy negative examples. Finally, we retrained the SOTA models on the eliminated and reannotated dataset. Our empirical results based on two SOTA models trained on TACRED-E following the IS show an average 4% F1-score improvement, whereas reannotation (TACRED-R) does not improve the original results. However, following ES, SOTA models show the average F1-score improvement of 3.8% and 4.4% when trained on respective eliminated (TACRED-EN) and reannotated (TACRED-RN) datasets respectively. We further extended the ES for cleaning positive examples as well, which resulted in an average performance improvement of 5.8% and 5.6% for the eliminated (TACRED-ENP) and reannotated (TACRED-RNP) datasets respectively.

* Work in Progress

Via

Access Paper or Ask Questions

VQA with Cascade of Self- and Co-Attention Blocks

Feb 28, 2023

Aakansha Mishra, Ashish Anand, Prithwijit Guha

Abstract:The use of complex attention modules has improved the performance of the Visual Question Answering (VQA) task. This work aims to learn an improved multi-modal representation through dense interaction of visual and textual modalities. The proposed model has an attention block containing both self-attention and co-attention on image and text. The self-attention modules provide the contextual information of objects (for an image) and words (for a question) that are crucial for inferring an answer. On the other hand, co-attention aids the interaction of image and text. Further, fine-grained information is obtained from two modalities by using a Cascade of Self- and Co-Attention blocks (CSCA). This proposal is benchmarked on the widely used VQA2.0 and TDIUC datasets. The efficacy of key components of the model and cascading of attention modules are demonstrated by experiments involving ablation analysis.

Via

Access Paper or Ask Questions

Budget Sensitive Reannotation of Noisy Relation Classification Data Using Label Hierarchy

Dec 26, 2021

Akshay Parekh, Ashish Anand, Amit Awekar

Figure 1 for Budget Sensitive Reannotation of Noisy Relation Classification Data Using Label Hierarchy

Figure 2 for Budget Sensitive Reannotation of Noisy Relation Classification Data Using Label Hierarchy

Figure 3 for Budget Sensitive Reannotation of Noisy Relation Classification Data Using Label Hierarchy

Figure 4 for Budget Sensitive Reannotation of Noisy Relation Classification Data Using Label Hierarchy

Abstract:Large crowd-sourced datasets are often noisy and relation classification (RC) datasets are no exception. Reannotating the entire dataset is one probable solution however it is not always viable due to time and budget constraints. This paper addresses the problem of efficient reannotation of a large noisy dataset for the RC. Our goal is to catch more annotation errors in the dataset while reannotating fewer instances. Existing work on RC dataset reannotation lacks the flexibility about how much data to reannotate. We introduce the concept of a reannotation budget to overcome this limitation. The immediate follow-up problem is: Given a specific reannotation budget, which subset of the data should we reannotate? To address this problem, we present two strategies to selectively reannotate RC datasets. Our strategies utilize the taxonomic hierarchy of relation labels. The intuition of our work is to rely on the graph distance between actual and predicted relation labels in the label hierarchy graph. We evaluate our reannotation strategies on the well-known TACRED dataset. We design our experiments to answer three specific research questions. First, does our strategy select novel candidates for reannotation? Second, for a given reannotation budget is our reannotation strategy more efficient at catching annotation errors? Third, what is the impact of data reannotation on RC model performance measurement? Experimental results show that our both reannotation strategies are novel and efficient. Our analysis indicates that the current reported performance of RC models on noisy TACRED data is inflated.

Via

Access Paper or Ask Questions

CL-NERIL: A Cross-Lingual Model for NER in Indian Languages

Nov 23, 2021

Akshara Prabhakar, Gouri Sankar Majumder, Ashish Anand

Figure 1 for CL-NERIL: A Cross-Lingual Model for NER in Indian Languages

Abstract:Developing Named Entity Recognition (NER) systems for Indian languages has been a long-standing challenge, mainly owing to the requirement of a large amount of annotated clean training instances. This paper proposes an end-to-end framework for NER for Indian languages in a low-resource setting by exploiting parallel corpora of English and Indian languages and an English NER dataset. The proposed framework includes an annotation projection method that combines word alignment score and NER tag prediction confidence score on source language (English) data to generate weakly labeled data in a target Indian language. We employ a variant of the Teacher-Student model and optimize it jointly on the pseudo labels of the Teacher model and predictions on the generated weakly labeled data. We also present manually annotated test sets for three Indian languages: Hindi, Bengali, and Gujarati. We evaluate the performance of the proposed framework on the test sets of the three Indian languages. Empirical results show a minimum 10% performance improvement compared to the zero-shot transfer learning model on all languages. This indicates that weakly labeled data generated using the proposed annotation projection method in target Indian languages can complement well-annotated source language data to enhance performance. Our code is publicly available at https://github.com/aksh555/CL-NERIL

* Accepted in AAAI 2022 Student Abstract

Via

Access Paper or Ask Questions

IQ-VQA: Intelligent Visual Question Answering

Jul 08, 2020

Vatsal Goel, Mohit Chandak, Ashish Anand, Prithwijit Guha

Figure 1 for IQ-VQA: Intelligent Visual Question Answering

Figure 2 for IQ-VQA: Intelligent Visual Question Answering

Figure 3 for IQ-VQA: Intelligent Visual Question Answering

Figure 4 for IQ-VQA: Intelligent Visual Question Answering

Abstract:Even though there has been tremendous progress in the field of Visual Question Answering, models today still tend to be inconsistent and brittle. To this end, we propose a model-independent cyclic framework which increases consistency and robustness of any VQA architecture. We train our models to answer the original question, generate an implication based on the answer and then also learn to answer the generated implication correctly. As a part of the cyclic framework, we propose a novel implication generator which can generate implied questions from any question-answer pair. As a baseline for future works on consistency, we provide a new human annotated VQA-Implications dataset. The dataset consists of ~30k questions containing implications of 3 types - Logical Equivalence, Necessary Condition and Mutual Exclusion - made from the VQA v2.0 validation dataset. We show that our framework improves consistency of VQA models by ~15% on the rule-based dataset, ~7% on VQA-Implications dataset and robustness by ~2%, without degrading their performance. In addition, we also quantitatively show improvement in attention maps which highlights better multi-modal understanding of vision and language.

Via

Access Paper or Ask Questions

CQ-VQA: Visual Question Answering on Categorized Questions

Feb 17, 2020

Aakansha Mishra, Ashish Anand, Prithwijit Guha

Figure 1 for CQ-VQA: Visual Question Answering on Categorized Questions

Figure 2 for CQ-VQA: Visual Question Answering on Categorized Questions

Figure 3 for CQ-VQA: Visual Question Answering on Categorized Questions

Figure 4 for CQ-VQA: Visual Question Answering on Categorized Questions

Abstract:This paper proposes CQ-VQA, a novel 2-level hierarchical but end-to-end model to solve the task of visual question answering (VQA). The first level of CQ-VQA, referred to as question categorizer (QC), classifies questions to reduce the potential answer search space. The QC uses attended and fused features of the input question and image. The second level, referred to as answer predictor (AP), comprises of a set of distinct classifiers corresponding to each question category. Depending on the question category predicted by QC, only one of the classifiers of AP remains active. The loss functions of QC and AP are aggregated together to make it an end-to-end model. The proposed model (CQ-VQA) is evaluated on the TDIUC dataset and is benchmarked against state-of-the-art approaches. Results indicate competitive or better performance of CQ-VQA.

Via

Access Paper or Ask Questions

Taxonomical hierarchy of canonicalized relations from multiple Knowledge Bases

Sep 17, 2019

Akshay Parekh, Ashish Anand, Amit Awekar

Figure 1 for Taxonomical hierarchy of canonicalized relations from multiple Knowledge Bases

Figure 2 for Taxonomical hierarchy of canonicalized relations from multiple Knowledge Bases

Figure 3 for Taxonomical hierarchy of canonicalized relations from multiple Knowledge Bases

Figure 4 for Taxonomical hierarchy of canonicalized relations from multiple Knowledge Bases

Abstract:This work addresses two important questions pertinent to Relation Extraction (RE). First, what are all possible relations that could exist between any two given entity types? Second, how do we define an unambiguous taxonomical (is-a) hierarchy among the identified relations? To address the first question, we use three resources Wikipedia Infobox, Wikidata, and DBpedia. This study focuses on relations between person, organization and location entity types. We exploit Wikidata and DBpedia in a data-driven manner, and Wikipedia Infobox templates manually to generate lists of relations. Further, to address the second question, we canonicalize, filter, and combine the identified relations from the three resources to construct a taxonomical hierarchy. This hierarchy contains 623 canonical relations with highest contribution from Wikipedia Infobox followed by DBpedia and Wikidata. The generated relation list subsumes an average of 85% of relations from RE datasets when entity types are restricted.

Via

Access Paper or Ask Questions