Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Parag Jain

Structsum Generation for Faster Text Comprehension

Jan 12, 2024

Parag Jain, Andreea Marzoca, Francesco Piccinno

Figure 1 for Structsum Generation for Faster Text Comprehension

Figure 2 for Structsum Generation for Faster Text Comprehension

Figure 3 for Structsum Generation for Faster Text Comprehension

Figure 4 for Structsum Generation for Faster Text Comprehension

Abstract:We consider the task of generating structured representations of text using large language models (LLMs). We focus on tables and mind maps as representative modalities. Tables are more organized way of representing data, while mind maps provide a visually dynamic and flexible approach, particularly suitable for sparse content. Despite the effectiveness of LLMs on different tasks, we show that current models struggle with generating structured outputs. In response, we present effective prompting strategies for both of these tasks. We introduce a taxonomy of problems around factuality, global and local structure, common to both modalities and propose a set of critiques to tackle these issues resulting in an absolute improvement in accuracy of +37pp (79%) for mind maps and +15pp (78%) for tables. To evaluate semantic coverage of generated structured representations we propose Auto-QA, and we verify the adequacy of Auto-QA using SQuAD dataset. We further evaluate the usefulness of structured representations via a text comprehension user study. The results show a significant reduction in comprehension time compared to text when using table (42.9%) and mind map (31.9%), without loss in accuracy.

Via

Access Paper or Ask Questions

Conversational Semantic Parsing using Dynamic Context Graphs

May 04, 2023

Parag Jain, Mirella Lapata

Abstract:In this paper we consider the task of conversational semantic parsing over general purpose knowledge graphs (KGs) with millions of entities, and thousands of relation-types. We are interested in developing models capable of interactively mapping user utterances into executable logical forms (e.g., SPARQL) in the context of the conversational history. Our key idea is to represent information about an utterance and its context via a subgraph which is created dynamically, i.e., the number of nodes varies per utterance. Moreover, rather than treating the subgraph as a sequence we exploit its underlying structure, and thus encode it using a graph neural network which further allows us to represent a large number of (unseen) nodes. Experimental results show that modeling context dynamically is superior to static approaches, delivering performance improvements across the board (i.e., for simple and complex questions). Our results further confirm that modeling the structure of context is better at processing discourse information, (i.e., at handling ellipsis and resolving coreference) and longer interactions.

Via

Access Paper or Ask Questions

Semantic Parsing for Conversational Question Answering over Knowledge Graphs

Jan 28, 2023

Laura Perez-Beltrachini, Parag Jain, Emilio Monti, Mirella Lapata

Abstract:In this paper, we are interested in developing semantic parsers which understand natural language questions embedded in a conversation with a user and ground them to formal queries over definitions in a general purpose knowledge graph (KG) with very large vocabularies (covering thousands of concept names and relations, and millions of entities). To this end, we develop a dataset where user questions are annotated with Sparql parses and system answers correspond to execution results thereof. We present two different semantic parsing approaches and highlight the challenges of the task: dealing with large vocabularies, modelling conversation context, predicting queries with multiple entities, and generalising to new questions at test time. We hope our dataset will serve as useful testbed for the development of conversational semantic parsers. Our dataset and models are released at https://github.com/EdinburghNLP/SPICE.

* EACL 2023

Via

Access Paper or Ask Questions

Unified Semantic Parsing with Weak Supervision

Jun 12, 2019

Priyanka Agrawal, Parag Jain, Ayushi Dalmia, Abhishek Bansal, Ashish Mittal, Karthik Sankaranarayanan

Figure 1 for Unified Semantic Parsing with Weak Supervision

Figure 2 for Unified Semantic Parsing with Weak Supervision

Figure 3 for Unified Semantic Parsing with Weak Supervision

Figure 4 for Unified Semantic Parsing with Weak Supervision

Abstract:Semantic parsing over multiple knowledge bases enables a parser to exploit structural similarities of programs across the multiple domains. However, the fundamental challenge lies in obtaining high-quality annotations of (utterance, program) pairs across various domains needed for training such models. To overcome this, we propose a novel framework to build a unified multi-domain enabled semantic parser trained only with weak supervision (denotations). Weakly supervised training is particularly arduous as the program search space grows exponentially in a multi-domain setting. To solve this, we incorporate a multi-policy distillation mechanism in which we first train domain-specific semantic parsers (teachers) using weak supervision in the absence of the ground truth programs, followed by training a single unified parser (student) from the domain specific policies obtained from these teachers. The resultant semantic parser is not only compact but also generalizes better, and generates more accurate programs. It further does not require the user to provide a domain label while querying. On the standard Overnight dataset (containing multiple domains), we demonstrate that the proposed model improves performance by 20% in terms of denotation accuracy in comparison to baseline techniques.

* Association for Computational Linguistics (ACL) 2019

Via

Access Paper or Ask Questions

Unsupervised Neural Text Simplification

Oct 18, 2018

Sai Surya, Abhijit Mishra, Anirban Laha, Parag Jain, Karthik Sankaranarayanan

Figure 1 for Unsupervised Neural Text Simplification

Figure 2 for Unsupervised Neural Text Simplification

Figure 3 for Unsupervised Neural Text Simplification

Figure 4 for Unsupervised Neural Text Simplification

Abstract:The paper presents a first attempt towards unsupervised neural text simplification that relies only on unlabeled text corpora. The core framework is comprised of a shared encoder and a pair of attentional-decoders that gains knowledge of both text simplification and complexification through discriminator-based-losses, back-translation and denoising. The framework is trained using unlabeled text collected from en-Wikipedia dump. Our analysis (both quantitative and qualitative involving human evaluators) on a public test data shows the efficacy of our model to perform simplification at both lexical and syntactic levels, competitive to existing supervised methods. We open source our implementation for academic use.

* 8 pages, 3 figures

Via

Access Paper or Ask Questions

Unsupervised Controllable Text Formalization

Sep 10, 2018

Parag Jain, Abhijit Mishra, Amar Prakash Azad, Karthik Sankaranarayanan

Figure 1 for Unsupervised Controllable Text Formalization

Figure 2 for Unsupervised Controllable Text Formalization

Figure 3 for Unsupervised Controllable Text Formalization

Figure 4 for Unsupervised Controllable Text Formalization

Abstract:We propose a novel framework for controllable natural language transformation. Realizing that the requirement of parallel corpus is practically unsustainable for controllable generation tasks, an unsupervised training scheme is introduced. The crux of the framework is a deep neural encoder-decoder that is reinforced with text-transformation knowledge through auxiliary modules (called scorers). The scorers, based on off-the-shelf language processing tools, decide the learning scheme of the encoder-decoder based on its actions. We apply this framework for the text-transformation task of formalizing an input text by improving its readability grade; the degree of required formalization can be controlled by the user at run-time. Experiments on public datasets demonstrate the efficacy of our model towards: (a) transforming a given text to a more formal style, and (b) introducing appropriate amount of formalness in the output text pertaining to the input control. Our code and datasets are released for academic use.

Via

Access Paper or Ask Questions

A Mixed Hierarchical Attention based Encoder-Decoder Approach for Standard Table Summarization

Apr 20, 2018

Parag Jain, Anirban Laha, Karthik Sankaranarayanan, Preksha Nema, Mitesh M. Khapra, Shreyas Shetty

Figure 1 for A Mixed Hierarchical Attention based Encoder-Decoder Approach for Standard Table Summarization

Figure 2 for A Mixed Hierarchical Attention based Encoder-Decoder Approach for Standard Table Summarization

Figure 3 for A Mixed Hierarchical Attention based Encoder-Decoder Approach for Standard Table Summarization

Figure 4 for A Mixed Hierarchical Attention based Encoder-Decoder Approach for Standard Table Summarization

Abstract:Structured data summarization involves generation of natural language summaries from structured input data. In this work, we consider summarizing structured data occurring in the form of tables as they are prevalent across a wide variety of domains. We formulate the standard table summarization problem, which deals with tables conforming to a single predefined schema. To this end, we propose a mixed hierarchical attention based encoder-decoder model which is able to leverage the structure in addition to the content of the tables. Our experiments on the publicly available WEATHERGOV dataset show around 18 BLEU (~ 30%) improvement over the current state-of-the-art.

* Accepted in NAACL-HLT 2018 (Short paper)

Via

Access Paper or Ask Questions

Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

Apr 20, 2018

Preksha Nema, Shreyas Shetty, Parag Jain, Anirban Laha, Karthik Sankaranarayanan, Mitesh M. Khapra

Figure 1 for Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

Figure 2 for Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

Figure 3 for Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

Figure 4 for Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization

Abstract:In this work, we focus on the task of generating natural language descriptions from a structured table of facts containing fields (such as nationality, occupation, etc) and values (such as Indian, actor, director, etc). One simple choice is to treat the table as a sequence of fields and values and then use a standard seq2seq model for this task. However, such a model is too generic and does not exploit task-specific characteristics. For example, while generating descriptions from a table, a human would attend to information at two levels: (i) the fields (macro level) and (ii) the values within the field (micro level). Further, a human would continue attending to a field for a few timesteps till all the information from that field has been rendered and then never return back to this field (because there is nothing left to say about it). To capture this behavior we use (i) a fused bifocal attention mechanism which exploits and combines this micro and macro level information and (ii) a gated orthogonalization mechanism which tries to ensure that a field is remembered for a few time steps and then forgotten. We experiment with a recently released dataset which contains fact tables about people and their corresponding one line biographical descriptions in English. In addition, we also introduce two similar datasets for French and German. Our experiments show that the proposed model gives 21% relative improvement over a recently proposed state of the art method and 10% relative improvement over basic seq2seq models. The code and the datasets developed as a part of this work are publicly available.

* Accepted in NAACL-HLT 2018

Via

Access Paper or Ask Questions

Story Generation from Sequence of Independent Short Descriptions

Aug 21, 2017

Parag Jain, Priyanka Agrawal, Abhijit Mishra, Mohak Sukhwani, Anirban Laha, Karthik Sankaranarayanan

Figure 1 for Story Generation from Sequence of Independent Short Descriptions

Figure 2 for Story Generation from Sequence of Independent Short Descriptions

Figure 3 for Story Generation from Sequence of Independent Short Descriptions

Figure 4 for Story Generation from Sequence of Independent Short Descriptions

Abstract:Existing Natural Language Generation (NLG) systems are weak AI systems and exhibit limited capabilities when language generation tasks demand higher levels of creativity, originality and brevity. Effective solutions or, at least evaluations of modern NLG paradigms for such creative tasks have been elusive, unfortunately. This paper introduces and addresses the task of coherent story generation from independent descriptions, describing a scene or an event. Towards this, we explore along two popular text-generation paradigms -- (1) Statistical Machine Translation (SMT), posing story generation as a translation problem and (2) Deep Learning, posing story generation as a sequence to sequence learning problem. In SMT, we chose two popular methods such as phrase based SMT (PB-SMT) and syntax based SMT (SYNTAX-SMT) to `translate' the incoherent input text into stories. We then implement a deep recurrent neural network (RNN) architecture that encodes sequence of variable length input descriptions to corresponding latent representations and decodes them to produce well formed comprehensive story like summaries. The efficacy of the suggested approaches is demonstrated on a publicly available dataset with the help of popular machine translation and summarization evaluation metrics.

* Accepted in SIGKDD Workshop on Machine Learning for Creativity (ML4Creativity), 2017

Via

Access Paper or Ask Questions

Topic Modeling Using Distributed Word Embeddings

Mar 15, 2016

Ramandeep S Randhawa, Parag Jain, Gagan Madan

Figure 1 for Topic Modeling Using Distributed Word Embeddings

Figure 2 for Topic Modeling Using Distributed Word Embeddings

Figure 3 for Topic Modeling Using Distributed Word Embeddings

Figure 4 for Topic Modeling Using Distributed Word Embeddings

Abstract:We propose a new algorithm for topic modeling, Vec2Topic, that identifies the main topics in a corpus using semantic information captured via high-dimensional distributed word embeddings. Our technique is unsupervised and generates a list of topics ranked with respect to importance. We find that it works better than existing topic modeling techniques such as Latent Dirichlet Allocation for identifying key topics in user-generated content, such as emails, chats, etc., where topics are diffused across the corpus. We also find that Vec2Topic works equally well for non-user generated content, such as papers, reports, etc., and for small corpora such as a single-document.

Via

Access Paper or Ask Questions