Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Debarshi Kumar Sanyal

Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets

Feb 26, 2025

Tohida Rehman, Soumabha Ghosh, Kuntal Das, Souvik Bhattacharjee, Debarshi Kumar Sanyal, Samiran Chattopadhyay

Abstract:Text summarization plays a crucial role in natural language processing by condensing large volumes of text into concise and coherent summaries. As digital content continues to grow rapidly and the demand for effective information retrieval increases, text summarization has become a focal point of research in recent years. This study offers a thorough evaluation of four leading pre-trained and open-source large language models: BART, FLAN-T5, LLaMA-3-8B, and Gemma-7B, across five diverse datasets CNN/DM, Gigaword, News Summary, XSum, and BBC News. The evaluation employs widely recognized automatic metrics, including ROUGE-1, ROUGE-2, ROUGE-L, BERTScore, and METEOR, to assess the models' capabilities in generating coherent and informative summaries. The results reveal the comparative strengths and limitations of these models in processing various text types.

* 5 pages, 2 figures, 6 tables

Via

Access Paper or Ask Questions

How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning

Jan 26, 2025

Tohida Rehman, Debarshi Kumar Sanyal, Samiran Chattopadhyay

Figure 1 for How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning

Figure 2 for How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning

Figure 3 for How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning

Figure 4 for How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning

Abstract:Artificial intelligence systems significantly impact the environment, particularly in natural language processing (NLP) tasks. These tasks often require extensive computational resources to train deep neural networks, including large-scale language models containing billions of parameters. This study analyzes the trade-offs between energy consumption and performance across three neural language models: two pre-trained models (T5-base and BART-base), and one large language model (LLaMA 3-8B). These models were fine-tuned for the text summarization task, focusing on generating research paper highlights that encapsulate the core themes of each paper. A wide range of evaluation metrics, including ROUGE, METEOR, MoverScore, BERTScore, and SciBERTScore, were employed to assess their performance. Furthermore, the carbon footprint associated with fine-tuning each model was measured, offering a comprehensive assessment of their environmental impact. This research underscores the importance of incorporating environmental considerations into the design and implementation of neural language models and calls for the advancement of energy-efficient AI methodologies.

Via

Access Paper or Ask Questions

Can pre-trained language models generate titles for research papers?

Sep 22, 2024

Tohida Rehman, Debarshi Kumar Sanyal, Samiran Chattopadhyay

Abstract:The title of a research paper communicates in a succinct style the main theme and, sometimes, the findings of the paper. Coming up with the right title is often an arduous task, and therefore, it would be beneficial to authors if title generation can be automated. In this paper, we fine-tune pre-trained and large language models to generate titles of papers from their abstracts. We also use ChatGPT in a zero-shot setting to generate paper titles. The performance of the models is measured with ROUGE, METEOR, MoverScore, BERTScore and SciBERTScore metrics.

Via

Access Paper or Ask Questions

Transfer Learning and Transformer Architecture for Financial Sentiment Analysis

Apr 28, 2024

Tohida Rehman, Raghubir Bose, Samiran Chattopadhyay, Debarshi Kumar Sanyal

Abstract:Financial sentiment analysis allows financial institutions like Banks and Insurance Companies to better manage the credit scoring of their customers in a better way. Financial domain uses specialized mechanisms which makes sentiment analysis difficult. In this paper, we propose a pre-trained language model which can help to solve this problem with fewer labelled data. We extend on the principles of Transfer learning and Transformation architecture principles and also take into consideration recent outbreak of pandemics like COVID. We apply the sentiment analysis to two different sets of data. We also take smaller training set and fine tune the same as part of the model.

* Proceedings of International Conference on Computational Intelligence, Data Science and Cloud Computing: IEM-ICDC 2021,pages 17--27
* 12 pages, 9 figures

Via

Access Paper or Ask Questions

GINopic: Topic Modeling with Graph Isomorphism Network

Apr 02, 2024

Suman Adhya, Debarshi Kumar Sanyal

Abstract:Topic modeling is a widely used approach for analyzing and exploring large document collections. Recent research efforts have incorporated pre-trained contextualized language models, such as BERT embeddings, into topic modeling. However, they often neglect the intrinsic informational value conveyed by mutual dependencies between words. In this study, we introduce GINopic, a topic modeling framework based on graph isomorphism networks to capture the correlation between words. By conducting intrinsic (quantitative as well as qualitative) and extrinsic evaluations on diverse benchmark datasets, we demonstrate the effectiveness of GINopic compared to existing topic models and highlight its potential for advancing topic modeling.

* Accepted as a long paper for NAACL 2024 main conference

Via

Access Paper or Ask Questions

Hallucination Reduction in Long Input Text Summarization

Sep 28, 2023

Tohida Rehman, Ronit Mandal, Abhishek Agarwal, Debarshi Kumar Sanyal

Abstract:Hallucination in text summarization refers to the phenomenon where the model generates information that is not supported by the input source document. Hallucination poses significant obstacles to the accuracy and reliability of the generated summaries. In this paper, we aim to reduce hallucinated outputs or hallucinations in summaries of long-form text documents. We have used the PubMed dataset, which contains long scientific research documents and their abstracts. We have incorporated the techniques of data filtering and joint entity and summary generation (JAENS) in the fine-tuning of the Longformer Encoder-Decoder (LED) model to minimize hallucinations and thereby improve the quality of the generated summary. We have used the following metrics to measure factual consistency at the entity level: precision-source, and F1-target. Our experiments show that the fine-tuned LED model performs well in generating the paper abstract. Data filtering techniques based on some preprocessing steps reduce entity-level hallucinations in the generated summaries in terms of some of the factual consistency metrics.

* 9 pages, 1 figure, 1 table

Via

Access Paper or Ask Questions

CitePrompt: Using Prompts to Identify Citation Intent in Scientific Papers

May 03, 2023

Avishek Lahiri, Debarshi Kumar Sanyal, Imon Mukherjee

Abstract:Citations in scientific papers not only help us trace the intellectual lineage but also are a useful indicator of the scientific significance of the work. Citation intents prove beneficial as they specify the role of the citation in a given context. In this paper, we present CitePrompt, a framework which uses the hitherto unexplored approach of prompt-based learning for citation intent classification. We argue that with the proper choice of the pretrained language model, the prompt template, and the prompt verbalizer, we can not only get results that are better than or comparable to those obtained with the state-of-the-art methods but also do it with much less exterior information about the scientific document. We report state-of-the-art results on the ACL-ARC dataset, and also show significant improvement on the SciCite dataset over all baseline models except one. As suitably large labelled datasets for citation intent classification can be quite hard to find, in a first, we propose the conversion of this task to the few-shot and zero-shot settings. For the ACL-ARC dataset, we report a 53.86% F1 score for the zero-shot setting, which improves to 63.61% and 66.99% for the 5-shot and 10-shot settings, respectively.

* Selected for publication at ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES 2023

Via

Access Paper or Ask Questions

What Does the Indian Parliament Discuss? An Exploratory Analysis of the Question Hour in the Lok Sabha

Apr 01, 2023

Suman Adhya, Debarshi Kumar Sanyal

Abstract:The TCPD-IPD dataset is a collection of questions and answers discussed in the Lower House of the Parliament of India during the Question Hour between 1999 and 2019. Although it is difficult to analyze such a huge collection manually, modern text analysis tools can provide a powerful means to navigate it. In this paper, we perform an exploratory analysis of the dataset. In particular, we present insightful corpus-level statistics and a detailed analysis of three subsets of the dataset. In the latter analysis, the focus is on understanding the temporal evolution of topics using a dynamic topic model. We observe that the parliamentary conversation indeed mirrors the political and socio-economic tensions of each period.

* Accepted at the workshop PoliticalNLP co-located with the conference LREC 2022

Via

Access Paper or Ask Questions

Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling

Mar 28, 2023

Suman Adhya, Avishek Lahiri, Debarshi Kumar Sanyal

Figure 1 for Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling

Figure 2 for Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling

Figure 3 for Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling

Figure 4 for Do Neural Topic Models Really Need Dropout? Analysis of the Effect of Dropout in Topic Modeling

Abstract:Dropout is a widely used regularization trick to resolve the overfitting issue in large feedforward neural networks trained on a small dataset, which performs poorly on the held-out test subset. Although the effectiveness of this regularization trick has been extensively studied for convolutional neural networks, there is a lack of analysis of it for unsupervised models and in particular, VAE-based neural topic models. In this paper, we have analyzed the consequences of dropout in the encoder as well as in the decoder of the VAE architecture in three widely used neural topic models, namely, contextualized topic model (CTM), ProdLDA, and embedded topic model (ETM) using four publicly available datasets. We characterize the dropout effect on these models in terms of the quality and predictive performance of the generated topics.

* Accepted at EACL 2023

Via

Access Paper or Ask Questions

Improving Contextualized Topic Models with Negative Sampling

Mar 27, 2023

Suman Adhya, Avishek Lahiri, Debarshi Kumar Sanyal, Partha Pratim Das

Figure 1 for Improving Contextualized Topic Models with Negative Sampling

Figure 2 for Improving Contextualized Topic Models with Negative Sampling

Figure 3 for Improving Contextualized Topic Models with Negative Sampling

Figure 4 for Improving Contextualized Topic Models with Negative Sampling

Abstract:Topic modeling has emerged as a dominant method for exploring large document collections. Recent approaches to topic modeling use large contextualized language models and variational autoencoders. In this paper, we propose a negative sampling mechanism for a contextualized topic model to improve the quality of the generated topics. In particular, during model training, we perturb the generated document-topic vector and use a triplet loss to encourage the document reconstructed from the correct document-topic vector to be similar to the input document and dissimilar to the document reconstructed from the perturbed vector. Experiments for different topic counts on three publicly available benchmark datasets show that in most cases, our approach leads to an increase in topic coherence over that of the baselines. Our model also achieves very high topic diversity.

* Accepted at 19th International Conference on Natural Language Processing (ICON 2022)

Via

Access Paper or Ask Questions