Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amarnath Gupta

Minimally Supervised Hierarchical Domain Intent Learning for CRS

May 04, 2025

Safikureshi Mondal, Subhasis Dasgupta, Amarnath Gupta

Abstract:Modeling domain intent within an evolving domain structure presents a significant challenge for domain-specific conversational recommendation systems (CRS). The conventional approach involves training an intent model using utterance-intent pairs. However, as new intents and patterns emerge, the model must be continuously updated while preserving existing relationships and maintaining efficient retrieval. This process leads to substantial growth in utterance-intent pairs, making manual labeling increasingly costly and impractical. In this paper, we propose an efficient solution for constructing a dynamic hierarchical structure that minimizes the number of user utterances required to achieve adequate domain knowledge coverage. To this end, we introduce a neural network-based attention-driven hierarchical clustering algorithm designed to optimize intent grouping using minimal data. The proposed method builds upon and integrates concepts from two existing flat clustering algorithms DEC and NAM, both of which utilize neural attention mechanisms. We apply our approach to a curated subset of 44,000 questions from the business food domain. Experimental results demonstrate that constructing the hierarchy using a stratified sampling strategy significantly reduces the number of questions needed to represent the evolving intent structure. Our findings indicate that this approach enables efficient coverage of dynamic domain knowledge without frequent retraining, thereby enhancing scalability and adaptability in domain-specific CSRs.

* This research is funded by the National Institution of Food and Agriculture U.S Department of Agriculture (USDA)

Via

Access Paper or Ask Questions

MISCON: A Mission-Driven Conversational Consultant for Pre-Venture Entrepreneurs in Food Deserts

Jan 24, 2025

Subhasis Dasgupta, Hans Taparia, Laura Schmidt, Amarnath Gupta

Abstract:This work-in-progress report describes MISCON, a conversational consultant being developed for a public mission project called NOURISH. With MISCON, aspiring small business owners in a food-insecure region and their advisors in Community-based organizations would be able to get information, recommendation and analysis regarding setting up food businesses. MISCON conversations are modeled as state machine that uses a heterogeneous knowledge graph as well as several analytical tools and services including a variety of LLMs. In this short report, we present the functional architecture and some design considerations behind MISCON.

* 8 pages. Acccepted for AAAI 2025 Workshop on AI for Public Missions, March 3rd, 2025

Via

Access Paper or Ask Questions

Temporal Relation Extraction with a Graph-Based Deep Biaffine Attention Model

Jan 16, 2022

Bo-Ying Su, Shang-Ling Hsu, Kuan-Yin Lai, Amarnath Gupta

Figure 1 for Temporal Relation Extraction with a Graph-Based Deep Biaffine Attention Model

Figure 2 for Temporal Relation Extraction with a Graph-Based Deep Biaffine Attention Model

Figure 3 for Temporal Relation Extraction with a Graph-Based Deep Biaffine Attention Model

Figure 4 for Temporal Relation Extraction with a Graph-Based Deep Biaffine Attention Model

Abstract:Temporal information extraction plays a critical role in natural language understanding. Previous systems have incorporated advanced neural language models and have successfully enhanced the accuracy of temporal information extraction tasks. However, these systems have two major shortcomings. First, they fail to make use of the two-sided nature of temporal relations in prediction. Second, they involve non-parallelizable pipelines in inference process that bring little performance gain. To this end, we propose a novel temporal information extraction model based on deep biaffine attention to extract temporal relationships between events in unstructured text efficiently and accurately. Our model is performant because we perform relation extraction tasks directly instead of considering event annotation as a prerequisite of relation extraction. Moreover, our architecture uses Multilayer Perceptrons (MLP) with biaffine attention to predict arcs and relation labels separately, improving relation detecting accuracy by exploiting the two-sided nature of temporal relationships. We experimentally demonstrate that our model achieves state-of-the-art performance in temporal relation extraction.

Via

Access Paper or Ask Questions

Discovering Technology Gaps using the IntSight Knowledge Navigator

Sep 11, 2021

Aurpon Gupta, Subhasis Dasgupta, Snehasis Sinha, Amarnath Gupta

Figure 1 for Discovering Technology Gaps using the IntSight Knowledge Navigator

Figure 2 for Discovering Technology Gaps using the IntSight Knowledge Navigator

Figure 3 for Discovering Technology Gaps using the IntSight Knowledge Navigator

Figure 4 for Discovering Technology Gaps using the IntSight Knowledge Navigator

Abstract:Knowledge analysis is an important application of knowledge graphs. In this paper, we present a complex knowledge analysis problem that discovers the gaps in the technology areas of interest to an organization. Our knowledge graph is developed on a heterogeneous data management platform. The analysis combines semantic search, graph analytics, and polystore query optimization.

Via

Access Paper or Ask Questions

News Meets Microblog: Hashtag Annotation via Retriever-Generator

Apr 18, 2021

Xiuwen Zheng, Dheeraj Mekala, Amarnath Gupta, Jingbo Shang

Figure 1 for News Meets Microblog: Hashtag Annotation via Retriever-Generator

Figure 2 for News Meets Microblog: Hashtag Annotation via Retriever-Generator

Figure 3 for News Meets Microblog: Hashtag Annotation via Retriever-Generator

Figure 4 for News Meets Microblog: Hashtag Annotation via Retriever-Generator

Abstract:Hashtag annotation for microblog posts has been recently formulated as a sequence generation problem to handle emerging hashtags that are unseen in the training set. The state-of-the-art method leverages conversations initiated by posts to enrich contextual information for the short posts. However, it is unrealistic to assume the existence of conversations before the hashtag annotation itself. Therefore, we propose to leverage news articles published before the microblog post to generate hashtags following a Retriever-Generator framework. Extensive experiments on English Twitter datasets demonstrate superior performance and significant advantages of leveraging news articles to generate hashtags.

Via

Access Paper or Ask Questions

Ingesting High-Velocity Streaming Graphs from Social Media Sources

May 20, 2019

Subhasis Dasgupta, Aditya Bagchi, Amarnath Gupta

Figure 1 for Ingesting High-Velocity Streaming Graphs from Social Media Sources

Figure 2 for Ingesting High-Velocity Streaming Graphs from Social Media Sources

Figure 3 for Ingesting High-Velocity Streaming Graphs from Social Media Sources

Figure 4 for Ingesting High-Velocity Streaming Graphs from Social Media Sources

Abstract:Many data science applications like social network analysis use graphs as their primary form of data. However, acquiring graph-structured data from social media presents some interesting challenges. The first challenge is the high data velocity and bursty nature of the social media data. The second challenge is that the complex nature of the data makes the ingestion process expensive. If we want to store the streaming graph data in a graph database, we face a third challenge -- the database is very often unable to sustain the ingestion of high-velocity, high-burst data. We have developed an adaptive buffering mechanism and a graph compression technique that effectively mitigates the problem. A novel aspect of our method is that the adaptive buffering algorithm uses the data rate, the data content as well as the CPU resources of the database machine to determine an optimal data ingestion mechanism. We further show that an ingestion-time graph-compression strategy improves the efficiency of the data ingestion into the database. We have verified the efficacy of our ingestion optimization strategy through extensive experiments.

Via

Access Paper or Ask Questions