Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ranga Raju Vatsavai

CoMAC: Conversational Agent for Multi-Source Auxiliary Context with Sparse and Symmetric Latent Interactions

Mar 25, 2025

Junfeng Liu, Christopher T. Symons, Ranga Raju Vatsavai

Abstract:Recent advancements in AI-driven conversational agents have exhibited immense potential of AI applications. Effective response generation is crucial to the success of these agents. While extensive research has focused on leveraging multiple auxiliary data sources (e.g., knowledge bases and personas) to enhance response generation, existing methods often struggle to efficiently extract relevant information from these sources. There are still clear limitations in the ability to combine versatile conversational capabilities with adherence to known facts and adaptation to large variations in user preferences and belief systems, which continues to hinder the wide adoption of conversational AI tools. This paper introduces a novel method, Conversational Agent for Multi-Source Auxiliary Context with Sparse and Symmetric Latent Interactions (CoMAC), for conversation generation, which employs specialized encoding streams and post-fusion grounding networks for multiple data sources to identify relevant persona and knowledge information for the conversation. CoMAC also leverages a novel text similarity metric that allows bi-directional information sharing among multiple sources and focuses on a selective subset of meaningful words. Our experiments show that CoMAC improves the relevant persona and knowledge prediction accuracies and response generation quality significantly over two state-of-the-art methods.

* The 29th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD2025)

Via

Access Paper or Ask Questions

Context Retrieval via Normalized Contextual Latent Interaction for Conversational Agent

Dec 01, 2023

Junfeng Liu, Zhuocheng Mei, Kewen Peng, Ranga Raju Vatsavai

Abstract:Conversational agents leveraging AI, particularly deep learning, are emerging in both academic research and real-world applications. However, these applications still face challenges, including disrespecting knowledge and facts, not personalizing to user preferences, and enormous demand for computational resources during training and inference. Recent research efforts have been focused on addressing these challenges from various aspects, including supplementing various types of auxiliary information to the conversational agents. However, existing methods are still not able to effectively and efficiently exploit relevant information from these auxiliary supplements to further unleash the power of the conversational agents and the language models they use. In this paper, we present a novel method, PK-NCLI, that is able to accurately and efficiently identify relevant auxiliary information to improve the quality of conversational responses by learning the relevance among persona, chat history, and knowledge background through low-level normalized contextual latent interaction. Our experimental results indicate that PK-NCLI outperforms the state-of-the-art method, PK-FoCus, by 47.80%/30.61%/24.14% in terms of perplexity, knowledge grounding, and training efficiency, respectively, and maintained the same level of persona grounding performance. We also provide a detailed analysis of how different factors, including language model choices and trade-offs on training weights, would affect the performance of PK-NCLI.

* 2023 IEEE International Conference on Data Mining Workshops (ICDMW)

Via

Access Paper or Ask Questions

Persona-Coded Poly-Encoder: Persona-Guided Multi-Stream Conversational Sentence Scoring

Sep 28, 2023

Junfeng Liu, Christopher Symons, Ranga Raju Vatsavai

Abstract:Recent advances in machine learning and deep learning have led to the widespread use of Conversational AI in many practical applications. However, it is still very challenging to leverage auxiliary information that can provide conversational context or personalized tuning to improve the quality of conversations. For example, there has only been limited research on using an individuals persona information to improve conversation quality, and even state-of-the-art conversational AI techniques are unable to effectively leverage signals from heterogeneous sources of auxiliary data, such as multi-modal interaction data, demographics, SDOH data, etc. In this paper, we present a novel Persona-Coded Poly-Encoder method that leverages persona information in a multi-stream encoding scheme to improve the quality of response generation for conversations. To show the efficacy of the proposed method, we evaluate our method on two different persona-based conversational datasets, and compared against two state-of-the-art methods. Our experimental results and analysis demonstrate that our method can improve conversation quality over the baseline method Poly-Encoder by 3.32% and 2.94% in terms of BLEU score and HR@1, respectively. More significantly, our method offers a path to better utilization of multi-modal data in conversational tasks. Lastly, our study outlines several challenges and future research directions for advancing personalized conversational AI technology.

* The 35th IEEE International Conference on Tools with Artificial Intelligence (ICTAI)

Via

Access Paper or Ask Questions

Persona-Based Conversational AI: State of the Art and Challenges

Dec 04, 2022

Junfeng Liu, Christopher Symons, Ranga Raju Vatsavai

Abstract:Conversational AI has become an increasingly prominent and practical application of machine learning. However, existing conversational AI techniques still suffer from various limitations. One such limitation is a lack of well-developed methods for incorporating auxiliary information that could help a model understand conversational context better. In this paper, we explore how persona-based information could help improve the quality of response generation in conversations. First, we provide a literature review focusing on the current state-of-the-art methods that utilize persona information. We evaluate two strong baseline methods, the Ranking Profile Memory Network and the Poly-Encoder, on the NeurIPS ConvAI2 benchmark dataset. Our analysis elucidates the importance of incorporating persona information into conversational systems. Additionally, our study highlights several limitations with current state-of-the-art methods and outlines challenges and future research directions for advancing personalized conversational AI technology.

* 2022 International Conference on Data Mining Workshops (ICDMW)

Via

Access Paper or Ask Questions

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

Dec 24, 2021

Anton Dereventsov, Ranga Raju Vatsavai, Clayton Webster

Figure 1 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

Figure 2 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

Figure 3 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

Figure 4 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

Abstract:In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals. In particular, our approach is based on state space clustering with the use of a simplistic $k$-means algorithm as well as conventional choices of the network architectures and optimization algorithms. Numerical examples demonstrate the efficiency of different RL procedures and are used to illustrate that this technique accelerates the agent's ability to learn and does not restrict the agent's performance.

Via

Access Paper or Ask Questions

Consistency Regularization with Generative Adversarial Networks for Semi-Supervised Image Classification

Jul 08, 2020

Zexi Chen, Bharathkumar Ramachandra, Ranga Raju Vatsavai

Figure 1 for Consistency Regularization with Generative Adversarial Networks for Semi-Supervised Image Classification

Figure 2 for Consistency Regularization with Generative Adversarial Networks for Semi-Supervised Image Classification

Figure 3 for Consistency Regularization with Generative Adversarial Networks for Semi-Supervised Image Classification

Figure 4 for Consistency Regularization with Generative Adversarial Networks for Semi-Supervised Image Classification

Abstract:Generative Adversarial Networks (GANs) based semi-supervised learning (SSL) approaches are shown to improve classification performance by utilizing a large number of unlabeled samples in conjunction with limited labeled samples. However, their performance still lags behind the state-of-the-art non-GAN based SSL approaches. One main reason we identify is the lack of consistency in class probability predictions on the same image under local perturbations. This problem was addressed in the past in a generic setting using the label consistency regularization, which enforces the class probability predictions for an input image to be unchanged under various semantic-preserving perturbations. In this work, we incorporate the consistency regularization in the vanilla semi-GAN to address this critical limitation. In particular, we present a new composite consistency regularization method which, in spirit, combines two well-known consistency-based techniques -- Mean Teacher and Interpolation Consistency Training. We demonstrate the efficacy of our approach on two SSL image classification benchmark datasets, SVHN and CIFAR-10. Our experiments show that this new composite consistency regularization based semi-GAN significantly improves its performance and achieves new state-of-the-art performance among GAN-based SSL approaches.

* 10 pages, 5 figures

Via

Access Paper or Ask Questions

Local Clustering with Mean Teacher for Semi-supervised Learning

Apr 20, 2020

Zexi Chen, Benjamin Dutton, Bharathkumar Ramachandra, Tianfu Wu, Ranga Raju Vatsavai

Figure 1 for Local Clustering with Mean Teacher for Semi-supervised Learning

Figure 2 for Local Clustering with Mean Teacher for Semi-supervised Learning

Figure 3 for Local Clustering with Mean Teacher for Semi-supervised Learning

Figure 4 for Local Clustering with Mean Teacher for Semi-supervised Learning

Abstract:The Mean Teacher (MT) model of Tarvainen and Valpola has shown favorable performance on several semi-supervised benchmark datasets. MT maintains a teacher model's weights as the exponential moving average of a student model's weights and minimizes the divergence between their probability predictions under diverse perturbations of the inputs. However, MT is known to suffer from confirmation bias, that is, reinforcing incorrect teacher model predictions. In this work, we propose a simple yet effective method called Local Clustering (LC) to mitigate the effect of confirmation bias. In MT, each data point is considered independent of other points during training; however, data points are likely to be close to each other in feature space if they share similar features. Motivated by this, we cluster data points locally by minimizing the pairwise distance between neighboring data points in feature space. Combined with a standard classification cross-entropy objective on labeled data points, the misclassified unlabeled data points are pulled towards high-density regions of their correct class with the help of their neighbors, thus improving model performance. We demonstrate on semi-supervised benchmark datasets SVHN and CIFAR-10 that adding our LC loss to MT yields significant improvements compared to MT and performance comparable to the state of the art in semi-supervised learning.

* 8 pages, 7 figures

Via

Access Paper or Ask Questions

A Survey of Single-Scene Video Anomaly Detection

Apr 13, 2020

Bharathkumar Ramachandra, Michael J. Jones, Ranga Raju Vatsavai

Figure 1 for A Survey of Single-Scene Video Anomaly Detection

Figure 2 for A Survey of Single-Scene Video Anomaly Detection

Figure 3 for A Survey of Single-Scene Video Anomaly Detection

Figure 4 for A Survey of Single-Scene Video Anomaly Detection

Abstract:This survey article summarizes research trends on the topic of anomaly detection in video feeds of a single scene. We discuss the various problem formulations, publicly available datasets and evaluation criteria. We categorize and situate past research into an intuitive taxonomy. Finally, we also provide best practices and suggest some possible directions for future research.

Via

Access Paper or Ask Questions

Learning a distance function with a Siamese network to localize anomalies in videos

Jan 24, 2020

Bharathkumar Ramachandra, Michael J. Jones, Ranga Raju Vatsavai

Figure 1 for Learning a distance function with a Siamese network to localize anomalies in videos

Figure 2 for Learning a distance function with a Siamese network to localize anomalies in videos

Figure 3 for Learning a distance function with a Siamese network to localize anomalies in videos

Figure 4 for Learning a distance function with a Siamese network to localize anomalies in videos

Abstract:This work introduces a new approach to localize anomalies in surveillance video. The main novelty is the idea of using a Siamese convolutional neural network (CNN) to learn a distance function between a pair of video patches (spatio-temporal regions of video). The learned distance function, which is not specific to the target video, is used to measure the distance between each video patch in the testing video and the video patches found in normal training video. If a testing video patch is not similar to any normal video patch then it must be anomalous. We compare our approach to previously published algorithms using 4 evaluation measures and 3 challenging target benchmark datasets. Experiments show that our approach either surpasses or performs comparably to current state-of-the-art methods.

* accepted to WACV 2020

Via

Access Paper or Ask Questions

Estimating a Manifold from a Tangent Bundle Learner

Jun 18, 2019

Bharathkumar Ramachandra, Benjamin Dutton, Ranga Raju Vatsavai

Figure 1 for Estimating a Manifold from a Tangent Bundle Learner

Figure 2 for Estimating a Manifold from a Tangent Bundle Learner

Figure 3 for Estimating a Manifold from a Tangent Bundle Learner

Figure 4 for Estimating a Manifold from a Tangent Bundle Learner

Abstract:Manifold hypotheses are typically used for tasks such as dimensionality reduction, interpolation, or improving classification performance. In the less common problem of manifold estimation, the task is to characterize the geometric structure of the manifold in the original ambient space from a sample. We focus on the role that tangent bundle learners (TBL) can play in estimating the underlying manifold from which data is assumed to be sampled. Since the unbounded tangent spaces natively represent a poor manifold estimate, the problem reduces to one of estimating regions in the tangent space where it acts as a relatively faithful linear approximator to the surface of the manifold. Local PCA methods, such as the Mixtures of Probabilistic Principal Component Analyzers method of Tipping and Bishop produce a subset of the tangent bundle of the manifold along with an assignment function that assigns points in the training data used by the TBL to elements of the estimated tangent bundle. We formulate three methods that use the data assigned to each tangent space to estimate the underlying bounded subspaces for which the tangent space is a faithful estimate of the manifold and offer thoughts on how this perspective is theoretically grounded in the manifold assumption. We seek to explore the conceptual and technical challenges that arise in trying to utilize simple TBL methods to arrive at reliable estimates of the underlying manifold.

Via

Access Paper or Ask Questions