Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Arpan Pal

TCS Research, TATA Consultancy Services, Kolkata, India

Concept-based Anomaly Detection in Retail Stores for Automatic Correction using Mobile Robots

Oct 21, 2023

Aditya Kapoor, Vartika Sengar, Nijil George, Vighnesh Vatsal, Jayavardhana Gubbi, Balamuralidhar P, Arpan Pal

Abstract:Tracking of inventory and rearrangement of misplaced items are some of the most labor-intensive tasks in a retail environment. While there have been attempts at using vision-based techniques for these tasks, they mostly use planogram compliance for detection of any anomalies, a technique that has been found lacking in robustness and scalability. Moreover, existing systems rely on human intervention to perform corrective actions after detection. In this paper, we present Co-AD, a Concept-based Anomaly Detection approach using a Vision Transformer (ViT) that is able to flag misplaced objects without using a prior knowledge base such as a planogram. It uses an auto-encoder architecture followed by outlier detection in the latent space. Co-AD has a peak success rate of 89.90% on anomaly detection image sets of retail objects drawn from the RP2K dataset, compared to 80.81% on the best-performing baseline of a standard ViT auto-encoder. To demonstrate its utility, we describe a robotic mobile manipulation pipeline to autonomously correct the anomalies flagged by Co-AD. This work is ultimately aimed towards developing autonomous mobile robot solutions that reduce the need for human intervention in retail store management.

* 8 pages, 9 figures, 2 tables, IEEE Transactions on Systems, Man and Cybernetics

Via

Access Paper or Ask Questions

DOST -- Domain Obedient Self-supervised Training for Multi Label Classification with Noisy Labels

Aug 09, 2023

Soumadeep Saha, Utpal Garain, Arijit Ukil, Arpan Pal, Sundeep Khandelwal

Abstract:The enormous demand for annotated data brought forth by deep learning techniques has been accompanied by the problem of annotation noise. Although this issue has been widely discussed in machine learning literature, it has been relatively unexplored in the context of "multi-label classification" (MLC) tasks which feature more complicated kinds of noise. Additionally, when the domain in question has certain logical constraints, noisy annotations often exacerbate their violations, making such a system unacceptable to an expert. This paper studies the effect of label noise on domain rule violation incidents in the MLC task, and incorporates domain rules into our learning algorithm to mitigate the effect of noise. We propose the Domain Obedient Self-supervised Training (DOST) paradigm which not only makes deep learning models more aligned to domain rules, but also improves learning performance in key metrics and minimizes the effect of annotation noise. This novel approach uses domain guidance to detect offending annotations and deter rule-violating predictions in a self-supervised manner, thus making it more "data efficient" and domain compliant. Empirical studies, performed over two large scale multi-label classification datasets, demonstrate that our method results in improvement across the board, and often entirely counteracts the effect of noise.

* Submitted to IEEE TNNLS on March 7th 2023. 8 pages, 4 figures

Via

Access Paper or Ask Questions

Challenges in Applying Robotics to Retail Store Management

Aug 18, 2022

Vartika Sengar, Aditya Kapoor, Nijil George, Vighnesh Vatsal, Jayavardhana Gubbi, Balamuralidhar P, Arpan Pal

Figure 1 for Challenges in Applying Robotics to Retail Store Management

Abstract:An autonomous retail store management system entails inventory tracking, store monitoring, and anomaly correction. Recent attempts at autonomous retail store management have faced challenges primarily in perception for anomaly detection, as well as new challenges arising in mobile manipulation for executing anomaly correction. Advances in each of these areas along with system integration are necessary for a scalable solution in this domain.

* Presented at the IEEE ICRA 2022 Workshop on Challenges in Applying Academic Research to Real-World Robotics, 23 May 2022

Via

Access Paper or Ask Questions

Unsupervised Driving Behavior Analysis using Representation Learning and Exploiting Group-based Training

May 12, 2022

Soma Bandyopadhyay, Anish Datta, Shruti Sachan, Arpan Pal

Figure 1 for Unsupervised Driving Behavior Analysis using Representation Learning and Exploiting Group-based Training

Figure 2 for Unsupervised Driving Behavior Analysis using Representation Learning and Exploiting Group-based Training

Figure 3 for Unsupervised Driving Behavior Analysis using Representation Learning and Exploiting Group-based Training

Figure 4 for Unsupervised Driving Behavior Analysis using Representation Learning and Exploiting Group-based Training

Abstract:Driving behavior monitoring plays a crucial role in managing road safety and decreasing the risk of traffic accidents. Driving behavior is affected by multiple factors like vehicle characteristics, types of roads, traffic, but, most importantly, the pattern of driving of individuals. Current work performs a robust driving pattern analysis by capturing variations in driving patterns. It forms consistent groups by learning compressed representation of time series (Auto Encoded Compact Sequence) using a multi-layer seq-2-seq autoencoder and exploiting hierarchical clustering along with recommending the choice of best distance measure. Consistent groups aid in identifying variations in driving patterns of individuals captured in the dataset. These groups are generated for both train and hidden test data. The consistent groups formed using train data, are exploited for training multiple instances of the classifier. Obtained choice of best distance measure is used to select the best train-test pair of consistent groups. We have experimented on the publicly available UAH-DriveSet dataset considering the signals captured from IMU sensors (accelerometer and gyroscope) for classifying driving behavior. We observe proposed method, significantly outperforms the benchmark performance.

* 7 figures, 8 pages , 7 tables, accepted and presented conference AAAI 2022 AI for Transportation Workshop (Prefinal version)

Via

Access Paper or Ask Questions

Automated Label Generation for Time Series Classification with Representation Learning: Reduction of Label Cost for Training

Jul 12, 2021

Soma Bandyopadhyay, Anish Datta, Arpan Pal

Figure 1 for Automated Label Generation for Time Series Classification with Representation Learning: Reduction of Label Cost for Training

Figure 2 for Automated Label Generation for Time Series Classification with Representation Learning: Reduction of Label Cost for Training

Figure 3 for Automated Label Generation for Time Series Classification with Representation Learning: Reduction of Label Cost for Training

Figure 4 for Automated Label Generation for Time Series Classification with Representation Learning: Reduction of Label Cost for Training

Abstract:Time-series generated by end-users, edge devices, and different wearables are mostly unlabelled. We propose a method to auto-generate labels of un-labelled time-series, exploiting very few representative labelled time-series. Our method is based on representation learning using Auto Encoded Compact Sequence (AECS) with a choice of best distance measure. It performs self-correction in iterations, by learning latent structure, as well as synthetically boosting representative time-series using Variational-Auto-Encoder (VAE) to improve the quality of labels. We have experimented with UCR and UCI archives, public real-world univariate, multivariate time-series taken from different application domains. Experimental results demonstrate that the proposed method is very close to the performance achieved by fully supervised classification. The proposed method not only produces close to benchmark results but outperforms the benchmark performance in some cases.

* 8 pages, 5 figures, 3 tables accepted in IJCAI2021 Weakly Supervised Representation Learning (WSRL) Workshop ; https://wsl-workshop.github.io/ijcai21.html

Via

Access Paper or Ask Questions

Hierarchical Clustering using Auto-encoded Compact Representation for Time-series Analysis

Jan 11, 2021

Soma Bandyopadhyay, Anish Datta, Arpan Pal

Figure 1 for Hierarchical Clustering using Auto-encoded Compact Representation for Time-series Analysis

Figure 2 for Hierarchical Clustering using Auto-encoded Compact Representation for Time-series Analysis

Figure 3 for Hierarchical Clustering using Auto-encoded Compact Representation for Time-series Analysis

Figure 4 for Hierarchical Clustering using Auto-encoded Compact Representation for Time-series Analysis

Abstract:Getting a robust time-series clustering with best choice of distance measure and appropriate representation is always a challenge. We propose a novel mechanism to identify the clusters combining learned compact representation of time-series, Auto Encoded Compact Sequence (AECS) and hierarchical clustering approach. Proposed algorithm aims to address the large computing time issue of hierarchical clustering as learned latent representation AECS has a length much less than the original length of time-series and at the same time want to enhance its performance.Our algorithm exploits Recurrent Neural Network (RNN) based under complete Sequence to Sequence(seq2seq) autoencoder and agglomerative hierarchical clustering with a choice of best distance measure to recommend the best clustering. Our scheme selects the best distance measure and corresponding clustering for both univariate and multivariate time-series. We have experimented with real-world time-series from UCR and UCI archive taken from diverse application domains like health, smart-city, manufacturing etc. Experimental results show that proposed method not only produce close to benchmark results but also in some cases outperform the benchmark.

* 6 figures, 8 pages , 6 tables, accepted and presented conference IJCAI-PRICAI LDRC Learning Data Representation for Clustering (LDRC) workshop 2020 https://ldrcworkshop.github.io/LDRC2020/

Via

Access Paper or Ask Questions

Enabling human-like task identification from natural conversation

Aug 29, 2020

Pradip Pramanick, Chayan Sarkar, Balamuralidhar P, Ajay Kattepur, Indrajit Bhattacharya, Arpan Pal

Figure 1 for Enabling human-like task identification from natural conversation

Figure 2 for Enabling human-like task identification from natural conversation

Figure 3 for Enabling human-like task identification from natural conversation

Figure 4 for Enabling human-like task identification from natural conversation

Abstract:A robot as a coworker or a cohabitant is becoming mainstream day-by-day with the development of low-cost sophisticated hardware. However, an accompanying software stack that can aid the usability of the robotic hardware remains the bottleneck of the process, especially if the robot is not dedicated to a single job. Programming a multi-purpose robot requires an on the fly mission scheduling capability that involves task identification and plan generation. The problem dimension increases if the robot accepts tasks from a human in natural language. Though recent advances in NLP and planner development can solve a variety of complex problems, their amalgamation for a dynamic robotic task handler is used in a limited scope. Specifically, the problem of formulating a planning problem from natural language instructions is not studied in details. In this work, we provide a non-trivial method to combine an NLP engine and a planner such that a robot can successfully identify tasks and all the relevant parameters and generate an accurate plan for the task. Additionally, some mechanism is required to resolve the ambiguity or missing pieces of information in natural language instruction. Thus, we also develop a dialogue strategy that aims to gather additional information with minimal question-answer iterations and only when it is necessary. This work makes a significant stride towards enabling a human-like task understanding capability in a robot.

* Published in: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Via

Access Paper or Ask Questions

Demo: Edge-centric Telepresence Avatar Robot for Geographically Distributed Environment

Jul 25, 2020

Ashis Sau, Ruddra Dev Roychoudhury, Hrishav Bakul Barua, Chayan Sarkar, Sayan Paul, Brojeshwar Bhowmick, Arpan Pal, Balamuralidhar P

Abstract:Using a robotic platform for telepresence applications has gained paramount importance in this decade. Scenarios such as remote meetings, group discussions, and presentations/talks in seminars and conferences get much attention in this regard. Though there exist some robotic platforms for such telepresence applications, they lack efficacy in communication and interaction between the remote person and the avatar robot deployed in another geographic location. Also, such existing systems are often cloud-centric which adds to its network overhead woes. In this demo, we develop and test a framework that brings the best of both cloud and edge-centric systems together along with a newly designed communication protocol. Our solution adds to the improvement of the existing systems in terms of robustness and efficacy in communication for a geographically distributed environment.

Via

Access Paper or Ask Questions

I can attend a meeting too! Towards a human-like telepresence avatar robot to attend meeting on your behalf

Jun 28, 2020

Hrishav Bakul Barua, Chayan Sarkar, Achanna Anil Kumar, Arpan Pal, Balamuralidhar P

Abstract:Telepresence robots are used in various forms in various use-cases that helps to avoid physical human presence at the scene of action. In this work, we focus on a telepresence robot that can be used to attend a meeting remotely with a group of people. Unlike a one-to-one meeting, participants in a group meeting can be located at a different part of the room, especially in an informal setup. As a result, all of them may not be at the viewing angle of the robot, a.k.a. the remote participant. In such a case, to provide a better meeting experience, the robot should localize the speaker and bring the speaker at the center of the viewing angle. Though sound source localization can easily be done using a microphone-array, bringing the speaker or set of speakers at the viewing angle is not a trivial task. First of all, the robot should react only to a human voice, but not to the random noises. Secondly, if there are multiple speakers, to whom the robot should face or should it rotate continuously with every new speaker? Lastly, most robotic platforms are resource-constrained and to achieve a real-time response, i.e., avoiding network delay, all the algorithms should be implemented within the robot itself. This article presents a study and implementation of an attention shifting scheme in a telepresence meeting scenario which best suits the needs and expectations of the collocated and remote attendees. We define a policy to decide when a robot should rotate and how much based on real-time speaker localization. Using user satisfaction study, we show the efficacy and usability of our system in the meeting scenario. Moreover, our system can be easily adapted to other scenarios where multiple people are located.

Via

Access Paper or Ask Questions

MOMBAT: Heart Rate Monitoring from Face Video using Pulse Modeling and Bayesian Tracking

May 10, 2020

Puneet Gupta, Brojeshwar Bhowmick, Arpan Pal

Figure 1 for MOMBAT: Heart Rate Monitoring from Face Video using Pulse Modeling and Bayesian Tracking

Figure 2 for MOMBAT: Heart Rate Monitoring from Face Video using Pulse Modeling and Bayesian Tracking

Figure 3 for MOMBAT: Heart Rate Monitoring from Face Video using Pulse Modeling and Bayesian Tracking

Figure 4 for MOMBAT: Heart Rate Monitoring from Face Video using Pulse Modeling and Bayesian Tracking

Abstract:A non-invasive yet inexpensive method for heart rate (HR) monitoring is of great importance in many real-world applications including healthcare, psychology understanding, affective computing and biometrics. Face videos are currently utilized for such HR monitoring, but unfortunately this can lead to errors due to the noise introduced by facial expressions, out-of-plane movements, camera parameters (like focus change) and environmental factors. We alleviate these issues by proposing a novel face video based HR monitoring method MOMBAT, that is, MOnitoring using Modeling and BAyesian Tracking. We utilize out-of-plane face movements to define a novel quality estimation mechanism. Subsequently, we introduce a Fourier basis based modeling to reconstruct the cardiovascular pulse signal at the locations containing the poor quality, that is, the locations affected by out-of-plane face movements. Furthermore, we design a Bayesian decision theory based HR tracking mechanism to rectify the spurious HR estimates. Experimental results reveal that our proposed method, MOMBAT outperforms state-of-the-art HR monitoring methods and performs HR monitoring with an average absolute error of 1.329 beats per minute and the Pearson correlation between estimated and actual heart rate is 0.9746. Moreover, it demonstrates that HR monitoring is significantly

* Computers in Biology and Medicine, 2020

Via

Access Paper or Ask Questions