Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yann-Aël Le Borgne

Transfer Learning for Credit Card Fraud Detection: A Journey from Research to Production

Jul 20, 2021

Wissam Siblini, Guillaume Coter, Rémy Fabry, Liyun He-Guelton, Frédéric Oblé, Bertrand Lebichot, Yann-Aël Le Borgne, Gianluca Bontempi

Figure 1 for Transfer Learning for Credit Card Fraud Detection: A Journey from Research to Production

Abstract:The dark face of digital commerce generalization is the increase of fraud attempts. To prevent any type of attacks, state of the art fraud detection systems are now embedding Machine Learning (ML) modules. The conception of such modules is only communicated at the level of research and papers mostly focus on results for isolated benchmark datasets and metrics. But research is only a part of the journey, preceded by the right formulation of the business problem and collection of data, and followed by a practical integration. In this paper, we give a wider vision of the process, on a case study of transfer learning for fraud detection, from business to research, and back to business.

Via

Access Paper or Ask Questions

Streaming Active Learning Strategies for Real-Life Credit Card Fraud Detection: Assessment and Visualization

Apr 20, 2018

Fabirzio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, Gianluca Bontempi

Figure 1 for Streaming Active Learning Strategies for Real-Life Credit Card Fraud Detection: Assessment and Visualization

Figure 2 for Streaming Active Learning Strategies for Real-Life Credit Card Fraud Detection: Assessment and Visualization

Figure 3 for Streaming Active Learning Strategies for Real-Life Credit Card Fraud Detection: Assessment and Visualization

Figure 4 for Streaming Active Learning Strategies for Real-Life Credit Card Fraud Detection: Assessment and Visualization

Abstract:Credit card fraud detection is a very challenging problem because of the specific nature of transaction data and the labeling process. The transaction data is peculiar because they are obtained in a streaming fashion, they are strongly imbalanced and prone to non-stationarity. The labeling is the outcome of an active learning process, as every day human investigators contact only a small number of cardholders (associated to the riskiest transactions) and obtain the class (fraud or genuine) of the related transactions. An adequate selection of the set of cardholders is therefore crucial for an efficient fraud detection process. In this paper, we present a number of active learning strategies and we investigate their fraud detection accuracies. We compare different criteria (supervised, semi-supervised and unsupervised) to query unlabeled transactions. Finally, we highlight the existence of an exploitation/exploration trade-off for active learning in the context of fraud detection, which has so far been overlooked in the literature.

* International Journal of Data Science and Analytics 2018

Via

Access Paper or Ask Questions

Feature selection in high-dimensional dataset using MapReduce

Sep 07, 2017

Claudio Reggiani, Yann-Aël Le Borgne, Gianluca Bontempi

Figure 1 for Feature selection in high-dimensional dataset using MapReduce

Figure 2 for Feature selection in high-dimensional dataset using MapReduce

Figure 3 for Feature selection in high-dimensional dataset using MapReduce

Figure 4 for Feature selection in high-dimensional dataset using MapReduce

Abstract:This paper describes a distributed MapReduce implementation of the minimum Redundancy Maximum Relevance algorithm, a popular feature selection method in bioinformatics and network inference problems. The proposed approach handles both tall/narrow and wide/short datasets. We further provide an open source implementation based on Hadoop/Spark, and illustrate its scalability on datasets involving millions of observations or features.

Via

Access Paper or Ask Questions