Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniel Jakubovitz

An information-Theoretic Approach to Semi-supervised Transfer Learning

Jun 11, 2023

Daniel Jakubovitz, David Uliel, Miguel Rodrigues, Raja Giryes

Abstract:Transfer learning is a valuable tool in deep learning as it allows propagating information from one "source dataset" to another "target dataset", especially in the case of a small number of training examples in the latter. Yet, discrepancies between the underlying distributions of the source and target data are commonplace and are known to have a substantial impact on algorithm performance. In this work we suggest novel information-theoretic approaches for the analysis of the performance of deep neural networks in the context of transfer learning. We focus on the task of semi-supervised transfer learning, in which unlabeled samples from the target dataset are available during network training on the source dataset. Our theory suggests that one may improve the transferability of a deep neural network by incorporating regularization terms on the target data based on information-theoretic quantities, namely the Mutual Information and the Lautum Information. We demonstrate the effectiveness of the proposed approaches in various semi-supervised transfer learning experiments.

* arXiv admin note: substantial text overlap with arXiv:1904.01670

Via

Access Paper or Ask Questions

AutoFraudNet: A Multimodal Network to Detect Fraud in the Auto Insurance Industry

Jan 15, 2023

Azin Asgarian, Rohit Saha, Daniel Jakubovitz, Julia Peyre

Abstract:In the insurance industry detecting fraudulent claims is a critical task with a significant financial impact. A common strategy to identify fraudulent claims is looking for inconsistencies in the supporting evidence. However, this is a laborious and cognitively heavy task for human experts as insurance claims typically come with a plethora of data from different modalities (e.g. images, text and metadata). To overcome this challenge, the research community has focused on multimodal machine learning frameworks that can efficiently reason through multiple data sources. Despite recent advances in multimodal learning, these frameworks still suffer from (i) challenges of joint-training caused by the different characteristics of different modalities and (ii) overfitting tendencies due to high model complexity. In this work, we address these challenges by introducing a multimodal reasoning framework, AutoFraudNet (Automobile Insurance Fraud Detection Network), for detecting fraudulent auto-insurance claims. AutoFraudNet utilizes a cascaded slow fusion framework and state-of-the-art fusion block, BLOCK Tucker, to alleviate the challenges of joint-training. Furthermore, it incorporates a light-weight architectural design along with additional losses to prevent overfitting. Through extensive experiments conducted on a real-world dataset, we demonstrate: (i) the merits of multimodal approaches, when compared to unimodal and bimodal methods, and (ii) the effectiveness of AutoFraudNet in fusing various modalities to boost performance (over 3\% in PR AUC).

* Published at The AAAI-2023 Workshop On Multimodal AI For Financial Forecasting

Via

Access Paper or Ask Questions

Lautum Regularization for Semi-supervised Transfer Learning

Apr 02, 2019

Daniel Jakubovitz, Miguel R. D. Rodrigues, Raja Giryes

Figure 1 for Lautum Regularization for Semi-supervised Transfer Learning

Figure 2 for Lautum Regularization for Semi-supervised Transfer Learning

Figure 3 for Lautum Regularization for Semi-supervised Transfer Learning

Figure 4 for Lautum Regularization for Semi-supervised Transfer Learning

Abstract:Transfer learning is a very important tool in deep learning as it allows propagating information from one "source dataset" to another "target dataset", especially in the case of a small number of training examples in the latter. Yet, discrepancies between the underlying distributions of the source and target data are commonplace and are known to have a substantial impact on algorithm performance. In this work we suggest a novel information theoretic approach for the analysis of the performance of deep neural networks in the context of transfer learning. We focus on the task of semi-supervised transfer learning, in which unlabeled samples from the target dataset are available during the network training on the source dataset. Our theory suggests that one may improve the transferability of a deep neural network by imposing a Lautum information based regularization that relates the network weights to the target data. We demonstrate in various transfer learning experiments the effectiveness of the proposed approach.

Via

Access Paper or Ask Questions

Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization

Aug 26, 2018

Daniel Jakubovitz, Raja Giryes

Figure 1 for Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization

Figure 2 for Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization

Figure 3 for Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization

Figure 4 for Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization

Abstract:Deep neural networks have lately shown tremendous performance in various applications including vision and speech processing tasks. However, alongside their ability to perform these tasks with such high accuracy, it has been shown that they are highly susceptible to adversarial attacks: a small change in the input would cause the network to err with high confidence. This phenomenon exposes an inherent fault in these networks and their ability to generalize well. For this reason, providing robustness to adversarial attacks is an important challenge in networks training, which has led to extensive research. In this work, we suggest a theoretically inspired novel approach to improve the networks' robustness. Our method applies regularization using the Frobenius norm of the Jacobian of the network, which is applied as post-processing, after regular training has finished. We demonstrate empirically that it leads to enhanced robustness results with a minimal change in the original network's accuracy.

* ECCV 2018 Conference Paper

Via

Access Paper or Ask Questions

Generalization Error in Deep Learning

Aug 03, 2018

Daniel Jakubovitz, Raja Giryes, Miguel R. D. Rodrigues

Figure 1 for Generalization Error in Deep Learning

Figure 2 for Generalization Error in Deep Learning

Figure 3 for Generalization Error in Deep Learning

Figure 4 for Generalization Error in Deep Learning

Abstract:Deep learning models have lately shown great performance in various fields such as computer vision, speech recognition, speech translation, and natural language processing. However, alongside their state-of-the-art performance, it is still generally unclear what is the source of their generalization ability. Thus, an important question is what makes deep neural networks able to generalize well from the training set to new data. In this article, we provide an overview of the existing theory and bounds for the characterization of the generalization error of deep neural networks, combining both classical and more recent theoretical and empirical results.

Via

Access Paper or Ask Questions