Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexander D'yakonov

Calibration of Neural Networks

Mar 19, 2023

Ruslan Vasilev, Alexander D'yakonov

Abstract:Neural networks solving real-world problems are often required not only to make accurate predictions but also to provide a confidence level in the forecast. The calibration of a model indicates how close the estimated confidence is to the true probability. This paper presents a survey of confidence calibration problems in the context of neural networks and provides an empirical comparison of calibration methods. We analyze problem statement, calibration definitions, and different approaches to evaluation: visualizations and scalar measures that estimate whether the model is well-calibrated. We review modern calibration techniques: based on post-processing or requiring changes in training. Empirical experiments cover various datasets and models, comparing calibration methods according to different criteria.

Via

Access Paper or Ask Questions

Learning to Generate Synthetic Training Data using Gradient Matching and Implicit Differentiation

Mar 16, 2022

Dmitry Medvedev, Alexander D'yakonov

Figure 1 for Learning to Generate Synthetic Training Data using Gradient Matching and Implicit Differentiation

Figure 2 for Learning to Generate Synthetic Training Data using Gradient Matching and Implicit Differentiation

Figure 3 for Learning to Generate Synthetic Training Data using Gradient Matching and Implicit Differentiation

Figure 4 for Learning to Generate Synthetic Training Data using Gradient Matching and Implicit Differentiation

Abstract:Using huge training datasets can be costly and inconvenient. This article explores various data distillation techniques that can reduce the amount of data required to successfully train deep networks. Inspired by recent ideas, we suggest new data distillation techniques based on generative teaching networks, gradient matching, and the Implicit Function Theorem. Experiments with the MNIST image classification problem show that the new methods are computationally more efficient than previous ones and allow to increase the performance of models trained on distilled data.

Via

Access Paper or Ask Questions

New Properties of the Data Distillation Method When Working With Tabular Data

Oct 19, 2020

Dmitry Medvedev, Alexander D'yakonov

Figure 1 for New Properties of the Data Distillation Method When Working With Tabular Data

Figure 2 for New Properties of the Data Distillation Method When Working With Tabular Data

Figure 3 for New Properties of the Data Distillation Method When Working With Tabular Data

Figure 4 for New Properties of the Data Distillation Method When Working With Tabular Data

Abstract:Data distillation is the problem of reducing the volume oftraining data while keeping only the necessary information. With thispaper, we deeper explore the new data distillation algorithm, previouslydesigned for image data. Our experiments with tabular data show thatthe model trained on distilled samples can outperform the model trainedon the original dataset. One of the problems of the considered algorithmis that produced data has poor generalization on models with differenthyperparameters. We show that using multiple architectures during distillation can help overcome this problem.

* 12 pages

Via

Access Paper or Ask Questions

Modern Deep Reinforcement Learning Algorithms

Jul 06, 2019

Sergey Ivanov, Alexander D'yakonov

Figure 1 for Modern Deep Reinforcement Learning Algorithms

Figure 2 for Modern Deep Reinforcement Learning Algorithms

Figure 3 for Modern Deep Reinforcement Learning Algorithms

Figure 4 for Modern Deep Reinforcement Learning Algorithms

Abstract:Recent advances in Reinforcement Learning, grounded on combining classical theoretical results with Deep Learning paradigm, led to breakthroughs in many artificial intelligence tasks and gave birth to Deep Reinforcement Learning (DRL) as a field of research. In this work latest DRL algorithms are reviewed with a focus on their theoretical justification, practical limitations and observed empirical properties.

Via

Access Paper or Ask Questions