Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Roman Ilin

Enhancing Self-Training Methods

Jan 18, 2023

Aswathnarayan Radhakrishnan, Jim Davis, Zachary Rabin, Benjamin Lewis, Matthew Scherreik, Roman Ilin

Abstract:Semi-supervised learning approaches train on small sets of labeled data along with large sets of unlabeled data. Self-training is a semi-supervised teacher-student approach that often suffers from the problem of "confirmation bias" that occurs when the student model repeatedly overfits to incorrect pseudo-labels given by the teacher model for the unlabeled data. This bias impedes improvements in pseudo-label accuracy across self-training iterations, leading to unwanted saturation in model performance after just a few iterations. In this work, we describe multiple enhancements to improve the self-training pipeline to mitigate the effect of confirmation bias. We evaluate our enhancements over multiple datasets showing performance gains over existing self-training design choices. Finally, we also study the extendability of our enhanced approach to Open Set unlabeled data (containing classes not seen in labeled data).

Via

Access Paper or Ask Questions

Bottom-up Hierarchical Classification Using Confusion-based Logit Compression

Oct 05, 2021

Tong Liang, Jim Davis, Roman Ilin

Figure 1 for Bottom-up Hierarchical Classification Using Confusion-based Logit Compression

Figure 2 for Bottom-up Hierarchical Classification Using Confusion-based Logit Compression

Figure 3 for Bottom-up Hierarchical Classification Using Confusion-based Logit Compression

Figure 4 for Bottom-up Hierarchical Classification Using Confusion-based Logit Compression

Abstract:In this work, we propose a method to efficiently compute label posteriors of a base flat classifier in the presence of few validation examples within a bottom-up hierarchical inference framework. A stand-alone validation set (not used to train the base classifier) is preferred for posterior estimation to avoid overfitting the base classifier, however a small validation set limits the number of features one can effectively use. We propose a simple, yet robust, logit vector compression approach based on generalized logits and label confusions for the task of label posterior estimation within the context of hierarchical classification. Extensive comparative experiments with other compression techniques are provided across multiple sized validation sets, and a comparison with related hierarchical classification approaches is also conducted. The proposed approach mitigates the problem of not having enough validation examples for reliable posterior estimation while maintaining strong hierarchical classification performance.

Via

Access Paper or Ask Questions

A Classification Refinement Strategy for Semantic Segmentation

Jan 23, 2018

James W. Davis, Christopher Menart, Muhammad Akbar, Roman Ilin

Figure 1 for A Classification Refinement Strategy for Semantic Segmentation

Figure 2 for A Classification Refinement Strategy for Semantic Segmentation

Figure 3 for A Classification Refinement Strategy for Semantic Segmentation

Figure 4 for A Classification Refinement Strategy for Semantic Segmentation

Abstract:Based on the observation that semantic segmentation errors are partially predictable, we propose a compact formulation using confusion statistics of the trained classifier to refine (re-estimate) the initial pixel label hypotheses. The proposed strategy is contingent upon computing the classifier confusion probabilities for a given dataset and estimating a relevant prior on the object classes present in the image to be classified. We provide a procedure to robustly estimate the confusion probabilities and explore multiple prior definitions. Experiments are shown comparing performances on multiple challenging datasets using different priors to improve a state-of-the-art semantic segmentation classifier. This study demonstrates the potential to significantly improve semantic labeling and motivates future work for reliable label prior estimation from images.

Via

Access Paper or Ask Questions

Formal Concept Analysis of Rodent Carriers of Zoonotic Disease

Aug 25, 2016

Roman Ilin, Barbara A. Han

Figure 1 for Formal Concept Analysis of Rodent Carriers of Zoonotic Disease

Figure 2 for Formal Concept Analysis of Rodent Carriers of Zoonotic Disease

Figure 3 for Formal Concept Analysis of Rodent Carriers of Zoonotic Disease

Abstract:The technique of Formal Concept Analysis is applied to a dataset describing the traits of rodents, with the goal of identifying zoonotic disease carriers,or those species carrying infections that can spillover to cause human disease. The concepts identified among these species together provide rules-of-thumb about the intrinsic biological features of rodents that carry zoonotic diseases, and offer utility for better targeting field surveillance efforts in the search for novel disease carriers in the wild.

* 5 pages, presented at 2016 ICML Workshop on #Data4Good: Machine Learning in Social Good Applications, New York, NY

Via

Access Paper or Ask Questions

Beyond Feedforward Models Trained by Backpropagation: a Practical Training Tool for a More Efficient Universal Approximator

Oct 23, 2007

Roman Ilin, Robert Kozma, Paul J. Werbos

Figure 1 for Beyond Feedforward Models Trained by Backpropagation: a Practical Training Tool for a More Efficient Universal Approximator

Figure 2 for Beyond Feedforward Models Trained by Backpropagation: a Practical Training Tool for a More Efficient Universal Approximator

Figure 3 for Beyond Feedforward Models Trained by Backpropagation: a Practical Training Tool for a More Efficient Universal Approximator

Figure 4 for Beyond Feedforward Models Trained by Backpropagation: a Practical Training Tool for a More Efficient Universal Approximator

Abstract:Cellular Simultaneous Recurrent Neural Network (SRN) has been shown to be a function approximator more powerful than the MLP. This means that the complexity of MLP would be prohibitively large for some problems while SRN could realize the desired mapping with acceptable computational constraints. The speed of training of complex recurrent networks is crucial to their successful application. Present work improves the previous results by training the network with extended Kalman filter (EKF). We implemented a generic Cellular SRN and applied it for solving two challenging problems: 2D maze navigation and a subset of the connectedness problem. The speed of convergence has been improved by several orders of magnitude in comparison with the earlier results in the case of maze navigation, and superior generalization has been demonstrated in the case of connectedness. The implications of this improvements are discussed.

Via

Access Paper or Ask Questions