Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anatoly Shalyto

New Arabic Medical Dataset for Diseases Classification

Jul 05, 2021

Jaafar Hammoud, Aleksandra Vatian, Natalia Dobrenko, Nikolai Vedernikov, Anatoly Shalyto, Natalia Gusarova

Figure 1 for New Arabic Medical Dataset for Diseases Classification

Abstract:The Arabic language suffers from a great shortage of datasets suitable for training deep learning models, and the existing ones include general non-specialized classifications. In this work, we introduce a new Arab medical dataset, which includes two thousand medical documents collected from several Arabic medical websites, in addition to the Arab Medical Encyclopedia. The dataset was built for the task of classifying texts and includes 10 classes (Blood, Bone, Cardiovascular, Ear, Endocrine, Eye, Gastrointestinal, Immune, Liver and Nephrological) diseases. Experiments on the dataset were performed by fine-tuning three pre-trained models: BERT from Google, Arabert that based on BERT with large Arabic corpus, and AraBioNER that based on Arabert with Arabic medical corpus.

Via

Access Paper or Ask Questions

Reinforcement-based Simultaneous Algorithm and its Hyperparameters Selection

Nov 07, 2016

Valeria Efimova, Andrey Filchenkov, Anatoly Shalyto

Figure 1 for Reinforcement-based Simultaneous Algorithm and its Hyperparameters Selection

Figure 2 for Reinforcement-based Simultaneous Algorithm and its Hyperparameters Selection

Abstract:Many algorithms for data analysis exist, especially for classification problems. To solve a data analysis problem, a proper algorithm should be chosen, and also its hyperparameters should be selected. In this paper, we present a new method for the simultaneous selection of an algorithm and its hyperparameters. In order to do so, we reduced this problem to the multi-armed bandit problem. We consider an algorithm as an arm and algorithm hyperparameters search during a fixed time as the corresponding arm play. We also suggest a problem-specific reward function. We performed the experiments on 10 real datasets and compare the suggested method with the existing one implemented in Auto-WEKA. The results show that our method is significantly better in most of the cases and never worse than the Auto-WEKA.

Via

Access Paper or Ask Questions

Symmetry Breaking Predicates for SAT-based DFA Identification

Feb 17, 2016

Vladimir Ulyantsev, Ilya Zakirzyanov, Anatoly Shalyto

Figure 1 for Symmetry Breaking Predicates for SAT-based DFA Identification

Figure 2 for Symmetry Breaking Predicates for SAT-based DFA Identification

Figure 3 for Symmetry Breaking Predicates for SAT-based DFA Identification

Figure 4 for Symmetry Breaking Predicates for SAT-based DFA Identification

Abstract:It was shown before that the NP-hard problem of deterministic finite automata (DFA) identification can be effectively translated to Boolean satisfiability (SAT). Modern SAT-solvers can tackle hard DFA identification instances efficiently. We present a technique to reduce the problem search space by enforcing an enumeration of DFA states in depth-first search (DFS) or breadth-first search (BFS) order. We propose symmetry breaking predicates, which can be added to Boolean formulae representing various DFA identification problems. We show how to apply this technique to DFA identification from both noiseless and noisy data. Also we propose a method to identify all automata of the desired size. The proposed approach outperforms the current state-of-the-art DFASAT method for DFA identification from noiseless data. A big advantage of the proposed approach is that it allows to determine exactly the existence or non-existence of a solution of the noisy DFA identification problem unlike metaheuristic approaches such as genetic algorithms.

* 14 pages, 9 figures, 5 tables, submitted to Journal of Computer and System Science

Via

Access Paper or Ask Questions

An Asynchronous Implementation of the Limited Memory CMA-ES

Oct 01, 2015

Viktor Arkhipov, Maxim Buzdalov, Anatoly Shalyto

Figure 1 for An Asynchronous Implementation of the Limited Memory CMA-ES

Figure 2 for An Asynchronous Implementation of the Limited Memory CMA-ES

Figure 3 for An Asynchronous Implementation of the Limited Memory CMA-ES

Figure 4 for An Asynchronous Implementation of the Limited Memory CMA-ES

Abstract:We present our asynchronous implementation of the LM-CMA-ES algorithm, which is a modern evolution strategy for solving complex large-scale continuous optimization problems. Our implementation brings the best results when the number of cores is relatively high and the computational complexity of the fitness function is also high. The experiments with benchmark functions show that it is able to overcome its origin on the Sphere function, reaches certain thresholds faster on the Rosenbrock and Ellipsoid function, and surprisingly performs much better than the original version on the Rastrigin function.

* 9 pages, 4 figures, 4 tables; this is a full version of a paper which has been accepted as a poster to IEEE ICMLA conference 2015

Via

Access Paper or Ask Questions