Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Konstantin Malanchev

ORACLE: A Real-Time, Hierarchical, Deep-Learning Photometric Classifier for the LSST

Jan 02, 2025

Ved G. Shah, Alex Gagliano, Konstantin Malanchev, Gautham Narayan, The LSST Dark Energy Science Collaboration

Abstract:We present ORACLE, the first hierarchical deep-learning model for real-time, context-aware classification of transient and variable astrophysical phenomena. ORACLE is a recurrent neural network with Gated Recurrent Units (GRUs), and has been trained using a custom hierarchical cross-entropy loss function to provide high-confidence classifications along an observationally-driven taxonomy with as little as a single photometric observation. Contextual information for each object, including host galaxy photometric redshift, offset, ellipticity and brightness, is concatenated to the light curve embedding and used to make a final prediction. Training on $\sim$0.5M events from the Extended LSST Astronomical Time-Series Classification Challenge, we achieve a top-level (Transient vs Variable) macro-averaged precision of 0.96 using only 1 day of photometric observations after the first detection in addition to contextual information, for each event; this increases to $>$0.99 once 64 days of the light curve has been obtained, and 0.83 at 1024 days after first detection for 19-way classification (including supernova sub-types, active galactic nuclei, variable stars, microlensing events, and kilonovae). We also compare ORACLE with other state-of-the-art classifiers and report comparable performance for the 19-way classification task, in addition to delivering accurate top-level classifications much earlier. The code and model weights used in this work are publicly available at our associated GitHub repository (https://github.com/uiucsn/ELAsTiCC-Classification).

* 29 pages, 19 figures, 9 tables. Submitted to ApJ

Via

Access Paper or Ask Questions

Exploring the Universe with SNAD: Anomaly Detection in Astronomy

Oct 24, 2024

Alina A. Volnova, Patrick D. Aleo, Anastasia Lavrukhina, Etienne Russeil, Timofey Semenikhin, Emmanuel Gangler, Emille E. O. Ishida, Matwey V. Kornilov, Vladimir Korolev, Konstantin Malanchev(+2 more)

Abstract:SNAD is an international project with a primary focus on detecting astronomical anomalies within large-scale surveys, using active learning and other machine learning algorithms. The work carried out by SNAD not only contributes to the discovery and classification of various astronomical phenomena but also enhances our understanding and implementation of machine learning techniques within the field of astrophysics. This paper provides a review of the SNAD project and summarizes the advancements and achievements made by the team over several years.

* In: Baixeries, J., Ignatov, D.I., Kuznetsov, S.O., Stupnikov, S. (eds) Data Analytics and Management in Data Intensive Domains. DAMDID/RCDL 2023. Communications in Computer and Information Science, vol 2086. Springer, Cham
* 14 pages, 4 figures

Via

Access Paper or Ask Questions

Multi-View Symbolic Regression

Feb 16, 2024

Etienne Russeil, Fabrício Olivetti de França, Konstantin Malanchev, Bogdan Burlacu, Emille E. O. Ishida, Marion Leroux, Clément Michelin, Guillaume Moinard, Emmanuel Gangler

Figure 1 for Multi-View Symbolic Regression

Figure 2 for Multi-View Symbolic Regression

Figure 3 for Multi-View Symbolic Regression

Figure 4 for Multi-View Symbolic Regression

Abstract:Symbolic regression (SR) searches for analytical expressions representing the relationship between a set of explanatory and response variables. Current SR methods assume a single dataset extracted from a single experiment. Nevertheless, frequently, the researcher is confronted with multiple sets of results obtained from experiments conducted with different setups. Traditional SR methods may fail to find the underlying expression since the parameters of each experiment can be different. In this work we present Multi-View Symbolic Regression (MvSR), which takes into account multiple datasets simultaneously, mimicking experimental environments, and outputs a general parametric solution. This approach fits the evaluated expression to each independent dataset and returns a parametric family of functions f(x; \theta) simultaneously capable of accurately fitting all datasets. We demonstrate the effectiveness of MvSR using data generated from known expressions, as well as real-world data from astronomy, chemistry and economy, for which an a priori analytical expression is not available. Results show that MvSR obtains the correct expression more frequently and is robust to hyperparameters change. In real-world data, it is able to grasp the group behaviour, recovering known expressions from the literature as well as promising alternatives, thus enabling the use SR to a large range of experimental scenarios.

* Submitted to GECCO-2024. 10 pages, 6 figures

Via

Access Paper or Ask Questions

Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Sep 15, 2022

Mariia Demianenko, Konstantin Malanchev, Ekaterina Samorodova, Mikhail Sysak, Aleksandr Shiriaev, Denis Derkach, Mikhail Hushchyn

Figure 1 for Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Figure 2 for Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Figure 3 for Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Figure 4 for Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Abstract:The modern time-domain photometric surveys collect a lot of observations of various astronomical objects, and the coming era of large-scale surveys will provide even more information. Most of the objects have never received a spectroscopic follow-up, which is especially crucial for transients e.g. supernovae. In such cases, observed light curves could present an affordable alternative. Time series are actively used for photometric classification and characterization, such as peak and luminosity decline estimation. However, the collected time series are multidimensional, irregularly sampled, contain outliers, and do not have well-defined systematic uncertainties. Machine learning methods help extract useful information from available data in the most efficient way. We consider several light curve approximation methods based on neural networks: Multilayer Perceptrons, Bayesian Neural Networks, and Normalizing Flows, to approximate observations of a single light curve. Tests using both the simulated PLAsTiCC and real Zwicky Transient Facility data samples demonstrate that even few observations are enough to fit networks and achieve better approximation quality than other state-of-the-art methods. We show that the methods described in this work have better computational complexity and work faster than Gaussian Processes. We analyze the performance of the approximation techniques aiming to fill the gaps in the observations of the light curves, and show that the use of appropriate technique increases the accuracy of peak finding and supernova classification. In addition, the study results are organized in a Fulu Python library available on GitHub, which can be easily used by the community.

* Submitted to MNRAS. 14 pages, 6 figures, 9 tables

Via

Access Paper or Ask Questions

Supernova Light Curves Approximation based on Neural Network Models

Jun 27, 2022

Mariia Demianenko, Ekaterina Samorodova, Mikhail Sysak, Aleksandr Shiriaev, Konstantin Malanchev, Denis Derkach, Mikhail Hushchyn

Figure 1 for Supernova Light Curves Approximation based on Neural Network Models

Figure 2 for Supernova Light Curves Approximation based on Neural Network Models

Figure 3 for Supernova Light Curves Approximation based on Neural Network Models

Figure 4 for Supernova Light Curves Approximation based on Neural Network Models

Abstract:Photometric data-driven classification of supernovae becomes a challenge due to the appearance of real-time processing of big data in astronomy. Recent studies have demonstrated the superior quality of solutions based on various machine learning models. These models learn to classify supernova types using their light curves as inputs. Preprocessing these curves is a crucial step that significantly affects the final quality. In this talk, we study the application of multilayer perceptron (MLP), bayesian neural network (BNN), and normalizing flows (NF) to approximate observations for a single light curve. We use these approximations as inputs for supernovae classification models and demonstrate that the proposed methods outperform the state-of-the-art based on Gaussian processes applying to the Zwicky Transient Facility Bright Transient Survey light curves. MLP demonstrates similar quality as Gaussian processes and speed increase. Normalizing Flows exceeds Gaussian processes in terms of approximation quality as well.

* 5 pages, 2 figures and 2 tables

Via

Access Paper or Ask Questions

Photometric Data-driven Classification of Type Ia Supernovae in the Open Supernova Catalog

Jun 18, 2020

Stanislav Dobryakov, Konstantin Malanchev, Denis Derkach, Mikhail Hushchyn

Figure 1 for Photometric Data-driven Classification of Type Ia Supernovae in the Open Supernova Catalog

Figure 2 for Photometric Data-driven Classification of Type Ia Supernovae in the Open Supernova Catalog

Figure 3 for Photometric Data-driven Classification of Type Ia Supernovae in the Open Supernova Catalog

Figure 4 for Photometric Data-driven Classification of Type Ia Supernovae in the Open Supernova Catalog

Abstract:We propose a novel approach for a machine-learning-based detection of the type Ia supernovae using photometric information. Unlike other approaches, only real observation data is used during training. Despite being trained on a relatively small sample, the method shows good results on real data from the Open Supernovae Catalog. We also demonstrate that the quality of a model, trained on PLASTiCC simulated sample, significantly decreases evaluated on real objects.

* 20 pages, 9 figures, submitted to Astronomy and Computing

Via

Access Paper or Ask Questions