Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sankalp Gilda

Robust Calibration For Improved Weather Prediction Under Distributional Shift

Jan 08, 2024

Sankalp Gilda, Neel Bhandari, Wendy Mak, Andrea Panizza

Abstract:In this paper, we present results on improving out-of-domain weather prediction and uncertainty estimation as part of the \texttt{Shifts Challenge on Robustness and Uncertainty under Real-World Distributional Shift} challenge. We find that by leveraging a mixture of experts in conjunction with an advanced data augmentation technique borrowed from the computer vision domain, in conjunction with robust \textit{post-hoc} calibration of predictive uncertainties, we can potentially achieve more accurate and better-calibrated results with deep neural networks than with boosted tree models for tabular data. We quantify our predictions using several metrics and propose several future lines of inquiry and experimentation to boost performance.

* Presented at the Bayesian Deep Learning workshop at NeurIPS 2021

Via

Access Paper or Ask Questions

Beyond mirkwood: Enhancing SED Modeling with Conformal Predictions

Dec 21, 2023

Sankalp Gilda

Figure 1 for Beyond mirkwood: Enhancing SED Modeling with Conformal Predictions

Figure 2 for Beyond mirkwood: Enhancing SED Modeling with Conformal Predictions

Figure 3 for Beyond mirkwood: Enhancing SED Modeling with Conformal Predictions

Abstract:Traditional spectral energy distribution (SED) fitting techniques face uncertainties due to assumptions in star formation histories and dust attenuation curves. We propose an advanced machine learning-based approach that enhances flexibility and uncertainty quantification in SED fitting. Unlike the fixed NGBoost model used in mirkwood, our approach allows for any sklearn-compatible model, including deterministic models. We incorporate conformalized quantile regression to convert point predictions into error bars, enhancing interpretability and reliability. Using CatBoost as the base predictor, we compare results with and without conformal prediction, demonstrating improved performance using metrics such as coverage and interval width. Our method offers a more versatile and accurate tool for deriving galaxy physical properties from observational data.

* 4 pages + 1 reference page. Accepted to the 3rd AI2ASE workshop at AAAI 2024

Via

Access Paper or Ask Questions

deep-REMAP: Parameterization of Stellar Spectra Using Regularized Multi-Task Learning

Nov 07, 2023

Sankalp Gilda

Abstract:Traditional spectral analysis methods are increasingly challenged by the exploding volumes of data produced by contemporary astronomical surveys. In response, we develop deep-Regularized Ensemble-based Multi-task Learning with Asymmetric Loss for Probabilistic Inference ($\rm{deep-REMAP}$), a novel framework that utilizes the rich synthetic spectra from the PHOENIX library and observational data from the MARVELS survey to accurately predict stellar atmospheric parameters. By harnessing advanced machine learning techniques, including multi-task learning and an innovative asymmetric loss function, $\rm{deep-REMAP}$ demonstrates superior predictive capabilities in determining effective temperature, surface gravity, and metallicity from observed spectra. Our results reveal the framework's effectiveness in extending to other stellar libraries and properties, paving the way for more sophisticated and automated techniques in stellar characterization.

* 5 main pages + 2 figures. Accepted to the ML4PS workshop at NeurIPS 2023

Via

Access Paper or Ask Questions

Unsupervised Domain Adaptation for Constraining Star Formation Histories

Dec 28, 2021

Sankalp Gilda, Antoine de Mathelin, Sabine Bellstedt, Guillaume Richard

Figure 1 for Unsupervised Domain Adaptation for Constraining Star Formation Histories

Figure 2 for Unsupervised Domain Adaptation for Constraining Star Formation Histories

Figure 3 for Unsupervised Domain Adaptation for Constraining Star Formation Histories

Figure 4 for Unsupervised Domain Adaptation for Constraining Star Formation Histories

Abstract:The prevalent paradigm of machine learning today is to use past observations to predict future ones. What if, however, we are interested in knowing the past given the present? This situation is indeed one that astronomers must contend with often. To understand the formation of our universe, we must derive the time evolution of the visible mass content of galaxies. However, to observe a complete star life, one would need to wait for one billion years! To overcome this difficulty, astrophysicists leverage supercomputers and evolve simulated models of galaxies till the current age of the universe, thus establishing a mapping between observed radiation and star formation histories (SFHs). Such ground-truth SFHs are lacking for actual galaxy observations, where they are usually inferred -- with often poor confidence -- from spectral energy distributions (SEDs) using Bayesian fitting methods. In this investigation, we discuss the ability of unsupervised domain adaptation to derive accurate SFHs for galaxies with simulated data as a necessary first step in developing a technique that can ultimately be applied to observational data.

* Accepted for oral presentation at the 1st Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE). Journal article to follow

Via

Access Paper or Ask Questions

Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Jun 30, 2021

Sankalp Gilda, Stark C. Draper, Sebastien Fabbro, William Mahoney, Simon Prunet, Kanoa Withington, Matthew Wilson, Yuan-Sen Ting, Andrew Sheinis

Figure 1 for Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Figure 2 for Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Figure 3 for Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Figure 4 for Uncertainty-Aware Learning for Improvements in Image Quality of the Canada-France-Hawaii Telescope

Abstract:We leverage state-of-the-art machine learning methods and a decade's worth of archival data from the Canada-France-Hawaii Telescope (CFHT) to predict observatory image quality (IQ) from environmental conditions and observatory operating parameters. Specifically, we develop accurate and interpretable models of the complex dependence between data features and observed IQ for CFHT's wide field camera, MegaCam. Our contributions are several-fold. First, we collect, collate and reprocess several disparate data sets gathered by CFHT scientists. Second, we predict probability distribution functions (PDFs) of IQ, and achieve a mean absolute error of $\sim0.07''$ for the predicted medians. Third, we explore data-driven actuation of the 12 dome ``vents'', installed in 2013-14 to accelerate the flushing of hot air from the dome. We leverage epistemic and aleatoric uncertainties in conjunction with probabilistic generative modeling to identify candidate vent adjustments that are in-distribution (ID) and, for the optimal configuration for each ID sample, we predict the reduction in required observing time to achieve a fixed SNR. On average, the reduction is $\sim15\%$. Finally, we rank sensor data features by Shapley values to identify the most predictive variables for each observation. Our long-term goal is to construct reliable and real-time models that can forecast optimal observatory operating parameters for optimization of IQ. Such forecasts can then be fed into scheduling protocols and predictive maintenance routines. We anticipate that such approaches will become standard in automating observatory operations and maintenance by the time CFHT's successor, the Maunakea Spectroscopic Explorer (MSE), is installed in the next decade.

* 25 pages, 1 appendix, 12 figures. To be submitted to MNRAS. Comments and feedback welcome

Via

Access Paper or Ask Questions

Feature Selection for Better Spectral Characterization or: How I Learned to Start Worrying and Love Ensembles

Feb 22, 2019

Sankalp Gilda

Figure 1 for Feature Selection for Better Spectral Characterization or: How I Learned to Start Worrying and Love Ensembles

Abstract:An ever-looming threat to astronomical applications of machine learning is the danger of over-fitting data, also known as the `curse of dimensionality.' This occurs when there are fewer samples than the number of independent variables. In this work, we focus on the problem of stellar parameterization from low-mid resolution spectra, with blended absorption lines. We address this problem using an iterative algorithm to sequentially prune redundant features from synthetic PHOENIX spectra, and arrive at an optimal set of wavelengths with the strongest correlation with each of the output variables -- T$_{\rm eff}$, $\log g$, and [Fe/H]. We find that at any given resolution, most features (i.e., absorption lines) are not only redundant, but actually act as noise and decrease the accuracy of parameter retrieval.

* 4 pages, 1 figure, presented at Astronomical Data Analysis Software & Systems (ADASS) 2018

Via

Access Paper or Ask Questions