Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Md Abul Hayat

Estimating Galactic Distances From Images Using Self-supervised Representation Learning

Jan 12, 2021

Md Abul Hayat, Peter Harrington, George Stein, Zarija Lukić, Mustafa Mustafa

Figure 1 for Estimating Galactic Distances From Images Using Self-supervised Representation Learning

Figure 2 for Estimating Galactic Distances From Images Using Self-supervised Representation Learning

Abstract:We use a contrastive self-supervised learning framework to estimate distances to galaxies from their photometric images. We incorporate data augmentations from computer vision as well as an application-specific augmentation accounting for galactic dust. We find that the resulting visual representations of galaxy images are semantically useful and allow for fast similarity searches, and can be successfully fine-tuned for the task of redshift estimation. We show that (1) pretraining on a large corpus of unlabeled data followed by fine-tuning on some labels can attain the accuracy of a fully-supervised model which requires 2-4x more labeled data, and (2) that by fine-tuning our self-supervised representations using all available data labels in the Main Galaxy Sample of the Sloan Digital Sky Survey (SDSS), we outperform the state-of-the-art supervised learning method.

Via

Access Paper or Ask Questions

Self-Supervised Representation Learning for Astronomical Images

Dec 24, 2020

Md Abul Hayat, George Stein, Peter Harrington, Zarija Lukić, Mustafa Mustafa

Figure 1 for Self-Supervised Representation Learning for Astronomical Images

Figure 2 for Self-Supervised Representation Learning for Astronomical Images

Figure 3 for Self-Supervised Representation Learning for Astronomical Images

Figure 4 for Self-Supervised Representation Learning for Astronomical Images

Abstract:Sky surveys are the largest data generators in astronomy, making automated tools for extracting meaningful scientific information an absolute necessity. We show that, without the need for labels, self-supervised learning recovers representations of sky survey images that are semantically useful for a variety of scientific tasks. These representations can be directly used as features, or fine-tuned, to outperform supervised methods trained only on labeled data. We apply a contrastive learning framework on multi-band galaxy photometry from the Sloan Digital Sky Survey (SDSS) to learn image representations. We then use them for galaxy morphology classification, and fine-tune them for photometric redshift estimation, using labels from the Galaxy Zoo 2 dataset and SDSS spectroscopy. In both downstream tasks, using the same learned representations, we outperform the supervised state-of-the-art results, and we show that our approach can achieve the accuracy of supervised models while using 2-4 times fewer labels for training.

Via

Access Paper or Ask Questions