Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tuan Do

UCLA

Learning the Evolution of Physical Structure of Galaxies via Diffusion Models

Nov 27, 2024

Andrew Lizarraga, Eric Hanchen Jiang, Jacob Nowack, Yun Qi Li, Ying Nian Wu, Bernie Boscoe, Tuan Do

Figure 1 for Learning the Evolution of Physical Structure of Galaxies via Diffusion Models

Figure 2 for Learning the Evolution of Physical Structure of Galaxies via Diffusion Models

Figure 3 for Learning the Evolution of Physical Structure of Galaxies via Diffusion Models

Figure 4 for Learning the Evolution of Physical Structure of Galaxies via Diffusion Models

Abstract:In astrophysics, understanding the evolution of galaxies in primarily through imaging data is fundamental to comprehending the formation of the Universe. This paper introduces a novel approach to conditioning Denoising Diffusion Probabilistic Models (DDPM) on redshifts for generating galaxy images. We explore whether this advanced generative model can accurately capture the physical characteristics of galaxies based solely on their images and redshift measurements. Our findings demonstrate that this model not only produces visually realistic galaxy images but also encodes the underlying changes in physical properties with redshift that are the result of galaxy evolution. This approach marks a significant advancement in using generative models to enhance our scientific insight into cosmic phenomena.

Via

Access Paper or Ask Questions

Using different sources of ground truths and transfer learning to improve the generalization of photometric redshift estimation

Nov 27, 2024

Jonathan Soriano, Srinath Saikrishnan, Vikram Seenivasan, Bernie Boscoe, Jack Singal, Tuan Do

Abstract:In this work, we explore methods to improve galaxy redshift predictions by combining different ground truths. Traditional machine learning models rely on training sets with known spectroscopic redshifts, which are precise but only represent a limited sample of galaxies. To make redshift models more generalizable to the broader galaxy population, we investigate transfer learning and directly combining ground truth redshifts derived from photometry and spectroscopy. We use the COSMOS2020 survey to create a dataset, TransferZ, which includes photometric redshift estimates derived from up to 35 imaging filters using template fitting. This dataset spans a wider range of galaxy types and colors compared to spectroscopic samples, though its redshift estimates are less accurate. We first train a base neural network on TransferZ and then refine it using transfer learning on a dataset of galaxies with more precise spectroscopic redshifts (GalaxiesML). In addition, we train a neural network on a combined dataset of TransferZ and GalaxiesML. Both methods reduce bias by $\sim$ 5x, RMS error by $\sim$ 1.5x, and catastrophic outlier rates by 1.3x on GalaxiesML, compared to a baseline trained only on TransferZ. However, we also find a reduction in performance for RMS and bias when evaluated on TransferZ data. Overall, our results demonstrate these approaches can meet cosmological requirements.

* 10 pages, 4 figures, 2 tables, accepted to NeurIPS 2024 Workshop ML4PS

Via

Access Paper or Ask Questions

Using Galaxy Evolution as Source of Physics-Based Ground Truth for Generative Models

Jul 09, 2024

Yun Qi Li, Tuan Do, Evan Jones, Bernie Boscoe, Kevin Alfaro, Zooey Nguyen

Figure 1 for Using Galaxy Evolution as Source of Physics-Based Ground Truth for Generative Models

Figure 2 for Using Galaxy Evolution as Source of Physics-Based Ground Truth for Generative Models

Figure 3 for Using Galaxy Evolution as Source of Physics-Based Ground Truth for Generative Models

Figure 4 for Using Galaxy Evolution as Source of Physics-Based Ground Truth for Generative Models

Abstract:Generative models producing images have enormous potential to advance discoveries across scientific fields and require metrics capable of quantifying the high dimensional output. We propose that astrophysics data, such as galaxy images, can test generative models with additional physics-motivated ground truths in addition to human judgment. For example, galaxies in the Universe form and change over billions of years, following physical laws and relationships that are both easy to characterize and difficult to encode in generative models. We build a conditional denoising diffusion probabilistic model (DDPM) and a conditional variational autoencoder (CVAE) and test their ability to generate realistic galaxies conditioned on their redshifts (galaxy ages). This is one of the first studies to probe these generative models using physically motivated metrics. We find that both models produce comparable realistic galaxies based on human evaluation, but our physics-based metrics are better able to discern the strengths and weaknesses of the generative models. Overall, the DDPM model performs better than the CVAE on the majority of the physics-based metrics. Ultimately, if we can show that generative models can learn the physics of galaxy evolution, they have the potential to unlock new astrophysical discoveries.

* 20 pages, 14 figures, 1 Table, code: https://github.com/astrodatalab/li2024_public, training data: https://zenodo.org/records/11117528

Via

Access Paper or Ask Questions

Elements of effective machine learning datasets in astronomy

Nov 29, 2022

Bernie Boscoe, Tuan Do, Evan Jones, Yunqi Li, Kevin Alfaro, Christy Ma

Abstract:In this work, we identify elements of effective machine learning datasets in astronomy and present suggestions for their design and creation. Machine learning has become an increasingly important tool for analyzing and understanding the large-scale flood of data in astronomy. To take advantage of these tools, datasets are required for training and testing. However, building machine learning datasets for astronomy can be challenging. Astronomical data is collected from instruments built to explore science questions in a traditional fashion rather than to conduct machine learning. Thus, it is often the case that raw data, or even downstream processed data is not in a form amenable to machine learning. We explore the construction of machine learning datasets and we ask: what elements define effective machine learning datasets? We define effective machine learning datasets in astronomy to be formed with well-defined data points, structure, and metadata. We discuss why these elements are important for astronomical applications and ways to put them in practice. We posit that these qualities not only make the data suitable for machine learning, they also help to foster usable, reusable, and replicable science practices.

* 5 pages, 1 figure, accepted to the peer-reviewed NeurIPS Machine Learning in the Physical Sciences Workshop, 2022

Via

Access Paper or Ask Questions

Multimodal Interactive Learning of Primitive Actions

Oct 01, 2018

Tuan Do, Nikhil Krishnaswamy, Kyeongmin Rim, James Pustejovsky

Figure 1 for Multimodal Interactive Learning of Primitive Actions

Figure 2 for Multimodal Interactive Learning of Primitive Actions

Abstract:We describe an ongoing project in learning to perform primitive actions from demonstrations using an interactive interface. In our previous work, we have used demonstrations captured from humans performing actions as training samples for a neural network-based trajectory model of actions to be performed by a computational agent in novel setups. We found that our original framework had some limitations that we hope to overcome by incorporating communication between the human and the computational agent, using the interaction between them to fine-tune the model learned by the machine. We propose a framework that uses multimodal human-computer interaction to teach action concepts to machines, making use of both live demonstration and communication through natural language, as two distinct teaching modalities, while requiring few training samples.

* Presented at AI-HRI AAAI-FSS, 2018 (arXiv:1809.06606)

Via

Access Paper or Ask Questions

Learning event representation: As sparse as possible, but not sparser

Oct 02, 2017

Tuan Do, James Pustejovsky

Figure 1 for Learning event representation: As sparse as possible, but not sparser

Figure 2 for Learning event representation: As sparse as possible, but not sparser

Figure 3 for Learning event representation: As sparse as possible, but not sparser

Figure 4 for Learning event representation: As sparse as possible, but not sparser

Abstract:Selecting an optimal event representation is essential for event classification in real world contexts. In this paper, we investigate the application of qualitative spatial reasoning (QSR) frameworks for classification of human-object interaction in three dimensional space, in comparison with the use of quantitative feature extraction approaches for the same purpose. In particular, we modify QSRLib, a library that allows computation of Qualitative Spatial Relations and Calculi, and employ it for feature extraction, before inputting features into our neural network models. Using an experimental setup involving motion captures of human-object interaction as three dimensional inputs, we observe that the use of qualitative spatial features significantly improves the performance of our machine learning algorithm against our baseline, while quantitative features of similar kinds fail to deliver similar improvement. We also observe that sequential representations of QSR features yield the best classification performance. A result of our learning method is a simple approach to the qualitative representation of 3D activities as compositions of 2D actions that can be visualized and learned using 2-dimensional QSR.

* Qualitative reasoning Workshop 2017

Via

Access Paper or Ask Questions

Fine-grained Event Learning of Human-Object Interaction with LSTM-CRF

Sep 30, 2017

Tuan Do, James Pustejovsky

Figure 1 for Fine-grained Event Learning of Human-Object Interaction with LSTM-CRF

Figure 2 for Fine-grained Event Learning of Human-Object Interaction with LSTM-CRF

Figure 3 for Fine-grained Event Learning of Human-Object Interaction with LSTM-CRF

Abstract:Event learning is one of the most important problems in AI. However, notwithstanding significant research efforts, it is still a very complex task, especially when the events involve the interaction of humans or agents with other objects, as it requires modeling human kinematics and object movements. This study proposes a methodology for learning complex human-object interaction (HOI) events, involving the recording, annotation and classification of event interactions. For annotation, we allow multiple interpretations of a motion capture by slicing over its temporal span, for classification, we use Long-Short Term Memory (LSTM) sequential models with Conditional Randon Field (CRF) for constraints of outputs. Using a setup involving captures of human-object interaction as three dimensional inputs, we argue that this approach could be used for event types involving complex spatio-temporal dynamics.

* European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2017)

Via

Access Paper or Ask Questions

ECAT: Event Capture Annotation Tool

Oct 05, 2016

Tuan Do, Nikhil Krishnaswamy, James Pustejovsky

Figure 1 for ECAT: Event Capture Annotation Tool

Figure 2 for ECAT: Event Capture Annotation Tool

Figure 3 for ECAT: Event Capture Annotation Tool

Figure 4 for ECAT: Event Capture Annotation Tool

Abstract:This paper introduces the Event Capture Annotation Tool (ECAT), a user-friendly, open-source interface tool for annotating events and their participants in video, capable of extracting the 3D positions and orientations of objects in video captured by Microsoft's Kinect(R) hardware. The modeling language VoxML (Pustejovsky and Krishnaswamy, 2016) underlies ECAT's object, program, and attribute representations, although ECAT uses its own spec for explicit labeling of motion instances. The demonstration will show the tool's workflow and the options available for capturing event-participant relations and browsing visual data. Mapping ECAT's output to VoxML will also be addressed.

* 4 pages, 4 figures, ISA workshop 2015

Via

Access Paper or Ask Questions