Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Peter Steiner

Institute for Acoustics and Speech Communication, Technische Universität Dresden, Dresden, Germany

Diag2Diag: Multi modal super resolution for physics discovery with application to fusion

May 09, 2024

Azarakhsh Jalalvand, Max Curie, SangKyeun Kim, Peter Steiner, Jaemin Seo, Qiming Hu, Andrew Oakleigh Nelson, Egemen Kolemen

Abstract:This paper introduces a groundbreaking multi-modal neural network model designed for resolution enhancement, which innovatively leverages inter-diagnostic correlations within a system. Traditional approaches have primarily focused on uni-modal enhancement strategies, such as pixel-based image enhancement or heuristic signal interpolation. In contrast, our model employs a novel methodology by harnessing the diagnostic relationships within the physics of fusion plasma. Initially, we establish the correlation among diagnostics within the tokamak. Subsequently, we utilize these correlations to substantially enhance the temporal resolution of the Thomson Scattering diagnostic, which assesses plasma density and temperature. By increasing its resolution from conventional 200Hz to 500kHz, we facilitate a new level of insight into plasma behavior, previously attainable only through computationally intensive simulations. This enhancement goes beyond simple interpolation, offering novel perspectives on the underlying physical phenomena governing plasma dynamics.

Via

Access Paper or Ask Questions

PyRCN: Exploration and Application of ESNs

Mar 08, 2021

Peter Steiner, Azarakhsh Jalalvand, Simon Stone, Peter Birkholz

Figure 1 for PyRCN: Exploration and Application of ESNs

Figure 2 for PyRCN: Exploration and Application of ESNs

Figure 3 for PyRCN: Exploration and Application of ESNs

Figure 4 for PyRCN: Exploration and Application of ESNs

Abstract:As a family member of Recurrent Neural Networks and similar to Long-Short-Term Memory cells, Echo State Networks (ESNs) are capable of solving temporal tasks, but with a substantially easier training paradigm based on linear regression. However, optimizing hyper-parameters and efficiently implementing the training process might be somewhat overwhelming for the first-time users of ESNs. This paper aims to facilitate the understanding of ESNs in theory and practice. Treating ESNs as non-linear filters, we explain the effect of the hyper-parameters using familiar concepts such as impulse responses. Furthermore, the paper introduces the Python toolbox PyRCN (Python Reservoir Computing Network) for developing, training and analyzing ESNs on arbitrarily large datasets. The tool is based on widely-used scientific packages, such as numpy and scipy and offers an interface to scikit-learn. Example code and results for classification and regression tasks are provided.

* Under review as a conference paper at IJCNN 2021

Via

Access Paper or Ask Questions

Cluster-based Input Weight Initialization for Echo State Networks

Mar 08, 2021

Peter Steiner, Azarakhsh Jalalvand, Peter Birkholz

Figure 1 for Cluster-based Input Weight Initialization for Echo State Networks

Figure 2 for Cluster-based Input Weight Initialization for Echo State Networks

Figure 3 for Cluster-based Input Weight Initialization for Echo State Networks

Figure 4 for Cluster-based Input Weight Initialization for Echo State Networks

Abstract:Echo State Networks (ESNs) are a special type of recurrent neural networks (RNNs), in which the input and recurrent connections are traditionally generated randomly, and only the output weights are trained. Despite the recent success of ESNs in various tasks of audio, image and radar recognition, we postulate that a purely random initialization is not the ideal way of initializing ESNs. The aim of this work is to propose an unsupervised initialization of the input connections using the K-Means algorithm on the training data. We show that this initialization performs equivalently or superior than a randomly initialized ESN whilst needing significantly less reservoir neurons (2000 vs. 4000 for spoken digit recognition, and 300 vs. 8000 neurons for f0 extraction) and thus reducing the amount of training time. Furthermore, we discuss that this approach provides the opportunity to estimate the suitable size of the reservoir based on the prior knowledge about the data.

* Submitted to IEEE Transactions on Neural Network and Learning System (TNNLS), 2021

Via

Access Paper or Ask Questions