Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jacob Hochhalter

Diffusion-Based Symbolic Regression

May 30, 2025

Zachary Bastiani, Robert M. Kirby, Jacob Hochhalter, Shandian Zhe

Abstract:Diffusion has emerged as a powerful framework for generative modeling, achieving remarkable success in applications such as image and audio synthesis. Enlightened by this progress, we propose a novel diffusion-based approach for symbolic regression. We construct a random mask-based diffusion and denoising process to generate diverse and high-quality equations. We integrate this generative processes with a token-wise Group Relative Policy Optimization (GRPO) method to conduct efficient reinforcement learning on the given measurement dataset. In addition, we introduce a long short-term risk-seeking policy to expand the pool of top-performing candidates, further enhancing performance. Extensive experiments and ablation studies have demonstrated the effectiveness of our approach.

Via

Access Paper or Ask Questions

Pseudo-Physics-Informed Neural Operators: Enhancing Operator Learning from Limited Data

Feb 04, 2025

Keyan Chen, Yile Li, Da Long, Zhitong Xu, Wei Xing, Jacob Hochhalter, Shandian Zhe

Figure 1 for Pseudo-Physics-Informed Neural Operators: Enhancing Operator Learning from Limited Data

Figure 2 for Pseudo-Physics-Informed Neural Operators: Enhancing Operator Learning from Limited Data

Figure 3 for Pseudo-Physics-Informed Neural Operators: Enhancing Operator Learning from Limited Data

Figure 4 for Pseudo-Physics-Informed Neural Operators: Enhancing Operator Learning from Limited Data

Abstract:Neural operators have shown great potential in surrogate modeling. However, training a well-performing neural operator typically requires a substantial amount of data, which can pose a major challenge in complex applications. In such scenarios, detailed physical knowledge can be unavailable or difficult to obtain, and collecting extensive data is often prohibitively expensive. To mitigate this challenge, we propose the Pseudo Physics-Informed Neural Operator (PPI-NO) framework. PPI-NO constructs a surrogate physics system for the target system using partial differential equations (PDEs) derived from simple, rudimentary physics principles, such as basic differential operators. This surrogate system is coupled with a neural operator model, using an alternating update and learning process to iteratively enhance the model's predictive power. While the physics derived via PPI-NO may not mirror the ground-truth underlying physical laws -- hence the term ``pseudo physics'' -- this approach significantly improves the accuracy of standard operator learning models in data-scarce scenarios, which is evidenced by extensive evaluations across five benchmark tasks and a fatigue modeling application.

Via

Access Paper or Ask Questions

Complexity-Aware Deep Symbolic Regression with Robust Risk-Seeking Policy Gradients

Jun 10, 2024

Zachary Bastiani, Robert M. Kirby, Jacob Hochhalter, Shandian Zhe

Figure 1 for Complexity-Aware Deep Symbolic Regression with Robust Risk-Seeking Policy Gradients

Figure 2 for Complexity-Aware Deep Symbolic Regression with Robust Risk-Seeking Policy Gradients

Figure 3 for Complexity-Aware Deep Symbolic Regression with Robust Risk-Seeking Policy Gradients

Figure 4 for Complexity-Aware Deep Symbolic Regression with Robust Risk-Seeking Policy Gradients

Abstract:This paper proposes a novel deep symbolic regression approach to enhance the robustness and interpretability of data-driven mathematical expression discovery. Despite the success of the state-of-the-art method, DSR, it is built on recurrent neural networks, purely guided by data fitness, and potentially meet tail barriers, which can zero out the policy gradient and cause inefficient model updates. To overcome these limitations, we use transformers in conjunction with breadth-first-search to improve the learning performance. We use Bayesian information criterion (BIC) as the reward function to explicitly account for the expression complexity and optimize the trade-off between interpretability and data fitness. We propose a modified risk-seeking policy that not only ensures the unbiasness of the gradient, but also removes the tail barriers, thus ensuring effective updates from top performers. Through a series of benchmarks and systematic experiments, we demonstrate the advantages of our approach.

Via

Access Paper or Ask Questions

Genetic Programming Based Symbolic Regression for Analytical Solutions to Differential Equations

Feb 07, 2023

Hongsup Oh, Roman Amici, Geoffrey Bomarito, Shandian Zhe, Robert Kirby, Jacob Hochhalter

Figure 1 for Genetic Programming Based Symbolic Regression for Analytical Solutions to Differential Equations

Figure 2 for Genetic Programming Based Symbolic Regression for Analytical Solutions to Differential Equations

Figure 3 for Genetic Programming Based Symbolic Regression for Analytical Solutions to Differential Equations

Figure 4 for Genetic Programming Based Symbolic Regression for Analytical Solutions to Differential Equations

Abstract:In this paper, we present a machine learning method for the discovery of analytic solutions to differential equations. The method utilizes an inherently interpretable algorithm, genetic programming based symbolic regression. Unlike conventional accuracy measures in machine learning we demonstrate the ability to recover true analytic solutions, as opposed to a numerical approximation. The method is verified by assessing its ability to recover known analytic solutions for two separate differential equations. The developed method is compared to a conventional, purely data-driven genetic programming based symbolic regression algorithm. The reliability of successful evolution of the true solution, or an algebraic equivalent, is demonstrated.

* 14 pages, 9 figures

Via

Access Paper or Ask Questions