Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yinggan Xu

Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning

Apr 02, 2025

Yinggan Xu, Hana Kimlee, Yijia Xiao, Di Luo

Abstract:Large Language Models (LLMs) are playing an expanding role in physics research by enhancing reasoning, symbolic manipulation, and numerical computation. However, ensuring the reliability and interpretability of their outputs remains a significant challenge. In our framework, we conceptualize the collaboration between AI and human scientists as a dynamic interplay among three modules: the reasoning module, the interpretation module, and the AI-scientist interaction module. Recognizing that effective physics reasoning demands rigorous logical consistency, quantitative precision, and deep integration with established theoretical models, we introduce the interpretation module to improve the understanding of AI-generated outputs, which is not previously explored in the literature. This module comprises multiple specialized agents, including summarizers, model builders, UI builders, and testers, which collaboratively structure LLM outputs within a physically grounded framework, by constructing a more interpretable science model. A case study demonstrates that our approach enhances transparency, facilitates validation, and strengthens AI-augmented reasoning in scientific discovery.

Via

Access Paper or Ask Questions

Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation

Feb 28, 2022

Jing Dong, Li Shen, Yinggan Xu, Baoxiang Wang

Figure 1 for Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation

Figure 2 for Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation

Figure 3 for Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation

Abstract:We study the convergence of the actor-critic algorithm with nonlinear function approximation under a nonconvex-nonconcave primal-dual formulation. Stochastic gradient descent ascent is applied with an adaptive proximal term for robust learning rates. We show the first efficient convergence result with primal-dual actor-critic with a convergence rate of $\mathcal{O}\left(\sqrt{\frac{\ln \left(N d G^2 \right)}{N}}\right)$ under Markovian sampling, where $G$ is the element-wise maximum of the gradient, $N$ is the number of iterations, and $d$ is the dimension of the gradient. Our result is presented with only the Polyak-\L{}ojasiewicz condition for the dual variables, which is easy to verify and applicable to a wide range of reinforcement learning (RL) scenarios. The algorithm and analysis are general enough to be applied to other RL settings, like multi-agent RL. Empirical results on OpenAI Gym continuous control tasks corroborate our theoretical findings.

Via

Access Paper or Ask Questions