Picture for Yanwei Jia

Yanwei Jia

Sublinear Regret for An Actor-Critic Algorithm in Continuous-Time Linear-Quadratic Reinforcement Learning

Add code
Jul 24, 2024
Viaarxiv icon

Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty

Add code
Apr 19, 2024
Viaarxiv icon

Learning Merton's Strategies in an Incomplete Market: Recursive Entropy Regularization and Biased Gaussian Exploration

Add code
Dec 19, 2023
Viaarxiv icon

q-Learning in Continuous Time

Add code
Jul 02, 2022
Figure 1 for q-Learning in Continuous Time
Figure 2 for q-Learning in Continuous Time
Figure 3 for q-Learning in Continuous Time
Figure 4 for q-Learning in Continuous Time
Viaarxiv icon

Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms

Add code
Nov 22, 2021
Figure 1 for Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Figure 2 for Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Viaarxiv icon

Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach

Add code
Aug 15, 2021
Figure 1 for Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach
Figure 2 for Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach
Figure 3 for Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach
Figure 4 for Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach
Viaarxiv icon