Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yota Hashizume

Tsallis Entropy Regularization for Linearly Solvable MDP and Linear Quadratic Regulator

Mar 04, 2024

Yota Hashizume, Koshi Oishi, Kenji Kashima

Abstract:Shannon entropy regularization is widely adopted in optimal control due to its ability to promote exploration and enhance robustness, e.g., maximum entropy reinforcement learning known as Soft Actor-Critic. In this paper, Tsallis entropy, which is a one-parameter extension of Shannon entropy, is used for the regularization of linearly solvable MDP and linear quadratic regulators. We derive the solution for these problems and demonstrate its usefulness in balancing between exploration and sparsity of the obtained control law.

* 6 figures

Via

Access Paper or Ask Questions

Imitation-regularized Optimal Transport on Networks: Provable Robustness and Application to Logistics Planning

Feb 28, 2024

Koshi Oishi, Yota Hashizume, Tomohiko Jimbo, Hirotaka Kaji, Kenji Kashima

Abstract:Network systems form the foundation of modern society, playing a critical role in various applications. However, these systems are at significant risk of being adversely affected by unforeseen circumstances, such as disasters. Considering this, there is a pressing need for research to enhance the robustness of network systems. Recently, in reinforcement learning, the relationship between acquiring robustness and regularizing entropy has been identified. Additionally, imitation learning is used within this framework to reflect experts' behavior. However, there are no comprehensive studies on the use of a similar imitation framework for optimal transport on networks. Therefore, in this study, imitation-regularized optimal transport (I-OT) on networks was investigated. It encodes prior knowledge on the network by imitating a given prior distribution. The I-OT solution demonstrated robustness in terms of the cost defined on the network. Moreover, we applied the I-OT to a logistics planning problem using real data. We also examined the imitation and apriori risk information scenarios to demonstrate the usefulness and implications of the proposed method.

Via

Access Paper or Ask Questions

Resilience Evaluation of Entropy Regularized Logistic Networks with Probabilistic Cost

Dec 05, 2022

Koshi Oishi, Yota Hashizume, Tomohiko Jimbo, Hirotaka Kaji, Kenji Kashima

Figure 1 for Resilience Evaluation of Entropy Regularized Logistic Networks with Probabilistic Cost

Figure 2 for Resilience Evaluation of Entropy Regularized Logistic Networks with Probabilistic Cost

Figure 3 for Resilience Evaluation of Entropy Regularized Logistic Networks with Probabilistic Cost

Figure 4 for Resilience Evaluation of Entropy Regularized Logistic Networks with Probabilistic Cost

Abstract:The demand for resilient logistics networks has increased because of recent disasters. When we consider optimization problems, entropy regularization is a powerful tool for the diversification of a solution. In this study, we proposed a method for designing a resilient logistics network based on entropy regularization. Moreover, we proposed a method for analytical resilience criteria to reduce the ambiguity of resilience. First, we modeled the logistics network, including factories, distribution bases, and sales outlets in an efficient framework using entropy regularization. Next, we formulated a resilience criterion based on probabilistic cost and Kullback--Leibler divergence. Finally, our method was performed using a simple logistics network, and the resilience of the three logistics plans designed by entropy regularization was demonstrated.

* Under review at IFAC World Congress 2023. 6 pages

Via

Access Paper or Ask Questions