Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Toygun Basaklar

PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm

Aug 16, 2022

Toygun Basaklar, Suat Gumussoy, Umit Y. Ogras

Figure 1 for PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm

Figure 2 for PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm

Figure 3 for PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm

Figure 4 for PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm

Abstract:Many real-world problems involve multiple, possibly conflicting, objectives. Multi-objective reinforcement learning (MORL) approaches have emerged to tackle these problems by maximizing a joint objective function weighted by a preference vector. These approaches find fixed customized policies corresponding to preference vectors specified during training. However, the design constraints and objectives typically change dynamically in real-life scenarios. Furthermore, storing a policy for each potential preference is not scalable. Hence, obtaining a set of Pareto front solutions for the entire preference space in a given domain with a single training is critical. To this end, we propose a novel MORL algorithm that trains a single universal network to cover the entire preference space. The proposed approach, Preference-Driven MORL (PD-MORL), utilizes the preferences as guidance to update the network parameters. After demonstrating PD-MORL using classical Deep Sea Treasure and Fruit Tree Navigation benchmarks, we evaluate its performance on challenging multi-objective continuous control tasks.

* 24 pages, 9 Figures, 9 Tables

Via

Access Paper or Ask Questions

tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices

Feb 18, 2022

Toygun Basaklar, Yigit Tuncel, Umit Y. Ogras

Figure 1 for tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices

Figure 2 for tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices

Figure 3 for tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices

Figure 4 for tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices

Abstract:Advances in low-power electronics and machine learning techniques lead to many novel wearable IoT devices. These devices have limited battery capacity and computational power. Thus, energy harvesting from ambient sources is a promising solution to power these low-energy wearable devices. They need to manage the harvested energy optimally to achieve energy-neutral operation, which eliminates recharging requirements. Optimal energy management is a challenging task due to the dynamic nature of the harvested energy and the battery energy constraints of the target device. To address this challenge, we present a reinforcement learning-based energy management framework, tinyMAN, for resource-constrained wearable IoT devices. The framework maximizes the utilization of the target device under dynamic energy harvesting patterns and battery constraints. Moreover, tinyMAN does not rely on forecasts of the harvested energy which makes it a prediction-free approach. We deployed tinyMAN on a wearable device prototype using TensorFlow Lite for Micro thanks to its small memory footprint of less than 100 KB. Our evaluations show that tinyMAN achieves less than 2.36 ms and 27.75 $\mu$J while maintaining up to 45% higher utility compared to prior approaches.

* 7 pages, 4 figures, accepted as "Full Paper" for the 2022 tinyML Research Symposium

Via

Access Paper or Ask Questions

Hypervector Design for Efficient Hyperdimensional Computing on Edge Devices

Mar 08, 2021

Toygun Basaklar, Yigit Tuncel, Shruti Yadav Narayana, Suat Gumussoy, Umit Y. Ogras

Figure 1 for Hypervector Design for Efficient Hyperdimensional Computing on Edge Devices

Figure 2 for Hypervector Design for Efficient Hyperdimensional Computing on Edge Devices

Figure 3 for Hypervector Design for Efficient Hyperdimensional Computing on Edge Devices

Figure 4 for Hypervector Design for Efficient Hyperdimensional Computing on Edge Devices

Abstract:Hyperdimensional computing (HDC) has emerged as a new light-weight learning algorithm with smaller computation and energy requirements compared to conventional techniques. In HDC, data points are represented by high-dimensional vectors (hypervectors), which are mapped to high-dimensional space (hyperspace). Typically, a large hypervector dimension ($\geq1000$) is required to achieve accuracies comparable to conventional alternatives. However, unnecessarily large hypervectors increase hardware and energy costs, which can undermine their benefits. This paper presents a technique to minimize the hypervector dimension while maintaining the accuracy and improving the robustness of the classifier. To this end, we formulate the hypervector design as a multi-objective optimization problem for the first time in the literature. The proposed approach decreases the hypervector dimension by more than $32\times$ while maintaining or increasing the accuracy achieved by conventional HDC. Experiments on a commercial hardware platform show that the proposed approach achieves more than one order of magnitude reduction in model size, inference time, and energy consumption. We also demonstrate the trade-off between accuracy and robustness to noise and provide Pareto front solutions as a design parameter in our hypervector design.

* 9 pages, 6 figures, accepted to tinyML 2021 Research Symposium

Via

Access Paper or Ask Questions

MGait: Model-Based Gait Analysis Using Wearable Bend and Inertial Sensors

Feb 23, 2021

Sizhe An, Yigit Tuncel, Toygun Basaklar, Gokul Krishna Krishnakumar, Ganapati Bhat, Umit Ogras

Figure 1 for MGait: Model-Based Gait Analysis Using Wearable Bend and Inertial Sensors

Figure 2 for MGait: Model-Based Gait Analysis Using Wearable Bend and Inertial Sensors

Figure 3 for MGait: Model-Based Gait Analysis Using Wearable Bend and Inertial Sensors

Figure 4 for MGait: Model-Based Gait Analysis Using Wearable Bend and Inertial Sensors

Abstract:Movement disorders, such as Parkinson's disease, affect more than 10 million people worldwide. Gait analysis is a critical step in the diagnosis and rehabilitation of these disorders. Specifically, step length provides valuable insights into the gait quality and rehabilitation process. However, traditional approaches for estimating step length are not suitable for continuous daily monitoring since they rely on special mats and clinical environments. To address this limitation, we present a novel and practical step-length estimation technique using low-power wearable bend and inertial sensors. Experimental results show that the proposed model estimates step length with 5.49% mean absolute percentage error and provides accurate real-time feedback to the user.

Via

Access Paper or Ask Questions