Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nihal Acharya Adde

Robust Evolutionary Multi-Objective Network Architecture Search for Reinforcement Learning (EMNAS-RL)

Jun 10, 2025

Nihal Acharya Adde, Alexandra Gianzina, Hanno Gottschalk, Andreas Ebert

Abstract:This paper introduces Evolutionary Multi-Objective Network Architecture Search (EMNAS) for the first time to optimize neural network architectures in large-scale Reinforcement Learning (RL) for Autonomous Driving (AD). EMNAS uses genetic algorithms to automate network design, tailored to enhance rewards and reduce model size without compromising performance. Additionally, parallelization techniques are employed to accelerate the search, and teacher-student methodologies are implemented to ensure scalable optimization. This research underscores the potential of transfer learning as a robust framework for optimizing performance across iterative learning processes by effectively leveraging knowledge from earlier generations to enhance learning efficiency and stability in subsequent generations. Experimental results demonstrate that tailored EMNAS outperforms manually designed models, achieving higher rewards with fewer parameters. The findings of these strategies contribute positively to EMNAS for RL in autonomous driving, advancing the field toward better-performing networks suitable for real-world scenarios.

* Published at ESANN 2025 Conference

Via

Access Paper or Ask Questions

Hyperparameter Optimization for Driving Strategies Based on Reinforcement Learning

Jul 19, 2024

Nihal Acharya Adde, Hanno Gottschalk, Andreas Ebert

Abstract:This paper focuses on hyperparameter optimization for autonomous driving strategies based on Reinforcement Learning. We provide a detailed description of training the RL agent in a simulation environment. Subsequently, we employ Efficient Global Optimization algorithm that uses Gaussian Process fitting for hyperparameter optimization in RL. Before this optimization phase, Gaussian process interpolation is applied to fit the surrogate model, for which the hyperparameter set is generated using Latin hypercube sampling. To accelerate the evaluation, parallelization techniques are employed. Following the hyperparameter optimization procedure, a set of hyperparameters is identified, resulting in a noteworthy enhancement in overall driving performance. There is a substantial increase of 4\% when compared to existing manually tuned parameters and the hyperparameters discovered during the initialization process using Latin hypercube sampling. After the optimization, we analyze the obtained results thoroughly and conduct a sensitivity analysis to assess the robustness and generalization capabilities of the learned autonomous driving strategies. The findings from this study contribute to the advancement of Gaussian process based Bayesian optimization to optimize the hyperparameters for autonomous driving in RL, providing valuable insights for the development of efficient and reliable autonomous driving systems.

* Submitted and accepted by LOD24 conference https://lod2024.icas.events/

Via

Access Paper or Ask Questions

Finding hidden-feature depending laws inside a data set and classifying it using Neural Network

Jan 25, 2021

Thilo Moshagen, Nihal Acharya Adde, Ajay Navilarekal Rajgopal

Figure 1 for Finding hidden-feature depending laws inside a data set and classifying it using Neural Network

Figure 2 for Finding hidden-feature depending laws inside a data set and classifying it using Neural Network

Figure 3 for Finding hidden-feature depending laws inside a data set and classifying it using Neural Network

Figure 4 for Finding hidden-feature depending laws inside a data set and classifying it using Neural Network

Abstract:The logcosh loss function for neural networks has been developed to combine the advantage of the absolute error loss function of not overweighting outliers with the advantage of the mean square error of continuous derivative near the mean, which makes the last phase of learning easier. It is clear, and one experiences it soon, that in the case of clustered data, an artificial neural network with logcosh loss learns the bigger cluster rather than the mean of the two. Even more so, the ANN, when used for regression of a set-valued function, will learn a value close to one of the choices, in other words, one branch of the set-valued function, while a mean-square-error NN will learn the value in between. This work suggests a method that uses artificial neural networks with logcosh loss to find the branches of set-valued mappings in parameter-outcome sample sets and classifies the samples according to those branches.

* arXiv admin note: substantial text overlap with arXiv:2011.07332

Via

Access Paper or Ask Questions

Classification based on invisible features and thereby finding the effect of tuberculosis vaccine on COVID-19

Nov 14, 2020

Nihal Acharya Adde, Thilo Moshagen

Figure 1 for Classification based on invisible features and thereby finding the effect of tuberculosis vaccine on COVID-19

Figure 2 for Classification based on invisible features and thereby finding the effect of tuberculosis vaccine on COVID-19

Figure 3 for Classification based on invisible features and thereby finding the effect of tuberculosis vaccine on COVID-19

Figure 4 for Classification based on invisible features and thereby finding the effect of tuberculosis vaccine on COVID-19

Abstract:In the case of clustered data, an artificial neural network with logcosh loss function learns the bigger cluster rather than the mean of the two. Even more so, the ANN when used for regression of a set-valued function, will learn a value close to one of the choices, in other words, it learns one branch of the set-valued function with high accuracy. This work suggests a method that uses artificial neural networks with logcosh loss to find the branches of set-valued mappings in parameter-outcome sample sets and classifies the samples according to those branches. The method not only classifies the data based on these branches but also provides an accurate prediction for the majority cluster. The method successfully classifies the data based on an invisible feature. A neural network was successfully established to predict the total number of cases, the logarithmic total number of cases, deaths, active cases and other relevant data of the coronavirus for each German district from a number of input variables. As it has been speculated that the Tuberculosis vaccine provides protection against the virus and since East Germany was vaccinated before reunification, an attempt was made to classify the Eastern and Western German districts by considering the vaccine information as an invisible feature.

Via

Access Paper or Ask Questions