Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Giuseppe Nuti

Ray-Tracing for Conditionally Activated Neural Networks

Feb 20, 2025

Claudio Gallicchio, Giuseppe Nuti

Figure 1 for Ray-Tracing for Conditionally Activated Neural Networks

Figure 2 for Ray-Tracing for Conditionally Activated Neural Networks

Figure 3 for Ray-Tracing for Conditionally Activated Neural Networks

Abstract:In this paper, we introduce a novel architecture for conditionally activated neural networks combining a hierarchical construction of multiple Mixture of Experts (MoEs) layers with a sampling mechanism that progressively converges to an optimized configuration of expert activation. This methodology enables the dynamic unfolding of the network's architecture, facilitating efficient path-specific training. Experimental results demonstrate that this approach achieves competitive accuracy compared to conventional baselines while significantly reducing the parameter count required for inference. Notably, this parameter reduction correlates with the complexity of the input patterns, a property naturally emerging from the network's operational dynamics without necessitating explicit auxiliary penalty functions.

* submitted to workshop

Via

Access Paper or Ask Questions

Hedging using reinforcement learning: Contextual $k$-Armed Bandit versus $Q$-learning

Jul 03, 2020

Loris Cannelli, Giuseppe Nuti, Marzio Sala, Oleg Szehr

Figure 1 for Hedging using reinforcement learning: Contextual $k$-Armed Bandit versus $Q$-learning

Figure 2 for Hedging using reinforcement learning: Contextual $k$-Armed Bandit versus $Q$-learning

Figure 3 for Hedging using reinforcement learning: Contextual $k$-Armed Bandit versus $Q$-learning

Figure 4 for Hedging using reinforcement learning: Contextual $k$-Armed Bandit versus $Q$-learning

Abstract:The construction of replication strategies for contingent claims in the presence of risk and market friction is a key problem of financial engineering. In real markets, continuous replication, such as in the model of Black, Scholes and Merton, is not only unrealistic but it is also undesirable due to high transaction costs. Over the last decades stochastic optimal-control methods have been developed to balance between effective replication and losses. More recently, with the rise of artificial intelligence, temporal-difference Reinforcement Learning, in particular variations of $Q$-learning in conjunction with Deep Neural Networks, have attracted significant interest. From a practical point of view, however, such methods are often relatively sample inefficient, hard to train and lack performance guarantees. This motivates the investigation of a stable benchmark algorithm for hedging. In this article, the hedging problem is viewed as an instance of a risk-averse contextual $k$-armed bandit problem, for which a large body of theoretical results and well-studied algorithms are available. We find that the $k$-armed bandit model naturally fits to the $P\&L$ formulation of hedging, providing for a more accurate and sample efficient approach than $Q$-learning and reducing to the Black-Scholes model in the absence of transaction costs and risks.

* 15 pages, 7 figures

Via

Access Paper or Ask Questions

Adaptive Bayesian Reticulum

Jan 29, 2020

Giuseppe Nuti, Lluís Antoni Jiménez Rugama, Kaspar Thommen

Figure 1 for Adaptive Bayesian Reticulum

Figure 2 for Adaptive Bayesian Reticulum

Figure 3 for Adaptive Bayesian Reticulum

Figure 4 for Adaptive Bayesian Reticulum

Abstract:Neural Networks and Random Forests: two popular techniques for supervised learning that are seemingly disconnected in their formulation and optimization method, have recently been linked in a single construct. The connection pivots on assembling an artificial Neural Network with nodes that allow for a gate-like function to mimic a tree split, optimized using the standard approach of recursively applying the chain rule to update its parameters. Yet two main challenges have impeded wide use of this hybrid approach: \emph{(a)} the inability of global gradient descent techniques to optimize hierarchical parameters (as introduced by the gate function); and \emph{(b)} the construction of the tree structure, which has relied on standard decision tree algorithms to learn the network topology or incrementally (and heuristically) searching the space at random. We propose a probabilistic construct that exploits the idea of a node's \emph{unexplained potential} (the total error channeled through the node) in order to decide where to expand further, mimicking the standard tree construction in a Neural Network setting, alongside a modified gradient descent that first locally optimizes an expanded node before a global optimization. The probabilistic approach allows us to evaluate each new split as a ratio of likelihoods that balance the statistical improvement in explaining the evidence against the additional model complexity --- thus providing a natural stopping condition. The result is a novel classification and regression technique that leverages the strength of both: a tree-structure that grows naturally and is simple to interpret with the plasticity of Neural Networks that allow for soft margins and slanted boundaries.

* 23 pages, 7 figures, 3 tables

Via

Access Paper or Ask Questions

A Bayesian Decision Tree Algorithm

Jan 11, 2019

Giuseppe Nuti, Lluís Antoni Jiménez Rugama, Andreea-Ingrid Cross

Figure 1 for A Bayesian Decision Tree Algorithm

Figure 2 for A Bayesian Decision Tree Algorithm

Figure 3 for A Bayesian Decision Tree Algorithm

Figure 4 for A Bayesian Decision Tree Algorithm

Abstract:Bayesian Decision Trees are known for their probabilistic interpretability. However, their construction can sometimes be costly. In this article we present a general Bayesian Decision Tree algorithm applicable to both regression and classification problems. The algorithm does not apply Markov Chain Monte Carlo and does not require a pruning step. While it is possible to construct a weighted probability tree space we find that one particular tree, the greedy-modal tree (GMT), explains most of the information contained in the numerical examples. This approach seems to perform similarly to Random Forests.

* 15 pages, 5 figures

Via

Access Paper or Ask Questions

An Efficient Algorithm for Bayesian Nearest Neighbours

Jun 02, 2017

Giuseppe Nuti

Figure 1 for An Efficient Algorithm for Bayesian Nearest Neighbours

Figure 2 for An Efficient Algorithm for Bayesian Nearest Neighbours

Figure 3 for An Efficient Algorithm for Bayesian Nearest Neighbours

Figure 4 for An Efficient Algorithm for Bayesian Nearest Neighbours

Abstract:K-Nearest Neighbours (k-NN) is a popular classification and regression algorithm, yet one of its main limitations is the difficulty in choosing the number of neighbours. We present a Bayesian algorithm to compute the posterior probability distribution for k given a target point within a data-set, efficiently and without the use of Markov Chain Monte Carlo (MCMC) methods or simulation - alongside an exact solution for distributions within the exponential family. The central idea is that data points around our target are generated by the same probability distribution, extending outwards over the appropriate, though unknown, number of neighbours. Once the data is projected onto a distance metric of choice, we can transform the choice of k into a change-point detection problem, for which there is an efficient solution: we recursively compute the probability of the last change-point as we move towards our target, and thus de facto compute the posterior probability distribution over k. Applying this approach to both a classification and a regression UCI data-sets, we compare favourably and, most importantly, by removing the need for simulation, we are able to compute the posterior probability of k exactly and rapidly. As an example, the computational time for the Ripley data-set is a few milliseconds compared to a few hours when using a MCMC approach.

Via

Access Paper or Ask Questions