Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Philippe Chlenski

Even Faster Hyperbolic Random Forests: A Beltrami-Klein Wrapper Approach

Jun 04, 2025

Philippe Chlenski, Itsik Pe'er

Abstract:Decision trees and models that use them as primitives are workhorses of machine learning in Euclidean spaces. Recent work has further extended these models to the Lorentz model of hyperbolic space by replacing axis-parallel hyperplanes with homogeneous hyperplanes when partitioning the input space. In this paper, we show how the hyperDT algorithm can be elegantly reexpressed in the Beltrami-Klein model of hyperbolic spaces. This preserves the thresholding operation used in Euclidean decision trees, enabling us to further rewrite hyperDT as simple pre- and post-processing steps that form a wrapper around existing tree-based models designed for Euclidean spaces. The wrapper approach unlocks many optimizations already available in Euclidean space models, improving flexibility, speed, and accuracy while offering a simpler, more maintainable, and extensible codebase. Our implementation is available at https://github.com/pchlenski/hyperdt.

* 15 pages, 4 figures, 2 tables

Via

Access Paper or Ask Questions

Manify: A Python Library for Learning Non-Euclidean Representations

Mar 12, 2025

Philippe Chlenski, Kaizhu Du, Dylan Satow, Itsik Pe'er

Abstract:We present Manify, an open-source Python library for non-Euclidean representation learning. Leveraging manifold learning techniques, Manify provides tools for learning embeddings in (products of) non-Euclidean spaces, performing classification and regression with data that lives in such spaces, and estimating the curvature of a manifold. Manify aims to advance research and applications in machine learning by offering a comprehensive suite of tools for manifold-based data analysis. Our source code, examples, datasets, results, and documentation are available at https://github.com/pchlenski/manify

* 30 pages, 4 figures, 4 tables. Preprint

Via

Access Paper or Ask Questions

Transcoders Find Interpretable LLM Feature Circuits

Jun 17, 2024

Jacob Dunefsky, Philippe Chlenski, Neel Nanda

Figure 1 for Transcoders Find Interpretable LLM Feature Circuits

Figure 2 for Transcoders Find Interpretable LLM Feature Circuits

Figure 3 for Transcoders Find Interpretable LLM Feature Circuits

Figure 4 for Transcoders Find Interpretable LLM Feature Circuits

Abstract:A key goal in mechanistic interpretability is circuit analysis: finding sparse subgraphs of models corresponding to specific behaviors or capabilities. However, MLP sublayers make fine-grained circuit analysis on transformer-based language models difficult. In particular, interpretable features -- such as those found by sparse autoencoders (SAEs) -- are typically linear combinations of extremely many neurons, each with its own nonlinearity to account for. Circuit analysis in this setting thus either yields intractably large circuits or fails to disentangle local and global behavior. To address this we explore transcoders, which seek to faithfully approximate a densely activating MLP layer with a wider, sparsely-activating MLP layer. We successfully train transcoders on language models with 120M, 410M, and 1.4B parameters, and find them to perform at least on par with SAEs in terms of sparsity, faithfulness, and human-interpretability. We then introduce a novel method for using transcoders to perform weights-based circuit analysis through MLP sublayers. The resulting circuits neatly factorize into input-dependent and input-invariant terms. Finally, we apply transcoders to reverse-engineer unknown circuits in the model, and we obtain novel insights regarding the greater-than circuit in GPT2-small. Our results suggest that transcoders can prove effective in decomposing model computations involving MLPs into interpretable circuits. Code is available at https://github.com/jacobdunefsky/transcoder_circuits.

* 28 pages, 6 figures, 3 tables, 2 algorithms. Under review

Via

Access Paper or Ask Questions

Mixed-Curvature Decision Trees and Random Forests

Jun 07, 2024

Philippe Chlenski, Quentin Chu, Itsik Pe'er

Figure 1 for Mixed-Curvature Decision Trees and Random Forests

Figure 2 for Mixed-Curvature Decision Trees and Random Forests

Figure 3 for Mixed-Curvature Decision Trees and Random Forests

Abstract:We extend decision tree and random forest algorithms to mixed-curvature product spaces. Such spaces, defined as Cartesian products of Euclidean, hyperspherical, and hyperbolic manifolds, can often embed points from pairwise distances with much lower distortion than in single manifolds. To date, all classifiers for product spaces fit a single linear decision boundary, and no regressor has been described. Our method overcomes these limitations by enabling simple, expressive classification and regression in product manifolds. We demonstrate the superior accuracy of our tool compared to Euclidean methods operating in the ambient space for component manifolds covering a wide range of curvatures, as well as on a selection of product manifolds.

Via

Access Paper or Ask Questions

Variational Pseudo Marginal Methods for Jet Reconstruction in Particle Physics

Jun 05, 2024

Hanming Yang, Antonio Khalil Moretti, Sebastian Macaluso, Philippe Chlenski, Christian A. Naesseth, Itsik Pe'er

Figure 1 for Variational Pseudo Marginal Methods for Jet Reconstruction in Particle Physics

Figure 2 for Variational Pseudo Marginal Methods for Jet Reconstruction in Particle Physics

Figure 3 for Variational Pseudo Marginal Methods for Jet Reconstruction in Particle Physics

Figure 4 for Variational Pseudo Marginal Methods for Jet Reconstruction in Particle Physics

Abstract:Reconstructing jets, which provide vital insights into the properties and histories of subatomic particles produced in high-energy collisions, is a main problem in data analyses in collider physics. This intricate task deals with estimating the latent structure of a jet (binary tree) and involves parameters such as particle energy, momentum, and types. While Bayesian methods offer a natural approach for handling uncertainty and leveraging prior knowledge, they face significant challenges due to the super-exponential growth of potential jet topologies as the number of observed particles increases. To address this, we introduce a Combinatorial Sequential Monte Carlo approach for inferring jet latent structures. As a second contribution, we leverage the resulting estimator to develop a variational inference algorithm for parameter learning. Building on this, we introduce a variational family using a pseudo-marginal framework for a fully Bayesian treatment of all variables, unifying the generative model with the inference process. We illustrate our method's effectiveness through experiments using data generated with a collider physics generative model, highlighting superior speed and accuracy across a range of tasks.

Via

Access Paper or Ask Questions

Fast hyperboloid decision tree algorithms

Oct 20, 2023

Philippe Chlenski, Ethan Turok, Antonio Moretti, Itsik Pe'er

Figure 1 for Fast hyperboloid decision tree algorithms

Figure 2 for Fast hyperboloid decision tree algorithms

Figure 3 for Fast hyperboloid decision tree algorithms

Figure 4 for Fast hyperboloid decision tree algorithms

Abstract:Hyperbolic geometry is gaining traction in machine learning for its effectiveness at capturing hierarchical structures in real-world data. Hyperbolic spaces, where neighborhoods grow exponentially, offer substantial advantages and consistently deliver state-of-the-art results across diverse applications. However, hyperbolic classifiers often grapple with computational challenges. Methods reliant on Riemannian optimization frequently exhibit sluggishness, stemming from the increased computational demands of operations on Riemannian manifolds. In response to these challenges, we present hyperDT, a novel extension of decision tree algorithms into hyperbolic space. Crucially, hyperDT eliminates the need for computationally intensive Riemannian optimization, numerically unstable exponential and logarithmic maps, or pairwise comparisons between points by leveraging inner products to adapt Euclidean decision tree algorithms to hyperbolic space. Our approach is conceptually straightforward and maintains constant-time decision complexity while mitigating the scalability issues inherent in high-dimensional Euclidean spaces. Building upon hyperDT we introduce hyperRF, a hyperbolic random forest model. Extensive benchmarking across diverse datasets underscores the superior performance of these models, providing a swift, precise, accurate, and user-friendly toolkit for hyperbolic data analysis.

Via

Access Paper or Ask Questions