Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexander Kolpakov

Fast Geometric Embedding for Node Influence Maximization

Jun 09, 2025

Alexander Kolpakov, Igor Rivin

Abstract:Computing classical centrality measures such as betweenness and closeness is computationally expensive on large-scale graphs. In this work, we introduce an efficient force layout algorithm that embeds a graph into a low-dimensional space, where the radial distance from the origin serves as a proxy for various centrality measures. We evaluate our method on multiple graph families and demonstrate strong correlations with degree, PageRank, and paths-based centralities. As an application, it turns out that the proposed embedding allows to find high-influence nodes in a network, and provides a fast and scalable alternative to the standard greedy algorithm.

* 8 pages, 4 figures, 18 tables; Github repository available (https://github.com/sashakolpakov/graphem/); Package available on PyPi (https://pypi.org/project/graphem-jax/)

Via

Access Paper or Ask Questions

DiRe-JAX: A JAX based Dimensionality Reduction Algorithm for Large-scale Data

Mar 06, 2025

Alexander Kolpakov, Igor Rivin

Abstract:DiRe - JAX is a new dimensionality reduction toolkit designed to address some of the challenges faced by traditional methods like UMAP and tSNE such as loss of global structure and computational efficiency. Built on the JAX framework, DiRe leverages modern hardware acceleration to provide an efficient, scalable, and interpretable solution for visualizing complex data structures, and for quantitative analysis of lower-dimensional embeddings. The toolkit shows considerable promise in preserving both local and global structures within the data as compared to state-of-the-art UMAP and tSNE implementations. This makes it suitable for a wide range of applications in machine learning, bio-informatics, and data science.

* 22 pages, 12 figures Github repository available at https://github.com/sashakolpakov/dire-jax Package available on PyPi https://pypi.org/project/dire-jax/

Via

Access Paper or Ask Questions

Machine Learning of the Prime Distribution

Mar 19, 2024

Alexander Kolpakov, Aidan Rocke

Abstract:In the present work we use maximum entropy methods to derive several theorems in probabilistic number theory, including a version of the Hardy-Ramanujan Theorem. We also provide a theoretical argument explaining the experimental observations of Y.-H. He about the learnability of primes, and posit that the Erd\H{o}s-Kac law would very unlikely be discovered by current machine learning techniques. Numerical experiments that we perform corroborate our theoretical findings.

* 10 pages; parts of arXiv:2308.10817 reworked and amended; author's draft; accepted in PLOS ONE

Via

Access Paper or Ask Questions

A ripple in time: a discontinuity in American history

Dec 02, 2023

Alexander Kolpakov, Igor Rivin

Abstract:In this note we use the State of the Union Address dataset from Kaggle to make some surprising (and some not so surprising) observations pertaining to the general timeline of American history, and the character and nature of the addresses themselves. Our main approach is using vector embeddings, such as BERT (DistilBERT) and GPT-2. While it is widely believed that BERT (and its variations) is most suitable for NLP classification tasks, we find out that GPT-2 in conjunction with nonlinear dimension reduction methods such as UMAP provide better separation and stronger clustering. This makes GPT-2 + UMAP an interesting alternative. In our case, no model fine-tuning is required, and the pre-trained out-of-the-box GPT-2 model is enough. We also used a fine-tuned DistilBERT model for classification (detecting which president delivered which address), with very good results (accuracy 93% - 95% depending on the run). All computations can be replicated by using the accompanying code on GitHub.

* 7 pages, 8 figures; GitHub repository https://github.com/sashakolpakov/ripple_in_time

Via

Access Paper or Ask Questions

Robust affine feature matching via quadratic assignment on Grassmannians

Mar 07, 2023

Alexander Kolpakov, Michael Werman

Abstract:GraNNI (Grassmannians for Nearest Neighbours Identification) a new algorithm to solve the problem of affine registration is proposed. The algorithm is based on the Grassmannian of $k$--dimensional planes in $\mathbb{R}^n$ and minimizing the Frobenius norm between the two elements of the Grassmannian. The Quadratic Assignment Problem (QAP) is used to find the matching. The results of the experiments show that the algorithm is more robust to noise and point discrepancy in point clouds than previous approaches.

* 12 pages, 18 figures; GitHub repository at (https://github.com/sashakolpakov/granni)

Via

Access Paper or Ask Questions

An approach to robust ICP initialization

Dec 10, 2022

Alexander Kolpakov, Michael Werman

Abstract:In this note, we propose an approach for initializing the Iterative Closest Point (ICP) algorithm that allows us to apply ICP to unlabelled point clouds that are related by rigid transformations. We also give bounds on the robustness of our approach to noise. Numerical experiments confirm our theoretical findings.

* 7 pages, 10 figures; GitHub repository at (https://github.com/sashakolpakov/icp-init)

Via

Access Paper or Ask Questions