Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amro Alabsi Aljundi

Boosting Graph Embedding on a Single GPU

Oct 19, 2021

Amro Alabsi Aljundi, Taha Atahan Akyıldız, Kamer Kaya

Figure 1 for Boosting Graph Embedding on a Single GPU

Figure 2 for Boosting Graph Embedding on a Single GPU

Figure 3 for Boosting Graph Embedding on a Single GPU

Figure 4 for Boosting Graph Embedding on a Single GPU

Abstract:Graphs are ubiquitous, and they can model unique characteristics and complex relations of real-life systems. Although using machine learning (ML) on graphs is promising, their raw representation is not suitable for ML algorithms. Graph embedding represents each node of a graph as a d-dimensional vector which is more suitable for ML tasks. However, the embedding process is expensive, and CPU-based tools do not scale to real-world graphs. In this work, we present GOSH, a GPU-based tool for embedding large-scale graphs with minimum hardware constraints. GOSH employs a novel graph coarsening algorithm to enhance the impact of updates and minimize the work for embedding. It also incorporates a decomposition schema that enables any arbitrarily large graph to be embedded with a single GPU. As a result, GOSH sets a new state-of-the-art in link prediction both in accuracy and speed, and delivers high-quality embeddings for node classification at a fraction of the time compared to the state-of-the-art. For instance, it can embed a graph with over 65 million vertices and 1.8 billion edges in less than 30 minutes on a single GPU.

* 12 pages, 11 tables, 6 figures, submitted for publication at Special Section on Parallel and Distributed Computing Techniques for AI, ML, and DL

Via

Access Paper or Ask Questions

Understanding Coarsening for Embedding Large-Scale Graphs

Sep 10, 2020

Taha Atahan Akyildiz, Amro Alabsi Aljundi, Kamer Kaya

Figure 1 for Understanding Coarsening for Embedding Large-Scale Graphs

Figure 2 for Understanding Coarsening for Embedding Large-Scale Graphs

Figure 3 for Understanding Coarsening for Embedding Large-Scale Graphs

Figure 4 for Understanding Coarsening for Embedding Large-Scale Graphs

Abstract:A significant portion of the data today, e.g, social networks, web connections, etc., can be modeled by graphs. A proper analysis of graphs with Machine Learning (ML) algorithms has the potential to yield far-reaching insights into many areas of research and industry. However, the irregular structure of graph data constitutes an obstacle for running ML tasks on graphs such as link prediction, node classification, and anomaly detection. Graph embedding is a compute-intensive process of representing graphs as a set of vectors in a d-dimensional space, which in turn makes it amenable to ML tasks. Many approaches have been proposed in the literature to improve the performance of graph embedding, e.g., using distributed algorithms, accelerators, and pre-processing techniques. Graph coarsening, which can be considered a pre-processing step, is a structural approximation of a given, large graph with a smaller one. As the literature suggests, the cost of embedding significantly decreases when coarsening is employed. In this work, we thoroughly analyze the impact of the coarsening quality on the embedding performance both in terms of speed and accuracy. Our experiments with a state-of-the-art, fast graph embedding tool show that there is an interplay between the coarsening decisions taken and the embedding quality.

* 10 pages, 6 figures, submitted to 2020 IEEE International Conference on Big Data

Via

Access Paper or Ask Questions