Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space

Jul 01, 2024

Menglin Yang, Harshit Verma, Delvin Ce Zhang, Jiahong Liu, Irwin King, Rex Ying

Figure 1 for Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space

Figure 2 for Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space

Figure 3 for Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space

Figure 4 for Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space

Share this with someone who'll enjoy it:

Abstract:Hyperbolic geometry have shown significant potential in modeling complex structured data, particularly those with underlying tree-like and hierarchical structures. Despite the impressive performance of various hyperbolic neural networks across numerous domains, research on adapting the Transformer to hyperbolic space remains limited. Previous attempts have mainly focused on modifying self-attention modules in the Transformer. However, these efforts have fallen short of developing a complete hyperbolic Transformer. This stems primarily from: (i) the absence of well-defined modules in hyperbolic space, including linear transformation layers, LayerNorm layers, activation functions, dropout operations, etc. (ii) the quadratic time complexity of the existing hyperbolic self-attention module w.r.t the number of input tokens, which hinders its scalability. To address these challenges, we propose, Hypformer, a novel hyperbolic Transformer based on the Lorentz model of hyperbolic geometry. In Hypformer, we introduce two foundational blocks that define the essential modules of the Transformer in hyperbolic space. Furthermore, we develop a linear self-attention mechanism in hyperbolic space, enabling hyperbolic Transformer to process billion-scale graph data and long-sequence inputs for the first time. Our experimental results confirm the effectiveness and efficiency of Hypformer across various datasets, demonstrating its potential as an effective and scalable solution for large-scale data representation and large models.

* KDD 2024

View paper on

Share this with someone who'll enjoy it:

Title:Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space

Paper and Code