Hyperbolic Busemann Neural Networks
About
Hyperbolic spaces provide a natural geometry for representing hierarchical and tree-structured data due to their exponential volume growth. To leverage these benefits, neural networks require intrinsic and efficient components that operate directly in hyperbolic space. In this work, we lift two core components of neural networks, Multinomial Logistic Regression (MLR) and Fully Connected (FC) layers, into hyperbolic space via Busemann functions, resulting in Busemann MLR (BMLR) and Busemann FC (BFC) layers with a unified mathematical interpretation. BMLR provides compact parameters, a point-to-horosphere distance interpretation, batch-efficient computation, and a Euclidean limit, while BFC generalizes FC and activation layers with comparable complexity. Experiments on image classification, genome sequence learning, node classification, and link prediction demonstrate improvements in effectiveness and efficiency over prior hyperbolic layers. The code is available at https://github.com/GitZH-Chen/HBNN.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | ImageNet 1k (test) | Top-1 Accuracy73.36 | 798 | |
| Node Classification | DISEASE δ = 0 (test) | F1 Score92.45 | 18 | |
| Node Classification | AIRPORT δ = 1 (test) | F1 Score (Test)86.02 | 18 | |
| Node Classification | PubMed δ=3.5 (test) | Test F1 Score77.36 | 18 | |
| Link Prediction | DISEASE δ = 0 (test) | ROC AUC80.45 | 17 | |
| Link Prediction | AIRPORT δ = 1 (test) | ROC AUC0.9537 | 17 | |
| Link Prediction | CoRA δ=11 (test) | ROC AUC92.28 | 17 | |
| Link Prediction | PubMed δ=3.5 (test) | ROC AUC94.9 | 17 | |
| Classification | Tiny ImageNet 200 (test) | Test Accuracy66.16 | 16 | |
| Core Promoter Detection | tata GUE | MCC82.83 | 10 |