Hyperbolic Fine-Tuning for Large Language Models

About

Large language models (LLMs) have demonstrated remarkable performance across various tasks. However, it remains an open question whether the default Euclidean space is the most suitable choice for LLMs. In this study, we investigate the geometric characteristics of LLMs, focusing specifically on tokens and their embeddings. Our findings reveal that token frequency follows a power-law distribution, where high-frequency tokens (e.g., the, that ) constitute the minority, while low-frequency tokens (e.g., apple, dog) constitute the majority. Furthermore, high-frequency tokens cluster near the origin, whereas low-frequency tokens are positioned farther away in the embedding space. Additionally, token embeddings exhibit hyperbolic characteristics, indicating a latent tree-like structure within the embedding space. Motivated by these observations, we propose HypLoRA, an efficient fine-tuning approach that operates in hyperbolic space to exploit these underlying hierarchical structures better. HypLoRA performs low-rank adaptation directly in hyperbolic space, thereby preserving hyperbolic modeling capabilities throughout the fine-tuning process. Extensive experiments across various base models and reasoning benchmarks, specifically arithmetic and commonsense reasoning tasks, demonstrate that HypLoRA substantially improves LLM performance.

Menglin Yang, Ram Samarth B B, Aosong Feng, Bo Xiong, Jihong Liu, Irwin King, Rex Ying• 2024

Related benchmarks

Task	Dataset	Result
Mathematical Reasoning	MATH 500	Accuracy10.8	442
Mathematical Reasoning	SVAMP	Accuracy66.33	403
Mathematical Reasoning	GSM8K	Accuracy64.67	388
Mathematical Reasoning	MAWPS	Accuracy65.77	279
Commonsense Reasoning	CSQA	CSQA Accuracy79.85	195
Commonsense Reasoning	OBQA	Accuracy87.8	187
Mathematical Reasoning	AQUA	Accuracy27.95	167
Math Reasoning	MATH500	Accuracy26.4	127
Mathematical Reasoning	MATH500	Accuracy14.6	82
Arithmetic Reasoning	AQUA	Accuracy41.34	57

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord