Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

THE-X: Privacy-Preserving Transformer Inference with Homomorphic Encryption

About

As more and more pre-trained language models adopt on-cloud deployment, the privacy issues grow quickly, mainly for the exposure of plain-text user data (e.g., search history, medical record, bank account). Privacy-preserving inference of transformer models is on the demand of cloud service users. To protect privacy, it is an attractive choice to compute only with ciphertext in homomorphic encryption (HE). However, enabling pre-trained models inference on ciphertext data is difficult due to the complex computations in transformer blocks, which are not supported by current HE tools yet. In this work, we introduce $\textit{THE-X}$, an approximation approach for transformers, which enables privacy-preserving inference of pre-trained models developed by popular frameworks. $\textit{THE-X}$ proposes a workflow to deal with complex computation in transformer networks, including all the non-polynomial functions like GELU, softmax, and LayerNorm. Experiments reveal our proposed $\textit{THE-X}$ can enable transformer inference on encrypted data for different downstream tasks, all with negligible performance drop but enjoying the theory-guaranteed privacy-preserving advantage.

Tianyu Chen, Hangbo Bao, Shaohan Huang, Li Dong, Binxing Jiao, Daxin Jiang, Haoyi Zhou, Jianxin Li, Furu Wei• 2022

Related benchmarks

TaskDatasetResultRank
Private text generationGPT2-base (124M)
Usage Fraction98.95
7
Private text generationT5 138M
Memory Fraction97.18
7
Private InferenceT5 138M
Embed Inference Time (s)316.2
7
Private InferenceGPT2-base (124M)
Embed Inference Time (s)329.3
7
Showing 4 of 4 rows

Other info

Follow for update