Privacy-Preserving Transformer Inference with Optimized Homomorphic Encryption and Secure Collaborative Computing

Tao Bai¹, Yang Tang², Kuan Shao³, Zhenyong Zhang^3,*, Yuanteng Liu⁴
1 Guizhou Provincial Meteorological Data Center, Guiyang, China
2 Technical Department of the People’s Procuratorate of Guizhou Province, Guiyang, China
3 College of Computer Science and Technology, Guizhou University, Guiyang, China
4 Colorful Guizhou Digital Technology Co., Ltd., Guiyang, China
* Corresponding Author: Zhenyong Zhang. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2026.078473

Received 31 December 2025; Accepted 10 March 2026; Published online 07 April 2026

Download PDF

Abstract

In recent years, the rapid development of artificial intelligence has greatly promoted the application of Machine Learning as a Service (MLaaS). Users can upload their requirements through front-end applications, and the server provides model inference services after receiving the user input. However, MLaaS may lead to serious privacy breaches. Large language model services are typical representatives of MLaaS, and the Transformer is a typical structure in large language models. Therefore, this paper proposes a privacy-protected Transformer inference scheme based on the CKKS fully homomorphic encryption scheme to optimize computational and communication efficiency. Firstly, this paper implements efficient matrix multiplication based on ring multiplication and optimizes the matrix partition parameters to adapt to different types (including ciphertext-plaintext and ciphertext-ciphertext) and different matrix dimensions. Secondly, this paper optimizes and designs secure Softmax, LayerNorm, and Gelu protocols based on parameter fuzzing and collaborative computing to perform efficient, secure atomic computations over ciphertexts. Finally, experiments on text classification were conducted on the IMDB and AGNEWS datasets. The results show that, under our experimental settings (including an AMD Ryzen 7 5700G CPU with 32 GB RAM and 8-thread parallel computing using the Lattigo library), the scheme proposed in this paper completes the inference process within 3 s, with communication costs below 1 GB, and the computing accuracy is comparable to that of plaintext computing.

Keywords

Machine learning as a service; privacy preservation; Transformer; collaborative computing

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

272

View
29

Download
0

Like

Attack Behavior Extraction Based on Heterogeneous Cyberthreat Intelligence and Graph Convolutional Networks
Binhui Tang, Junfeng Wang, Huanran...
A Novel Action Transformer Network for Hybrid Multimodal Sign Language Recognition
Sameena Javaid, Safdar Rizvi
Deep Learning-based Environmental Sound Classification Using Feature Fusion and Data Enhancement
Rashid Jahangir, Muhammad Asif...
A Survey on Image Semantic Segmentation Using Deep Learning Techniques
Jieren Cheng, Hua Li, Dengbo Li,...
Few-Shot Object Detection Based on the Transformer and High-Resolution Network
Dengyong Zhang, Huaijian Pu, Feng...

All issues

Online First

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Privacy-Preserving Transformer Inference with Optimized Homomorphic Encryption and Secure Collaborative Computing

Abstract

Keywords

272

29

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link