Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chaofan Yu

A Fast, Performant, Secure Distributed Training Framework For Large Language Model

Jan 19, 2024

Wei Huang, Yinggui Wang, Anda Cheng, Aihui Zhou, Chaofan Yu, Lei Wang

Figure 1 for A Fast, Performant, Secure Distributed Training Framework For Large Language Model

Figure 2 for A Fast, Performant, Secure Distributed Training Framework For Large Language Model

Figure 3 for A Fast, Performant, Secure Distributed Training Framework For Large Language Model

Figure 4 for A Fast, Performant, Secure Distributed Training Framework For Large Language Model

Abstract:The distributed (federated) LLM is an important method for co-training the domain-specific LLM using siloed data. However, maliciously stealing model parameters and data from the server or client side has become an urgent problem to be solved. In this paper, we propose a secure distributed LLM based on model slicing. In this case, we deploy the Trusted Execution Environment (TEE) on both the client and server side, and put the fine-tuned structure (LoRA or embedding of P-tuning v2) into the TEE. Then, secure communication is executed in the TEE and general environments through lightweight encryption. In order to further reduce the equipment cost as well as increase the model performance and accuracy, we propose a split fine-tuning scheme. In particular, we split the LLM by layers and place the latter layers in a server-side TEE (the client does not need a TEE). We then combine the proposed Sparsification Parameter Fine-tuning (SPF) with the LoRA part to improve the accuracy of the downstream task. Numerous experiments have shown that our method guarantees accuracy while maintaining security.

* Accepted by ICASSP 2024 (Federated LLM)

Via

Access Paper or Ask Questions

S3ML: A Secure Serving System for Machine Learning Inference

Oct 13, 2020

Junming Ma, Chaofan Yu, Aihui Zhou, Bingzhe Wu, Xibin Wu, Xingyu Chen, Xiangqun Chen, Lei Wang, Donggang Cao

Figure 1 for S3ML: A Secure Serving System for Machine Learning Inference

Figure 2 for S3ML: A Secure Serving System for Machine Learning Inference

Figure 3 for S3ML: A Secure Serving System for Machine Learning Inference

Figure 4 for S3ML: A Secure Serving System for Machine Learning Inference

Abstract:We present S3ML, a secure serving system for machine learning inference in this paper. S3ML runs machine learning models in Intel SGX enclaves to protect users' privacy. S3ML designs a secure key management service to construct flexible privacy-preserving server clusters and proposes novel SGX-aware load balancing and scaling methods to satisfy users' Service-Level Objectives. We have implemented S3ML based on Kubernetes as a low-overhead, high-available, and scalable system. We demonstrate the system performance and effectiveness of S3ML through extensive experiments on a series of widely-used models.

Via

Access Paper or Ask Questions

A Hybrid-Domain Framework for Secure Gradient Tree Boosting

May 18, 2020

Wenjing Fang, Chaochao Chen, Jin Tan, Chaofan Yu, Yufei Lu, Li Wang, Lei Wang, Jun Zhou, Alex X

Figure 1 for A Hybrid-Domain Framework for Secure Gradient Tree Boosting

Figure 2 for A Hybrid-Domain Framework for Secure Gradient Tree Boosting

Figure 3 for A Hybrid-Domain Framework for Secure Gradient Tree Boosting

Figure 4 for A Hybrid-Domain Framework for Secure Gradient Tree Boosting

Abstract:Gradient tree boosting (e.g. XGB) is one of the most widely usedmachine learning models in practice. How to build a secure XGB inface of data isolation problem becomes a hot research topic. However, existing works tend to leak intermediate information and thusraise potential privacy risk. In this paper, we propose a novel framework for two parties to build secure XGB with vertically partitioneddata. Specifically, we associate Homomorphic Encryption (HE) domain with Secret Sharing (SS) domain by providing the two-waytransformation primitives. The framework generally promotes theefficiency for privacy preserving machine learning and offers theflexibility to implement other machine learning models. Then weelaborate two secure XGB training algorithms as well as a corresponding prediction algorithm under the hybrid security domains.Next, we compare our proposed two training algorithms throughboth complexity analysis and experiments. Finally, we verify themodel performance on benchmark dataset and further apply ourwork to a real-world scenario.

Via

Access Paper or Ask Questions