WIP - Outline of Privacy-preserving machine learning: Methods, challenges and directions#
Year: 2021
Author: Xu, Baracaldo, and Joshi
Privacy Preserving {P}hase in PPML#
How privacy techniques can be applied at different stages of the ML pipeline
Data Preparation
Model Training/Evaluation
Model Deployment
Model Serving
Model Generation#
Data Preparation
Model Training/Evaluation
Model Serving#
Model Deployment
Model Serving
Full Pipeline#
Privacy {G}uarantees in PPML#
Object-Oriented Privacy Guarantee#
Data Oriented#
Attackers can not infer private information from input data
e.g. Revealing the column used in training data, which may let attackers link the data to specific individuals
Model Oriented#
Attackers can not infer private information from model, even after multiple queries
Pipeline-Orientied Privacy Guarantee#
In entire ML pipeline, you may use local machine or cloud service, involving multiple participants. Therefore, to ensure privacy, you need to understand the flow of data and the responsibility of each participant in the pipeline. Therefore, you need to:
Define data processing boundaries
Set trust level for each role in the pipeline (
🟩 Trusted
,🟢 Honest
,🔴 Curious
,🟥 Untrusted
)
Model Generation Phase#
Guarantee on model generation phase, there are three roles:
Data Producer
Local CF
Third-party CF
Vanilla Local Privacy Guarantee#
🟢 third-party CF
Primary Local Privacy Guarantee#
🔴 third-party CF
Enhanced Local Privacy Guarantee#
🟥 third-party CF
Model Serving Phase#
Guarantee on model serving phase, there are four roles:
Data Producer
Local CF
Third-party CF
Model Consumer