Leakage of Dataset Properties in {Multi-Party} Machine Learning

Wanrong Zhang; Shruti Tople; Olga Ohrimenko

Authors:

Wanrong Zhang, Georgia Institute of Technology; Shruti Tople, Microsoft Research; Olga Ohrimenko, The University of Melbourne

Abstract:

Secure multi-party machine learning allows several parties to build a model on their pooled data to increase utility while not explicitly sharing data with each other. We show that such multi-party computation can cause leakage of global dataset properties between the parties even when parties obtain only black-box access to the final model. In particular, a "curious" party can infer the distribution of sensitive attributes in other parties' data with high accuracy. This raises concerns regarding the confidentiality of properties pertaining to the whole dataset as opposed to individual data records. We show that our attack can leak population-level properties in datasets of different types, including tabular, text, and graph data. To understand and measure the source of leakage, we consider several models of correlation between a sensitive attribute and the rest of the data. Using multiple machine learning models, we show that leakage occurs even if the sensitive attribute is not included in the training data and has a low correlation with other attributes or the target variable.

Wanrong Zhang, Georgia Institute of Technology

Shruti Tople, Microsoft Research

Olga Ohrimenko, The University of Melbourne

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX

Zhang PDF

View the slides

Leakage of Dataset Properties in Multi-Party Machine Learning

Website Maintenance Alert

Wanrong Zhang, Georgia Institute of Technology

Shruti Tople, Microsoft Research

Olga Ohrimenko, The University of Melbourne

Open Access Media

Presentation Video