Shrinkwrap

Efficient SQL query processing in differentially private data federations

Johes Bater, Xi He, William Ehrich, Ashwin Machanavajjhala, Jennie M Rogers

Research output: Contribution to journalConference article

Abstract

A private data federation is a set of autonomous databases that share a unified query interface offering in-situ evaluation of SQL queries over the union of the sensitive data of its members. Owing to privacy concerns, these systems do not have a trusted data collector that can see all their data and their member databases cannot learn about individual records of other engines. Federations currently achieve this goal by evaluating queries obliviously using secure multiparty computation. This hides the intermediate result cardinality of each query operator by exhaustively padding it. With cascades of such operators, this padding accumulates to a blow-up in the output size of each operator and a proportional loss in query performance. Hence, existing private data federations do not scale well to complex SQL queries over large datasets. We introduce Shrinkwrap, a private data federation that offers data owners a differentially private view of the data held by others to improve their performance over oblivious query processing. Shrinkwrap uses computational differential privacy to minimize the padding of intermediate query results, achieving up to a 35X performance improvement over oblivious query processing. When the query needs differentially private output, Shrinkwrap provides a trade-off between result accuracy and query evaluation performance.

Original languageEnglish (US)
Pages (from-to)307-320
Number of pages14
JournalProceedings of the VLDB Endowment
Volume12
Issue number3
DOIs
StatePublished - Jan 1 2018
Event45th International Conference on Very Large Data Bases, VLDB 2019 - Los Angeles, United States
Duration: Aug 26 2017Aug 30 2017

Fingerprint

Query processing
Engines

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computer Science(all)

Cite this

Bater, Johes ; He, Xi ; Ehrich, William ; Machanavajjhala, Ashwin ; Rogers, Jennie M. / Shrinkwrap : Efficient SQL query processing in differentially private data federations. In: Proceedings of the VLDB Endowment. 2018 ; Vol. 12, No. 3. pp. 307-320.
@article{1bb68c8aa73646d79584b36cc4e49bf6,
title = "Shrinkwrap: Efficient SQL query processing in differentially private data federations",
abstract = "A private data federation is a set of autonomous databases that share a unified query interface offering in-situ evaluation of SQL queries over the union of the sensitive data of its members. Owing to privacy concerns, these systems do not have a trusted data collector that can see all their data and their member databases cannot learn about individual records of other engines. Federations currently achieve this goal by evaluating queries obliviously using secure multiparty computation. This hides the intermediate result cardinality of each query operator by exhaustively padding it. With cascades of such operators, this padding accumulates to a blow-up in the output size of each operator and a proportional loss in query performance. Hence, existing private data federations do not scale well to complex SQL queries over large datasets. We introduce Shrinkwrap, a private data federation that offers data owners a differentially private view of the data held by others to improve their performance over oblivious query processing. Shrinkwrap uses computational differential privacy to minimize the padding of intermediate query results, achieving up to a 35X performance improvement over oblivious query processing. When the query needs differentially private output, Shrinkwrap provides a trade-off between result accuracy and query evaluation performance.",
author = "Johes Bater and Xi He and William Ehrich and Ashwin Machanavajjhala and Rogers, {Jennie M}",
year = "2018",
month = "1",
day = "1",
doi = "10.14778/3291264.3291274",
language = "English (US)",
volume = "12",
pages = "307--320",
journal = "Proceedings of the VLDB Endowment",
issn = "2150-8097",
publisher = "Very Large Data Base Endowment Inc.",
number = "3",

}

Shrinkwrap : Efficient SQL query processing in differentially private data federations. / Bater, Johes; He, Xi; Ehrich, William; Machanavajjhala, Ashwin; Rogers, Jennie M.

In: Proceedings of the VLDB Endowment, Vol. 12, No. 3, 01.01.2018, p. 307-320.

Research output: Contribution to journalConference article

TY - JOUR

T1 - Shrinkwrap

T2 - Efficient SQL query processing in differentially private data federations

AU - Bater, Johes

AU - He, Xi

AU - Ehrich, William

AU - Machanavajjhala, Ashwin

AU - Rogers, Jennie M

PY - 2018/1/1

Y1 - 2018/1/1

N2 - A private data federation is a set of autonomous databases that share a unified query interface offering in-situ evaluation of SQL queries over the union of the sensitive data of its members. Owing to privacy concerns, these systems do not have a trusted data collector that can see all their data and their member databases cannot learn about individual records of other engines. Federations currently achieve this goal by evaluating queries obliviously using secure multiparty computation. This hides the intermediate result cardinality of each query operator by exhaustively padding it. With cascades of such operators, this padding accumulates to a blow-up in the output size of each operator and a proportional loss in query performance. Hence, existing private data federations do not scale well to complex SQL queries over large datasets. We introduce Shrinkwrap, a private data federation that offers data owners a differentially private view of the data held by others to improve their performance over oblivious query processing. Shrinkwrap uses computational differential privacy to minimize the padding of intermediate query results, achieving up to a 35X performance improvement over oblivious query processing. When the query needs differentially private output, Shrinkwrap provides a trade-off between result accuracy and query evaluation performance.

AB - A private data federation is a set of autonomous databases that share a unified query interface offering in-situ evaluation of SQL queries over the union of the sensitive data of its members. Owing to privacy concerns, these systems do not have a trusted data collector that can see all their data and their member databases cannot learn about individual records of other engines. Federations currently achieve this goal by evaluating queries obliviously using secure multiparty computation. This hides the intermediate result cardinality of each query operator by exhaustively padding it. With cascades of such operators, this padding accumulates to a blow-up in the output size of each operator and a proportional loss in query performance. Hence, existing private data federations do not scale well to complex SQL queries over large datasets. We introduce Shrinkwrap, a private data federation that offers data owners a differentially private view of the data held by others to improve their performance over oblivious query processing. Shrinkwrap uses computational differential privacy to minimize the padding of intermediate query results, achieving up to a 35X performance improvement over oblivious query processing. When the query needs differentially private output, Shrinkwrap provides a trade-off between result accuracy and query evaluation performance.

UR - http://www.scopus.com/inward/record.url?scp=85061741912&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85061741912&partnerID=8YFLogxK

U2 - 10.14778/3291264.3291274

DO - 10.14778/3291264.3291274

M3 - Conference article

VL - 12

SP - 307

EP - 320

JO - Proceedings of the VLDB Endowment

JF - Proceedings of the VLDB Endowment

SN - 2150-8097

IS - 3

ER -