Privacy-preserving similarity coefficients for binary data

Kok Seng Wong, Myung Ho Kim

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

Similarity coefficients (also known as coefficients of association) are important measurement techniques used to quantify the extent to which objects resemble one another. Due to privacy concerns, the data owner might not want to participate in any similarity measurement if the original dataset will be revealed or could be derived from the final output. There are many different measurements used for numerical, structural and binary data. In this paper, we particularly consider the computation of similarity coefficients for binary data. A large number of studies related to similarity coefficients have been performed. Our objective in this paper is not to design a specific similarity coefficient. Rather, we are demonstrating how to compute similarity coefficients in a secure and privacy preserved environment. In our protocol, a client and a server jointly participate in the computation. At the end of the protocol, the client will obtain all summation variables needed for the computation while the server learns nothing. We incorporate cryptographic methods in our protocol to protect the original dataset and all other intermediate results. Note that our protocol also supports dissimilarity coefficients.

Original languageEnglish
Pages (from-to)1280-1290
Number of pages11
JournalComputers and Mathematics with Applications
Volume65
Issue number9
DOIs
Publication statusPublished - May 1 2013
Externally publishedYes

Fingerprint

Similarity Coefficient
Binary Data
Privacy Preserving
Network protocols
Servers
Privacy
Server
Measurement Techniques
Dissimilarity
Coefficient
Summation
Quantify
Output

Keywords

  • Association coefficients
  • Binary data similarity measures
  • Privacy preserving similarity test
  • Similarity coefficients
  • Similarity computation

ASJC Scopus subject areas

  • Modelling and Simulation
  • Computational Theory and Mathematics
  • Computational Mathematics

Cite this

Privacy-preserving similarity coefficients for binary data. / Wong, Kok Seng; Kim, Myung Ho.

In: Computers and Mathematics with Applications, Vol. 65, No. 9, 01.05.2013, p. 1280-1290.

Research output: Contribution to journalArticle

@article{84d058ff39f7470e866ed9e7b5c62d90,
title = "Privacy-preserving similarity coefficients for binary data",
abstract = "Similarity coefficients (also known as coefficients of association) are important measurement techniques used to quantify the extent to which objects resemble one another. Due to privacy concerns, the data owner might not want to participate in any similarity measurement if the original dataset will be revealed or could be derived from the final output. There are many different measurements used for numerical, structural and binary data. In this paper, we particularly consider the computation of similarity coefficients for binary data. A large number of studies related to similarity coefficients have been performed. Our objective in this paper is not to design a specific similarity coefficient. Rather, we are demonstrating how to compute similarity coefficients in a secure and privacy preserved environment. In our protocol, a client and a server jointly participate in the computation. At the end of the protocol, the client will obtain all summation variables needed for the computation while the server learns nothing. We incorporate cryptographic methods in our protocol to protect the original dataset and all other intermediate results. Note that our protocol also supports dissimilarity coefficients.",
keywords = "Association coefficients, Binary data similarity measures, Privacy preserving similarity test, Similarity coefficients, Similarity computation",
author = "Wong, {Kok Seng} and Kim, {Myung Ho}",
year = "2013",
month = "5",
day = "1",
doi = "10.1016/j.camwa.2012.02.028",
language = "English",
volume = "65",
pages = "1280--1290",
journal = "Computers and Mathematics with Applications",
issn = "0898-1221",
publisher = "Elsevier",
number = "9",

}

TY - JOUR

T1 - Privacy-preserving similarity coefficients for binary data

AU - Wong, Kok Seng

AU - Kim, Myung Ho

PY - 2013/5/1

Y1 - 2013/5/1

N2 - Similarity coefficients (also known as coefficients of association) are important measurement techniques used to quantify the extent to which objects resemble one another. Due to privacy concerns, the data owner might not want to participate in any similarity measurement if the original dataset will be revealed or could be derived from the final output. There are many different measurements used for numerical, structural and binary data. In this paper, we particularly consider the computation of similarity coefficients for binary data. A large number of studies related to similarity coefficients have been performed. Our objective in this paper is not to design a specific similarity coefficient. Rather, we are demonstrating how to compute similarity coefficients in a secure and privacy preserved environment. In our protocol, a client and a server jointly participate in the computation. At the end of the protocol, the client will obtain all summation variables needed for the computation while the server learns nothing. We incorporate cryptographic methods in our protocol to protect the original dataset and all other intermediate results. Note that our protocol also supports dissimilarity coefficients.

AB - Similarity coefficients (also known as coefficients of association) are important measurement techniques used to quantify the extent to which objects resemble one another. Due to privacy concerns, the data owner might not want to participate in any similarity measurement if the original dataset will be revealed or could be derived from the final output. There are many different measurements used for numerical, structural and binary data. In this paper, we particularly consider the computation of similarity coefficients for binary data. A large number of studies related to similarity coefficients have been performed. Our objective in this paper is not to design a specific similarity coefficient. Rather, we are demonstrating how to compute similarity coefficients in a secure and privacy preserved environment. In our protocol, a client and a server jointly participate in the computation. At the end of the protocol, the client will obtain all summation variables needed for the computation while the server learns nothing. We incorporate cryptographic methods in our protocol to protect the original dataset and all other intermediate results. Note that our protocol also supports dissimilarity coefficients.

KW - Association coefficients

KW - Binary data similarity measures

KW - Privacy preserving similarity test

KW - Similarity coefficients

KW - Similarity computation

UR - http://www.scopus.com/inward/record.url?scp=84877756060&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84877756060&partnerID=8YFLogxK

U2 - 10.1016/j.camwa.2012.02.028

DO - 10.1016/j.camwa.2012.02.028

M3 - Article

VL - 65

SP - 1280

EP - 1290

JO - Computers and Mathematics with Applications

JF - Computers and Mathematics with Applications

SN - 0898-1221

IS - 9

ER -