Secure Construction of Contingency Tables from Distributed Data

Lu, Haibing; He, Xiaoyun; Vaidya, Jaideep; Adam, Nabil

doi:10.1007/978-3-540-70567-3_11

Haibing Lu¹,
Xiaoyun He¹,
Jaideep Vaidya¹ &
…
Nabil Adam¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5094))

Included in the following conference series:

IFIP Annual Conference on Data and Applications Security and Privacy

1418 Accesses
2 Citations

Abstract

Contingency tables are widely used in many fields to analyze the relationship or infer the association between two or more variables. Indeed, due to their simplicity and ease, they are one of the first methods used to analyze gathered data. Typically, the construction of contingency tables from source data is considered straightforward since all data is supposed to be aggregated at a single party. However, in many cases, the collected data may actually be federated among different parties. Privacy and security concerns may restrict the data owners from free sharing of the raw data. However, construction of the global contingency tables would still be of immense interest. In this paper, we propose techniques for enabling secure construction of contingency tables from both horizontally and vertically partitioned data. Our methods are efficient and secure. We also examine cases where the constructed contingency table may itself leak too much information and discuss potential solutions.

Download to read the full chapter text

Chapter PDF

Privacy Issues in Association Rule Mining

Privacy Preserving Collaborative Agglomerative Hierarchical Clustering Construction

Information-Theoretically Secure Privacy Preserving Approaches for Collaborative Association Rule Mining

References

Agrawal, D., Aggarwal, C.C.: On the design and quantification of privacy preserving data mining algorithms. In: Proceedings of the Twentieth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp. 247–255 (2001)
Google Scholar
Agrawal, R., Srikant, R.: Privacy-preserving data mining. In: Proceedings of the 2000 ACM SIGMOD Conference on Management of Data, pp. 439–450 (2000)
Google Scholar
Bedi, R.: Money Laundering - Controls and Prevention, 1st edn. ISI Publications (2004)
Google Scholar
Benaloh, J.C.: Secret Sharing Homomorphisms: Keeping Shares of a Secret Secret. In: Odlyzko, A.M. (ed.) CRYPTO 1986. LNCS, vol. 263, pp. 251–260. Springer, Heidelberg (1987)
Chapter Google Scholar
Blum, M., Goldwasser, S.: An efficient probabilistic public-key encryption that hides all partial information. In: Blakely, R. (ed.) Advances in Cryptology – Crypto 84 Proceedings. Springer, Heidelberg (1984)
Google Scholar
L. Cauley.: Nsa has massive database of americans’ phone calls (May 2006) (USA Today).
Google Scholar
Clifton, C., Kantarcioglu, M., Lin, X., Vaidya, J., Zhu, M.: Tools for privacy preserving distributed data mining. SIGKDD Explorations 4(2), 28–34 (2003)
Article Google Scholar
Cornfield, J.: A method of estimating comparative rates from clinical data: Applications to cancer of the lung, breast, and cervix. Journal of the National Cancer Institute 11, 1269–1275 (1951)
Google Scholar
Dellaportas, P., Tarantola, C.: Model determination for categorical data with factor level merging. Journal of the Royal Statistical Society 67, 269–283 (2005)
Article MathSciNet MATH Google Scholar
Evfimievski, A., Srikant, R., Agrawal, R., Gehrke, J.: Privacy preserving mining of association rules. In: The Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 217–228 (2002)
Google Scholar
Fienberg, S.E.: The Analysis of Cross-classified Categorical Data, 2nd edn. M.I.T. Press, Cambridge (1980)
MATH Google Scholar
Goethals, B., Laur, S., Lipmaa, H., Mielikäinen, T.: On Secure Scalar Product Computation for Privacy-Preserving Data Mining. In: Park, C.-s., Chee, S. (eds.) ICISC 2004. LNCS, vol. 3506, pp. 104–120. Springer, Heidelberg (2005)
Chapter Google Scholar
Goldreich, O.: General Cryptographic Protocols. In: The Foundations of Cryptography, vol. 2. Cambridge University Press, Cambridge (2004)
Chapter Google Scholar
Goldreich, O., Micali, S., Wigderson, A.: How to play any mental game - a completeness theorem for protocols with honest majority. In: 19th ACM Symposium on the Theory of Computing, pp. 218–229 (1987)
Google Scholar
Huang, Z., Du, W., Chen, B.: Deriving private information from randomized data. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, Baltimore, MD, June 13-16 (2005)
Google Scholar
Jagannathan, G., Wright, R.N.: Privacy-preserving distributed k-means clustering over arbitrarily partitioned data. In: KDD 2005: Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, pp. 593–599. ACM Press, New York (2005)
Chapter Google Scholar
Kargupta, H., Datta, S., Wang, Q., Sivakumar, K.: On the privacy preserving properties of random data perturbation techniques. In: Proceedings of the Third IEEE International Conference on Data Mining (ICDM 2003) (2003)
Google Scholar
Lindell, Y., Pinkas, B.: Privacy Preserving Data Mining. In: Bellare, M. (ed.) CRYPTO 2000. LNCS, vol. 1880, pp. 36–54. Springer, Heidelberg (2000)
Chapter Google Scholar
Naccache, D., Stern, J.: A new public key cryptosystem based on higher residues. In: Proceedings of the 5th ACM conference on Computer and communications security, pp. 59–66. ACM Press, San Francisco (1998)
Chapter Google Scholar
Okamoto, T., Uchiyama, S.: A New Public-Key Cryptosystem as Secure as Factoring. In: Nyberg, K. (ed.) EUROCRYPT 1998. LNCS, vol. 1403, pp. 308–318. Springer, Heidelberg (1998)
Chapter Google Scholar
Paillier, P.: Public-Key Cryptosystems Based on Composite Degree Residuosity Classes. In: Stern, J. (ed.) EUROCRYPT 1999. LNCS, vol. 1592, pp. 223–238. Springer, Heidelberg (1999)
Chapter Google Scholar
Rizvi, S.J., Haritsa, J.R.: Maintaining data privacy in association rule mining. In: Proceedings of 28th International Conference on Very Large Data Bases. VLDB, Hong Kong, August 20-23, pp. 682–693 (2002)
Google Scholar
Vaidya, J., Clifton, C.: Secure set intersection cardinality with application to association rule mining. Journal of Computer Security 13(4) (November 2005)
Google Scholar
Vaidya, J., Clifton, C., Zhu, M.: Privacy-Preserving Data Mining, 1st edn. Advances in Information Security. Springer, Heidelberg (2005)
MATH Google Scholar
Yao, A.C.: Protocols for secure computation (extended abstract). In: Proceedings of the 23th IEEE Symposium on Foundations of Computer Science, pp. 160–164. IEEE, Los Alamitos (1982)
Google Scholar
Yu, H., Vaidya, J.: Secure matrix addition (UIOWA Technical Report) (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

MSIS Department and CIMIC, Rutgers University, USA
Haibing Lu, Xiaoyun He, Jaideep Vaidya & Nabil Adam

Authors

Haibing Lu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyun He
View author publications
You can also search for this author in PubMed Google Scholar
Jaideep Vaidya
View author publications
You can also search for this author in PubMed Google Scholar
Nabil Adam
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Vijay Atluri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, H., He, X., Vaidya, J., Adam, N. (2008). Secure Construction of Contingency Tables from Distributed Data. In: Atluri, V. (eds) Data and Applications Security XXII. DBSec 2008. Lecture Notes in Computer Science, vol 5094. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70567-3_11

Download citation

DOI: https://doi.org/10.1007/978-3-540-70567-3_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70566-6
Online ISBN: 978-3-540-70567-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Secure Construction of Contingency Tables from Distributed Data

Abstract

Chapter PDF

Similar content being viewed by others

Privacy Issues in Association Rule Mining

Privacy Preserving Collaborative Agglomerative Hierarchical Clustering Construction

Information-Theoretically Secure Privacy Preserving Approaches for Collaborative Association Rule Mining

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Secure Construction of Contingency Tables from Distributed Data

Abstract

Chapter PDF

Similar content being viewed by others

Privacy Issues in Association Rule Mining

Privacy Preserving Collaborative Agglomerative Hierarchical Clustering Construction

Information-Theoretically Secure Privacy Preserving Approaches for Collaborative Association Rule Mining

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation