Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

A parallel method for scalable anonymization of transaction data

Memon, Neelam, Loukides, Grigorios and Shao, Jianhua 2015. A parallel method for scalable anonymization of transaction data. Presented at: 14th International Symposium on Parallel and Distributed Computing, ISPDC 2015, Limassol, Cyprus, 29 June - 2 July 2015. 2015 14th International Symposium on Parallel and Distributed Computing (ISPDC). IEEE, pp. 235-241. 10.1109/ISPDC.2015.34

[thumbnail of ispdc.pdf]
PDF - Accepted Post-Print Version
Download (960kB) | Preview


Transaction data, such as market basket or diagnostic data, contain sensitive information about individuals. Such data are often disseminated widely to support analytic studies. This raises privacy concerns, as the confidentiality of individuals must be protected. Economization is an established methodology to protect transaction data, which can be applied using different algorithms. RBAT is an algorithm for anonymzitng transaction data that has many desirable features. These include flexible specification of privacy requirements and the ability to preserve data utility well. However, like most economization methods, RBAT is a sequential algorithm that is not scalable to large datasets. This limits the applicability of RBAT in practice. To address this issue, in this paper, we develop a parallel version of RBAT using MapReduce. We partition the data across cluster of computing nodes and implement the key operations of RBAT in parallel. Our experimental results show that scalable economization of large transaction datasets can be achieved using MapReduce and our method can scale nearly linear to the number of processing nodes.

Item Type: Conference or Workshop Item (Paper)
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Publisher: IEEE
ISBN: 9781467371476
Date of First Compliant Deposit: 30 March 2016
Last Modified: 11 Dec 2020 02:59

Citation Data

Cited 4 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item


Downloads per month over past year

View more statistics