Memon, Neelam, Loukides, Grigorios ORCID: https://orcid.org/0000-0003-0888-5061 and Shao, Jianhua ORCID: https://orcid.org/0000-0001-8461-1471 2015. A parallel method for scalable anonymization of transaction data. Presented at: 14th International Symposium on Parallel and Distributed Computing, ISPDC 2015, Limassol, Cyprus, 29 June - 2 July 2015. 2015 14th International Symposium on Parallel and Distributed Computing (ISPDC). IEEE, pp. 235-241. 10.1109/ISPDC.2015.34 |
Preview |
PDF
- Accepted Post-Print Version
Download (960kB) | Preview |
Abstract
Transaction data, such as market basket or diagnostic data, contain sensitive information about individuals. Such data are often disseminated widely to support analytic studies. This raises privacy concerns, as the confidentiality of individuals must be protected. Economization is an established methodology to protect transaction data, which can be applied using different algorithms. RBAT is an algorithm for anonymzitng transaction data that has many desirable features. These include flexible specification of privacy requirements and the ability to preserve data utility well. However, like most economization methods, RBAT is a sequential algorithm that is not scalable to large datasets. This limits the applicability of RBAT in practice. To address this issue, in this paper, we develop a parallel version of RBAT using MapReduce. We partition the data across cluster of computing nodes and implement the key operations of RBAT in parallel. Our experimental results show that scalable economization of large transaction datasets can be achieved using MapReduce and our method can scale nearly linear to the number of processing nodes.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Computer Science & Informatics |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Publisher: | IEEE |
ISBN: | 9781467371476 |
Date of First Compliant Deposit: | 30 March 2016 |
Last Modified: | 28 Oct 2022 10:01 |
URI: | https://orca.cardiff.ac.uk/id/eprint/76413 |
Citation Data
Cited 4 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |