Loukides, Grigorios ORCID: https://orcid.org/0000-0003-0888-5061, Gkoulalas-Divanis, Aris and Shao, Jianhua ORCID: https://orcid.org/0000-0001-8461-1471 2013. Efficient and flexible anonymization of transaction data. Knowledge and Information Systems 36 (1) , pp. 153-210. 10.1007/s10115-012-0544-3 |
Abstract
Transaction data are increasingly used in applications, such asmarketing research and biomedical studies. Publishing these data, however, may risk privacy breaches, as they often contain personal information about individuals. Approaches to anonymizing transaction data have been proposed recently, but they may produce excessively distorted and inadequately protected solutions. This is because these approaches do not consider privacy requirements that are common in real-world applications in a realistic and flexible manner, and attempt to safeguard the data only against either identity disclosure or sensitive information inference. In this paper, we propose a new approach that overcomes these limitations. We introduce a rule-based privacy model that allows data publishers to express fine-grained protection requirements for both identity and sensitive information disclosure. Based on this model, we also develop two anonymization algorithms. Our first algorithm works in a top-down fashion, employing an efficient strategy to recursively generalize data with low information loss. Our second algorithm uses sampling and a combination of top-down and bottom-up generalization heuristics, which greatly improves scalability while maintaining low information loss. Extensive experiments show that our algorithms significantly outperform the state-of-the-art in terms of retaining data utility, while achieving good protection and scalability.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Computer Science & Informatics |
Subjects: | Q Science > QA Mathematics > QA76 Computer software |
Uncontrolled Keywords: | Anonymity; Privacy; Transaction data; Privacy requirements; Identity disclosure; Sensitive information disclosure; Efficiency; Scalability |
Publisher: | Springer |
ISSN: | 0219-1377 |
Last Modified: | 21 Oct 2022 10:01 |
URI: | https://orca.cardiff.ac.uk/id/eprint/38707 |
Citation Data
Cited 42 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
Edit Item |