Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Efficient and flexible anonymization of transaction data

Loukides, Grigorios ORCID:, Gkoulalas-Divanis, Aris and Shao, Jianhua ORCID: 2013. Efficient and flexible anonymization of transaction data. Knowledge and Information Systems 36 (1) , pp. 153-210. 10.1007/s10115-012-0544-3

Full text not available from this repository.


Transaction data are increasingly used in applications, such asmarketing research and biomedical studies. Publishing these data, however, may risk privacy breaches, as they often contain personal information about individuals. Approaches to anonymizing transaction data have been proposed recently, but they may produce excessively distorted and inadequately protected solutions. This is because these approaches do not consider privacy requirements that are common in real-world applications in a realistic and flexible manner, and attempt to safeguard the data only against either identity disclosure or sensitive information inference. In this paper, we propose a new approach that overcomes these limitations. We introduce a rule-based privacy model that allows data publishers to express fine-grained protection requirements for both identity and sensitive information disclosure. Based on this model, we also develop two anonymization algorithms. Our first algorithm works in a top-down fashion, employing an efficient strategy to recursively generalize data with low information loss. Our second algorithm uses sampling and a combination of top-down and bottom-up generalization heuristics, which greatly improves scalability while maintaining low information loss. Extensive experiments show that our algorithms significantly outperform the state-of-the-art in terms of retaining data utility, while achieving good protection and scalability.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
Subjects: Q Science > QA Mathematics > QA76 Computer software
Uncontrolled Keywords: Anonymity; Privacy; Transaction data; Privacy requirements; Identity disclosure; Sensitive information disclosure; Efficiency; Scalability
Publisher: Springer
ISSN: 0219-1377
Last Modified: 21 Oct 2022 10:01

Citation Data

Cited 41 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item