Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

(r, k, ε)-anonymization: Privacy-preserving data publishing algorithm based on multi-dimensional outlier detection, k-anonymity, and ε-differential privacy

Kara, Burak Cem, Eyupoglu, Can and Karakus, Oktay ORCID: https://orcid.org/0000-0001-8009-9319 2025. (r, k, ε)-anonymization: Privacy-preserving data publishing algorithm based on multi-dimensional outlier detection, k-anonymity, and ε-differential privacy. IEEE Access 13 , pp. 70422-70435. 10.1109/access.2025.3559410

[thumbnail of r_k_-Anonymization_Privacy-Preserving_Data_Publishing_Algorithm_Based_on_Multi-Dimensional_Outlier_Detection_k-Anonymity_and_-Differential_Privacy.pdf] PDF - Published Version
Download (1MB)
License URL: https://creativecommons.org/licenses/by/4.0/legalcode
License Start date: 1 January 2025

Abstract

In recent years, there has been a tremendous rise in both the volume and variety of big data, providing enormous potential benefits to businesses that seek to utilize consumer experiences for research or commercial purposes. The general data protection regulation (GDPR) implementation, on the other hand, has introduced extensive control over the use of individuals’ personal information and placed many limits. Data anonymization technologies have become an important solution for businesses trying to generate value from data while adhering to GDPR limitations. To address these challenges, researchers have developed various methods, including k-anonymity and ε -differential privacy, offering solutions for both industry and academia. However, protecting individuals’ privacy against diverse attack attempts presents significant challenges for anonymization models that rely solely on a single technique, highlighting the need for more adaptable and hybrid approaches. In this study, a new hybrid anonymization algorithm called (r, k, ε )-anonymization has been proposed, which combines k-anonymity and ε -differential privacy models in a consistent framework and provides stronger privacy guarantees compared to existing privacy-preserving models. The proposed algorithm is capable of overcoming well-known shortcomings of the k-anonymity and ε -differential privacy models, and it has been confirmed by extensive tests on real-world datasets. The proposed (r, k, ε )-anonymization algorithm outperforms k-anonymity and ε -differential privacy in terms of the average error rate measure, achieving data utility increases of 31.74% and 26.99%, respectively.

Item Type: Article
Date Type: Published Online
Status: Published
Schools: Schools > Computer Science & Informatics
Additional Information: License information from Publisher: LICENSE 1: URL: https://creativecommons.org/licenses/by/4.0/legalcode, Start Date: 2025-01-01
Publisher: Institute of Electrical and Electronics Engineers
ISSN: 2169-3536
Date of First Compliant Deposit: 29 April 2025
Date of Acceptance: 2 April 2025
Last Modified: 29 Apr 2025 10:30
URI: https://orca.cardiff.ac.uk/id/eprint/177958

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics