Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Revisiting sequential pattern hiding to enhance utility

Gkoulalas-Divanis, Aris and Loukides, Grigorios ORCID: https://orcid.org/0000-0003-0888-5061 2011. Revisiting sequential pattern hiding to enhance utility. Presented at: KDD '11: 17th ACM SIGKDD international conference on Knowledge discovery and data mining, San Diego, CA, USA, 21-24 August 2011. Published in: Ghosh, J. and Smyth, P. eds. Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '11). New York: ACM, pp. 1316-1324. 10.1145/2020408.2020605

Full text not available from this repository.

Abstract

Sequence datasets are encountered in a plethora of applications spanning from web usage analysis to healthcare studies and ubiquitous computing. Disseminating such datasets offers remarkable opportunities for discovering interesting knowledge patterns, but may lead to serious privacy violations if sensitive patterns, such as business secrets, are disclosed. In this work, we consider how to sanitize data to prevent the disclosure of sensitive patterns during sequential pattern mining, while ensuring that the nonsensitive patterns can still be discovered. First, we re-define the problem of sequential pattern hiding to capture the information loss incurred by sanitization in terms of both events' modification (distortion) and lost nonsensitive knowledge patterns (side-effects). Second, we model sequences as graphs and propose two algorithms to solve the problem by operating on the graphs. The first algorithm attempts to sanitize data with minimal distortion, whereas the second focuses on reducing the side-effects. Extensive experiments show that our algorithms outperform the existing solution in terms of data distortion and side-effects and are more efficient.

Item Type: Conference or Workshop Item (Poster)
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Publisher: ACM
ISBN: 9781450308137
Last Modified: 20 Oct 2022 09:23
URI: https://orca.cardiff.ac.uk/id/eprint/31762

Citation Data

Cited 33 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item