Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Optimal event sequence sanitization

Loukides, Grigorios ORCID: https://orcid.org/0000-0003-0888-5061 and Gwadera, Robert 2015. Optimal event sequence sanitization. Presented at: SIAM International Conference on Data Mining, Vancouver, Canada, 30 April - 2 May 2015. Published in: Ventakatasubramanian, Suresh and Ye, Jieping eds. Proceedings of the 2015 SIAM International Conference on Data Mining. Society for Industrial and Applied Mathematics, pp. 775-784. 10.1137/1.9781611974010.87

[thumbnail of sdm_odesa_with_supplement.pdf]
Preview
PDF - Submitted Pre-Print Version
Download (528kB) | Preview

Abstract

Frequent event mining is a fundamental task to extract insight from an event sequence (long sequence of events that are associated with time points). However, it may expose sensitive events that leak confidential business knowledge or lead to intrusive inferences about groups of individuals. In this work, we aim to prevent this threat, by deleting occurrences of sensitive events, while preserving the utility of the event sequence. To quantify utility, we propose a model that captures changes, caused by deletion, to the probability distribution of events across the sequence. Based on the model, we define the problem of sanitizing an event sequence as an optimization problem. Solving the problem is important to preserve the output of many mining tasks, including frequent pattern mining and sequence segmentation. However, this is also challenging, due to the exponential number of ways to apply deletion to the sequence. To optimally solve the problem when there is one sensitive event, we develop an efficient algorithm based on dynamic programming. The algorithm also forms the basis of a simple, iterative method that optimally sanitizes an event sequence, when there are multiple sensitive events. Experiments on real and synthetic datasets show the effectiveness and efficiency of our method.

Item Type: Conference or Workshop Item (Paper)
Date Type: Publication
Status: Published
Schools: Computer Science & Informatics
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Publisher: Society for Industrial and Applied Mathematics
ISBN: 9781611974010
Date of Acceptance: 22 January 2015
Last Modified: 07 Nov 2023 00:38
URI: https://orca.cardiff.ac.uk/id/eprint/89640

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics