Xiang, Zhiliang ![]() ![]() ![]() |
![]() |
PDF
- Accepted Post-Print Version
Download (415kB) |
Abstract
We present ASPEN+, which extends an existing ASP-based system, ASPEN, for collective entity resolution with two important functionalities: support for local merges and new optimality criteria for preferred solutions. Indeed, ASPEN only supports so-called global merges of entity-referring constants (e.g. author ids), in which all occurrences of matched constants are treated as equivalent and merged accordingly. However, it has been argued that when resolving data values, local merges are often more appropriate, as e.g. some instances of ‘J. Lee’ may refer to ‘Joy Lee’, while others should be matched with ‘Jake Lee’. In addition to allowing such local merges, ASPEN+ offers new optimality criteria for selecting solutions, such as minimizing rule violations or maximizing the number of rules supporting a merge. Our main contributions are thus: The formalization and computational analysis of various notions of optimal solution, and An extensive experimental evaluation on real-world datasets, demonstrating the effect of local merges and the new optimality criteria on both accuracy and runtime.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Status: | Submitted |
Schools: | Schools > Computer Science & Informatics |
Date of First Compliant Deposit: | 1 September 2025 |
Date of Acceptance: | 16 July 2025 |
Last Modified: | 11 Sep 2025 09:00 |
URI: | https://orca.cardiff.ac.uk/id/eprint/180758 |
Actions (repository staff only)
![]() |
Edit Item |