Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Advances in Logic-Based Entity Resolution: Enhancing ASPEN with Local Merges and Optimality Criteria

Xiang, Zhiliang ORCID: https://orcid.org/0000-0002-0263-7289, Bienvenu, Meghyn, Cima, Gianluca, Gutierrez Basulto, Victor ORCID: https://orcid.org/0000-0002-6117-5459 and Ibanez Garcia, Yazmin ORCID: https://orcid.org/0000-0002-1276-904X 2025. Advances in Logic-Based Entity Resolution: Enhancing ASPEN with Local Merges and Optimality Criteria. Presented at: 22nd International Conference on Principles of Knowledge Representation and Reasoning, Melbourne, Australia, November 11-17, 2025.

[thumbnail of ASPEn_KR2025_CR.pdf] PDF - Accepted Post-Print Version
Download (415kB)
Official URL: https://kr.org/KR2025/

Abstract

We present ASPEN+, which extends an existing ASP-based system, ASPEN, for collective entity resolution with two important functionalities: support for local merges and new optimality criteria for preferred solutions. Indeed, ASPEN only supports so-called global merges of entity-referring constants (e.g. author ids), in which all occurrences of matched constants are treated as equivalent and merged accordingly. However, it has been argued that when resolving data values, local merges are often more appropriate, as e.g. some instances of ‘J. Lee’ may refer to ‘Joy Lee’, while others should be matched with ‘Jake Lee’. In addition to allowing such local merges, ASPEN+ offers new optimality criteria for selecting solutions, such as minimizing rule violations or maximizing the number of rules supporting a merge. Our main contributions are thus: The formalization and computational analysis of various notions of optimal solution, and An extensive experimental evaluation on real-world datasets, demonstrating the effect of local merges and the new optimality criteria on both accuracy and runtime.

Item Type: Conference or Workshop Item (Paper)
Status: Submitted
Schools: Schools > Computer Science & Informatics
Date of First Compliant Deposit: 1 September 2025
Date of Acceptance: 16 July 2025
Last Modified: 11 Sep 2025 09:00
URI: https://orca.cardiff.ac.uk/id/eprint/180758

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics