Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

On helmholtz's principle for documents processing

Balinsky, Alexander ORCID:, Balinsky, Helen Y. and Simske, Steven J. 2010. On helmholtz's principle for documents processing. Presented at: 10th ACM Symposium on Document Engineering (DocEng2010), Manchester, UK, 21-24 September 2010. Published in: Antonacopoulos, Apostolos, Grmish, Michael and Ingold, Rolf eds. Proceedings of the 10th ACM symposium on Document engineering. New York: ACM Press, pp. 283-286. 10.1145/1860559.1860624

Full text not available from this repository.


Keyword extraction is a fundamental problem in text data mining and document processing. A large number of document processing applications directly depend on the quality and speed of keyword extraction algorithms. In this article, a novel approach to rapid change detection in data stream and documents is developed. It is based on ideas from image processing and especially on the Helmholtz Principle from the Gestalt Theory of human perception. Applied to the problem of keywords extraction, it delivers fast and effective tools to identify meaningful keywords using parameter-free methods. We also define a level of meaningfulness of the keywords which can be used to modify the set of keywords depending on application needs.

Item Type: Conference or Workshop Item (Paper)
Date Type: Publication
Status: Published
Schools: Mathematics
Subjects: Q Science > QA Mathematics
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Publisher: ACM Press
ISBN: 9781450302319
Related URLs:
Last Modified: 19 Oct 2022 10:27

Citation Data

Cited 23 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item