Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Deriving taxonomy from documents at sentence level

Liu, Ying ORCID:, Loh, Han Tong and Lu, Wen Feng 2008. Deriving taxonomy from documents at sentence level. Prado, Hercules Antonio do and Ferneda, Edilson, eds. Emerging Technologies of Text Mining: Techniques and Applications, Hershey, PA, USA: Information Science Reference, pp. 99-119. (10.4018/978-1-59904-373-9.ch005)

Full text not available from this repository.


This chapter introduces an approach of deriving taxonomy from documents using a novel document profile model that enables document representations with the semantic information systematically generated at the document sentence level. A frequent word sequence method is proposed to search for the salient semantic information and has been integrated into the document profile model. The experimental study of taxonomy generation using hierarchical agglomerative clustering has shown a significant improvement in terms of Fscore based on the document profile model. A close examination reveals that the integration of semantic information has a clear contribution compared to the classic bag-of-words approach. This study encourages us to further investigate the possibility of applying document profile model over a wide range of text based mining tasks.

Item Type: Book Section
Date Type: Publication
Status: Published
Schools: Engineering
Subjects: T Technology > TA Engineering (General). Civil engineering (General)
Publisher: Information Science Reference
ISBN: 9781599043739
Related URLs:
Last Modified: 25 Oct 2022 08:03

Citation Data

Cited 17 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item