Liu, Ying ![]() |
Abstract
This chapter introduces an approach of deriving taxonomy from documents using a novel document profile model that enables document representations with the semantic information systematically generated at the document sentence level. A frequent word sequence method is proposed to search for the salient semantic information and has been integrated into the document profile model. The experimental study of taxonomy generation using hierarchical agglomerative clustering has shown a significant improvement in terms of Fscore based on the document profile model. A close examination reveals that the integration of semantic information has a clear contribution compared to the classic bag-of-words approach. This study encourages us to further investigate the possibility of applying document profile model over a wide range of text based mining tasks.
Item Type: | Book Section |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Engineering |
Subjects: | T Technology > TA Engineering (General). Civil engineering (General) |
Publisher: | Information Science Reference |
ISBN: | 9781599043739 |
Related URLs: | |
Last Modified: | 25 Oct 2022 08:03 |
URI: | https://orca.cardiff.ac.uk/id/eprint/51239 |
Citation Data
Cited 17 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
![]() |
Edit Item |