Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

A Modified Fuzzy Clustering for Documents Retrieval: Application to Document Categorization

Nefti-Meziani, S., Oussalah, M. and Rezgui, Yacine ORCID: https://orcid.org/0000-0002-5711-8400 2009. A Modified Fuzzy Clustering for Documents Retrieval: Application to Document Categorization. Journal of the Operational Research Society 60 (3) , pp. 384-394. 10.1057/palgrave.jors.2602555

Full text not available from this repository.

Abstract

The paper advocates the use of a new fuzzy-based clustering algorithm for document categorization. Each document/datum will be represented as a fuzzy set. In this respect, the fuzzy clustering algorithm, will be constrained additionally in order to cluster fuzzy sets. Then, one needs to find a metric measure in order to detect the overlapping between documents and the cluster prototype (category). In this respect, we use one of the interclass probabilistic reparability measures known as Bhattacharyya distance, which will be incorporated in the general scheme of the fuzzy c-means algorithm for measuring the overlapping between fuzzy sets. This enables the introduction of fuzziness in the document clustering in the sense that it allows a single document to belong to more than one category. This is in line with semantic multiple interpretations conveyed by single words, which support multiple membership to several classes. Performances of the algorithms will be illustrated using a case study from the construction sector.

Item Type: Article
Date Type: Publication
Status: Published
Schools: Engineering
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > TA Engineering (General). Civil engineering (General)
Uncontrolled Keywords: fuzzy clustering, document clustering, fuzzy decision making
Publisher: Palgrave Macmillan
ISSN: 0160-5682
Last Modified: 17 Oct 2022 09:44
URI: https://orca.cardiff.ac.uk/id/eprint/5453

Citation Data

Cited 10 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item Edit Item