Heuristic target class selection for advancing performance of coverage-based rule learning

Liu, Han

, Chen, Shyi-Ming and Cocea, Mihaela 2019. Heuristic target class selection for advancing performance of coverage-based rule learning. Information Sciences 479 , pp. 164-179. 10.1016/j.ins.2018.12.001

Preview

PDF - Accepted Post-Print Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
Download (593kB) | Preview

Official URL: https://doi.org/10.1016/j.ins.2018.12.001

Abstract

Rule learning is a popular branch of machine learning, which can provide accurate and interpretable classification results. In general, two main strategies of rule learning are referred to as 'divide and conquer' and 'separate and con-quer'. Decision tree generation that follows the former strategy has a serious drawback, which is known as the replicated sub-tree problem, resulting from the constraint that all branches of a decision tree must have one or more common attributes. The above problem is likely to result in high computational complexity and the risk of overfitting, which leads to the necessity to develop rule learning algorithms (e.g., Prism) that follow the separate and conquer strategy. The replicated sub-tree problem can be effectively solved using the Prism algorithm , but the trained models are still complex due to the need of training an independent rule set for each selected target class. In order to reduce the risk of overfitting and the model complexity, we propose in this paper a variant of the Prism algorithm referred to as PrismCTC. The experimental results show that the PrismCTC algorithm leads to advances in classification performance and reduction of model complexity, in comparison with the C4.5 and Prism algorithms.

Item Type:	Article
Date Type:	Publication
Status:	Published
Schools:	Schools > Computer Science & Informatics
Publisher:	Elsevier
ISSN:	0020-0255
Date of First Compliant Deposit:	4 December 2018
Date of Acceptance:	1 December 2018
Last Modified:	15 Nov 2024 15:05
URI:	https://orca.cardiff.ac.uk/id/eprint/117342

Citation Data

Cited 8 times in Scopus. View in Scopus. Powered By Scopus® Data

Actions (repository staff only)

Edit Item

Altmetric

Dimensions

Download Statistics

Downloads

Downloads per month over past year

View more statistics

CORE (COnnecting REpositories)