Sidorowicz, Tomas, Peres, Pedro and Li, Yuhua ORCID: https://orcid.org/0000-0003-2913-4478 2022. A novel approach for cross-selling insurance products using positive unlabelled learning. Presented at: International Joint Conference on Neural Networks, Padua - Italy, 18-23 July 2022. 2022 International Joint Conference on Neural Networks (IJCNN). IEEE, 10.1109/IJCNN55064.2022.9892762 |
Preview |
PDF
- Accepted Post-Print Version
Download (626kB) | Preview |
Abstract
Successful cross-selling of products is a key goal of companies operating within the insurance industry. Choosing the right customer to approach for cross-purchase opportunities has a direct effect on both decreasing customer churn rate and increasing revenue. Unlike sales data of general products, insurance sales data typically contains only a few products (e.g., private medical insurance, life insurance, etc), it is highly imbalanced with a vast majority of customers with no cross-purchasing information, highly noisy due to varying purchase behaviour between different customers, and has no ground truth for knowing if the majority customers are truly non-cross-sell customers or they are missed opportunities. These data challenges render the building of machine learning models for accurately identifying potential cross-sell customers extremely difficult. This paper proposes a novel approach to solve this challenging problem of cross-sell customer identification using Positive Unlabelled (PU) learning in conjunction with advanced feature engineering on customer demographic data and unstructured customer question-response texts through topic modelling. We implement a bagging approach to iteratively learn the positive samples (the confirmed cross-sells) alongside random sub-samples of the unlabelled set. The proposed approach is extensively evaluated on real insurance data that has been newly collected from a leading insurance company for this study. Evaluation results demonstrate that our approach can successfully identify new potential opportunities for likely cross-sell customers.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Date Type: | Published Online |
Status: | In Press |
Schools: | Computer Science & Informatics Data Innovation Research Institute (DIURI) |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Uncontrolled Keywords: | positive unlabelled learning, cross-sell, bagging classification, topic modelling, text similarity |
Publisher: | IEEE |
ISBN: | 9781728186719 |
ISSN: | 2161-4407 |
Date of First Compliant Deposit: | 10 June 2022 |
Date of Acceptance: | 26 April 2022 |
Last Modified: | 25 Nov 2022 15:10 |
URI: | https://orca.cardiff.ac.uk/id/eprint/149953 |
Actions (repository staff only)
Edit Item |