Skip to content

Research at St Andrews

Combining clustering and classification ensembles: a novel pipeline to identify breast cancer profiles

Research output: Contribution to journalArticle

Open Access permissions

Open

Author(s)

Utkarsh Agrawal, Daniele Soria, Christian Wagner, Jonathan Garibaldi, Ian O. Ellis, John M.S. Bartlett, Emad A. Rakha, Andrew R. Green

School/Research organisations

Abstract

Breast Cancer is one of the most common causes of cancer death in women, representing a very complex disease with varied molecular alterations. To assist breast cancer prognosis, the classification of patients into biological groups is of great significance for treatment strategies. Recent studies have used an ensemble of multiple clustering algorithms to elucidate the most characteristic biological groups of breast cancer. However, the combination of various clustering methods resulted in a number of patients remaining unclustered. Therefore, a framework still needs to be developed which can assign as many unclustered (i.e. biologically diverse) patients to one of the identified groups in order to improve classification. Therefore, in this paper we develop a novel classification framework which introduces a new ensemble classification stage after the ensemble clustering stage to target the unclustered patients. Thus, a step-by-step pipeline is introduced which couples ensemble clustering with ensemble classification for the identification of core groups, data distribution in them and improvement in final classification results by targeting the unclustered data. The proposed pipeline is employed on a novel real world breast cancer dataset and subsequently its robustness and stability are examined by testing it on standard datasets. The results show that by using the presented framework, an improved classification is obtained. Finally, the results have been verified using statistical tests, visualisation techniques, cluster quality assessment and interpretation from clinical experts.
Close

Details

Original languageEnglish
Pages (from-to)27-37
JournalArtificial Intelligence in Medicine
Volume97
Early online date15 May 2019
DOIs
Publication statusPublished - Jun 2019

    Research areas

  • Ensemble clustering, Ensemble classification, Class level fusion, Refining cluster results, Breast cancer, Pipeline

Discover related content
Find related publications, people, projects and more using interactive charts.

View graph of relations

Related by author

  1. Towards real-time heavy goods vehicle driving behaviour classification in the United Kingdom

    Agrawal, U., Mase, J. M., Figueredo, G. P., Wagner, C., Mesgarpour, M. & John, R. I., 27 Oct 2019, 2019 IEEE International Intelligent Transportation Systems Conference (ITSC). IEEE, p. 2330-2336

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  2. Fuzzy integral driven ensemble classification using a priori fuzzy measures

    Agrawal, U., Wagner, C., Garibaldi, J. & Soria, D., 10 Oct 2019, 2019 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE). IEEE, p. 1-7 7 p. 8858821. (IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

ID: 263044195

Top