Skip to content

Research at St Andrews

PFClust: an optimised implementation of a parameter-free clustering algorithm

Research output: Contribution to journalArticlepeer-review

DOI

Open Access permissions

Open

Abstract

Background: A well-known problem in cluster analysis is finding an optimal number of clusters reflecting the inherent structure of the data. PFClust is a partitioning-based clustering algorithm capable, unlike many widely-used clustering algorithms, of automatically proposing an optimal number of clusters for the data. Results: The results of tests on various types of data showed that PFClust can discover clusters of arbitrary shapes, sizes and densities. The previous implementation of the algorithm had already been successfully used to cluster large macromolecular structures and small druglike compounds. We have greatly improved the algorithm by a more efficient implementation, which enables PFClust to process large data sets acceptably fast. Conclusions: In this paper we present a new optimized implementation of the PFClust algorithm that runs considerably faster than the original.
Close

Details

Original languageEnglish
JournalSource Code for Biology and Medicine
Volume9
Issue number5
DOIs
Publication statusPublished - Feb 2014

    Research areas

  • Clustering, Cluster analysis, Number of clusters

Discover related content
Find related publications, people, projects and more using interactive charts.

View graph of relations

Related by author

  1. Cooperative coinfection dynamics on clustered networks

    Mann, P., Smith, V. A., Mitchell, J. B. O. & Dobson, S., 9 Apr 2021, In: Physical Review E. 103, 4, 042307.

    Research output: Contribution to journalArticlepeer-review

  2. Percolation in random graphs with higher-order clustering

    Mann, P. S., Smith, V. A., Mitchell, J. B. O. & Dobson, S. A., 25 Jan 2021, In: Physical Review. E, Statistical, nonlinear, and soft matter physics. 103, 1, 11 p., 012313 .

    Research output: Contribution to journalArticlepeer-review

  3. Random graphs with arbitrary clustering and their applications

    Mann, P. S., Smith, V. A., Mitchell, J. B. O. & Dobson, S. A., 22 Jan 2021, In: Physical Review. E, Statistical, nonlinear, and soft matter physics. 103, 1, 10 p., 012309 .

    Research output: Contribution to journalArticlepeer-review

  4. Three machine learning models for the 2019 Solubility Challenge

    Mitchell, J. B. O., 27 Sep 2020, In: ADMET & DMPK. 8, 3, p. 215-251 37 p.

    Research output: Contribution to journalArticlepeer-review

  5. Data protection for the common good: developing a framework for a data protection-focused data commons

    Wong, J., Henderson, T. & Ball, K., 15 Sep 2020, Data for Policy Conference 2020. Data for Policy, 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related by journal

  1. MOSAL: software tools for multiobjective sequence alignment

    Paquete, L., Matias, P., Abbasi, M. & Monsanto Pinheiro, M., 8 Jan 2014, In: Source Code for Biology and Medicine. 9, 2.

    Research output: Contribution to journalArticlepeer-review

ID: 95083033

Top