CRAN/E | genieclust

genieclust

Fast and Robust Hierarchical Clustering with Noise Points Detection

Installation

About

A retake on the Genie algorithm (Gagolewski, 2021 doi:10.1016/j.softx.2021.100722) - a robust hierarchical clustering method (Gagolewski, Bartoszuk, Cena, 2016 doi:10.1016/j.ins.2016.05.003). Now faster and more memory efficient; determining the whole hierarchy for datasets of 10M points in low dimensional Euclidean spaces or 100K points in high-dimensional ones takes only 1-2 minutes. Allows clustering with respect to mutual reachability distances so that it can act as a noise point detector or a robustified version of 'HDBSCAN*' (that is able to detect a predefined number of clusters and hence it does not dependent on the somewhat fragile 'eps' parameter). The package also features an implementation of inequality indices (the Gini, Bonferroni index), external cluster validity measures (e.g., the normalised clustering accuracy and partition similarity scores such as the adjusted Rand, Fowlkes-Mallows, adjusted mutual information, and the pair sets index), and internal cluster validity indices (e.g., the Calinski-Harabasz, Davies-Bouldin, Ball-Hall, Silhouette, and generalised Dunn indices). See also the 'Python' version of 'genieclust' available on 'PyPI', which supports sparse data, more metrics, and even larger datasets.

Citation genieclust citation info
genieclust.gagolewski.com/
clustering-benchmarks.gagolewski.com/
github.com/gagolews/genieclust
System requirements OpenMP
Bug report File report

Key Metrics

Version 1.1.5-2
Published 2023-10-18 183 days ago
Needs compilation? yes
License AGPL-3
CRAN checks genieclust results

Downloads

Yesterday 15 0%
Last 7 days 82 -43%
Last 30 days 554 -8%
Last 90 days 1.924 +0%
Last 365 days 6.992 -0%

Maintainer

Maintainer

Marek Gagolewski

marek@gagolewski.com

Authors

Marek Gagolewski

aut / cre / cph

Maciej Bartoszuk

ctb

Anna Cena

ctb

Peter M. Larsen

ctb

Material

NEWS
Reference manual
Package source

In Views

Cluster

macOS

r-release

arm64

r-oldrel

arm64

r-release

x86_64

r-oldrel

x86_64

Windows

r-devel

x86_64

r-release

x86_64

r-oldrel

x86_64

Old Sources

genieclust archive

Imports

Rcpp ≥ 1.0.4
stats
utils

Suggests

datasets
mlpack

LinkingTo

Rcpp

Reverse Depends

genie

Reverse Imports

Kmedians
RGMM