Sergio Gómez   
Departament d'Enginyeria Informàtica i Matemàtiques
Universitat Rovira i Virgili


A hierarchical clustering tool




MultiDendrograms is a simple yet powerful program to make the Hierarchical Clustering of real data, distributed under an Open Source license. Starting from a distances (or similarities) matrix, MultiDendrograms calculates its dendrogram using the most common Agglomerative Hierarchical Clustering algorithms, allows the tuning of many of the graphical representation parameters, and the results may be easily exported to file. A summary of characteristics:

MultiDendrograms implements the variable-group algorithms in [1] to solve the non-uniqueness problem found in the standard pair-group algorithms and implementations. This problem arises when two or more minimum distances between different clusters are equal during the amalgamation process. The standard approach consists in choosing a pair, breaking the ties between distances, and proceeds in the same way until the final hierarchical classification is obtained. However, different clusterings are possible depending on the criterion used to break the ties (usually a pair is just chosen at random!), and the user is unaware of this problem.

The variable-group algorithms group more than two clusters at the same time when ties occur, given rise to a graphical representation called multidendrogram. Their main properties are:

MultiDendrograms also introduces a new parameterized type of hierarchical clustering algorithm called Versatile Linkage, which includes Singles Linkage, Complete Linkage and Arithmetic Linkage as particular cases, and which naturally defines two new algorithms, Geometric Linkage and Harmonic Linkage (hence the convenience to rename UPGMA as Arithmetic Linkage, to emphasize the existence of different types of averages).

Comparison with other applications

How do other applications deal with ties?

How do I know if there are ties in my data?

How many binary dendrograms may correspond to one MultiDendrogram?


[1] Solving Non-uniqueness in Agglomerative Hierarchical Clustering Using Multidendrograms
Alberto Fernández and Sergio Gómez
Journal of Classification 25 (2008) 43-65
(pdf) (doi) (Springer)


Please cite [1] if you use MultiDendrograms in your publications:

Alternatively, you may use the Hierarchical_Clustering program in Radatools, which is able to calculate MultiDendrograms and also to enumerate or count the corresponding Binary Dendrograms.


No installation needed, just unzip and run multidendrograms.bat (Windows), (Linux) or multidendrograms.jar (all OS). Java version 6 (also known as Java 1.6) or higher is required.


You may contribute to the development of MultiDendrograms in GitHub:


