g03ejf computes a cluster indicator variable from the results of
g03ecf.
Given a distance or dissimilarity matrix for
objects, cluster analysis aims to group the
objects into a number of more or less homogeneous groups or clusters. With agglomerative clustering methods (see
g03ecf), a hierarchical tree is produced by starting with
clusters each with a single object and then at each of
stages, merging two clusters to form a larger cluster until all objects are in a single cluster.
g03ejf takes the information from the tree and produces the clusters that exist at a given distance. This is equivalent to taking the dendrogram (see
g03ehf) and drawing a line across at a given distance to produce clusters.
As an alternative to giving the distance at which clusters are required, you can specify the number of clusters required and g03ejf will compute the corresponding distance. However, it may not be possible to compute the number of clusters required due to ties in the distance matrix.
If on entry
or
, explanatory error messages are output on the current error message unit (as defined by
x04aaf).
The accuracy will depend upon the accuracy of the distances in
cd and
dord (see
g03ecf).
Background information to multithreading can be found in the
Multithreading documentation.
A fixed number of clusters can be found using the non-hierarchical method used in
g03eff.
Data consisting of three variables on five objects are input. Euclidean squared distances are computed using
g03eaf and median clustering performed using
g03ecf. A dendrogram is produced by
g03ehf and printed.
g03ejf finds two clusters and the results are printed.