2. • CLUTO is a software package which is used for
clustering high dimensional datasets and for
analyzing the characteristics of the various
clusters.
3. Algorithms of CLUTO
• Vcluster
• Scluster
Major difference: Input
Vcluster: actual mutidimensional representation
of the objects to be clustered.
Scluster: The similarity matrix (or graph)
between these objects.
7. Reporting and Analysis Parameters
• Control the amount of information that vcluster
and scluster report about the clusters as well as
the analysis performed on discovered clusters.
• Examples
1. -clustfile = string. ( Default is
MatrixFile.clustering.Nclusters( or GraphFile))
2. -clabelfile = string (name of the file that’s stores the
labels of the columns. Used when –showfeatues, -
showsummaries or –labeltree are used)
9. Cluster Visualization Parameters
• Simple plots of the original input matrix which
show how the different objects (rows) and
features (columns) are clustered together.
• Examples
1. -plottree = string; gives graphic representation of
the entire hierarchical tree
2. -plotmatrix = string; shows how the rows of the
original matrix are clustered together.
12. Classfile and rlabelfile
0
Evo 1
Sem
Imp 2
Imp 3
Deo
Deo 4
Imp 5
Imp
Deo 6
Deo 7
Imp
Deo 8
Deo 9
Imp
Sem 10
Deo 11
Sem
Imp 12
Imp 13
Evo
14
15
13.
14.
15. The plot uses red to
denote positive values
and green to denote
negative values. Bright
red/green indicate
large
positive/negative
values, whereas colors
close to white indicate
values close to zero.
16. Sem 0
Imp 1
Deo 2
Deo 3
Sem 4
Evo 5
Evo 6
Sem 7
Imp 8
Imp 9
Deo 10
Deo 11
Imp 12
Imp 13
Deo 14
Deo 15
Imp 16
Deo 17
Deo 18
Imp 19
Sem 20