SlideShare una empresa de Scribd logo
1 de 16
CLUTO
A Clustering Toolkit
          By
    Roseline Antai
• CLUTO is a software package which is used for
  clustering high dimensional datasets and for
  analyzing the characteristics of the various
  clusters.
Algorithms of CLUTO
• Vcluster
• Scluster

Major difference: Input
Vcluster: actual mutidimensional representation
  of the objects to be clustered.
Scluster: The similarity matrix (or graph)
  between these objects.
Calling Sequence

vcluster [optional parameters] MatrixFile
  Nclusters
scluster [optional parameters] MatrixFile
  NClusters
Optional Parameters
• Standard specification
      -paramname or –paramname = value

• Three categories:
  – Clustering algorithm parameters
  – Reporting and Analysis parameters
  – Cluster Visualization parameters
Clustering algorithm parameters
• Control how CLUTO computes the clustering
  solution.
• Examples

  1.   -clmethod=string ( rb, agglo,direct,graph, etc)
  2.   -sim = string (cos,corr,dist,jacc)
  3.   -crfun = string (i1,i2 etc)
  4.   -fulltree
Reporting and Analysis Parameters
• Control the amount of information that vcluster
  and scluster report about the clusters as well as
  the analysis performed on discovered clusters.
• Examples
  1. -clustfile = string. ( Default is
     MatrixFile.clustering.Nclusters( or GraphFile))
  2. -clabelfile = string (name of the file that’s stores the
     labels of the columns. Used when –showfeatues, -
     showsummaries or –labeltree are used)
3. -rlabelfile=string
4. -rclassfile=string (Stores the labels of the rows –
   objects to be clustered).
5. -showtree
6. -showfeatures (descriptive and discriminating)
Cluster Visualization Parameters
• Simple plots of the original input matrix which
  show how the different objects (rows) and
  features (columns) are clustered together.
• Examples
  1. -plottree = string; gives graphic representation of
     the entire hierarchical tree
  2. -plotmatrix = string; shows how the rows of the
     original matrix are clustered together.
A practical example
– ../cluto/Linux/vcluster -clmethod=rb -sim=cos -
  fulltree -rlabelfile=Final_Results/rlabelfile -
  rclassfile=Final_Results/classfile -showtree -
  plotformat=gif -plottree=Final_Results/Images/PT-
  Final10d -plotmatrix=Final_Results/Images/PM-
  Final10d -plotclusters=Final_Results/Images/PC-
  Final10d -showfeatures
  Final_Results/FinalOutput10d-Vt.mat 4
roselineantai@ubuntu:~/JLSI/jlsi$ ./clusterscript2.sh
********************************************************************************
vcluster (CLUTO 2.1.1) Copyright 2001-03, Regents of the University of Minnesota
Matrix Information -----------------------------------------------------------
  Name: Final_Results2/FinalOutput50dFinal.mat, #Rows: 59, #Columns: 59, #NonZeros:   3481
Options ----------------------------------------------------------------------
  CLMethod=RB, CRfun=I2, SimFun=Cosine, #Clusters: 4
  RowModel=None, ColModel=None, GrModel=SY-DIR, NNbrs=40
  Colprune=1.00, EdgePrune=-1.00, VtxPrune=-1.00, MinComponent=5
  CSType=Best, AggloFrom=0, AggloCRFun=I2, NTrials=10, NIter=10
Solution ---------------------------------------------------------------------
------------------------------------------------------------------------
4-way clustering: [I2=5.65e+01] [59 of 59], Entropy: 0.438, Purity: 0.729
------------------------------------------------------------------------
cid Size ISim ISdev      ESim ESdev Entpy Purty | Sem Imp Deo Evo
------------------------------------------------------------------------
  0     2 +0.779 +0.000 +0.638 +0.222 0.000 1.000 |    0    2    0    0
  1    15 +0.918 +0.024 +0.853 +0.021 0.453 0.800 |    1    0    2   12
  2    22 +0.923 +0.015 +0.866 +0.019 0.416 0.818 |    1   18    3    0
  3    20 +0.923 +0.015 +0.869 +0.034 0.496 0.550 |    9    0   11    0
------------------------------------------------------------------------
--------------------------------------------------------------------------------
4-way clustering solution - Descriptive & Discriminating Features...
--------------------------------------------------------------------------------
Cluster   0, Size:     2, ISim: 0.779, ESim: 0.638
      Descriptive: col00010 32.4%, col00039 22.7%, col00013 5.6%, col00019 3.9%,      col00047   2.2%
   Discriminating: col00010 53.5%, col00039 22.3%, col00013 2.5%, col00020 2.1%,      col00012   1.4%
Cluster   1, Size:    15, ISim: 0.918, ESim: 0.853
      Descriptive: col00013 11.3%, col00006 8.2%, col00019 7.9%, col00058 4.4%,       col00047   4.2%
   Discriminating: col00006 14.8%, col00007 7.4%, col00031 7.1%, col00029 6.6%,       col00053   6.2%
Cluster   2, Size:    22, ISim: 0.923, ESim: 0.866
      Descriptive: col00013 14.3%, col00020 7.1%, col00014 6.4%, col00019 4.4%,       col00047   4.4%
   Discriminating: col00020 7.0%, col00014 5.9%, col00013 5.5%, col00006 5.1%,        col00010   4.3%
Cluster   3, Size:    20, ISim: 0.923, ESim: 0.869
      Descriptive: col00013 8.9%, col00047 6.6%, col00019 5.2%, col00020 4.4%,        col00014   3.8%
   Discriminating: col00015 8.0%, col00013 7.4%, col00046 5.2%, col00006 5.0%,        col00042   4.3%
--------------------------------------------------------------------------------
------------------------------------------------------------------------------
Hierarchical Tree that optimizes the I2 criterion function...
------------------------------------------------------------------------------
               Sem Imp Deo Evo
------------------------------------
6
|---4
|   |---2        1   18    3    0
|   |---3        9    0   11    0
|-5
  |-----1        1    0    2   12
  |-----0        0    2    0    0
------------------------------------
------------------------------------------------------------------------------
Timing Information -----------------------------------------------------------
   I/O:                                   0.004 sec
   Clustering:                            0.008 sec
   Reporting:                             0.316 sec
********************************************************************************
Classfile and rlabelfile
         0
Evo      1
Sem
Imp      2
Imp      3
Deo
Deo      4
Imp      5
Imp
Deo      6
Deo      7
Imp
Deo      8
Deo      9
Imp
Sem      10
Deo      11
Sem
Imp      12
Imp      13
Evo
         14
         15
The plot uses red to
denote positive values
and green to denote
negative values. Bright
red/green indicate
large
positive/negative
values, whereas colors
close to white indicate
values close to zero.
Sem   0
Imp   1
Deo   2
Deo   3
Sem   4
Evo   5
Evo   6
Sem   7
Imp   8
Imp   9
Deo   10
Deo   11
Imp   12
Imp   13
Deo   14
Deo   15
Imp   16
Deo   17
Deo   18
Imp   19
Sem   20

Más contenido relacionado

La actualidad más candente

Running Free with the Monads
Running Free with the MonadsRunning Free with the Monads
Running Free with the Monadskenbot
 
Extreme querying with_analytics
Extreme querying with_analyticsExtreme querying with_analytics
Extreme querying with_analyticsGary Myers
 
Query optimization techniques for partitioned tables.
Query optimization techniques for partitioned tables.Query optimization techniques for partitioned tables.
Query optimization techniques for partitioned tables.Ashutosh Bapat
 
Partition and conquer large data in PostgreSQL 10
Partition and conquer large data in PostgreSQL 10Partition and conquer large data in PostgreSQL 10
Partition and conquer large data in PostgreSQL 10Ashutosh Bapat
 
Window functions in MariaDB 10.2
Window functions in MariaDB 10.2Window functions in MariaDB 10.2
Window functions in MariaDB 10.2Sergey Petrunya
 
peRm R group. Review of packages for r for market data downloading and analysis
peRm R group. Review of packages for r for market data downloading and analysispeRm R group. Review of packages for r for market data downloading and analysis
peRm R group. Review of packages for r for market data downloading and analysisVyacheslav Arbuzov
 
Monads from Definition
Monads from DefinitionMonads from Definition
Monads from DefinitionDierk König
 
Polymorphic Table Functions in 18c
Polymorphic Table Functions in 18cPolymorphic Table Functions in 18c
Polymorphic Table Functions in 18cAndrej Pashchenko
 
Common Table Expressions in MariaDB 10.2 (Percona Live Amsterdam 2016)
Common Table Expressions in MariaDB 10.2 (Percona Live Amsterdam 2016)Common Table Expressions in MariaDB 10.2 (Percona Live Amsterdam 2016)
Common Table Expressions in MariaDB 10.2 (Percona Live Amsterdam 2016)Sergey Petrunya
 
High Performance Mysql - Friday Tech Talks at Squareboat
High Performance Mysql - Friday Tech Talks at SquareboatHigh Performance Mysql - Friday Tech Talks at Squareboat
High Performance Mysql - Friday Tech Talks at SquareboatSquareboat
 
Using Optimizer Hints to Improve MySQL Query Performance
Using Optimizer Hints to Improve MySQL Query PerformanceUsing Optimizer Hints to Improve MySQL Query Performance
Using Optimizer Hints to Improve MySQL Query Performanceoysteing
 
Data Visualization With R: Learn To Modify Color Of Plots
Data Visualization With R: Learn To Modify Color Of PlotsData Visualization With R: Learn To Modify Color Of Plots
Data Visualization With R: Learn To Modify Color Of PlotsRsquared Academy
 
Oracle Database 12c - Introducing SQL Pattern Recognition through MATCH_RECOG...
Oracle Database 12c - Introducing SQL Pattern Recognition through MATCH_RECOG...Oracle Database 12c - Introducing SQL Pattern Recognition through MATCH_RECOG...
Oracle Database 12c - Introducing SQL Pattern Recognition through MATCH_RECOG...Lucas Jellema
 
Function Operations
Function OperationsFunction Operations
Function Operationsswartzje
 
Pandas Cheat Sheet
Pandas Cheat SheetPandas Cheat Sheet
Pandas Cheat SheetACASH1011
 

La actualidad más candente (20)

Resoluçãohaskell2
Resoluçãohaskell2Resoluçãohaskell2
Resoluçãohaskell2
 
Running Free with the Monads
Running Free with the MonadsRunning Free with the Monads
Running Free with the Monads
 
Extreme querying with_analytics
Extreme querying with_analyticsExtreme querying with_analytics
Extreme querying with_analytics
 
Seminar PSU 10.10.2014 mme
Seminar PSU 10.10.2014 mmeSeminar PSU 10.10.2014 mme
Seminar PSU 10.10.2014 mme
 
Query optimization techniques for partitioned tables.
Query optimization techniques for partitioned tables.Query optimization techniques for partitioned tables.
Query optimization techniques for partitioned tables.
 
Partition and conquer large data in PostgreSQL 10
Partition and conquer large data in PostgreSQL 10Partition and conquer large data in PostgreSQL 10
Partition and conquer large data in PostgreSQL 10
 
Window functions in MariaDB 10.2
Window functions in MariaDB 10.2Window functions in MariaDB 10.2
Window functions in MariaDB 10.2
 
peRm R group. Review of packages for r for market data downloading and analysis
peRm R group. Review of packages for r for market data downloading and analysispeRm R group. Review of packages for r for market data downloading and analysis
peRm R group. Review of packages for r for market data downloading and analysis
 
Monads from Definition
Monads from DefinitionMonads from Definition
Monads from Definition
 
Perm winter school 2014.01.31
Perm winter school 2014.01.31Perm winter school 2014.01.31
Perm winter school 2014.01.31
 
Polymorphic Table Functions in 18c
Polymorphic Table Functions in 18cPolymorphic Table Functions in 18c
Polymorphic Table Functions in 18c
 
Common Table Expressions in MariaDB 10.2 (Percona Live Amsterdam 2016)
Common Table Expressions in MariaDB 10.2 (Percona Live Amsterdam 2016)Common Table Expressions in MariaDB 10.2 (Percona Live Amsterdam 2016)
Common Table Expressions in MariaDB 10.2 (Percona Live Amsterdam 2016)
 
High Performance Mysql - Friday Tech Talks at Squareboat
High Performance Mysql - Friday Tech Talks at SquareboatHigh Performance Mysql - Friday Tech Talks at Squareboat
High Performance Mysql - Friday Tech Talks at Squareboat
 
Using Optimizer Hints to Improve MySQL Query Performance
Using Optimizer Hints to Improve MySQL Query PerformanceUsing Optimizer Hints to Improve MySQL Query Performance
Using Optimizer Hints to Improve MySQL Query Performance
 
Data Visualization With R: Learn To Modify Color Of Plots
Data Visualization With R: Learn To Modify Color Of PlotsData Visualization With R: Learn To Modify Color Of Plots
Data Visualization With R: Learn To Modify Color Of Plots
 
Oracle Database 12c - Introducing SQL Pattern Recognition through MATCH_RECOG...
Oracle Database 12c - Introducing SQL Pattern Recognition through MATCH_RECOG...Oracle Database 12c - Introducing SQL Pattern Recognition through MATCH_RECOG...
Oracle Database 12c - Introducing SQL Pattern Recognition through MATCH_RECOG...
 
Mat lab day 1
Mat lab day 1Mat lab day 1
Mat lab day 1
 
Function Operations
Function OperationsFunction Operations
Function Operations
 
Numpy python cheat_sheet
Numpy python cheat_sheetNumpy python cheat_sheet
Numpy python cheat_sheet
 
Pandas Cheat Sheet
Pandas Cheat SheetPandas Cheat Sheet
Pandas Cheat Sheet
 

Similar a Cluto presentation

Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfHailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfcookie1969
 
Demystifying cost based optimization
Demystifying cost based optimizationDemystifying cost based optimization
Demystifying cost based optimizationRiyaj Shamsudeen
 
Riyaj: why optimizer_hates_my_sql_2010
Riyaj: why optimizer_hates_my_sql_2010Riyaj: why optimizer_hates_my_sql_2010
Riyaj: why optimizer_hates_my_sql_2010Riyaj Shamsudeen
 
Histograms in 12c era
Histograms in 12c eraHistograms in 12c era
Histograms in 12c eraMauro Pagano
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with RYanchang Zhao
 
Latin America Tour 2019 - 10 great sql features
Latin America Tour 2019  - 10 great sql featuresLatin America Tour 2019  - 10 great sql features
Latin America Tour 2019 - 10 great sql featuresConnor McDonald
 
JF608: Quality Control - Unit 3
JF608: Quality Control - Unit 3JF608: Quality Control - Unit 3
JF608: Quality Control - Unit 3Asraf Malik
 
A few things about the Oracle optimizer - 2013
A few things about the Oracle optimizer - 2013A few things about the Oracle optimizer - 2013
A few things about the Oracle optimizer - 2013Connor McDonald
 
R Programming: Transform/Reshape Data In R
R Programming: Transform/Reshape Data In RR Programming: Transform/Reshape Data In R
R Programming: Transform/Reshape Data In RRsquared Academy
 
The ultimate-guide-to-sql
The ultimate-guide-to-sqlThe ultimate-guide-to-sql
The ultimate-guide-to-sqlMcNamaraChiwaye
 
Metadata Matters
Metadata MattersMetadata Matters
Metadata Mattersafa reg
 
Wellington APAC Groundbreakers tour - SQL Pattern Matching
Wellington APAC Groundbreakers tour - SQL Pattern MatchingWellington APAC Groundbreakers tour - SQL Pattern Matching
Wellington APAC Groundbreakers tour - SQL Pattern MatchingConnor McDonald
 
2013.11.14 Big Data Workshop Bruno Voisin
2013.11.14 Big Data Workshop Bruno Voisin 2013.11.14 Big Data Workshop Bruno Voisin
2013.11.14 Big Data Workshop Bruno Voisin NUI Galway
 
Perth APAC Groundbreakers tour - SQL Techniques
Perth APAC Groundbreakers tour - SQL TechniquesPerth APAC Groundbreakers tour - SQL Techniques
Perth APAC Groundbreakers tour - SQL TechniquesConnor McDonald
 
【Maclean liu技术分享】拨开oracle cbo优化器迷雾,探究histogram直方图之秘 0321
【Maclean liu技术分享】拨开oracle cbo优化器迷雾,探究histogram直方图之秘 0321【Maclean liu技术分享】拨开oracle cbo优化器迷雾,探究histogram直方图之秘 0321
【Maclean liu技术分享】拨开oracle cbo优化器迷雾,探究histogram直方图之秘 0321maclean liu
 

Similar a Cluto presentation (20)

Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfHailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
 
Demystifying cost based optimization
Demystifying cost based optimizationDemystifying cost based optimization
Demystifying cost based optimization
 
Riyaj: why optimizer_hates_my_sql_2010
Riyaj: why optimizer_hates_my_sql_2010Riyaj: why optimizer_hates_my_sql_2010
Riyaj: why optimizer_hates_my_sql_2010
 
Histograms in 12c era
Histograms in 12c eraHistograms in 12c era
Histograms in 12c era
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with R
 
Latin America Tour 2019 - 10 great sql features
Latin America Tour 2019  - 10 great sql featuresLatin America Tour 2019  - 10 great sql features
Latin America Tour 2019 - 10 great sql features
 
X Bar R Charts
X Bar R ChartsX Bar R Charts
X Bar R Charts
 
X Bar R Charts
X Bar R ChartsX Bar R Charts
X Bar R Charts
 
X Bar R Charts
X Bar R ChartsX Bar R Charts
X Bar R Charts
 
JF608: Quality Control - Unit 3
JF608: Quality Control - Unit 3JF608: Quality Control - Unit 3
JF608: Quality Control - Unit 3
 
A few things about the Oracle optimizer - 2013
A few things about the Oracle optimizer - 2013A few things about the Oracle optimizer - 2013
A few things about the Oracle optimizer - 2013
 
R Programming: Transform/Reshape Data In R
R Programming: Transform/Reshape Data In RR Programming: Transform/Reshape Data In R
R Programming: Transform/Reshape Data In R
 
The ultimate-guide-to-sql
The ultimate-guide-to-sqlThe ultimate-guide-to-sql
The ultimate-guide-to-sql
 
Metadata Matters
Metadata MattersMetadata Matters
Metadata Matters
 
Wellington APAC Groundbreakers tour - SQL Pattern Matching
Wellington APAC Groundbreakers tour - SQL Pattern MatchingWellington APAC Groundbreakers tour - SQL Pattern Matching
Wellington APAC Groundbreakers tour - SQL Pattern Matching
 
2013.11.14 Big Data Workshop Bruno Voisin
2013.11.14 Big Data Workshop Bruno Voisin 2013.11.14 Big Data Workshop Bruno Voisin
2013.11.14 Big Data Workshop Bruno Voisin
 
Oracle 12c SPM
Oracle 12c SPMOracle 12c SPM
Oracle 12c SPM
 
Perth APAC Groundbreakers tour - SQL Techniques
Perth APAC Groundbreakers tour - SQL TechniquesPerth APAC Groundbreakers tour - SQL Techniques
Perth APAC Groundbreakers tour - SQL Techniques
 
【Maclean liu技术分享】拨开oracle cbo优化器迷雾,探究histogram直方图之秘 0321
【Maclean liu技术分享】拨开oracle cbo优化器迷雾,探究histogram直方图之秘 0321【Maclean liu技术分享】拨开oracle cbo优化器迷雾,探究histogram直方图之秘 0321
【Maclean liu技术分享】拨开oracle cbo优化器迷雾,探究histogram直方图之秘 0321
 
SQLQueries
SQLQueriesSQLQueries
SQLQueries
 

Cluto presentation

  • 1. CLUTO A Clustering Toolkit By Roseline Antai
  • 2. • CLUTO is a software package which is used for clustering high dimensional datasets and for analyzing the characteristics of the various clusters.
  • 3. Algorithms of CLUTO • Vcluster • Scluster Major difference: Input Vcluster: actual mutidimensional representation of the objects to be clustered. Scluster: The similarity matrix (or graph) between these objects.
  • 4. Calling Sequence vcluster [optional parameters] MatrixFile Nclusters scluster [optional parameters] MatrixFile NClusters
  • 5. Optional Parameters • Standard specification -paramname or –paramname = value • Three categories: – Clustering algorithm parameters – Reporting and Analysis parameters – Cluster Visualization parameters
  • 6. Clustering algorithm parameters • Control how CLUTO computes the clustering solution. • Examples 1. -clmethod=string ( rb, agglo,direct,graph, etc) 2. -sim = string (cos,corr,dist,jacc) 3. -crfun = string (i1,i2 etc) 4. -fulltree
  • 7. Reporting and Analysis Parameters • Control the amount of information that vcluster and scluster report about the clusters as well as the analysis performed on discovered clusters. • Examples 1. -clustfile = string. ( Default is MatrixFile.clustering.Nclusters( or GraphFile)) 2. -clabelfile = string (name of the file that’s stores the labels of the columns. Used when –showfeatues, - showsummaries or –labeltree are used)
  • 8. 3. -rlabelfile=string 4. -rclassfile=string (Stores the labels of the rows – objects to be clustered). 5. -showtree 6. -showfeatures (descriptive and discriminating)
  • 9. Cluster Visualization Parameters • Simple plots of the original input matrix which show how the different objects (rows) and features (columns) are clustered together. • Examples 1. -plottree = string; gives graphic representation of the entire hierarchical tree 2. -plotmatrix = string; shows how the rows of the original matrix are clustered together.
  • 10. A practical example – ../cluto/Linux/vcluster -clmethod=rb -sim=cos - fulltree -rlabelfile=Final_Results/rlabelfile - rclassfile=Final_Results/classfile -showtree - plotformat=gif -plottree=Final_Results/Images/PT- Final10d -plotmatrix=Final_Results/Images/PM- Final10d -plotclusters=Final_Results/Images/PC- Final10d -showfeatures Final_Results/FinalOutput10d-Vt.mat 4
  • 11. roselineantai@ubuntu:~/JLSI/jlsi$ ./clusterscript2.sh ******************************************************************************** vcluster (CLUTO 2.1.1) Copyright 2001-03, Regents of the University of Minnesota Matrix Information ----------------------------------------------------------- Name: Final_Results2/FinalOutput50dFinal.mat, #Rows: 59, #Columns: 59, #NonZeros: 3481 Options ---------------------------------------------------------------------- CLMethod=RB, CRfun=I2, SimFun=Cosine, #Clusters: 4 RowModel=None, ColModel=None, GrModel=SY-DIR, NNbrs=40 Colprune=1.00, EdgePrune=-1.00, VtxPrune=-1.00, MinComponent=5 CSType=Best, AggloFrom=0, AggloCRFun=I2, NTrials=10, NIter=10 Solution --------------------------------------------------------------------- ------------------------------------------------------------------------ 4-way clustering: [I2=5.65e+01] [59 of 59], Entropy: 0.438, Purity: 0.729 ------------------------------------------------------------------------ cid Size ISim ISdev ESim ESdev Entpy Purty | Sem Imp Deo Evo ------------------------------------------------------------------------ 0 2 +0.779 +0.000 +0.638 +0.222 0.000 1.000 | 0 2 0 0 1 15 +0.918 +0.024 +0.853 +0.021 0.453 0.800 | 1 0 2 12 2 22 +0.923 +0.015 +0.866 +0.019 0.416 0.818 | 1 18 3 0 3 20 +0.923 +0.015 +0.869 +0.034 0.496 0.550 | 9 0 11 0 ------------------------------------------------------------------------ -------------------------------------------------------------------------------- 4-way clustering solution - Descriptive & Discriminating Features... -------------------------------------------------------------------------------- Cluster 0, Size: 2, ISim: 0.779, ESim: 0.638 Descriptive: col00010 32.4%, col00039 22.7%, col00013 5.6%, col00019 3.9%, col00047 2.2% Discriminating: col00010 53.5%, col00039 22.3%, col00013 2.5%, col00020 2.1%, col00012 1.4% Cluster 1, Size: 15, ISim: 0.918, ESim: 0.853 Descriptive: col00013 11.3%, col00006 8.2%, col00019 7.9%, col00058 4.4%, col00047 4.2% Discriminating: col00006 14.8%, col00007 7.4%, col00031 7.1%, col00029 6.6%, col00053 6.2% Cluster 2, Size: 22, ISim: 0.923, ESim: 0.866 Descriptive: col00013 14.3%, col00020 7.1%, col00014 6.4%, col00019 4.4%, col00047 4.4% Discriminating: col00020 7.0%, col00014 5.9%, col00013 5.5%, col00006 5.1%, col00010 4.3% Cluster 3, Size: 20, ISim: 0.923, ESim: 0.869 Descriptive: col00013 8.9%, col00047 6.6%, col00019 5.2%, col00020 4.4%, col00014 3.8% Discriminating: col00015 8.0%, col00013 7.4%, col00046 5.2%, col00006 5.0%, col00042 4.3% -------------------------------------------------------------------------------- ------------------------------------------------------------------------------ Hierarchical Tree that optimizes the I2 criterion function... ------------------------------------------------------------------------------ Sem Imp Deo Evo ------------------------------------ 6 |---4 | |---2 1 18 3 0 | |---3 9 0 11 0 |-5 |-----1 1 0 2 12 |-----0 0 2 0 0 ------------------------------------ ------------------------------------------------------------------------------ Timing Information ----------------------------------------------------------- I/O: 0.004 sec Clustering: 0.008 sec Reporting: 0.316 sec ********************************************************************************
  • 12. Classfile and rlabelfile 0 Evo 1 Sem Imp 2 Imp 3 Deo Deo 4 Imp 5 Imp Deo 6 Deo 7 Imp Deo 8 Deo 9 Imp Sem 10 Deo 11 Sem Imp 12 Imp 13 Evo 14 15
  • 13.
  • 14.
  • 15. The plot uses red to denote positive values and green to denote negative values. Bright red/green indicate large positive/negative values, whereas colors close to white indicate values close to zero.
  • 16. Sem 0 Imp 1 Deo 2 Deo 3 Sem 4 Evo 5 Evo 6 Sem 7 Imp 8 Imp 9 Deo 10 Deo 11 Imp 12 Imp 13 Deo 14 Deo 15 Imp 16 Deo 17 Deo 18 Imp 19 Sem 20