20. bisecting k-means
1. Pick a cluster to split.
2. Find 2 sub-clusters using the basic K-means
algorithm. (Bisecting step)
3. Repeat step 2, the bisecting step, for ITER
times and take the split that produces the
clustering with the highest overall similarity.
4. Repeat steps 1, 2 and 3 until the desired
number of clusters is reached.
21. bisecting k-means
algorithm:
1. 把所有數據作為⼀一個cluster加⼊入cluster list
2. Repeat
3. 从cluster list中挑選⼀一個較⼤大cost function(J)的cluster出来
4. for i=1 to 預設的疊代次数
5. ⽤用k-means算法把挑出来的cluster分成兩個⼦子cluster
6. 计算兩個⼦子cluster的J
7. end for
8. 把for循環中最⼩小J的那兩個⼦子cluster加⼊入cluster list
9. until cluster list 擁有k 個cluster