input data,k randomly select k points in data as initial center
compute the distance of each point to each center label the point by the nearest center then get the inital cluster compute the gravity center by the current cluster
repeat the process by num_iters
EM-example by coins
1 2 3 4 5 6 7 8 9 10
there are two biased coins, with probability of head A,B.
if you know the hidden_vector, you can initialize the parameter sequence1, 0.9 1 sequence2, 0.4 0 sequence3, 0.8. 1
A = 0.85, B = 0.4
if you don't know, you can guess the parameter, and then