clusterID of the original samples #137

isaac-you · 2018-10-17T00:49:13Z

SOM clustering is a good customer segment method, and your somoclu make the method strong enough to deal with big data. Thank you so much.
But when I have done the train process, I can only find clusterID for nodes or neurons, but there is no clusterID for the original samples. Besides your default cluster number is 8 for kmeans, so how can I set another cluster number? Thank you so much for your help.

isaac-you · 2018-10-17T01:51:25Z

best matching units array do not tell me the ClusterID directly. when I do the experiment from https://somoclu.readthedocs.io/en/stable/example.html for the 150 random samples, the best matching units array just give me the a matrix of shape (150,2) , but no ClusterID, it is more like a coordinate for 150 samples in 2-D space.
So how can I find the ClusterID for original 150 samples, thank you.

deepwindlee · 2019-07-04T12:47:46Z

请问我要怎么知道样本聚类后所属的具体种类呢

Sitin · 2021-01-17T19:04:59Z

Hi, @isaac-you, you can use best matching units as suggested in documentation.

bmus = som.get_bmus(som.get_surface_state(X))
cluster_labels = [som.clusters[bmu[0]][bmu[1]] for bmu in bmus]

However, I am still wondering why there is no such method in the library itself given that it already have clustering support.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clusterID of the original samples #137

clusterID of the original samples #137

isaac-you commented Oct 17, 2018

isaac-you commented Oct 17, 2018

deepwindlee commented Jul 4, 2019

Sitin commented Jan 17, 2021 •

edited

Loading

clusterID of the original samples #137

clusterID of the original samples #137

Comments

isaac-you commented Oct 17, 2018

isaac-you commented Oct 17, 2018

deepwindlee commented Jul 4, 2019

Sitin commented Jan 17, 2021 • edited Loading

Sitin commented Jan 17, 2021 •

edited

Loading