Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
1.0.2
Description
This is a bad design choice. I think that it is preferable to produce no duplicate cluster centers. So instead of forcing the number of clusters to be K, return at most K clusters.
Attachments
Issue Links
- is related to
-
SPARK-17389 KMeans speedup with better choice of k-means|| init steps = 2
- Resolved
-
SPARK-18427 Update docs of mllib.KMeans
- Resolved
- relates to
-
SPARK-19319 SparkR Kmeans summary returns error when the cluster size doesn't equal to k
- Resolved
- links to
(1 links to)