Details
-
Umbrella
-
Status: Closed
-
Major
-
Resolution: Done
-
None
-
None
-
None
Description
Here is the todo list of SPARK-11604 found issues:
Note: I did not list the SparkR related features (such as ml.feature.Interaction). We have supported RFormula as a wrapper at Python side, I think we should discuss the necessary to support other R related features at Python side.
- Missing classes
- ml.attribute
SPARK-8516 - ml.feature
- QuantileDiscretizer
SPARK-11922 - ChiSqSelector
SPARK-11923
- QuantileDiscretizer
- ml.classification
- OneVsRest
SPARK-7861
- OneVsRest
- ml.clustering
- LDA
SPARK-11940
- LDA
- mllib.clustering
- BisectingKMeans
SPARK-11944
- BisectingKMeans
- ml.attribute
- Missing methods/parameters
SPARK-11938- ml.classification
SPARK-11815SPARK-11820 - ml.feature
SPARK-11925 - ml.clustering
SPARK-11945 - mllib.linalg
SPARK-12040SPARK-12041 - mllib.stat.test.StreamingTest
SPARK-12042
- ml.classification
- Docs:
- ml.classification
SPARK-11875
- ml.classification
Attachments
Issue Links
- is related to
-
SPARK-11604 ML 1.6 QA: API: Python API coverage
- Resolved
- relates to
-
SPARK-12810 PySpark CrossValidatorModel should support avgMetrics
- Resolved
1.
|
Python API for ml.feature.QuantileDiscretizer | Resolved | Holden Karau | |
2.
|
Update doc for PySpark HasCheckpointInterval | Resolved | Yanbo Liang | |
3.
|
PySpark LiR & LoR should support weightCol | Resolved | Yanbo Liang | |
4.
|
Add PySpark missing methods for ml.feature during Spark 1.6 QA | Resolved | Yanbo Liang | |
5.
|
Python API for ml.feature.ChiSqSelector | Resolved | Xusen Yin | |
6.
|
PySpark DecisionTreeClassifier & DecisionTreeRegressor should support setSeed | Resolved | Yanbo Liang | |
7.
|
Expose numFeatures in all ML PredictionModel for PySpark | Resolved | Kai | |
8.
|
Python API for ml.clustering.LDA | Resolved | Jeff Zhang | |
9.
|
Python API for mllib.clustering.BisectingKMeans | Resolved | Holden Karau | |
10.
|
Add computeCost to KMeansModel for PySpark spark.ml | Resolved | Yanbo Liang | |
11.
|
Add toJson/fromJson to Vector/Vectors for PySpark | Closed | Unassigned | |
12.
|
Add columnSimilarities to IndexedRowMatrix for PySpark | Resolved | Kai Jiang | |
13.
|
Feature parity for pyspark.mllib StandardScalerModel | Resolved | Holden Karau |