Details
-
Umbrella
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
None
-
None
Description
This is an umbrella for discussing and creating ML abstractions. This was originally handled under SPARK-1856 and SPARK-3702, under which we created the Pipelines API and some Developer APIs for classification and regression.
This umbrella is for future work, including:
- Stabilizing the classification and regression APIs
- Discussing traits vs. abstract classes for abstraction APIs
- Creating other abstractions not yet covered (clustering, multilabel prediction, etc.)
Note that SPARK-3702 still has useful discussion and design docs.
Attachments
Issue Links
- contains
-
SPARK-7409 Designing multilabel abstractions for spark.ml
- Resolved
-
SPARK-7412 Designing distributed prediction model abstractions for spark.ml
- Resolved
-
SPARK-7610 Design clustering abstractions for Pipelines API
- Resolved
- is related to
-
SPARK-3702 Standardize MLlib classes for learners, models
- Closed
-
SPARK-3251 Clarify learning interfaces
- Resolved
- relates to
-
SPARK-19498 Discussion: Making MLlib APIs extensible for 3rd party libraries
- Resolved
- supercedes
-
SPARK-1856 Standardize MLlib interfaces
- Resolved