Details
-
Brainstorming
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
None
-
None
Description
This JIRA is for discussing how to support multi-label prediction in the Pipelines API (spark.ml package). Some issues to figure out are:
- Should there be abstractions?
- How should they relate to the existing single-label abstractions: Predictor, Classifier, Regressor?
- How much code sharing can the abstractions provide?
- How should we support a mix of categorical and real-valued labels?
- How do we support structure among the labels? There could be no known structure, a graphical structure, a chain structure, etc., depending on the application/model.
Attachments
Issue Links
- Is contained by
-
SPARK-10817 ML abstraction umbrella
- Resolved
- is related to
-
SPARK-7129 Add generic boosting algorithm to spark.ml
- Resolved
-
SPARK-7128 Add generic bagging algorithm to spark.ml
- Closed