Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
1.2.0
-
None
Description
Currently, Gradient and Optimizer interfaces support data in form of RDD[Double, Vector] which refers to label and features. This limits its application to classification problems. For example, artificial neural network demands Vector as output (instead of label: Double). Moreover, current interface does not support data batches. I propose to replace label: Double with output: Vector. It enables passing generic output instead of label and also passing data and output batches stored in corresponding vectors.
Attachments
Issue Links
- relates to
-
SPARK-9120 Add multivariate regression (or prediction) interface
- Resolved
-
SPARK-5256 Improving MLlib optimization APIs
- Resolved
-
SPARK-18303 CLONE - Improving MLlib optimization APIs
- Resolved
- links to