Step 1 of 4: Choose Issues

Cancel

T Patch Info Key Summary Assignee Reporter P Status Resolution Created Updated Due Development
Sub-task SPARK-45397

SPARK-42471 Add vector assembler feature transformer

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-45396

SPARK-42471 Add doc entry for `pyspark.ml.connect` module

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-45130

SPARK-42471 Avoid Spark connect ML model to change input pandas dataframe

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-45129

SPARK-42471 Add pyspark "ml-connect" extras dependencies

Unassigned Weichen Xu Major Open Unresolved  
Sub-task SPARK-44374

SPARK-42471 Add example code

Weichen Xu Weichen Xu Major Resolved Done  
Sub-task SPARK-44250

SPARK-42471 Implement classification evaluator

Weichen Xu Weichen Xu Major Resolved Done  
Sub-task SPARK-44100

SPARK-42471 Move namespace from `pyspark.mlv2` to `pyspark.ml.connect`

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-43983

SPARK-42471 Implement cross validator estimator

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-43982

SPARK-42471 Implement pipeline estimator

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-43981

SPARK-42471 Basic saving / loading implementation

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-43790

SPARK-42471 Add API `copyLocalFileToHadoopFS`

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-43715

SPARK-42471 Add spark DataFrame binary file format writer

Weichen Xu Weichen Xu Major Resolved Won't Do  
Sub-task SPARK-43516

SPARK-42471 Basic estimator / transformer / model / evaluator interfaces and basic transformer / evaluator implementation

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-43097

SPARK-42471 Implement pyspark ML logistic regression estimator on top of torch distributor

Weichen Xu Weichen Xu Major Resolved Fixed  
Sub-task SPARK-43081

SPARK-42471 Add torch distributor data loader that loads data from spark partition data

Weichen Xu Weichen Xu Major Resolved Done  
Sub-task SPARK-42994

SPARK-42471 Torch Distributor support Local Mode

Ruifeng Zheng Ruifeng Zheng Major Resolved Fixed  
Sub-task SPARK-42993

SPARK-42471 Make Torch Distributor compatible with Spark Connect

Ruifeng Zheng Ruifeng Zheng Major Resolved Fixed  
Sub-task SPARK-42870

SPARK-42471 Move `toCatalystValue` to connect-common

Ruifeng Zheng Ruifeng Zheng Major Resolved Fixed  
Sub-task SPARK-42800

SPARK-42471 Implement ml function {array_to_vector, vector_to_array}

Ruifeng Zheng Ruifeng Zheng Major Resolved Fixed  
Sub-task SPARK-42756

SPARK-42471 Helper function to convert proto literal to value in Python Client

Ruifeng Zheng Ruifeng Zheng Major Resolved Fixed  
Sub-task SPARK-42755

SPARK-42471 Factor literal value conversion out to connect-common

Ruifeng Zheng Ruifeng Zheng Major Resolved Fixed  
Sub-task SPARK-42725

SPARK-42471 Make LiteralExpression support array

Ruifeng Zheng Ruifeng Zheng Major Resolved Fixed  
Sub-task SPARK-42508

SPARK-42471 Extract the common .ml classes to `mllib-common`

Ruifeng Zheng Ruifeng Zheng Major Resolved Fixed  
Sub-task SPARK-42501

SPARK-42471 High level design doc for Distributed ML <> spark connect

Weichen Xu Weichen Xu Major Resolved Done  
Sub-task SPARK-42472

SPARK-42471 Make spark connect supporting canceling job group

Unassigned Ruifeng Zheng Major Open Unresolved  
Sub-task SPARK-42412

SPARK-42471 Initial prototype implementation for PySparkML

Weichen Xu Weichen Xu Major Resolved Done  

Cancel