Classification and Regression - RDD-based API(分类和回归)

spark.mllib包支持binary classification(二分类)multiclass classification(多分类)regression analysis(回归分析)的各种方法。

下表列出了每种类型问题支持的算法。

问题类型 支持方法
Binary Classification(二分类) linear SVMs, logistic regression, decision trees, random forests, gradient-boosted trees, naive Bayes
Multiclass Classification(多分类) logistic regression, decision trees, random forests, naive Bayes
Regression(回归) linear least squares, Lasso, ridge regression, decision trees, random forests, gradient-boosted trees, isotonic regression