MLlib(机器学习)
机器学习库(MLlib)指南
ML Pipelines(ML管道)
Extracting, transforming and selecting features(特征的提取,转换和选择)
Classification and regression(分类和回归)
Clustering(聚类)
Collaborative Filtering(协同过滤)
ML Tuning: model selection and hyperparameter tuning(ML调优:模型选择和超参数调整)
Advanced topics(高级主题)
MLlib:基于RDD的API
Data Types - RDD-based API(数据类型)
Basic Statistics - RDD-based API(基本统计)
Classification and Regression - RDD-based API(分类和回归)
Linear Methods - RDD-based API(线性方法)
Naive Bayes - RDD-based API(朴素贝叶斯)
Decision Trees - RDD-based API(决策树)
Ensembles - RDD-based API(集成方法)
Regression - RDD-based API(回归)
Collaborative Filtering - RDD-based API(协同过滤)
Clustering - RDD-based API(聚类 - 基于RDD的API)
Dimensionality Reduction - RDD-based API(降维)
Feature Extraction and Transformation - RDD-based API(特征的提取和转换)
Frequent Pattern Mining - RDD-based API(频繁模式挖掘)
Evaluation metrics - RDD-based API(评估指标)
PMML model export - RDD-based API(PMML模型导出)
Optimization - RDD-based API(最优化)