index_en.rst 1.4 KB
Newer Older
1 2 3
Use different clusters
======================

4
The user's cluster environment is not the same. To facilitate everyone's deployment, we provide a variety of cluster deployment methods to facilitate the submission of cluster training tasks, which will be introduced as follows:
5

6
`Kubernetes <http://kubernetes.io>`_ is a scheduling framework of Google open source container cluster, supporting a complete cluster solution for large-scale cluster production environment. The following guidelines show PaddlePaddle's support for Kubernetes:
7

8 9 10 11 12 13 14
..  toctree::
  :maxdepth: 1

  k8s_cn.md
  k8s_distributed_cn.md

`OpenMPI <https://www.open-mpi.org>`_ is a mature high-performance parallel computing framework, which is widely used in the field of HPC. The following guide describes how to use OpenMPI to build PaddlePaddle's cluster training task:
15 16 17 18

..  toctree::
  :maxdepth: 1

19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35
  openmpi_cn.md

`Fabric <http://www.fabfile.org>`_ is a convenient tool for program deployment and management. We provide a way to deploy and manage with Fabric. If you want to know more about it, please read the following guidelines:

..  toctree::
  :maxdepth: 1

  fabric_cn.md

We also support the deployment of PaddlePaddle on AWS. Learn more about:

..  toctree::
  :maxdepth: 1

  k8s_aws_cn.md

The examples can be found under `cluster_train_v2 <https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/scripts/cluster_train_v2>`_ .