index_en.rst 1.4 KB
Newer Older
1 2 3
Use different clusters
======================

4
The user's cluster environment is not the same. To facilitate everyone's deployment, we provide a variety of cluster deployment methods to facilitate the submission of cluster training tasks, which will be introduced as follows:
5

6
`Kubernetes <http://kubernetes.io>`_ is a scheduling framework of Google open source container cluster, supporting a complete cluster solution for large-scale cluster production environment. The following guidelines show PaddlePaddle's support for Kubernetes:
7

8 9 10
..  toctree::
  :maxdepth: 1

11 12
  k8s_en.md
  k8s_distributed_en.md
13 14

`OpenMPI <https://www.open-mpi.org>`_ is a mature high-performance parallel computing framework, which is widely used in the field of HPC. The following guide describes how to use OpenMPI to build PaddlePaddle's cluster training task:
15 16 17 18

..  toctree::
  :maxdepth: 1

19
  openmpi_en.md
20 21 22 23 24 25

`Fabric <http://www.fabfile.org>`_ is a convenient tool for program deployment and management. We provide a way to deploy and manage with Fabric. If you want to know more about it, please read the following guidelines:

..  toctree::
  :maxdepth: 1

26
  fabric_en.md
27 28 29 30 31 32

We also support the deployment of PaddlePaddle on AWS. Learn more about:

..  toctree::
  :maxdepth: 1

33
  k8s_aws_en.md
34

35
The examples can be found under `cluster_train_v2 <https://github.com/PaddlePaddle/Paddle/tree/develop/paddle/scripts/cluster_train_v2>`_ .