build_en.md 5.4 KB
Newer Older
1
# Build using Docker
2 3 4 5 6 7 8 9

## What Developers Need

To contribute to PaddlePaddle, you need

1. A computer -- Linux, BSD, Windows, MacOS, and
1. Docker.

Y
Yi Wang 已提交
10
Nothing else.  Not even Python and GCC, because you can install all build tools into a Docker image.  We run all the tools by running this image.
11 12 13 14 15 16 17 18 19

## General Process

1. Retrieve source code.

   ```bash
   git clone https://github.com/paddlepaddle/paddle
   ```

Y
Yi Wang 已提交
20
2. Install build tools into a Docker image.
21 22 23 24 25

   ```bash
   cd paddle; docker build -t paddle:dev .
   ```

Y
Yi Wang 已提交
26 27
   Please be aware of the `.` at the end of the command, which refers to the [`./Dockerfile` file](https://github.com/PaddlePaddle/Paddle/blob/develop/Dockerfile).  `docker build` follows instructions in this file to create a Docker image named `paddle:dev`, and installs building tools into it.

28 29
3. Build from source.

Y
Yi Wang 已提交
30 31
   This following command starts a Docker container that executes the Docker image `paddle:dev`, mapping the current directory to `/paddle/` in the container, and runs the default entry-point [`build.sh`](https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/scripts/docker/build.sh) as specified in the Dockefile.  `build.sh` invokes `cmake` and `make` to build PaddlePaddle source code, which had been mapped to `/paddle`, and writes outputs to `/paddle/build`, which maps to `build` in the current source directory on the computer.

32 33 34 35
   ```bash
   docker run -v $PWD:/paddle paddle:dev
   ```

Y
Yi Wang 已提交
36
   Above command builds a CUDA-enabled version.  If we want to build a CPU-only version, we can type
Y
Yi Wang 已提交
37 38 39 40 41

   ```bash
   docker run -e WITH_GPU=OFF -v $PWD:/paddle paddle:dev
   ```

42 43
4. Run unit tests.

Y
Yi Wang 已提交
44 45
   To run all unit tests using the first GPU of a node:

46
   ```bash
Y
Yi Wang 已提交
47
   NV_GPU=0 nvidia-docker run -v $PWD:/paddle paddle:dev bash -c "cd /paddle/build; ctest"
48 49
   ```

Y
Yi Wang 已提交
50 51 52 53 54
   If we used `WITH_GPU=OFF` at build time, it generates only CPU-based unit tests, and we don't need nvidia-docker to run them.  We can just run

   ```bash
   docker run -v $PWD:/paddle paddle:dev bash -c "cd /paddle/build; ctest"
   ```
55

Y
Yi Wang 已提交
56 57 58
   Sometimes we want to run a specific unit test, say `memory_test`, we can run

   ```bash
Y
Yi Wang 已提交
59 60 61 62 63 64 65 66 67
   nvidia-docker run -v $PWD:/paddle paddle:dev bash -c "cd /paddle/build; ctest -V -R memory_test"
   ```

5. Clean Build.

   Sometimes, we might want to clean all thirt-party dependents and built binaries.  To do so, just

   ```bash
   rm -rf build
Y
Yi Wang 已提交
68 69
   ```

70 71 72 73 74 75 76 77
## Docker, Or Not?

- What is Docker?

  If you haven't heard of it, consider it something like Python's virtualenv.

- Docker or virtual machine?

Y
Yi Wang 已提交
78
  Some people compare Docker with VMs, but Docker doesn't virtualize any hardware nor running a guest OS, which means there is no compromise on the performance.
79 80 81

- Why Docker?

Y
Yi Wang 已提交
82
  Using a Docker image of build tools standardizes the building environment, which makes it easier for others to reproduce your problems and to help.
83 84 85

  Also, some build tools don't run on Windows or Mac or BSD, but Docker runs almost everywhere, so developers can use whatever computer they want.

Y
Yi Wang 已提交
86
- Can I choose not to use Docker?
87

Y
Yi Wang 已提交
88
  Sure, you don't have to install build tools into a Docker image; instead, you can install them in your local computer.  This document exists because Docker would make the development way easier.
89 90 91

- How difficult is it to learn Docker?

Y
Yi Wang 已提交
92
    It takes you ten minutes to read [an introductory article](https://docs.docker.com/get-started) and saves you more than one hour to install all required build tools, configure them, especially when new versions of PaddlePaddle require some new tools.  Not even to mention the time saved when other people trying to reproduce the issue you have.
93 94 95 96 97 98 99 100 101 102 103 104 105 106 107

- Can I use my favorite IDE?

  Yes, of course.  The source code resides on your local computer, and you can edit it using whatever editor you like.

  Many PaddlePaddle developers are using Emacs.  They add the following few lines into their `~/.emacs` configure file:

  ```emacs
  (global-set-key "\C-cc" 'compile)
  (setq compile-command
   "docker run --rm -it -v $(git rev-parse --show-toplevel):/paddle paddle:dev")
  ```

  so they could type `Ctrl-C` and `c` to build PaddlePaddle from source.

Y
Yi Wang 已提交
108
- Does Docker do parallel building?
109

Y
Yi Wang 已提交
110 111 112 113 114 115 116
  Our building Docker image runs a [Bash script](https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/scripts/docker/build.sh), which calls `make -j$(nproc)` to starts as many processes as the number of your CPU cores.

## Some Gotchas

- Docker requires sudo

  An owner of a computer has the administrative privilege, a.k.a., sudo, and Docker requires this privilege to work properly.  If you use a shared computer for development, please ask the administrator to install and configure Docker.  We will do our best to support rkt, another container technology that doesn't require sudo.
Y
Yi Wang 已提交
117

Y
Yi Wang 已提交
118
- Docker on Windows/MacOS builds slowly
Y
Yi Wang 已提交
119

Y
Yi Wang 已提交
120
  On Windows and MacOS, Docker containers run in a Linux VM.  You might want to give this VM some more memory and CPUs so to make the building efficient.  Please refer to [this issue](https://github.com/PaddlePaddle/Paddle/issues/627) for details.
121 122 123 124

- Not enough disk space

  Examples in this article uses option `--rm` with the `docker run` command.  This option ensures that stopped containers do not exist on hard disks.  We can use `docker ps -a` to list all containers, including stopped.  Sometimes `docker build` generates some intermediate dangling images, which also take disk space.  To clean them, please refer to [this article](https://zaiste.net/posts/removing_docker_containers/).