4.9 KB
Newer Older
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176
# How to run PaddleServing in Docker

## Requirements

Docker (GPU version requires nvidia-docker to be installed on the GPU machine)

## CPU

### Get docker image

You can get images in two ways:

1. Pull image directly

   docker pull

2. Building image based on dockerfile

   Create a new folder and copy [Dockerfile](../Dockerfile) to this folder, and run the following command:

   docker build -t .

### Create container

docker run -p 9292:9292 --name test -dit
docker exec -it test bash

The `-p` option is to map the `9292` port of the container to the `9292` port of the host.

### Install PaddleServing

In order to make the image smaller, the PaddleServing package is not installed in the image. You can run the following command to install it

pip install paddle-serving-server

### Test example

Get the trained Boston house price prediction model by the following command:

wget --no-check-certificate
tar -xzf uci_housing.tar.gz

- Test HTTP service

  Running on the Server side (inside the container):

  python -m paddle_serving_server.web_serve --model uci_housing_model --thread 10 --port 9292 --name uci &>std.log 2>err.log &

  Running on the Client side (inside or outside the container):

  curl -H "Content-Type:application/json" -X POST -d '{"x": [0.0137, -0.1136, 0.2553, -0.0692, 0.0582, -0.0727, -0.1583, -0.0584, 0.6283, 0.4919, 0.1856, 0.0795, -0.0332], "fetch":["price"]}'

- Test RPC service

  Running on the Server side (inside the container):

  python -m paddle_serving_server.serve --model uci_housing_model --thread 10 --port 9292 &>std.log 2>err.log &

  Running following Python code on the Client side (inside or outside the container, The `paddle-serving-client` package needs to be installed):

  from paddle_serving_client import Client
  client = Client()
  data = [0.0137, -0.1136, 0.2553, -0.0692, 0.0582, -0.0727,
          -0.1583, -0.0584, 0.6283, 0.4919, 0.1856, 0.0795, -0.0332]
  fetch_map = client.predict(feed={"x": data}, fetch=["price"])


## GPU

The GPU version is basically the same as the CPU version, with only some differences in interface naming (GPU version requires nvidia-docker to be installed on the GPU machine).

### Get docker image

You can also get images in two ways:

1. Pull image directly

   nvidia-docker pull

2. Building image based on dockerfile

   Create a new folder and copy [Dockerfile.gpu](../Dockerfile.gpu) to this folder, and run the following command:

   nvidia-docker build -t .

### Create container

nvidia-docker run -p 9292:9292 --name test -dit
nvidia-docker exec -it test bash

The `-p` option is to map the `9292` port of the container to the `9292` port of the host.

### Install PaddleServing

In order to make the image smaller, the PaddleServing package is not installed in the image. You can run the following command to install it:

pip install paddle-serving-server-gpu

### Test example

Get the trained Boston house price prediction model by the following command:

wget --no-check-certificate
tar -xzf uci_housing.tar.gz

- Test HTTP service

  Running on the Server side (inside the container):

  python -m paddle_serving_server_gpu.web_serve --model uci_housing_model --thread 10 --port 9292 --name uci

  Running on the Client side (inside or outside the container):

  curl -H "Content-Type:application/json" -X POST -d '{"x": [0.0137, -0.1136, 0.2553, -0.0692, 0.0582, -0.0727, -0.1583, -0.0584, 0.6283, 0.4919, 0.1856, 0.0795, -0.0332], "fetch":["price"]}'

- Test RPC service

  Running on the Server side (inside the container):

  python -m paddle_serving_server_gpu.serve --model uci_housing_model --thread 10 --port 9292

  Running following Python code on the Client side (inside or outside the container, The `paddle-serving-client` package needs to be installed):

  from paddle_serving_client import Client
  client = Client()
  data = [0.0137, -0.1136, 0.2553, -0.0692, 0.0582, -0.0727,
          -0.1583, -0.0584, 0.6283, 0.4919, 0.1856, 0.0795, -0.0332]
  fetch_map = client.predict(feed={"x": data}, fetch=["price"])