Created by: barrierye
- update test script to use the compiled binary
- add GPU compile dockerfile and CI test
- fix the bug that "even if HTTP status code is not 200, it can pass the CI"
- add description of SERVING_BIN to COMPILE.md
- fix import error and fix multi-card parse in gpu web part