1. 03 1月, 2020 1 次提交
    • Y
      Add the first implememtation of fusion_group op (#19621) · d4832077
      Yiqun Liu 提交于
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Refine the calling of PADDLE_ENFORCE.
      test=develop
      d4832077
  2. 01 1月, 2020 1 次提交
  3. 30 12月, 2019 2 次提交
  4. 15 12月, 2019 1 次提交
  5. 10 12月, 2019 1 次提交
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228
  6. 06 12月, 2019 1 次提交
  7. 05 12月, 2019 2 次提交
  8. 04 12月, 2019 1 次提交
  9. 03 12月, 2019 2 次提交
  10. 02 12月, 2019 1 次提交
  11. 01 12月, 2019 1 次提交
  12. 29 11月, 2019 1 次提交
  13. 28 11月, 2019 2 次提交
  14. 25 11月, 2019 1 次提交
  15. 24 11月, 2019 1 次提交
  16. 18 11月, 2019 2 次提交
  17. 14 11月, 2019 2 次提交
  18. 13 11月, 2019 1 次提交
  19. 12 11月, 2019 1 次提交
  20. 08 11月, 2019 2 次提交
    • J
      Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835
      joanna.wozna.intel 提交于
      * Add transpose2 INT8 for mkl-dnn
      
      test=develop
      
      * Fix test_transpose_int8_mkldnn
      
      test=develop
      
      * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"
      
      This reverts commit 34011bdb, reversing
      changes made to 2ce6473f.
      
      * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""
      
      This reverts commit 23754dd7.
      
      * Add template to TransposeMKLDNNHandler
      
      test=develop
      
      * Resolve conflict
      
      test=develop
      
      * Restore get_size and refactor
      
      test=develop
      77c20835
    • C
      Enrich the type of error and declare the error type interfaces (#21024) · 7ee25189
      Chen Weihang 提交于
      * Enrich the type of error and declare the error type interfaces, test=develop
      
      * adjust tests to adapt new form, test=develop
      
      * add inference deps with error_codes.pb.h, test=develop
      
      * restore stack iter start pos, test=develop
      
      * polish code based review comments, test=develop
      7ee25189
  21. 07 11月, 2019 1 次提交
  22. 06 11月, 2019 1 次提交
  23. 01 11月, 2019 1 次提交
  24. 31 10月, 2019 1 次提交
  25. 28 10月, 2019 1 次提交
  26. 25 10月, 2019 1 次提交
  27. 22 10月, 2019 1 次提交
  28. 20 10月, 2019 1 次提交
  29. 18 10月, 2019 3 次提交
  30. 17 10月, 2019 1 次提交
    • J
      [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241) · a1cd27f1
      Jacek Czaja 提交于
      * - Flushing mkl-dnn cache
      
      test=develop
      
      - Disabled clearing cache for LoadModel
      
      - Added clearing of mkl-dnn cache when Executor is created
      
      test=develop
      
      - Do not clear for GPU places
      
      test=develop
      
      - compilation fix
      
      test=develop
      
      * - Moved clearing of mkl-dnn cache in destructor of executor
      
      test=develop
      
      * - Compilation fix
      
      test=develop
      
      - Reverted conditional clearing of mkl-dnn cache in Executors's
        destructor
      
      test=develop
      
      - compilation fix
      a1cd27f1
  31. 16 10月, 2019 1 次提交