1. 24 9月, 2019 1 次提交
    • J
      - ReImplemented pooling fwd mkldnn (#19911) · 5b07ca9c
      Jacek Czaja 提交于
      - First implementation of BWD and FWD of pooling mkl-dnn
      
      - Compilation fix
      
      - Fix
      
      - Fix
      
       - Fix
      
      - Fix to crash
      
      - Compilation fix
      
      - Combined AcquireBacward with Fwd
      
      test=develop
      5b07ca9c
  2. 23 9月, 2019 1 次提交
  3. 22 9月, 2019 1 次提交
  4. 20 9月, 2019 2 次提交
    • Z
      remove enforce.h file written, test=develop (#19897) · b25d1e75
      Zeng Jinle 提交于
      b25d1e75
    • J
      [MKL-DNN] LRN refactoring (#19798) · 619c797a
      Jacek Czaja 提交于
      - LRN mkl-dnn kernel refactor
      
      test=develop
      
      - compilation fix
      
      - Another compilation fix
      
      - Compilation fix
      
      - another compilation fix
      
      - compilation fix
      
      - Crash fix
      
      - optional LRN mkldnn workspace
      
      - Added mid allocation
      
      - Workaround for tests
      
      - Removed gradient from is_test ut
      
      - Removed mid for inference
      
      - Reverted LRN mid removal for is_test
      
      - PADDLE_ENFORCE adjusted
      
      - Rebase to templatization commit
      
      - Compilation fix
      
      - compilation fix
      
      test=develop
      
      - lint
      
      test=develop
      
      - Fix to crash
      
      - Rebase to recent codebase
      
       - lin
      
      - lint
      
      - compilation fix
      619c797a
  5. 19 9月, 2019 2 次提交
  6. 18 9月, 2019 2 次提交
  7. 17 9月, 2019 1 次提交
  8. 16 9月, 2019 1 次提交
  9. 14 9月, 2019 2 次提交
  10. 12 9月, 2019 1 次提交
  11. 11 9月, 2019 1 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
  12. 10 9月, 2019 2 次提交
  13. 09 9月, 2019 1 次提交
  14. 05 9月, 2019 2 次提交
  15. 03 9月, 2019 3 次提交
  16. 02 9月, 2019 1 次提交
  17. 01 9月, 2019 2 次提交
    • J
      [MKL-DNN] Refactoring Softmax (#19312) · cef95ee3
      Jacek Czaja 提交于
      * - First set of modifications
      
      - Compilation fixes
      
      - compilation fix
      
      - Another compilation fix
      
      - Moved AcquireSoftmaxPrimitiveDescriptor call into handler
      
      - MKL-DNN Softmax PD refactor
      
      test=develop
      
      - Compilation fix
      
      test=develop
      
      - another compilation fix
      
      - cosmetcis
      
      test=develop
      
      - Compilation fix
      
      - Fix to crash when softmax backward is created
      
      * - Fixes after review of softmax refactoring
      
      test=develop
      cef95ee3
    • Z
      Add retry_allocator for gpu (#19409) · 0a73f720
      Zeng Jinle 提交于
      * add retry_allocator for gpu, test=develop
      
      * follow chengduoZH's comments, test=develop
      
      * follow huihuang's comments,test=develop
      
      * change f,l in enforce.h to be file,line, test=develop
      
      * increase code coverage by adding unittests, test=develop
      
      * fix CMakeLists.txt, test=develop
      0a73f720
  18. 30 8月, 2019 3 次提交
    • J
      [MKL-DNN] Fix to face model on AVX512 platforms (#19282) · ecd9f330
      Jacek Czaja 提交于
      - Refactor step 1
      
      - Compilation fix
      
      - Yet another compilation fix
      
      - Even more compilation fix
      
      - Lint fixes
      
      test=develop
      
      - Removed deprectaed PADDLE_ENFORCE occurance
      
      test=develop
      
      - Candidate fix to BN forward
      
      - Lint fixes
      
      test=develop
      
      - Refactoring in data_layout_transform
      
      - compilation fix
      
      - Another comppilation fix
      
      - Step further into darkness
      
      - Yet another compilation fix
      
      - Yet another compilation fix
      
      - missing header
      
      - compilation fix
      
      - Added MKLDNN -> Paddle conversion in fetch op
      
      test=develop
      
      - Compilation fix
      
      test=develop
      
      - Lint
      
      test=develop
      
      - Mul fix
      
      - Fix to MKLDNN MUL op and Elementwise MUL UT
      
      test=develop
      
      - Workaround for diffrent weights with groups representation Paddle vs
        MKL-DNN.
      
      test=develop
      
      - Candidate fix for 5D convolution with groups
      
      - Refactor of fix for conv3d and conv2d in fetch op
      
      test=develop
      
      - Compilation fix
      
      - Still same compilation fix
      
      - Compilation fix
      
      - Compilation fix
      
      - Reverted refactoring of fixes
      
      - Adapted test_conv2d_int8_mkldnn so it exects data in NCHW format
        not NHWC
      
      test=develop
      
      - minor fix in UT
      
      test=develop
      
      - Lint fixes
      
      test=develop
      ecd9f330
    • L
      d6cb1a41
    • Z
      remove signal raise msg, test=develop (#19527) · c2c5b1b9
      Zeng Jinle 提交于
      c2c5b1b9
  19. 28 8月, 2019 1 次提交
  20. 27 8月, 2019 2 次提交
  21. 20 8月, 2019 2 次提交
  22. 19 8月, 2019 1 次提交
  23. 16 8月, 2019 2 次提交
  24. 15 8月, 2019 1 次提交
  25. 12 8月, 2019 2 次提交