1. 30 8月, 2019 1 次提交
    • H
      [Cherry-pick] Support memory eager deletion on recurrent OP (#19411) · cb74dac3
      Huihuang Zheng 提交于
      * Support memory eager deletion on recurrent OP (#17710)
      
      Test PaddingRNN on V100 GPU device.
      
      Test configuration: large model, padding mode (which is the mode using recurrentOp), one GPU.
                         
      GPU memory (MiB):   6414 (this PR)     vs   6837 (without this PR)
      Speed (steps/s):         10.28 (this PR)    vs    9.89 (without this PR)
      
      * Fix random test_recurrent_op failure (#18718)
      
      The change includes 3 things:
      
      1. Set CPU_NUM to 1 in the tests because the ParallelExecutor will print warning that CPU_NUM is not set and use default 1.
      
      2. Old tests compare two RNNs, hand written simple RNN and same RNN built by Paddle, but initialized RNN weights in numpy random and Paddle random separately. Fixed it by setting weights and bias values.
      
      3. Also set numpy random seed in the tests. Now the two RNNs diff can be smaller (rtol from 0.1, 0.2 to. 0.01) in the tests.
      cb74dac3
  2. 29 8月, 2019 2 次提交
  3. 26 8月, 2019 2 次提交
  4. 21 8月, 2019 2 次提交
  5. 20 8月, 2019 1 次提交
  6. 29 7月, 2019 1 次提交
  7. 08 7月, 2019 1 次提交
  8. 05 7月, 2019 1 次提交
  9. 28 6月, 2019 1 次提交
    • Update the Anakin interfaces for content-dnn and MLU, test=release/1.5 (#18028) · 924e53b7
      石晓伟 提交于
      * Update the Anakin interfaces for content-dnn and MLU (#17890)
      
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      
      * modify the access level of anakin engine (#18015)
      
      test=develop
      
      * fix ci test cmake test=develop
      924e53b7
  10. 27 6月, 2019 1 次提交
  11. 26 6月, 2019 1 次提交
  12. 24 6月, 2019 1 次提交
  13. 19 6月, 2019 3 次提交
  14. 17 6月, 2019 1 次提交
  15. 15 6月, 2019 1 次提交
  16. 14 6月, 2019 1 次提交
  17. 13 6月, 2019 1 次提交
  18. 10 6月, 2019 2 次提交
  19. 08 6月, 2019 1 次提交
  20. 06 6月, 2019 2 次提交
  21. 05 6月, 2019 1 次提交
    • B
      [NGraph] some ngraph updates to enable bert (#17739) · a4c528a3
      baojun 提交于
      * delay infershape test=develop
      
      * fall back subblock to paddle test=develop
      
      * fix edge cases test=develop
      
      * remove output duplicates test=develop
      
      * handle reshape2_grad infershape test=develop
      a4c528a3
  22. 04 6月, 2019 2 次提交
  23. 03 6月, 2019 1 次提交
  24. 31 5月, 2019 1 次提交
  25. 30 5月, 2019 2 次提交
  26. 29 5月, 2019 2 次提交
  27. 28 5月, 2019 1 次提交
    • J
      [MKL-DNN] conv_transpose mkldnn bias pass (#17644) · 6d8075ec
      Jacek Czaja 提交于
      * - changes to graph detector
      
      - Changes to pass
      
      - Added ut for new pass
      
      - use_pass
      
      - Added pass to mkldnn passes
      
      - fix to registration
      
      - improved verbose messaging for conv bias passes
      
      - Lint fixes
      
      test=develop
      
      * - Lint fixes
      
      test=develop
      6d8075ec
  28. 27 5月, 2019 3 次提交
    • S
      add Concat quantization (#17448) · 96845d21
      Sylwester Fraczek 提交于
      * add Concat quantization
      add unit test for quantizing concat
      fix for wrong value when the input is not in map of calculated scales
      add use_quantizer to concat_op.cc
      add scale_algo rules for concat
      
      test=develop
      
      * missing fix for multiple inputs quantize-squash
      
      * wojtuss review fix: adding comment
      
      test=develop
      96845d21
    • G
      65bbf950
    • Z
      Code clean of Allocator (#17602) · 4aa931dd
      Zeng Jinle 提交于
      * Revert "Revert "Fix allocator bug""
      
      This reverts commit 174d0d0b.
      
      * Revert "fix travis ci"
      
      This reverts commit 5656fa9f.
      
      test=develop
      
      * add inlined_vector.h, test=develop
      
      * add inlined_vector_test,test=develop
      
      * clean code of allocator,test=develop
      
      * delete zero_size_allocator.h,test=develop
      
      * fix failed unittest,test=develop
      4aa931dd