1. 25 12月, 2019 3 次提交
  2. 24 12月, 2019 3 次提交
    • A
      Optimize adam speed (#21777) · 51a86d2b
      Aurelius84 提交于
      * optimize adam speed by removing _finish_update test=develop
      
      * fix SparseAdamFunctor param list test=develop
      
      * Remove scale_op in expect_list of adam_op test=develop
      
      * fix test optimizer loss assert error test=develop
      
      * fix test optimizer loss assert error test=develop
      
      * modify PADDLE_ENFORCE usage test=develop
      
      * fix op_type in lamb_op.cc test=develop
      
      * fix errors ostream format bug test=develop
      
      * add betaPowOut in ngraph op test=develop
      
      * fix ngraph::op api for gcc8 test=develop
      
      * clean code test=develop
      
      * modify struct into class test=develop
      
      * remove code of beta1Tensor in lamb_op test=develop
      51a86d2b
    • F
      Update iou_similarity op to support non-normalized bbox (#21671) · 6b9fbcf3
      FDInSky 提交于
      Update iou_similarity op to support non-normalized bbox
      6b9fbcf3
    • G
      Modify the while_loop API (#21844) · 46f9184a
      guofei 提交于
      46f9184a
  3. 23 12月, 2019 2 次提交
  4. 20 12月, 2019 1 次提交
  5. 19 12月, 2019 4 次提交
  6. 17 12月, 2019 1 次提交
  7. 16 12月, 2019 3 次提交
  8. 15 12月, 2019 1 次提交
  9. 12 12月, 2019 2 次提交
    • J
      Add reshape int8 mkldnn op (#21428) · d419b859
      joanna.wozna.intel 提交于
      * Add reshape int8 op
      
      test=develop
      
      * Change test to CPUPlace
      
      test=develop
      
      * Correct tests
      
      test=develop
      d419b859
    • T
      memory leak for cpu (#21174) · 9ad940fd
      tangwei12 提交于
      * add fake init for the trainer, fix large memory hold in the trainer
      * do not merge recv vars from a remote endpoint, test=develop
      * add recv and save op, merge slice var in one op, save memory
      * remove hsigmoid with pull sparse, test=develop
      9ad940fd
  10. 11 12月, 2019 1 次提交
  11. 10 12月, 2019 5 次提交
    • W
    • Z
      refine some grad op makers, test=develop (#21629) · 29f64c8c
      Zeng Jinle 提交于
      29f64c8c
    • M
      Dropout with seed (#21590) · e2d849b9
      mapingshuo 提交于
      * add seed op
      e2d849b9
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228
    • W
      Mean gpu optimize (#21643) · 95b95a28
      wangchaochaohu 提交于
      * accelerate mean op test=develop
      95b95a28
  12. 06 12月, 2019 5 次提交
    • Z
      Polish op registry codes (#21561) · 0f888836
      Zeng Jinle 提交于
      * polish infer shape registry, test=develop
      
      * modify some operators registry, test=develop
      0f888836
    • A
      3d9dee57
    • H
      Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532) · 1dcf6a72
      Huihuang Zheng 提交于
      Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests.
      
      Fix bugs:
      
      1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op.
      
      2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var.
      
      This PR also did some code clean up:
      1. Print the var name when sgd_op catches shape error so that it is easier to debug
      2. Fix a typo: dicta -> dict
      1dcf6a72
    • J
      - Fix to regression in performance of ResNet-50 training (#21588) · 8f5a93a0
      Jacek Czaja 提交于
      test=develop
      8f5a93a0
    • J
      [MKL-DNN] Batch norm mkl-dnn NHWC support (#21553) · 9ce0e29d
      Jacek Czaja 提交于
      * - BAtch norm mkl-dnn NHWC
      
      test=develop
      
      - compilation fix
      
      test=develop
      
      - UT fix
      
      - cosmetics
      
      test=develop
      
      - Fix to Batch Norm MKL-DNN NHWC UT
      
      test=develop
      
      Conflicts:
      	paddle/fluid/operators/batch_norm_op.h
      
      * - Lint fixes
      
      test=develop
      9ce0e29d
  13. 04 12月, 2019 3 次提交
  14. 03 12月, 2019 6 次提交