1. 10 12月, 2019 2 次提交
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228
    • W
      Mean gpu optimize (#21643) · 95b95a28
      wangchaochaohu 提交于
      * accelerate mean op test=develop
      95b95a28
  2. 06 12月, 2019 5 次提交
    • Z
      Polish op registry codes (#21561) · 0f888836
      Zeng Jinle 提交于
      * polish infer shape registry, test=develop
      
      * modify some operators registry, test=develop
      0f888836
    • A
      3d9dee57
    • H
      Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532) · 1dcf6a72
      Huihuang Zheng 提交于
      Add tests to use dy/dx to make sure the gradient values calculated by the control flow backward is correct. Also fixed bugs detected by those tests.
      
      Fix bugs:
      
      1. Unlike sum_op, optimizer ops don't allow uninitialized input tensor. But in conditional_block_grad_op, since the conditional_block may not run, the output gradient tensor may be uninitialized, which will cause the optimizer op error. To fix it, we should let optimizer ops support uninitialized input like sum_op or assign the uninitialized gradient to 0 when the conditional_block_grad_op doesn't run. I found there are about 10+ optimizer ops. **To be simpler, I just assign output gradient of the conditional_block_grad_op to 0 in this PR**. But it can be further explored whether we can make optimizer ops like sum_op to support uninitialized input tensor because theoretically we can speed up without the assigning in conditional_block_grad_op.
      
      2. Infer parameter shapes during append_backward. I didn't know that all our parameters are in global block. When op_desc is inferring shapes at the sub-block, it may not know the shape of gradients of parameters whose shape information is at global block. I fixed it by inferring shapes of gradients from forward var.
      
      This PR also did some code clean up:
      1. Print the var name when sgd_op catches shape error so that it is easier to debug
      2. Fix a typo: dicta -> dict
      1dcf6a72
    • J
      - Fix to regression in performance of ResNet-50 training (#21588) · 8f5a93a0
      Jacek Czaja 提交于
      test=develop
      8f5a93a0
    • J
      [MKL-DNN] Batch norm mkl-dnn NHWC support (#21553) · 9ce0e29d
      Jacek Czaja 提交于
      * - BAtch norm mkl-dnn NHWC
      
      test=develop
      
      - compilation fix
      
      test=develop
      
      - UT fix
      
      - cosmetics
      
      test=develop
      
      - Fix to Batch Norm MKL-DNN NHWC UT
      
      test=develop
      
      Conflicts:
      	paddle/fluid/operators/batch_norm_op.h
      
      * - Lint fixes
      
      test=develop
      9ce0e29d
  3. 04 12月, 2019 3 次提交
  4. 03 12月, 2019 6 次提交
  5. 02 12月, 2019 7 次提交
  6. 01 12月, 2019 1 次提交
  7. 29 11月, 2019 9 次提交
    • L
      add unused input vars check for OpWithKernel, test=develop (#21169) · e0c9d856
      Leo Chen 提交于
      * add unused input vars check for OpWithKernel, test=develop
      
      * remove unused vars in some ops, test=develop
      
      * fix batch_norm, test=develop
      
      * add white list, test=develop
      
      * add CI check for white list, test=develop
      
      * :ove white list to c++, test=develop
      
      * solve failure of CI, test=develop
      
      * add unittest for unused_var_check, test=develop
      
      * refine code, enable check in operator_test, test=develop
      
      * skip mkldnn, test=develop
      
      * extend white list, test=develop
      
      * refine condition of mkldnn, test=develop
      
      * fix paddle_build, test=develop
      
      * follow comments, test=develop
      
      * fix GetExpectedKernelType
      
      * add wiki ref to err_msg, test=develop
      
      * follow comment, test=develop
      e0c9d856
    • C
      Fix optimizer op infershape failed in dygraph multi-cards mode (#21374) · 664f958a
      Chen Weihang 提交于
      * add param & grad shape check for sgd op
      
      * add _reshape_inplece interface for dygraph parallel
      
      * refine unittest based paddle/models scripts, test=develop
      
      * add unittest for parallel grad fuse, test=develop
      664f958a
    • H
      Fix Cond Bug for Nested Control Flow (#21340) · 630be319
      Huihuang Zheng 提交于
      * Commit before merging develop
      
      test=develop
      
      * Backup after working with Huihuang logs
      
      * Commit before deleting Huihuang debug loggings
      
      * Commit before debug
      
      test=develop
      
      * Fix bug commit
      
      test=develop
      
      * Backup of fixing bugs
      
      test=develop
      
      * Clean up code
      
      test=develop
      
      * Fix a bug in sum_op
      
      test=develop
      630be319
    • J
      [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375) · cd43c444
      Jacek Czaja 提交于
      cd43c444
    • L
      remove kDepXOut for abs_grad op, test=develop (#21407) · add62acf
      Leo Chen 提交于
      add62acf
    • A
      Add template version of UpdatePadding (#21426) · 9107bf20
      Adam 提交于
      test=develop
      9107bf20
    • Z
      Add dscending for argsort (#21400) · b1627455
      zhaoyuchen2018 提交于
      * Add ascending for argsort
      
      * Refine api doc description.
      
      * Refine descending description
      
      * Add int32 logic to speedup when data is small size.
      
      * Remove int32 opt as not support in python
      b1627455
    • H
      Add dygraph execution context (#20157) · ac854670
      hong 提交于
      * add_dygraph_execution_context
      
      * add dygraph infershape context and execution context; test=develop
      
      * fix imperative bug; test=develop
      
      * remove inputs outputs interface from execution context,
      because it have same function with inputNames;
      test=develop
      
      * remove tracer_test ctest; test=develop
      
      * fix split op bug; test=develop
      
      * fix unitests bug; test=develop
      
      * fix distribute test bug; test=develop
      
      * fix ngraph compile bug; test=develop
      
      * fix grad maker bug; test=develop
      
      * fix load op bugs; test=develop
      
      * fix operator.cc construct bug; test=develop
      
      * remove useless name find in operator; test=develop
      
      * add tracer_test; test=develop
      
      * fix concat, split bug; test=develop
      
      * remove tracer_test unitest; test=develop
      
      * fix attribute check bug; test=develop
      
      * add test code to fix converage; test=develop
      
      * remove useless code, change check backward input in engin; test=develop
      
      * unlock var type infer shape;test=develop
      
      * add ShareAllLoD api; test=develop
      
      * add dygraph infershape context unitest; test=develop
      
      * remove increase and decrease lod in dygraph; test=develop
      
      * addd override; test=develop
      
      * fix increase descrease lod; test=develop
      
      * fix paddle_enforce; test=develop
      
      * disable lod op dygraph check; test=develop
      
      * fix paddle enforce error; test=develop
      
      * add comment for op_registry and OperatorBase; test=develop
      
      * optimize the comment of op_registry; test=develop
      
      * fix format of comment; test=develop
      
      * fix format of comment; test=develop
      
      * optimize the format of comment; test=develop
      
      * optimize the format of the comment; test=develop
      
      * optimize comment of op_registry; test=develop
      ac854670
    • H
      add macro to ban windows (#21422) · a6b089c6
      hutuxian 提交于
      remove nccl related code in windows
      a6b089c6
  8. 28 11月, 2019 7 次提交