1. 12 12月, 2019 1 次提交
    • T
      memory leak for cpu (#21174) · 9ad940fd
      tangwei12 提交于
      * add fake init for the trainer, fix large memory hold in the trainer
      * do not merge recv vars from a remote endpoint, test=develop
      * add recv and save op, merge slice var in one op, save memory
      * remove hsigmoid with pull sparse, test=develop
      9ad940fd
  2. 11 12月, 2019 2 次提交
  3. 10 12月, 2019 3 次提交
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228
    • X
      fix master patch when slot is dense (#21580) · f4041572
      xujiaqi01 提交于
      * fix master patch when slot is dense
      * test=develop
      f4041572
    • X
      fix code style of fleet_wrapper (#21639) · c05706fe
      xujiaqi01 提交于
      * fix code style of fleet_wrapper
      * test=develop
      c05706fe
  4. 07 12月, 2019 1 次提交
  5. 06 12月, 2019 3 次提交
  6. 04 12月, 2019 1 次提交
  7. 03 12月, 2019 2 次提交
  8. 02 12月, 2019 2 次提交
  9. 01 12月, 2019 1 次提交
  10. 29 11月, 2019 5 次提交
    • L
      add unused input vars check for OpWithKernel, test=develop (#21169) · e0c9d856
      Leo Chen 提交于
      * add unused input vars check for OpWithKernel, test=develop
      
      * remove unused vars in some ops, test=develop
      
      * fix batch_norm, test=develop
      
      * add white list, test=develop
      
      * add CI check for white list, test=develop
      
      * :ove white list to c++, test=develop
      
      * solve failure of CI, test=develop
      
      * add unittest for unused_var_check, test=develop
      
      * refine code, enable check in operator_test, test=develop
      
      * skip mkldnn, test=develop
      
      * extend white list, test=develop
      
      * refine condition of mkldnn, test=develop
      
      * fix paddle_build, test=develop
      
      * follow comments, test=develop
      
      * fix GetExpectedKernelType
      
      * add wiki ref to err_msg, test=develop
      
      * follow comment, test=develop
      e0c9d856
    • H
      Fix Cond Bug for Nested Control Flow (#21340) · 630be319
      Huihuang Zheng 提交于
      * Commit before merging develop
      
      test=develop
      
      * Backup after working with Huihuang logs
      
      * Commit before deleting Huihuang debug loggings
      
      * Commit before debug
      
      test=develop
      
      * Fix bug commit
      
      test=develop
      
      * Backup of fixing bugs
      
      test=develop
      
      * Clean up code
      
      test=develop
      
      * Fix a bug in sum_op
      
      test=develop
      630be319
    • J
      [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375) · cd43c444
      Jacek Czaja 提交于
      cd43c444
    • Z
    • H
      Add dygraph execution context (#20157) · ac854670
      hong 提交于
      * add_dygraph_execution_context
      
      * add dygraph infershape context and execution context; test=develop
      
      * fix imperative bug; test=develop
      
      * remove inputs outputs interface from execution context,
      because it have same function with inputNames;
      test=develop
      
      * remove tracer_test ctest; test=develop
      
      * fix split op bug; test=develop
      
      * fix unitests bug; test=develop
      
      * fix distribute test bug; test=develop
      
      * fix ngraph compile bug; test=develop
      
      * fix grad maker bug; test=develop
      
      * fix load op bugs; test=develop
      
      * fix operator.cc construct bug; test=develop
      
      * remove useless name find in operator; test=develop
      
      * add tracer_test; test=develop
      
      * fix concat, split bug; test=develop
      
      * remove tracer_test unitest; test=develop
      
      * fix attribute check bug; test=develop
      
      * add test code to fix converage; test=develop
      
      * remove useless code, change check backward input in engin; test=develop
      
      * unlock var type infer shape;test=develop
      
      * add ShareAllLoD api; test=develop
      
      * add dygraph infershape context unitest; test=develop
      
      * remove increase and decrease lod in dygraph; test=develop
      
      * addd override; test=develop
      
      * fix increase descrease lod; test=develop
      
      * fix paddle_enforce; test=develop
      
      * disable lod op dygraph check; test=develop
      
      * fix paddle enforce error; test=develop
      
      * add comment for op_registry and OperatorBase; test=develop
      
      * optimize the comment of op_registry; test=develop
      
      * fix format of comment; test=develop
      
      * fix format of comment; test=develop
      
      * optimize the format of comment; test=develop
      
      * optimize the format of the comment; test=develop
      
      * optimize comment of op_registry; test=develop
      ac854670
  11. 28 11月, 2019 4 次提交
  12. 27 11月, 2019 3 次提交
    • Y
      Support numpy bridge (enabled by default in dygraph mode) (#20983) · d5ff79e5
      Youwei Song 提交于
      * add numpy bridge
      
      * fix template compile
      
      * add unittest, add default
      test=develop
      
      * fix unittest
      test=develop
      
      * fix unittest
      test=develop
      
      * zero_copy=True for to_variable,
      test=develop
      
      * bug fix
      test=develop
      
      * disable deprecated NumPy API
      test=develop
      
      * use better design of NumpyAllocator
      test=develop
      
      * fix Py_None check
      test=develop
      
      * reset c++ tracer when jump out dygraph guard
      test=develop
      
      * refine PADDLE_ENFORCE_xx format
      test=develop
      
      * bug fix of tracer switch
      test=develop
      
      * update decref
      test=develop
      d5ff79e5
    • G
      Polish the codes of fc when needs padding (#21378) · 8493f20e
      GaoWei8 提交于
      test=develop
      8493f20e
    • M
      INT8 Fully-connected (#17641) · 5d7d5482
      Michał Gallus 提交于
      * Implement Int8 FC
      
      * Integrate FC into INT8v2
      
      test=develop
      
      * int8 FC: transpose weights before computing scales
      
      test=develop
      
      * Add support for activation_type string in FC
      
      test=develop
      
      * Disable MKL-DNN's FC in VGG16 and 19
      
      test=develop
      
      * Disable FC quantization when mkldnn FC is disabled
      
      test=develop
      
      * Solve PADDLE_ENFORCES in FC int8
      
      * Fix Paddle enforces and remove const cast
      
      test=develop
      
      * Fix style changes
      
      test=develop
      
      * Fix quantizer_tester test and add fc quantization
      
      test=develop
      
      * Fix FC test fail on CUDA
      
      * Remove unnecessary log from quantize placement pass
      
      test=develop
      
      * Add Thread ID to FC hash key
      
      test=develop
      
      * Add comments to MKL-DNN FC Kernel
      
      test=develop
      
      * Refactor quantizer
      
      test=develop
      
      * Fix linter issues
      
      test=develop
      
      * Fix crash in slim googlenet
      
      test=develop
      
      * Fix PADDLE_ENFORCE messages
      
      test=develop
      5d7d5482
  13. 26 11月, 2019 1 次提交
    • G
      Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8
      GaoWei8 提交于
      * Add fc padding to solve mkl performance
      test=develop
      
      * fix gpu pass and error information
      test=develop
      
      * fix fc_fuse_pass_test
      test=develop
      
      * fix error information
      test=develop
      
      * fix error information
      test=develop
      
      * fix name and add fc op padding test
      test=develop
      
      * fix attributes
      test=develop
      
      * optimize fc padding
      test=develop
      
      * fix test
      test=develop
      234060f8
  14. 25 11月, 2019 2 次提交
  15. 24 11月, 2019 2 次提交
    • D
      Refactor fetch handler (#21264) · 691ced87
      Dong Daxiang 提交于
      * fix fetch handler problem and refactor
      when a user define FetchHandler class, he or she should initialize a handler
      with variable dict. the key of a variable dict is a user defined name,
      the value of a variable dict is a Varaible generated from python API.
      
      For each fetching, a user should implement handler function in which
      fetched_result_dict will be available and the user can access the fetched value
      with user defined keys.
      691ced87
    • Y
      Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310) · c918788b
      Yiqun Liu 提交于
      * Disable fusion_group pass for windows and mac. We will do some experiments on Linux first.
      test=develop
      
      * Print the subgraph when check failed.
      test=develop
      c918788b
  16. 22 11月, 2019 1 次提交
  17. 21 11月, 2019 1 次提交
    • T
      solve pslib core in stop worker (#21263) · 0d17c1b8
      Thunderbrook 提交于
      * general table
      
      * add sparse table
      test=develop
      
      * no cvm
      test=develop
      
      * add no_cvm
      test=develop
      
      * add note
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * add key of optimizer
      test=develop
      
      * solve pslib stop core
      test=develop
      
      * barrier
      test=develop
      
      * add notes
      test=develop
      0d17c1b8
  18. 20 11月, 2019 2 次提交
    • T
      support general embedding params (#21217) · 349e82d6
      Thunderbrook 提交于
      * general table
      
      * add sparse table
      test=develop
      
      * no cvm
      test=develop
      
      * add no_cvm
      test=develop
      
      * add note
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * code style
      test=develop
      
      * add key of optimizer
      test=develop
      349e82d6
    • Y
      Enable generating code for a given subgraph. (#21126) · 6b1e1f0d
      Yiqun Liu 提交于
      * Enable generating code for a given subgraph.
      
      * Support sorting the subgraph.
      
      * Remove the rearange of expressions because we use the sorted subgraph directly.
      
      * Enable generating code for a subgraph which is composed of grad ops.
      
      * Use expression information to check the accuracy in unittest.
      
      * Separate load and store from computation expressions.
      test=develop
      
      * Improve the loading statements in generated codes.
      test=develop
      
      * Remove unused arguments from formal list.
      test=develop
      6b1e1f0d
  19. 19 11月, 2019 1 次提交
  20. 18 11月, 2019 2 次提交