1. 11 10月, 2022 1 次提交
  2. 08 9月, 2022 1 次提交
    • P
      [PHI] Migrate cast, clip+grad and pool+grad oneDNN kernels (#45775) · 1a929c31
      piotrekobi 提交于
      * gaussian random
      
      * mkldnn to onednn renaming
      
      * fix merge conflicts
      
      * remove fluid code
      
      * onednn renaming
      
      * Move classes from mkldnn_reuse.h to onednn_reuse.h
      
      * Migrate pool+grad, clip+grad and cast oneDNN kernels to PHI
      
      * Refactor grad kernels into separate files
      
      * Fix CI failures
      
      * Fix Codestyle
      
      * Implement reviewer suggestions
      
      * Add new lines after includes for readability
      Co-authored-by: NSilv3S <slawomir.siwek@intel.com>
      1a929c31
  3. 26 6月, 2022 1 次提交
  4. 05 6月, 2022 1 次提交
  5. 21 4月, 2022 1 次提交
  6. 14 3月, 2022 1 次提交
  7. 20 2月, 2022 1 次提交
  8. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
  9. 15 2月, 2022 1 次提交
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  10. 24 11月, 2021 1 次提交
  11. 16 11月, 2021 1 次提交
  12. 05 11月, 2021 1 次提交
    • J
      Disable pool&conv_transpose&quantize caching (#36695) · db6c00c4
      Jacek Czaja 提交于
      * - WIP
      
      - compilation fix
      
      - fix
      
      - fixes
      
      - fix
      
      - fix
      
      - fix again
      
      - fix
      
      - another fix
      
      - another compilation fix
      
      - fix
      
      - fix
      
      - fix
      
      - lint
      
      * - pool2d partially stripped from cache
      
      - pool2d partially stripped of caching
      
      * - compilation fix
      
      * - compilation fix
      
      * - Fix to UT of caching
      
      * - Enabling test_conv3d_mkldnn
      
      * - conv_transpose stripped of cache
      
      * - compilation fix
      
      * - fix
      
      * - fix
      
      * - compilation fix
      
      * - fix
      
      * Reverted disabling caching of conv2d
      
      * - compilation fix
      
      * - ut reverted
      db6c00c4
  13. 16 6月, 2021 1 次提交
  14. 19 5月, 2021 1 次提交
  15. 04 2月, 2021 1 次提交
  16. 25 1月, 2021 1 次提交
  17. 14 12月, 2020 1 次提交
  18. 26 11月, 2020 1 次提交
  19. 12 10月, 2020 1 次提交
  20. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  21. 02 9月, 2020 1 次提交
  22. 26 5月, 2020 2 次提交
  23. 10 12月, 2019 1 次提交
    • A
      MKL-DNN 1.0 Update (#20162) · e81f0228
      Adam 提交于
      * MKLDNN v1.0 rebase to Paddle 1.6
      test=develop
      
      * Add hacky paddle::string::to_string() implementation
      
      * vectorize<int64-t>() -> vectorize() cleanup
      test=develop
      
      * PADDLE_ENFORCE and void_cast fixes
      test=develop
      
      * Rebase changes
      test=develop
      
      * Cosmetics
      test=develop
      
      * Delete MKL from mkldnn.cmake
      test=develop
      
      * CMake debug commands
      test=develop
      
      * Delete MKLDNN_VERBOSE and rebase fixes
      test=develop
      
      * Rebase fixes
      test=develop
      
      * Temporarily disable int8 resnet101 vgg16 and vgg19 tests
      test=develop
      
      * Add libmkldnn.so.1 to python setup
      test=develop
      
      * Add libmkldnn.so.1 to inference_lib cmake after rebase
      test=develop
      
      * Post rebase fixes + FC int8 changes
      test=develop
      
      * Fix LRN NHWC
      test=develop
      
      * Fix NHWC conv3d
      test=develop
      
      * Windows build fix + next conv3d fix
      test=develop
      
      * Fix conv2d on AVX2 machines
      test=develop
      e81f0228
  24. 29 11月, 2019 1 次提交
    • H
      Add dygraph execution context (#20157) · ac854670
      hong 提交于
      * add_dygraph_execution_context
      
      * add dygraph infershape context and execution context; test=develop
      
      * fix imperative bug; test=develop
      
      * remove inputs outputs interface from execution context,
      because it have same function with inputNames;
      test=develop
      
      * remove tracer_test ctest; test=develop
      
      * fix split op bug; test=develop
      
      * fix unitests bug; test=develop
      
      * fix distribute test bug; test=develop
      
      * fix ngraph compile bug; test=develop
      
      * fix grad maker bug; test=develop
      
      * fix load op bugs; test=develop
      
      * fix operator.cc construct bug; test=develop
      
      * remove useless name find in operator; test=develop
      
      * add tracer_test; test=develop
      
      * fix concat, split bug; test=develop
      
      * remove tracer_test unitest; test=develop
      
      * fix attribute check bug; test=develop
      
      * add test code to fix converage; test=develop
      
      * remove useless code, change check backward input in engin; test=develop
      
      * unlock var type infer shape;test=develop
      
      * add ShareAllLoD api; test=develop
      
      * add dygraph infershape context unitest; test=develop
      
      * remove increase and decrease lod in dygraph; test=develop
      
      * addd override; test=develop
      
      * fix increase descrease lod; test=develop
      
      * fix paddle_enforce; test=develop
      
      * disable lod op dygraph check; test=develop
      
      * fix paddle enforce error; test=develop
      
      * add comment for op_registry and OperatorBase; test=develop
      
      * optimize the comment of op_registry; test=develop
      
      * fix format of comment; test=develop
      
      * fix format of comment; test=develop
      
      * optimize the format of comment; test=develop
      
      * optimize the format of the comment; test=develop
      
      * optimize comment of op_registry; test=develop
      ac854670
  25. 07 11月, 2019 1 次提交
  26. 28 9月, 2019 1 次提交
  27. 26 9月, 2019 1 次提交
  28. 24 9月, 2019 1 次提交
    • J
      - ReImplemented pooling fwd mkldnn (#19911) · 5b07ca9c
      Jacek Czaja 提交于
      - First implementation of BWD and FWD of pooling mkl-dnn
      
      - Compilation fix
      
      - Fix
      
      - Fix
      
       - Fix
      
      - Fix to crash
      
      - Compilation fix
      
      - Combined AcquireBacward with Fwd
      
      test=develop
      5b07ca9c
  29. 14 9月, 2019 1 次提交
  30. 04 9月, 2019 1 次提交
  31. 03 9月, 2019 1 次提交
  32. 16 7月, 2019 1 次提交
    • J
      [MKL-DNN] Reimplemented pool2d mkl-dnn to use Acquire API (#18585) · 71d883b8
      Jacek Czaja 提交于
      * - Added partial draft of pooling acquire
      
      - Workspace support
      
      - compilation fix
      
      - Added draft of pooling backward reimplementation
      
      - Segfault fix
      
      - reverted 'any' for diff_dst crewation in pooling
      
      - Lint fixes
      
      test=develop
      
      - lint fixes
      
      test=develop
      
      - Further lint fixes
      
      test=develop
      
      * - Fixes after review
      
      test=develop
      
      * - Lint fixes
      
      test=develop
      
      * - Even more lint fixes
      
      test=develop
      71d883b8
  33. 02 7月, 2019 1 次提交
  34. 28 6月, 2019 1 次提交
  35. 13 6月, 2019 1 次提交
    • W
      Added unit test for QAT FP32 & INT8 comparison (#17814) · 78e93286
      Wojciech Uss 提交于
      * added unit test for QAT FP32 & INT8 comparison
      
      test=develop
      
      * enabled other models and updated filenames
      
      test=develop
      
      * added accuracy check and multiple batch handling
      
      test=develop
      
      * removed quantization_mkldnn_pass.py
      
      test=develop
      
      * cleanup
      
      test=develop
      
      * updated model paths
      
      test=develop
      
      * renamed tests without MKL-DNN
      
      test=develop
      
      * fix reusing mkldnn pool2d primitive
      
      test=develop
      
      * add performance measuring
      
      test=develop
      
      * fix accuracy statistics
      
      test=develop
      
      * removed non-mkldnn tests
      
      test=develop
      
      * added conv2d_depthwise->conv2d mkldnn transformation
      
      test=develop
      
      * format update
      
      test=develop
      
      * fixed creating key for pool2d grad
      
      test=develop
      
      * added pass
      
      * Fix the accuracy issue while using float precision to get the scale.
      
      test=develop
      
      * Fix the format issue when 'X' is not nchw.
      
      test=develop
      
      * removed output comparing and changed number of images
      
      test=develop
      
      * cmake and comment fix
      
      test=develop
      
      * updated acc threshold for QAT comparison tests
      
      test=develop
      
      * added OMP_NUM_THREADS setting
      
      test=develop
      
      * enable all QAT INT8 tests
      
      test=develop
      
      * restored upstream version of a file
      
      test=develop
      
      * modified directory names
      
      test=develop
      78e93286
  36. 27 2月, 2019 1 次提交
    • X
      INT8 Pool kernel Key Creation Optimization. (#15883) · 6724be2b
      xiaolil1 提交于
      * Optimize key creation of INT8 pool kernel to improve the peformance of ResNet-50 and MobileNet, especially for latency.
      test=develop
      
      * Optimize key creation of pool fp32 grad.
      test=develop
      6724be2b
  37. 22 2月, 2019 1 次提交
  38. 29 1月, 2019 1 次提交
  39. 10 1月, 2019 1 次提交