1. 30 11月, 2022 1 次提交
  2. 01 11月, 2022 1 次提交
    • L
      Fix bugs in tranpose kernel (#47212) · ec7fe888
      limingshu 提交于
      * first commit
      
      * transpose_kernel_optimization
      
      * first complishment of transpose op
      
      * second commit
      
      * refine code logics of tranpose_kernel
      
      * refine transpose kernel
      
      * first commit
      
      * fix DtoD copy bugs for hip
      
      * refine code according to the PR advice
      
      * change dim to int64_t type.
      
      * fix some type error
      ec7fe888
  3. 13 10月, 2022 1 次提交
  4. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  5. 26 6月, 2022 1 次提交
  6. 07 6月, 2022 1 次提交
  7. 05 6月, 2022 1 次提交
  8. 02 3月, 2022 1 次提交
    • H
      Move transpose to pten (#39327) · 7a857924
      hong 提交于
      * immigrate_transpose_to_pten cpu kernel only; test=develop
      
      * fix bug; test=develop
      
      * add transpose cuda api
      
      * bug fix;
      
      * fix bugs
      
      * fix bugs; test=develop
      
      * bug fix;
      
      * move transepose to pten; test=develop
      
      * fix bug; test=develop
      
      * fix bugs; test=develop
      
      * add transpose grad fp16 support; test=develop
      
      * fix bug; test=develop
      
      * fix npu bug; test=develop
      
      * fix nemul = 0 bug; test=develop
      
      * add fp16 support; test=develop
      
      * fix data type register bug; test=develop
      
      * fix transpose bug; test=develop
      
      * update transpose
      
      * fix transpose bug; test=develop
      
      * remove useless code; test=develop
      
      * remove useless code; test=develop
      
      * fix transpose alias bug; test=develop
      
      * polish code; test=develop
      
      * resolve confict; test=develop
      
      * resolve confilct; test=develop
      
      * recover prepared operator; test=develop
      
      * fix bug; test=develop
      
      * polish code; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      7a857924
  9. 20 2月, 2022 1 次提交
  10. 11 2月, 2022 1 次提交
  11. 17 9月, 2020 1 次提交
  12. 16 7月, 2020 1 次提交
    • H
      Add Support for SelectedRows for Transpose OP and Fix a Bug That SelectedRows... · d8fe517b
      Huihuang Zheng 提交于
      Add Support for SelectedRows for Transpose OP and Fix a Bug That SelectedRows Cannot be Supported in SimNet (#25536)
      
      This PR fixes a bug that SelectedRows cannot be supported in SimNet. The reason of this bug is that dygraph basic_engine didn't copy var's type when the var needs to be accumulated during backward. So when a var is SelectedRows and needs to be accumulated, like SimNet which calls net for two times, the var's type will be changed to default LoDTensor thus bug happens. To fix it, we just also copy the type.
      
      Without this PR, the accumulated SelectedRows parameters in dygraph will be changed into LoDTensor. So when we fixed the bug of supporting SelectedRows in SimNet, we found `test_imperative_lod_tensor_to_selected_rows` failed and threw the error that SelectedRows was not supported for Transpose OP. To fix it, too, this PR also added support for SelectedRows for Transpose OP.
      d8fe517b
  13. 13 5月, 2020 1 次提交
  14. 02 4月, 2020 1 次提交
  15. 08 11月, 2019 1 次提交
    • J
      Add transpose2 INT8 for mkl-dnn (#19424) · 77c20835
      joanna.wozna.intel 提交于
      * Add transpose2 INT8 for mkl-dnn
      
      test=develop
      
      * Fix test_transpose_int8_mkldnn
      
      test=develop
      
      * Revert "Merge branch 'develop' into transpose_int8_mkldnn_2"
      
      This reverts commit 34011bdb, reversing
      changes made to 2ce6473f.
      
      * Revert "Revert "Merge branch 'develop' into transpose_int8_mkldnn_2""
      
      This reverts commit 23754dd7.
      
      * Add template to TransposeMKLDNNHandler
      
      test=develop
      
      * Resolve conflict
      
      test=develop
      
      * Restore get_size and refactor
      
      test=develop
      77c20835
  16. 09 4月, 2018 1 次提交
  17. 12 2月, 2018 1 次提交
  18. 10 2月, 2018 2 次提交
  19. 26 12月, 2017 1 次提交
  20. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  21. 11 11月, 2017 1 次提交
  22. 28 9月, 2017 1 次提交
  23. 20 9月, 2017 1 次提交
  24. 19 9月, 2017 1 次提交
  25. 18 9月, 2017 1 次提交
  26. 11 9月, 2017 1 次提交
  27. 08 9月, 2017 2 次提交
  28. 06 9月, 2017 1 次提交