1. 07 11月, 2022 1 次提交
    • H
      [Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      
      * Call SetDnnFallback function in the base class
      
      * activation fallback to plain kernel
      
      * fix default GetExpectedKernelType find wrong kernel
      
      * search cudnn kernel instead of fallback
      
      * fix cudnn_handle bug
      
      * remove tanh use_cudnn
      
      * restore tanh use_cudnn
      
      * debug tanh
      
      * fix tanh bug
      
      * delete activation cudnn kernel
      
      * polish code
      908a381d
  2. 02 11月, 2022 1 次提交
  3. 01 11月, 2022 1 次提交
    • H
      [Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325) · f9134045
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      f9134045
  4. 25 10月, 2022 1 次提交
  5. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  6. 01 8月, 2022 1 次提交
    • L
      unify gpu context (#44740) · 86763023
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      
      * remove alias name CUDADeviceContext
      
      * fix compile
      
      * fix tests
      
      * revert changes
      86763023
  7. 26 6月, 2022 1 次提交
  8. 20 1月, 2022 1 次提交
  9. 03 12月, 2021 1 次提交
  10. 02 6月, 2021 1 次提交
  11. 27 5月, 2021 1 次提交
  12. 29 3月, 2021 1 次提交
  13. 07 12月, 2020 1 次提交
  14. 03 11月, 2020 1 次提交
    • G
      Add rnn_op (#28197) · 9a600df3
      Guo Sheng 提交于
      * Add rnn_op.
      test=develop
      
      * Fix rnn_op grad maker's drop_empty_grad.
      test=develop
      9a600df3
  15. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  16. 18 9月, 2020 1 次提交
  17. 15 9月, 2020 1 次提交
  18. 07 9月, 2020 1 次提交
  19. 15 7月, 2020 1 次提交
  20. 07 7月, 2020 1 次提交
  21. 06 2月, 2020 1 次提交
  22. 07 1月, 2020 1 次提交
  23. 18 11月, 2019 1 次提交
    • Z
      Fix warn of gcc8 (#21205) · cdb3d279
      Zeng Jinle 提交于
      * fix warnings oof gcc 8 compilation, test=develop
      
      * fix boost::bad_get, test=develop
      
      * refine PADDLE_ENFORCE, test=develop
      cdb3d279
  24. 28 9月, 2019 1 次提交
    • L
      fix pool2d pool3d,support asymmetric padding and channel_last (#19739) · 24010472
      liym27 提交于
      * fix pool2d pool3d:
      1. support asymmetric padding;
      2. support padding algorithm:"SAME" and "VALID";
      3. support channel_last: data_format NHWC and NDHWC;
      4. support inferring shape when input with negative dims in compile time;
      5. change doc of python API and c++;
      6. fix bug in cuda kernel when Attr(adaptive) is true.
      
      test=develop,test=document_preview
      
      * fix 'tensors' to 'Tensors'. test=develop,test=document_preview
      
      * add test for converage ValueError.test=develop,test=document_preview
      
      * resolve conflict in test_pool2d. test=develop
      24010472
  25. 03 9月, 2019 1 次提交
  26. 25 12月, 2018 1 次提交
  27. 19 12月, 2018 1 次提交
  28. 22 11月, 2018 1 次提交
  29. 19 11月, 2018 1 次提交
  30. 16 11月, 2018 1 次提交
    • W
      Add cudnn ctc loss (#12366) · b32c13dc
      Wu Yi 提交于
      * add cudnn ctc loss
      
      * wip add test test=develop
      
      * wip
      
      * wip
      
      * done test=develop
      
      * move include cudnn test=develop
      
      * test test=develop
      
      * fix build test=develop
      
      * fix build test=develop
      
      * fix build on cudnn5 test=develop
      
      * fix cudnn5 build test=develop
      
      * fix cudnn5 build test=develop
      
      * merge develop softmax functor change test=develop
      b32c13dc
  31. 08 11月, 2018 1 次提交
  32. 02 11月, 2018 1 次提交
    • W
      Add affine grid generator op (#12238) · 0c319e0b
      whs 提交于
      * Add affine grid generator.
      
      * fix ffine grid.
      
      * Add unitest.
      
      * Add CPU kernel and fix unitest.
      
      * Fix CPU kernel.
      
      * Refine code.
      test=develop
      
      * Fix python api.
      test=develop
      
      * Update python api.
      test=develop
      
      * Fix comment.
      test=develop
      
      * Rename affine_grid_generator to affine_grid and enhence unitest.
      test=develop
      
      * Fix unitest.
      test=develop
      0c319e0b
  33. 29 10月, 2018 3 次提交
  34. 26 10月, 2018 1 次提交
  35. 16 10月, 2018 1 次提交
  36. 26 7月, 2018 1 次提交
  37. 06 6月, 2018 2 次提交