1. 14 4月, 2023 1 次提交
    • G
      [phi] move sequence_pool to phi - Step 2 : sequence_pool_op (#52750) · b281b221
      gouzil 提交于
      * [phi] move sequence_pool kernel to phi
      
      * [phi] mv sequence_pooling to phi funcs
      
      * [phi] mv sequence_pooling_test
      
      * [phi] RollBACK `paddle/fluid/operators/sequence_ops/sequence_pool_op.cc`
      
      * [phi][funcs] fix mutable_data
      
      * [phi][funcs] fix mutable_data
      b281b221
  2. 08 2月, 2023 1 次提交
  3. 18 11月, 2022 1 次提交
  4. 11 10月, 2022 1 次提交
  5. 28 9月, 2022 1 次提交
    • C
      Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
      Chen Weihang 提交于
      * remove needless using tensor
      
      * remove needless using tensor
      
      * resolve conflict
      
      * replace tensor using
      
      * fix format error
      
      * revert needless changing
      
      * fix rocm and npu compile error
      
      * fix cinn compile error
      
      * fix format error
      
      * fix mkldnn format error
      
      * fix mkldnn format error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * fix cinn compile error
      
      * resolve conflict
      e12a905e
  6. 01 8月, 2022 1 次提交
    • L
      unify gpu context (#44740) · 86763023
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      
      * remove alias name CUDADeviceContext
      
      * fix compile
      
      * fix tests
      
      * revert changes
      86763023
  7. 26 6月, 2022 1 次提交
  8. 05 6月, 2022 1 次提交
  9. 22 2月, 2022 1 次提交
    • X
      change Vector to std::vector and provide MixVector class as a helper … (#39559) · 728c0624
      xiongkun 提交于
      * change Vector to std::vector and provide MixVector class as a helper wrapper class
      
      * solve the multi-gpu hang problem
      
      * remove the duplicate template instantialize
      
      * Copy vector to cpu
      
      * add CopyToCPU
      
      * xxx
      
      * final version: fix the problem of all reduce
      
      * remove mixvector dependence
      
      * fix
      
      * merge
      
      * fix code
      
      * fix by CI
      728c0624
  10. 20 2月, 2022 1 次提交
  11. 11 2月, 2022 1 次提交
  12. 03 12月, 2021 1 次提交
  13. 09 9月, 2020 1 次提交
  14. 10 2月, 2020 1 次提交
  15. 06 2月, 2020 1 次提交
    • Y
      Correct the use of DeviceContext in unittest sequence_pooling_test and... · 44b45b9f
      Yiqun Liu 提交于
      Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)
      
      * Add log in memory::Copy for debug purpose.
      
      * Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one.
      
      * Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one.
      test=develop
      
      * Change the type of second_dim from size_t to int64_t.
      test=develop
      44b45b9f
  16. 20 9月, 2019 1 次提交
  17. 10 6月, 2019 1 次提交
    • Y
      Enable seq_pool op to accept len 0 input (#17284) · 33d1e565
      Yibing Liu 提交于
      * Enable seq_pool op to accept len 0 input
      
      test=develop
      
      * Update sequence_pool's api
      
      test=develop
      
      * Add more unittest cases for seq_pool op
      
      test=develop
      
      * Remove legacy comments
      
      test=develop
      
      * Don't use template in op maker
      
      test=develop
      33d1e565
  18. 27 11月, 2018 1 次提交
  19. 08 11月, 2018 1 次提交
  20. 05 11月, 2018 1 次提交
  21. 29 10月, 2018 1 次提交
    • J
      Rebase of seqpool-max optimization · 458b16f4
      Jacek Czaja 提交于
      test=develop
      
      - Added rough profiling
      
      - Profiled maxpool itself
      
      - First draft of max seqpool optimization (is_test added)
      
      - Added unit tests to seqpool
      
      - Cosmetic fixes
      
      - Fix to UT of Seq pool
      
      Disabled grad checking for sequence max pool when is_test is set to True
      
      -Cosmetic fix to comment
      
      test=develop
      
      - Fix to GPU build
      
      test=develop
      
      - yet another GPU fix for sequence max pool
      
      - Fix to comment
      
      test=develop
      
      - Change to API of sequence_pool
      
      test=develop
      
      - Yet another API spec change
      
      test=develop
      458b16f4
  22. 17 9月, 2018 1 次提交
  23. 03 9月, 2018 1 次提交
  24. 30 4月, 2018 1 次提交
  25. 27 4月, 2018 1 次提交
  26. 29 3月, 2018 1 次提交
    • D
      Speed/sequence op1 (#9217) · 8425c2c8
      dzhwinter 提交于
      * "add functors"
      
      * "remove old code"
      
      * "fix"
      
      * "fix ci"
      
      * "add details"
      
      * "fix ci"
      
      * "fix ci"
      
      * "fix ci"
      
      * "fix ci"
      
      * "remove unused code"
      8425c2c8
  27. 12 2月, 2018 1 次提交
  28. 10 2月, 2018 2 次提交
  29. 08 2月, 2018 1 次提交
  30. 31 1月, 2018 1 次提交
    • D
      Fix/lod (#7714) · ae7d1c1f
      dzhwinter 提交于
      * "Need to re-design LoD "
      
      * "add lod design"
      
      * "fix lod gpu ptr pointer"
      
      * "removed commented code"
      
      * "fix CI"
      
      * "remove set lod in pybind"
      
      * "fix style check"
      
      * "fix CI"
      
      * "fix long type template error"
      
      * "pybind reorder to use Place"
      
      * "fix ci"
      
      * "fix ci"
      
      * fix ci
      
      * "sperate as a new file"
      
      * "fix CI"
      
      * "fix ci"
      
      * small fix
      
      * "add test"
      
      * "fix adam op"
      
      * "fix lstmp op"
      
      * "fix adam op"
      
      * "follow comments"
      
      * "fix ci"
      ae7d1c1f
  31. 12 12月, 2017 1 次提交
    • Q
      Refine device context (#6433) · 61ec0b95
      QI JUN 提交于
      There are mainly following fixes:
      
      - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
      - remove `eigen_device` interface in base class  `DeviceContext`
      - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
      - remove unused `platform::EigenDeviceConverter`
      - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
      - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
      61ec0b95
  32. 03 11月, 2017 1 次提交