1. 05 12月, 2022 1 次提交
  2. 18 11月, 2022 1 次提交
  3. 11 11月, 2022 1 次提交
  4. 10 11月, 2022 1 次提交
  5. 02 11月, 2022 1 次提交
  6. 01 9月, 2022 1 次提交
  7. 05 6月, 2022 1 次提交
  8. 28 4月, 2022 1 次提交
  9. 03 4月, 2022 1 次提交
    • F
      add maximum limit for grid of index_select (#41127) · af8d2482
      FlyingQianMM 提交于
      * limit grid dim for index select
      
      * mv LimitGridDim into gpu_launch_config.h
      
      * fix conflicts
      
      * fix conflicts
      
      * fix code style
      
      * set block to 256
      
      * fix grid setting
      
      * set dtype of block_dim to unsigned int
      af8d2482
  10. 25 3月, 2022 1 次提交
  11. 07 3月, 2022 1 次提交
  12. 02 3月, 2022 1 次提交
  13. 26 2月, 2022 1 次提交
    • Z
      [Pten] Refactor the copy kernel (#39731) · 9a7b9eda
      zyfncg 提交于
      * remove SetAllocationForOutputTenosr
      
      * add place param for copy kernel
      
      * recover SetAllocationForOutputTenosr
      
      * polish code
      
      * fix empty_dev api bug
      
      * test=allcases
      
      * test=allcases
      
      * fix bug
      
      * recover empty
      
      * recover modify
      9a7b9eda
  14. 23 2月, 2022 1 次提交
  15. 22 2月, 2022 1 次提交
    • X
      change Vector to std::vector and provide MixVector class as a helper … (#39559) · 728c0624
      xiongkun 提交于
      * change Vector to std::vector and provide MixVector class as a helper wrapper class
      
      * solve the multi-gpu hang problem
      
      * remove the duplicate template instantialize
      
      * Copy vector to cpu
      
      * add CopyToCPU
      
      * xxx
      
      * final version: fix the problem of all reduce
      
      * remove mixvector dependence
      
      * fix
      
      * merge
      
      * fix code
      
      * fix by CI
      728c0624
  16. 20 2月, 2022 2 次提交
  17. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
  18. 11 2月, 2022 1 次提交
  19. 09 2月, 2022 2 次提交
  20. 06 2月, 2022 1 次提交
  21. 27 1月, 2022 2 次提交
  22. 25 1月, 2022 2 次提交
  23. 24 1月, 2022 2 次提交
    • Y
      [Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and... · 3bf3a6ee
      YuanRisheng 提交于
      [Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048)
      
      * refactor elementwise add grad
      
      * fix compile bugs
      
      * fix unit test bugs
      
      * fix file conflicts
      
      * fix bugs when buildPtenContext
      3bf3a6ee
    • [Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7
      石晓伟 提交于
      * updates callers, test=develop
      
      * updates tensor, test=develop
      
      * fixes errors, test=develop
      
      * remove some dtypes, test=develop
      
      * fix errors in the base storage modification, test=develop
      
      * fixes a bug, test=develop
      
      * fixes the bugs in push the whole, test=develop
      
      * updates, test=develop
      
      * update
      
      * update, test=develop
      
      * fixes the mac-py3 CI, test=develop
      
      * remove the storage impl, test=develop
      
      * updates some codes, test=develop
      
      * update, test=develop
      
      * updates pten allocation, test=develop
      a56e16a7
  24. 21 1月, 2022 1 次提交
  25. 18 1月, 2022 2 次提交
  26. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  27. 14 1月, 2022 1 次提交
  28. 12 1月, 2022 2 次提交
  29. 11 1月, 2022 1 次提交
  30. 06 1月, 2022 1 次提交
  31. 29 12月, 2021 1 次提交
  32. 28 12月, 2021 1 次提交
  33. 27 12月, 2021 1 次提交
    • L
      Support multi-outputs feature for broadcast ops (#38329) · 89d38f55
      limingshu 提交于
      * No harm to KP
      
      * Pass the compile stage
      
      * change the WriteData function
      
      * fix template bugs and pass ctest of current elementwise
      
      * for passing partial template specialization of tempalte function in CI-ROCm
      
      * To make 'WriteData' funtion flexible.
      
      * a less harmful way to support multi-output
      
      * a less harmful way to support multi-output
      89d38f55