1. 02 12月, 2021 1 次提交
  2. 30 11月, 2021 3 次提交
  3. 29 11月, 2021 2 次提交
    • C
      [Pten] Add reduce mean kernel, replace with mean API (#37559) · f9e9fd19
      chentianyu03 提交于
      * add pten reduce kernel
      
      * add reduce_sum kernel
      
      * update attribute args and order
      
      * make out dtype undefined
      
      * fix empty input error
      
      * merge develop branch
      
      * rename sum as reduce function
      
      * rename sum as reduce function
      
      * fix reducekernelImpl args error
      
      * add reduce cuda kernel
      
      * modify dims type to const &
      
      * remove unsed log
      
      * fix reduce_all out eigen function error
      
      * remove unused codes
      
      * add the missing sum api define and testcase
      
      * merge develop branch
      
      * fix sum test axis value error
      
      * replace pten mean kernel with reduce_mean
      
      * revcover meam cuda to original implement
      f9e9fd19
    • C
      [Pten] add cuda implement of cast kernel (#37610) · 9956763e
      chentianyu03 提交于
      * add cuda implement of cast kernel
      
      * remove bfloat16 when defined paddle_with_hip
      9956763e
  4. 26 11月, 2021 2 次提交
  5. 25 11月, 2021 2 次提交
  6. 24 11月, 2021 3 次提交
  7. 23 11月, 2021 5 次提交
  8. 22 11月, 2021 4 次提交
    • Z
      e87545ce
    • J
      Refactor dygraph to eager (#37405) · a258badb
      Jiabin Yang 提交于
      * Add EagerTensor and tests
      
      * remove useless enforce
      
      * remove comment in cmake
      
      * fix test_error
      
      * add depends on python
      
      * Remove python.h
      
      * Merge develop and add Eager tensor with test back
      a258badb
    • C
      [PTen] Add variable transform to/from ptenTensor and add cast kernel (#36916) · 5caa6fc5
      chentianyu03 提交于
      * add cast kernel
      
      * add cast cuda kernel
      
      * add cast kernel
      
      * make cast kernel output dtype undefined
      
      * get cast dtype from vardesc
      
      * move cast to manipulation and add test case
      
      * add castinfershape
      
      * avoid reinitilaze variable
      
      * InitializeVariable support datatype
      
      * merge develop branch
      
      * fix merge bug
      
      * revert modify initializeVariable
      
      * revert modify on InitializeVariable
      
      * revert modify on InitializeVariable
      
      * mutable support reset dtype
      
      * enable make pten tensor from variable when def_arg.type is undefined
      
      * fix build pten ctx start_idx error
      
      * copy pten out tensor to variable
      
      * merge develop branch
      
      * fix non pten kernel cast failed
      
      * add reset allocation place for remake tensor
      
      * fix inplace realloc error
      
      * add mutable on pten kernles and remove unused cast files
      
      * rename function names
      
      * fix output type error
      
      * fix conflict with develop branch
      
      * set data type to variable with pten's dtype
      
      * fix test_cast_api type mismatch
      
      * densorTensro mutable_data support 0 bytes value
      
      * fix the inplace bug of reshape kernel
      
      * fix pten.backend != variable.place when moving storage, palce mismatch bug
      
      * fix conflict with develop branch
      
      * Fix bug of paddle::experimental::MovesStorage
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * Revert "fix ReMakePtenDenseTensor place mismatch bug"
      
      This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a.
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * reverts the set_lod interface, test=develop
      
      * modify by the review options
      
      * modify error message
      
      * add & for const input arguments
      
      * add reference in params
      
      * elementwise_sub add mutable_data
      
      * fix ResetHolderWithType check size bug
      
      * add dependence pten_tensor to test_cast_api object
      
      * remove unused code to pass ci coverage
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
      5caa6fc5
    • N
      modified the elementwise_op_broadcast and elementwise_op_impl for xpu2 (#37226) · 79ee6d63
      niuliling123 提交于
      * modified the elementwise_op_broadcast and elementwise_op_impl for xpu2
      79ee6d63
  9. 20 11月, 2021 1 次提交
  10. 19 11月, 2021 5 次提交
  11. 18 11月, 2021 1 次提交
    • Y
      [PTen]elementwise_sub kernel refactor (#37260) · 36a95654
      YuanRisheng 提交于
      * elementwise_add kernel refactor
      
      * fix compile bugs in elementwise_add refactor
      
      * fix compile bugs when run in npu/xpu
      
      * fix bugs when run unit test
      
      * fix bugs when run ci-windows
      
      * modify code as recommended
      
      * code format adjust
      
      * fix bugs when run ci
      
      * fix compile bug when run in ci-windwos
      
      * elementwise_sub refactor
      
      * add PD_DLL_DECL for elementwise_sub
      
      * fix bugs when compilei
      36a95654
  12. 17 11月, 2021 3 次提交
  13. 16 11月, 2021 2 次提交
  14. 15 11月, 2021 4 次提交
    • C
      [Pten] Refactor the implementation of custom operator (#37122) · 1e598f1a
      Chen Weihang 提交于
      * move extension into pten [no-verify]
      
      * append tensor methods by ext_tensor [no-verify]
      
      * append other tensor methods [no-verify]
      
      * ext related files tidy [no-verify]
      
      * include relation tidy [no-verify]
      
      * add pten tensor test [no-verify]
      
      * replace tensor in custom op & compile success
      
      * refine tensor constructor for unittest
      
      * custom relu jit run success
      
      * fix all custom op unittests
      
      * add inference cmake adapt [no-verify]
      
      * fix failed unittests
      
      * fix windows failed unittests
      
      * try to fix kunlun and inference failed
      
      * fix test_elementwise_api error
      
      * try to fix win compile failed
      
      * fix kunlun fp16 type error
      
      * remove useless haddle error macro
      
      * add custom linear op test
      
      * fix compile failed & add win symbols
      
      * fix non pten kernel cast failed
      
      * add dll decl for api
      
      * polish several deetails
      
      * polish details by review comment
      
      * add dll_decl for register
      1e598f1a
    • C
      remove needless declare (#37195) · 9c591703
      Chen Weihang 提交于
      9c591703
    • A
      Added BF16 to mean op (#37104) · df7cc457
      arlesniak 提交于
      * Added BF16 to mean op
      
      * fix for CI
      
      * fix for CI
      
      * fix for CI
      df7cc457
    • L
      Optimize Matmul_v2 (#37037) · 444a7358
      Linjie Chen 提交于
      Optimize dot product of Matmul_v2 
      444a7358
  15. 14 11月, 2021 1 次提交
    • Y
      [PTen]Reshape Kernel Refactor (#37164) · 895692e3
      YuanRisheng 提交于
      * reshape kernel refactor
      
      * fix compile bugs when run ci
      
      * support xpu for reshape
      
      * fix bugs when run unittest in kunlun ci
      
      * fix compile bugs when run kunlun
      
      * perfect code according to suggestion
      895692e3
  16. 12 11月, 2021 1 次提交