1. 11 1月, 2022 1 次提交
    • Z
      【PTen】Add dot and matmul grad kernel in pten (#38713) · be817719
      zyfncg 提交于
      * refactor matmul directory in pten
      
      * fix merge conflict
      
      * add dot_grad kernel
      
      * add dot_grad kernel in pten
      
      * add matmul_grad kernel
      
      * update the code
      
      * delete useless code in fluid
      
      * fix some bug of running matmul grad kernel
      
      * fix merge conflict
      
      * refactor some code
      
      * refactor code
      be817719
  2. 10 1月, 2022 3 次提交
    • C
      move get expected kernel args into pten (#38825) · 3a23c1a2
      Chen Weihang 提交于
      3a23c1a2
    • Z
      [Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea
      Zhanlue Yang 提交于
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor
      
      * Modified framework::Tensor to inherit from DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      
      * Rearranged cfunction calls from tensor.data<void>() to tensor.data()
      
      * Fixed CI issues
      
      * Fixed lite issues
      
      * Fixed data() interface issues,test=allcases
      
      * Resolved IsInitialized() issues
      
      * Fixed ResetHolder() issues
      
      * Fixed MKLDNN & Storage issues
      
      * Resolved ShareBufferWith() issues
      
      * Fixed LoD issues
      5c73a6ea
    • C
      Support setting infershape function for custom grad op (#38776) · 046553c7
      Chen Weihang 提交于
      * unify infer_shape func calling
      
      * support set grad infer shape fn for custom op
      
      * unify infershape in new executor and eager
      
      * remove todo comment
      
      * revert infershape in operator
      046553c7
  3. 31 12月, 2021 1 次提交
  4. 30 12月, 2021 1 次提交
    • F
      flags to choose kp kernel (#38455) · ed2cfecf
      Feng Xing 提交于
      This PR adds runtime flags run_kp_kernel, which choose which op to run for xpu2. There are two: dynamic linked and built from kp.
      ed2cfecf
  5. 27 12月, 2021 2 次提交
  6. 23 12月, 2021 1 次提交
  7. 20 12月, 2021 1 次提交
  8. 16 12月, 2021 2 次提交
  9. 14 12月, 2021 3 次提交
  10. 10 12月, 2021 3 次提交
  11. 09 12月, 2021 1 次提交
  12. 08 12月, 2021 2 次提交
  13. 07 12月, 2021 2 次提交
  14. 06 12月, 2021 3 次提交
  15. 03 12月, 2021 1 次提交
  16. 02 12月, 2021 1 次提交
  17. 01 12月, 2021 1 次提交
  18. 27 11月, 2021 1 次提交
    • A
      [NPU] reorganization for device API abstraction (#37110) · 72241a6a
      Aganlengzi 提交于
      * [NPU] reorganization for device API abstraction
      
      * [NPU] delete old files
      
      * [NPU] fix npu_collective_helper
      
      * [NPU] fix collective_helper
      
      * [NPU] fix ut
      
      * [NPU] mod memory allocation and hccl_helper
      
      * [NPU] fix place_type
      
      * [NPU] split enfoce.h
      
      * move acl* call into npu_info
      
      * merge conflict
      
      * fix merge
      
      * merge conflict
      
      * merge conflict
      72241a6a
  19. 26 11月, 2021 1 次提交
    • Z
      Added interface reset_grad_inplace_version (#37573) · dcb91fd7
      Zhanlue Yang 提交于
      reset_inplace_version removes all inplace related records to VarBase/VariableWrapper, the essential purpose of which is to let you use inplace operations as if using its non-inplaced version, which of course will cause unexpected consequences if not used with care.
      
      This is essentially a hack interface to satisfy one specific request
      dcb91fd7
  20. 25 11月, 2021 1 次提交
    • Z
      【PTen】Add fill_constant kernel using ScalarArray in pten (#37481) · a0d465f8
      zyfncg 提交于
      * add scalar and scalar_array
      
      * remove DenseTensor include from Scalar and ScalarArray
      
      * remove inner header from scalar_array
      
      * refactor the method of fill_constant and add some comment
      
      * add fill_constant kernel using ScalarArray
      
      * modify some prompt
      
      * remove fill_constant kernel with no shape
      a0d465f8
  21. 24 11月, 2021 1 次提交
    • 0
      [Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a
      0x45f 提交于
      * run dy2stat pure fp16 in Linear model
      
      * no use self._pure_fp16_inputs
      
      * add test and fix Adam error in dy2stat pure fp16 training
      
      * use paddle.optimizer.Adam
      
      * run test in gpu
      
      * change test time for CI
      
      * enlarge atol for test_resnet_pure_fp16
      
      * refine code and enlarge atol
      
      * make custom_white_list and custom_black_list take effect for AMP and pure fp16
      
      * check tracer is not None
      
      * use default atol
      
      * change filter_size
      
      * change atol and add some NOTE
      52edad6a
  22. 23 11月, 2021 4 次提交
  23. 22 11月, 2021 3 次提交
    • Z
      Add backward function hook to dygraph (#37141) · 31344ab7
      Zhanlue Yang 提交于
      31344ab7
    • W
      Renamed Func and removed ENFORCE statement (#37348) · 2702af21
      Weilong Wu 提交于
      * Removed one ENFORCE statement
      
      * Changed func name to _share_buffer_to
      
      * Improve error reporting information
      
      * Updated the logic of _is_share_buffer_to func
      2702af21
    • C
      [PTen] Add variable transform to/from ptenTensor and add cast kernel (#36916) · 5caa6fc5
      chentianyu03 提交于
      * add cast kernel
      
      * add cast cuda kernel
      
      * add cast kernel
      
      * make cast kernel output dtype undefined
      
      * get cast dtype from vardesc
      
      * move cast to manipulation and add test case
      
      * add castinfershape
      
      * avoid reinitilaze variable
      
      * InitializeVariable support datatype
      
      * merge develop branch
      
      * fix merge bug
      
      * revert modify initializeVariable
      
      * revert modify on InitializeVariable
      
      * revert modify on InitializeVariable
      
      * mutable support reset dtype
      
      * enable make pten tensor from variable when def_arg.type is undefined
      
      * fix build pten ctx start_idx error
      
      * copy pten out tensor to variable
      
      * merge develop branch
      
      * fix non pten kernel cast failed
      
      * add reset allocation place for remake tensor
      
      * fix inplace realloc error
      
      * add mutable on pten kernles and remove unused cast files
      
      * rename function names
      
      * fix output type error
      
      * fix conflict with develop branch
      
      * set data type to variable with pten's dtype
      
      * fix test_cast_api type mismatch
      
      * densorTensro mutable_data support 0 bytes value
      
      * fix the inplace bug of reshape kernel
      
      * fix pten.backend != variable.place when moving storage, palce mismatch bug
      
      * fix conflict with develop branch
      
      * Fix bug of paddle::experimental::MovesStorage
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * Revert "fix ReMakePtenDenseTensor place mismatch bug"
      
      This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a.
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * reverts the set_lod interface, test=develop
      
      * modify by the review options
      
      * modify error message
      
      * add & for const input arguments
      
      * add reference in params
      
      * elementwise_sub add mutable_data
      
      * fix ResetHolderWithType check size bug
      
      * add dependence pten_tensor to test_cast_api object
      
      * remove unused code to pass ci coverage
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
      5caa6fc5