1. 31 12月, 2021 1 次提交
  2. 30 12月, 2021 1 次提交
    • F
      flags to choose kp kernel (#38455) · ed2cfecf
      Feng Xing 提交于
      This PR adds runtime flags run_kp_kernel, which choose which op to run for xpu2. There are two: dynamic linked and built from kp.
      ed2cfecf
  3. 27 12月, 2021 2 次提交
  4. 23 12月, 2021 1 次提交
  5. 20 12月, 2021 1 次提交
  6. 16 12月, 2021 2 次提交
  7. 14 12月, 2021 3 次提交
  8. 10 12月, 2021 3 次提交
  9. 09 12月, 2021 1 次提交
  10. 08 12月, 2021 2 次提交
  11. 07 12月, 2021 2 次提交
  12. 06 12月, 2021 3 次提交
  13. 03 12月, 2021 1 次提交
  14. 02 12月, 2021 1 次提交
  15. 01 12月, 2021 1 次提交
  16. 27 11月, 2021 1 次提交
    • A
      [NPU] reorganization for device API abstraction (#37110) · 72241a6a
      Aganlengzi 提交于
      * [NPU] reorganization for device API abstraction
      
      * [NPU] delete old files
      
      * [NPU] fix npu_collective_helper
      
      * [NPU] fix collective_helper
      
      * [NPU] fix ut
      
      * [NPU] mod memory allocation and hccl_helper
      
      * [NPU] fix place_type
      
      * [NPU] split enfoce.h
      
      * move acl* call into npu_info
      
      * merge conflict
      
      * fix merge
      
      * merge conflict
      
      * merge conflict
      72241a6a
  17. 26 11月, 2021 1 次提交
    • Z
      Added interface reset_grad_inplace_version (#37573) · dcb91fd7
      Zhanlue Yang 提交于
      reset_inplace_version removes all inplace related records to VarBase/VariableWrapper, the essential purpose of which is to let you use inplace operations as if using its non-inplaced version, which of course will cause unexpected consequences if not used with care.
      
      This is essentially a hack interface to satisfy one specific request
      dcb91fd7
  18. 25 11月, 2021 1 次提交
    • Z
      【PTen】Add fill_constant kernel using ScalarArray in pten (#37481) · a0d465f8
      zyfncg 提交于
      * add scalar and scalar_array
      
      * remove DenseTensor include from Scalar and ScalarArray
      
      * remove inner header from scalar_array
      
      * refactor the method of fill_constant and add some comment
      
      * add fill_constant kernel using ScalarArray
      
      * modify some prompt
      
      * remove fill_constant kernel with no shape
      a0d465f8
  19. 24 11月, 2021 1 次提交
    • 0
      [Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a
      0x45f 提交于
      * run dy2stat pure fp16 in Linear model
      
      * no use self._pure_fp16_inputs
      
      * add test and fix Adam error in dy2stat pure fp16 training
      
      * use paddle.optimizer.Adam
      
      * run test in gpu
      
      * change test time for CI
      
      * enlarge atol for test_resnet_pure_fp16
      
      * refine code and enlarge atol
      
      * make custom_white_list and custom_black_list take effect for AMP and pure fp16
      
      * check tracer is not None
      
      * use default atol
      
      * change filter_size
      
      * change atol and add some NOTE
      52edad6a
  20. 23 11月, 2021 4 次提交
  21. 22 11月, 2021 3 次提交
    • Z
      Add backward function hook to dygraph (#37141) · 31344ab7
      Zhanlue Yang 提交于
      31344ab7
    • W
      Renamed Func and removed ENFORCE statement (#37348) · 2702af21
      Weilong Wu 提交于
      * Removed one ENFORCE statement
      
      * Changed func name to _share_buffer_to
      
      * Improve error reporting information
      
      * Updated the logic of _is_share_buffer_to func
      2702af21
    • C
      [PTen] Add variable transform to/from ptenTensor and add cast kernel (#36916) · 5caa6fc5
      chentianyu03 提交于
      * add cast kernel
      
      * add cast cuda kernel
      
      * add cast kernel
      
      * make cast kernel output dtype undefined
      
      * get cast dtype from vardesc
      
      * move cast to manipulation and add test case
      
      * add castinfershape
      
      * avoid reinitilaze variable
      
      * InitializeVariable support datatype
      
      * merge develop branch
      
      * fix merge bug
      
      * revert modify initializeVariable
      
      * revert modify on InitializeVariable
      
      * revert modify on InitializeVariable
      
      * mutable support reset dtype
      
      * enable make pten tensor from variable when def_arg.type is undefined
      
      * fix build pten ctx start_idx error
      
      * copy pten out tensor to variable
      
      * merge develop branch
      
      * fix non pten kernel cast failed
      
      * add reset allocation place for remake tensor
      
      * fix inplace realloc error
      
      * add mutable on pten kernles and remove unused cast files
      
      * rename function names
      
      * fix output type error
      
      * fix conflict with develop branch
      
      * set data type to variable with pten's dtype
      
      * fix test_cast_api type mismatch
      
      * densorTensro mutable_data support 0 bytes value
      
      * fix the inplace bug of reshape kernel
      
      * fix pten.backend != variable.place when moving storage, palce mismatch bug
      
      * fix conflict with develop branch
      
      * Fix bug of paddle::experimental::MovesStorage
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * Revert "fix ReMakePtenDenseTensor place mismatch bug"
      
      This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a.
      
      * fix ReMakePtenDenseTensor place mismatch bug
      
      * reverts the set_lod interface, test=develop
      
      * modify by the review options
      
      * modify error message
      
      * add & for const input arguments
      
      * add reference in params
      
      * elementwise_sub add mutable_data
      
      * fix ResetHolderWithType check size bug
      
      * add dependence pten_tensor to test_cast_api object
      
      * remove unused code to pass ci coverage
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
      5caa6fc5
  22. 16 11月, 2021 4 次提交