提交 · cd2855b0626a4cce979ce58587c942b0b0304691 · 机器未来 / Paddle

31 12月, 2021 1 次提交

[MLU]support calling mlu op from python interface (#38292) · b6bf650a

由 fwenguang 提交于 12月 31, 2021

* [MLU]support calling mlu op from python interface

* [MLU]fix

* fix

* [mlu]fix mlu_places

* [mlu]fix required mlu

* fix

* [MLU]fix tensor copy

* [mlu] fix MLUPlace call path

b6bf650a

30 12月, 2021 1 次提交

flags to choose kp kernel (#38455) · ed2cfecf

由 Feng Xing 提交于 12月 30, 2021

This PR adds runtime flags run_kp_kernel, which choose which op to run for xpu2. There are two: dynamic linked and built from kp.

ed2cfecf

27 12月, 2021 2 次提交
- P
  fix accumulator bug when multiple inplace OPs are executed continuously (#38406) · 113c8b93
  由 pangyoki 提交于 12月 27, 2021
```
* fix accumulator bug

* fix unittest
```
  113c8b93
- S
  [BugFix]Fix bug in pfp16 in DataParallel (#38378) · e8e47581
  由 ShenLiang 提交于 12月 27, 2021
```
* fix bug in pfp16

* fix hip

* fix hip
```
  e8e47581
23 12月, 2021 1 次提交
- add new API: paddle.clone;Tensor.element_size;nn.utils.parameters_to_vector (#38020) · 0eb03ed7
  由 zhouweiwei2014 提交于 12月 23, 2021
```
* add new API: paddle.clone;Tensor.element_size;nn.utils.parameters_to_vector

* fix comment
```
  0eb03ed7
20 12月, 2021 1 次提交
- F
  
  [MLU]add mlu backend (#38207) · 76514a1f
  由 fwenguang 提交于 12月 20, 2021
  
  76514a1f
16 12月, 2021 2 次提交
- C
  
  add grad maker debug log (#38183) · a43d8e59
  由 chentianyu03 提交于 12月 16, 2021
  
  a43d8e59
- C
  pylayer support tuple/list type args and fix check args bug (#38146) · 861053eb
  由 chentianyu03 提交于 12月 16, 2021
```
* Revert "Revert "pylayer support tuple/list type args (#37727)" (#37956)"

This reverts commit d848ff04.

* move check args,kwargs before forward execute
```
  861053eb
14 12月, 2021 3 次提交
- A
  
  Add const in GetInput/OutputVarPtrs in InferShapeContext (#38066) · 22f14e74
  由 Aurelius84 提交于 12月 14, 2021
  
  22f14e74
- Y
  
  remove KernelName (#38082) · 8198cad7
  由 YuanRisheng 提交于 12月 14, 2021
  
  8198cad7
- Y
  [PTen] Reduce reshape kernel functions in pten (#38055) · a3c8abc7
  由 YuanRisheng 提交于 12月 14, 2021
```
* Reduce reshape kernel functions in pten

* delete notes

* fix bugs when compile
```
  a3c8abc7
10 12月, 2021 3 次提交
- P
  
  fix dygraph_grad_maker to support set_value (#38014) · dabf8152
  由 pangyoki 提交于 12月 10, 2021
  
  dabf8152
- K
  
  fix ndiv for npu (#37998) · 11c785a4
  由 kuizhiqing 提交于 12月 10, 2021
  
  11c785a4
- L
  
  revert flags_benchmark (#38005) · 26c44a86
  由 Leo Chen 提交于 12月 10, 2021
  
  26c44a86
09 12月, 2021 1 次提交
- J
  
  add ipu device p2 (#37840) · cb636a48
  由 jianghaicheng 提交于 12月 09, 2021
  
  cb636a48
08 12月, 2021 2 次提交
- C
  Revert "pylayer support tuple/list type args (#37727)" (#37956) · d848ff04
  由 chentianyu03 提交于 12月 08, 2021
```
This reverts commit a73064f2.
```
  d848ff04
- C
  
  add check whether tensor is inplace and leaf when calcute gradient (#37931) · 2c02a580
  由 chentianyu03 提交于 12月 08, 2021
  
  2c02a580
07 12月, 2021 2 次提交
- Z
  Buf fix for reset grad inplace version (#37811) · cf586021
  由 Zhanlue Yang 提交于 12月 07, 2021
```
* Debug

* Fixed issue with reset_grad_inplace_version when used with clear_gradient & cross-batch accumulation

* Rearranged interfaces

* Fixed ci issues
```
  cf586021
- L
  
  fix import error of GlooParallelContext (#37892) · 2b479e17
  由 Leo Chen 提交于 12月 07, 2021
  
  2b479e17
06 12月, 2021 3 次提交
- C
  
  pylayer support tuple/list type args (#37727) · a73064f2
  由 chentianyu03 提交于 12月 06, 2021
  
  a73064f2
- R
  
  Fix bug (#37868) · 1432e3d2
  由 ronnywang 提交于 12月 06, 2021
  
  1432e3d2
- K
  
  heter for collective (#37613) · 1bdb8578
  由 kuizhiqing 提交于 12月 06, 2021
  
  1bdb8578
03 12月, 2021 1 次提交
- R
  refine structure for cuda and rocm (#37202) · a6d2fddb
  由 ronnywang 提交于 12月 03, 2021
```
* refine structure for cuda and rocm

* update

* update

* update

* update
```
  a6d2fddb
02 12月, 2021 1 次提交

[PTen]Make inplace_op and vector<DenseTensor> input compatible with old architecture (#37674) · c1fd1b1c

由 YuanRisheng 提交于 12月 02, 2021

* add inplace op adaptation

* optimize inplace logic and fix bugs when run kernel that has args of vector<DenseTensor>

* refactor logic that transform variable to densetensor

* update func name

c1fd1b1c

01 12月, 2021 1 次提交

Remove cpp layer (#37730) · 44def66a

由 Jiabin Yang 提交于 12月 01, 2021

* optimizer __call__ to make dygraph faster

* fix return type

* remove cpp Layer

44def66a

27 11月, 2021 1 次提交

[NPU] reorganization for device API abstraction (#37110) · 72241a6a

由 Aganlengzi 提交于 11月 27, 2021

* [NPU] reorganization for device API abstraction

* [NPU] delete old files

* [NPU] fix npu_collective_helper

* [NPU] fix collective_helper

* [NPU] fix ut

* [NPU] mod memory allocation and hccl_helper

* [NPU] fix place_type

* [NPU] split enfoce.h

* move acl* call into npu_info

* merge conflict

* fix merge

* merge conflict

* merge conflict

72241a6a

26 11月, 2021 1 次提交

Added interface reset_grad_inplace_version (#37573) · dcb91fd7

由 Zhanlue Yang 提交于 11月 26, 2021

reset_inplace_version removes all inplace related records to VarBase/VariableWrapper, the essential purpose of which is to let you use inplace operations as if using its non-inplaced version, which of course will cause unexpected consequences if not used with care.

This is essentially a hack interface to satisfy one specific request

dcb91fd7

25 11月, 2021 1 次提交

【PTen】Add fill_constant kernel using ScalarArray in pten (#37481) · a0d465f8

由 zyfncg 提交于 11月 25, 2021

* add scalar and scalar_array

* remove DenseTensor include from Scalar and ScalarArray

* remove inner header from scalar_array

* refactor the method of fill_constant and add some comment

* add fill_constant kernel using ScalarArray

* modify some prompt

* remove fill_constant kernel with no shape

a0d465f8

24 11月, 2021 1 次提交

[Dy2stat]support pure fp16 for dy2stat (#36944) · 52edad6a

由 0x45f 提交于 11月 24, 2021

* run dy2stat pure fp16 in Linear model

* no use self._pure_fp16_inputs

* add test and fix Adam error in dy2stat pure fp16 training

* use paddle.optimizer.Adam

* run test in gpu

* change test time for CI

* enlarge atol for test_resnet_pure_fp16

* refine code and enlarge atol

* make custom_white_list and custom_black_list take effect for AMP and pure fp16

* check tracer is not None

* use default atol

* change filter_size

* change atol and add some NOTE

52edad6a

23 11月, 2021 4 次提交
- P
  fix inplace bug when the first grad_var(loss_grad) is inplace var (#37420) · ee1e1642
  由 pangyoki 提交于 11月 23, 2021
```
* fix inplace bug

* fix custom grad input error

* add unittest

* fix inplace bug
```
  ee1e1642
- Q
  [XPU] Reorganize xpu device codes in platform, test=develop (#37428) · 79800978
  由 Qi Li 提交于 11月 23, 2021
```
* [XPU] Reorganize xpu device codes in platform, test=develop

* fix xpu_header.h, test=develop
```
  79800978
- R
  [NPU] Added HCCL backend support in dygraph mode (#36285) · 83e55cff
  由 ronnywang 提交于 11月 23, 2021
```
* Added HCCL backend support in dynamic graph mode

* fix segmentation fault

* add ut
```
  83e55cff
- Z
  Bug fix for snapshotting VariableWrapper with initialized tensor but e… (#37410) · e58ac121
  由 Zhanlue Yang 提交于 11月 23, 2021
```
* Bug fix for snapshoting VariableWrapper with initialized tensor but empty allocation

* Added unittest for inplace&clear_gradient
```
  e58ac121
22 11月, 2021 3 次提交

Z

Add backward function hook to dygraph (#37141) · 31344ab7
由 Zhanlue Yang 提交于 11月 22, 2021

31344ab7

Renamed Func and removed ENFORCE statement (#37348) · 2702af21

由 Weilong Wu 提交于 11月 22, 2021

* Removed one ENFORCE statement

* Changed func name to _share_buffer_to

* Improve error reporting information

* Updated the logic of _is_share_buffer_to func

2702af21

[PTen] Add variable transform to/from ptenTensor and add cast kernel (#36916) · 5caa6fc5

由 chentianyu03 提交于 11月 22, 2021

* add cast kernel

* add cast cuda kernel

* add cast kernel

* make cast kernel output dtype undefined

* get cast dtype from vardesc

* move cast to manipulation and add test case

* add castinfershape

* avoid reinitilaze variable

* InitializeVariable support datatype

* merge develop branch

* fix merge bug

* revert modify initializeVariable

* revert modify on InitializeVariable

* revert modify on InitializeVariable

* mutable support reset dtype

* enable make pten tensor from variable when def_arg.type is undefined

* fix build pten ctx start_idx error

* copy pten out tensor to variable

* merge develop branch

* fix non pten kernel cast failed

* add reset allocation place for remake tensor

* fix inplace realloc error

* add mutable on pten kernles and remove unused cast files

* rename function names

* fix output type error

* fix conflict with develop branch

* set data type to variable with pten's dtype

* fix test_cast_api type mismatch

* densorTensro mutable_data support 0 bytes value

* fix the inplace bug of reshape kernel

* fix pten.backend != variable.place when moving storage, palce mismatch bug

* fix conflict with develop branch

* Fix bug of paddle::experimental::MovesStorage

* fix ReMakePtenDenseTensor place mismatch bug

* Revert "fix ReMakePtenDenseTensor place mismatch bug"

This reverts commit 86336032f60b8a15eacd2c1ff2fa513f5d8dfd1a.

* fix ReMakePtenDenseTensor place mismatch bug

* reverts the set_lod interface, test=develop

* modify by the review options

* modify error message

* add & for const input arguments

* add reference in params

* elementwise_sub add mutable_data

* fix ResetHolderWithType check size bug

* add dependence pten_tensor to test_cast_api object

* remove unused code to pass ci coverage
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

5caa6fc5

16 11月, 2021 4 次提交
- C
  
  decrease pten log level (#37239) · d8982c52
  由 Chen Weihang 提交于 11月 16, 2021
  
  d8982c52
- W
  
  Removed unnecessary ENFORCE statement (#37219) · 70b7c7ed
  由 Weilong Wu 提交于 11月 16, 2021
  
  70b7c7ed
- Y
  Add API and unit test for reshape (#37232) · 79b49c20
  由 YuanRisheng 提交于 11月 16, 2021
```
* reshape kernel refactor

* fix compile bugs when run ci

* support xpu for reshape

* fix bugs when run unittest in kunlun ci

* fix compile bugs when run kunlun

* perfect code according to suggestion

* add api and unit test for reshape
```
  79b49c20
- Z
  for pure fp16 (#37230) · 6ebc318e
  由 zhangkaihuo 提交于 11月 16, 2021
```
Add pure fp16 support for fused transformer.
```
  6ebc318e

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致