提交 · 4383494f012b6613aa65496e7892ae3f0052ddd9 · PaddlePaddle / Paddle

04 1月, 2023 1 次提交

[Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f

由 HongyuJia 提交于 1月 04, 2023

* execute use kernel_key first

* change OpKernelType->KernelKey

* fix py3 compile error, remove redundant header files

* fix build_strategy_test

* fix DataType::RAW

* fix custom_type test: operator_test.cc

* fix transform place

* fix backends_are_same_class

* try fix place TransDataDevice

* support all KernelKey

* fix TransformData

* fix place_are_same_class

* fix merge

* fix test_params_no_grad

* fix specific place of GetExpectedKernelType

* fix specific place of GetExpectedKernelType

* fix GetKernelTypeForVar

* fix dtype error

* fix fetch_v2

* change GetKernelTypeForVar

* fix interpreter

* fix typo error

* polish codes

* polish codes

* polish codes

* fix conflict

4383494f

08 12月, 2022 1 次提交
- L
  
  first commit (#38143) · 2e7c172c
  由 limingshu 提交于 12月 08, 2022
  
  2e7c172c
07 12月, 2022 1 次提交
- 张
  
  [phi::DenseTensor] Replace Tensor with phi::DenseTensor (#48682) · 65420271
  由张春乔提交于 12月 07, 2022
  
  65420271
28 11月, 2022 1 次提交
- 张
  
  replace LoDTensor with phi::DenseTensor in fluid\operators\*\ except sequence_ops (#48418) · 30a31a53
  由张春乔提交于 11月 28, 2022
  
  30a31a53
18 11月, 2022 1 次提交
- W
  [PHI decoupling] remove "gpu_primitives.h" in fluid (#48063) · 9918bf9c
  由 Wang Xin 提交于 11月 18, 2022
```
* remove "gpu_primitives.h" in fluid namespace

* fix PR-CI-GpuPS fail

* fix PR-CI-GpuPS fail
```
  9918bf9c
31 10月, 2022 1 次提交
- W
  
  remove boost compiler flags in flags.cmake (#47468) · 91096ae2
  由 Wang Xin 提交于 10月 31, 2022
  
  91096ae2
26 10月, 2022 1 次提交
- H
  
  clean mkldnn headerfile (#47362) · 436115cf
  由 HongyuJia 提交于 10月 26, 2022
  
  436115cf
25 10月, 2022 1 次提交

[Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (Part2 add dnn_fallback flag) (#47200) · 6f5e7826

由 HongyuJia 提交于 10月 25, 2022

* use dnn_fallback flag to delete mkldnn hardcode

* polish code style

* fix protected error

* fix const error

* fix reduce_op fallback

* fix pool_op fallback

* add Set function of dnn_fallback_

6f5e7826

24 10月, 2022 1 次提交
- W
  [CodeStyle] fix macos inconsistent-missing-override warnings and add -Werror (#47264) · c5fe109b
  由 Wang Xin 提交于 10月 24, 2022
```
* fix macos inconsistent-missing-override warnings

* fix inconsistent-missing-override error in test
```
  c5fe109b
17 10月, 2022 1 次提交
- Y
  [PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
  由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
  ec749398
11 10月, 2022 2 次提交
- C
  
  [MLU] add masterparam support for mlu adamw. (#46804) · 7541579a
  由 Chenxiao Niu 提交于 10月 11, 2022
  
  7541579a
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
28 9月, 2022 1 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

23 9月, 2022 1 次提交
- Y
  
  move selected_rows_functor (#46373) · b6c6f4f9
  由 YuanRisheng 提交于 9月 23, 2022
  
  b6c6f4f9
22 9月, 2022 1 次提交
- P
  [PHI] Migrate sgd and stack oneDNN kernels (#46374) · 4ae37aee
  由 Piotr Paturej 提交于 9月 22, 2022
```
* Convert slice+grad oneDNN fluid kernels to PHI

* Change mutable_data to Alloc

* Refactor licences
```
  4ae37aee
15 9月, 2022 1 次提交
- N
  
  [CodeStyle] trim trailing whitespace in .h, .cc, .cu, etc. (#46006) · 8dde7aea
  由 Nyakku Shigure 提交于 9月 15, 2022
  
  8dde7aea
14 9月, 2022 2 次提交
- S
  Fix DistributedFusedLAMB NaN problem (#46011) · 6833ecfe
  由 sneaxiy 提交于 9月 14, 2022
```
* fix distributed_fused_lamb nan

* remove CUDA_ASSERT
```
  6833ecfe
- C
  
  [MLU] add mergedAdam kernel. (#45965) · bf6ec262
  由 Chenxiao Niu 提交于 9月 14, 2022
  
  bf6ec262
06 9月, 2022 2 次提交
- Y
  
  migrate deformable_conv and merged momentum kernels to phi, test=kunlun (#45691) · 7f3c7aeb
  由 ykkk2333 提交于 9月 06, 2022
  
  7f3c7aeb
- H
  
  [XPU] rmsprop to phi. (#45734) · 1137677a
  由 houj04 提交于 9月 06, 2022
  
  1137677a
02 9月, 2022 2 次提交
- Y
  
  migrate shaple sgd, split,sign xpu kernels to phi, test=kunlun (#45607) · 3b9b4c34
  由 ykkk2333 提交于 9月 02, 2022
  
  3b9b4c34
- A
  [XPU]Migrate Adam XPU kernel into Phi (#45572) · cbabbe2e
  由 Aurelius84 提交于 9月 02, 2022
```
* [XPU]Migrate Adam XPU kernel into Phi

* test=kunlun
```
  cbabbe2e
01 9月, 2022 2 次提交
- T
  xpu-paddlepaddle-37 [任务] 迁移lamb到phi (#45520) · 1a0ef45e
  由 taixiurong 提交于 9月 01, 2022
```
test=kunlun
```
  1a0ef45e
- A
  [XPU]Migrate adamw XPU kernel into Phi (#45609) · f5a041e6
  由 Aurelius84 提交于 9月 01, 2022
```
* [XPU]Migrate adamw XPU kernel into Phi

* test=kunlun

* test=kunlun
```
  f5a041e6
31 8月, 2022 1 次提交
- W
  Move XPU momentum to phi (#45565) · d7807806
  由 WangZhen 提交于 8月 31, 2022
```
* Move XPU momentum to phi, test=kunlun

* Fix mu type, test=kunlun
```
  d7807806
24 8月, 2022 1 次提交

Support fp16 of adam operator in xpu environment (#45292) · a012d426

由 mengqingchun02 提交于 8月 24, 2022

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support beam_search operator on xpu. test=kunlun

* support fp16 of adam operator in xpu environment. test=kunlun

* support fp16 of adam operator in xpu environment. test=kunlun

* support fp16 of adam operator in xpu environment. test=kunlun

a012d426

19 8月, 2022 1 次提交

[XPU] add merged_momentum unittest and change momentum (#45241) · e0f1c9f2

由 dongfangshenzhu 提交于 8月 19, 2022

* add merged_momentum *test=kunlun

* add merged_momentum *test=kunlun

* add fp16 to merged_momentum,*test=kunlun

* change dist_model.cc

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

* add merged_momentum unittest and  change momentum,test=kunlun

e0f1c9f2

17 8月, 2022 1 次提交
- F
  
  [MLU] fix copy error (#45194) · 75690584
  由 fwenguang 提交于 8月 17, 2022
  
  75690584
08 8月, 2022 1 次提交
- T
  
  move lamb_op to phi (#44899) · 4a7aa7c3
  由 Thomas Young 提交于 8月 08, 2022
  
  4a7aa7c3
04 8月, 2022 2 次提交
- D
  [XPU] add merged_momentum including fp32 and fp16 (#44824) · 4922376c
  由 dongfangshenzhu 提交于 8月 04, 2022
```
* add merged_momentum *test=kunlun

* add merged_momentum *test=kunlun

* add fp16 to merged_momentum,*test=kunlun
```
  4922376c
- S
  
  opt allreduce (#44843) · 1f9e2742
  由 sneaxiy 提交于 8月 04, 2022
  
  1f9e2742
03 8月, 2022 1 次提交
- S
  Add use_hierarchical_allreduce for DistributedFusedLAMB (#44821) · c770053c
  由 sneaxiy 提交于 8月 03, 2022
```
* add use_hierarchical_allreduce

* support hierarchical allreduce for more cases
```
  c770053c
01 8月, 2022 1 次提交

unify gpu context (#44740) · 86763023

由 Leo Chen 提交于 8月 01, 2022

* remove cudaDeviceContext

* remove more template

* fix rocm compile

* remove alias name CUDADeviceContext

* fix compile

* fix tests

* revert changes

86763023

29 7月, 2022 1 次提交
- Q
  add some fp16 op for kunlun resnet50 model (#44672) · fecbc958
  由 QingshuChen 提交于 7月 29, 2022
```
* add some fp16 op for kunlun resnet50 model
*test=kunlun

* tmp
*test=kunlun
```
  fecbc958
27 7月, 2022 1 次提交
- Y
  
  [DCU] Fix NAN problem when training BERT on DUC platform (#44643) · 28aa0c61
  由 Yuang Liu 提交于 7月 27, 2022
  
  28aa0c61
25 7月, 2022 1 次提交
- L
  
  [Phi] Migrate squared_l2_norm_op to phi (#44492) · 3e170163
  由 lyq 提交于 7月 25, 2022
  
  3e170163
22 7月, 2022 1 次提交
- Q
  add xpu lars_momentum/pow2_decay (#44448) · 8ccbb863
  由 QingshuChen 提交于 7月 22, 2022
```
*test=kunlun
```
  8ccbb863
14 7月, 2022 2 次提交
- Y
  
  [operator migration] Migrate infer shape for merged momentum (#44338) · 246ac976
  由 Yuang Liu 提交于 7月 14, 2022
  
  246ac976
- Y
  
  [operator migration] Migrate merged momentum cpu/gpu kernels (#44300) · d15b490a
  由 Yuang Liu 提交于 7月 14, 2022
  
  d15b490a
13 7月, 2022 1 次提交
- Q
  fix cpu lars_momentum bug & add xpu grad_add/log_softmax/log_softmax_… (#44260) · d6d60cbc
  由 QingshuChen 提交于 7月 13, 2022
```
* fix cpu lars_momentum bug & add xpu grad_add/log_softmax/log_softmax_grad
*test=kunlun

* minor
*test=kunlun
```
  d6d60cbc

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功