提交 · 3d1981ad87d91bb88c46d87e2c4df40812ce291f · BaiXuePrincess / Paddle

25 11月, 2022 1 次提交

[PROFILER] add flops for Profiler (#47766) · 3d1981ad

由 Chitsing KUI 提交于 11月 25, 2022

* attr ready

* op ip ready

* start dynamic

* end2end ok

* input shape to map, stat by op

* layer wip

* first version ready

* fix proto depds

* fix profiler deps

* fix flops typo, rm tuple shape

3d1981ad

23 11月, 2022 1 次提交
- Z
  
  add warpctc kernel and change cast_v2 to cast for xpu, test=kunlun (#48134) · 25ffe9c2
  由 zhangyikun02 提交于 11月 23, 2022
  
  25ffe9c2
18 11月, 2022 1 次提交
- Z
  
  cast and gradient_accumulator support double for xpu, test=kunlun (#47800) · 982d5ff7
  由 zhangyikun02 提交于 11月 18, 2022
  
  982d5ff7
16 11月, 2022 1 次提交
- L
  
  increase the level of some log (#47990) · 2f8901cb
  由 Leo Chen 提交于 11月 16, 2022
  
  2f8901cb
15 11月, 2022 1 次提交

mkldnn directory cleanup (#47779) · 8a339d24

由 Sławomir Siwek 提交于 11月 15, 2022

* cleanup unused code

* unify is_int8 is_bfloat16

* Simplify matmul_v2 FWD kernel

* remove RunKernel methods

* remove import namespace

* remove headers

* clean fluid/phi cross imports

* remove fluid axpy_handler

* delete fluid methods

* activations

* OneDNNMemDesc

* MKLDNNFormatForSize

* MatchShapeToLayout

* MKLDNNMemoryFormat

* MKLDNNFormat

* ReorderMKLDNNHandler

* to_void_cast

* review suggestions

* interpolate

* remove fluid depedency

8a339d24

10 11月, 2022 1 次提交
- Z
  
  fix amp cast bug for bn (#47802) · 5004c33a
  由 zhangbo9674 提交于 11月 10, 2022
  
  5004c33a
08 11月, 2022 1 次提交
- J
  removing dependent to fluid/framework/eigen.h in phi (#47675) · c7cd8d98
  由 jzhang533 提交于 11月 08, 2022
```
* removing dependent to fluid/framework/eigen.h in phi

* more fix according to PR-CI-Py3 fail
```
  c7cd8d98
07 11月, 2022 1 次提交

[Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d

由 HongyuJia 提交于 11月 07, 2022

* move cudnn hardcode outside GetExpectedKernelType

* add header file

* debug

* update interpreter_util with hardcode

* update interpreter_util headerfile

* solve activation hardcode

* debug with CI

* add mkldnn_op_list header file

* temporarily uncomment mkldnn

* temporarily uncomment mkldnn

* delete sequence_softmax cudnn hardcode

* add hardcode to data_transfer.cc

* update data_transfer headerfile

* try fix segment fault

* update cudnn&miopen_helper

* reset HasAttr of DygraphExctnCtx

* debug, this commit should pass all CI

* debug should pass CI, temporarily disable activation

* debug should pass CI

* fix default_attr=nullptr bug

* clean debug code

* Call SetDnnFallback function in the base class

* activation fallback to plain kernel

* fix default GetExpectedKernelType find wrong kernel

* search cudnn kernel instead of fallback

* fix cudnn_handle bug

* remove tanh use_cudnn

* restore tanh use_cudnn

* debug tanh

* fix tanh bug

* delete activation cudnn kernel

* polish code

908a381d

04 11月, 2022 1 次提交
- Y
  
  fix deepfm and deep_wide bug, add embedding_sparse_grad kernel, test=kunlun (#47365) · f53e920d
  由 ykkk2333 提交于 11月 04, 2022
  
  f53e920d
03 11月, 2022 1 次提交

[Opt Kernel Selection] Opt CanMKLDNNBeUsed performance (#47563) · 9adad42d

由 HongyuJia 提交于 11月 03, 2022

* opt CanMKLDNNBeUsed performance

* fix nullptr bug

* fix OpBase default_attrs=nullptr bug

* fix OpBase default_attrs=nullptr bug

* fix OpBase default_attrs=nullptr bug

9adad42d

02 11月, 2022 1 次提交
- H
  Revert "[Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325)" (#47582) · a57a19ea
  由 HongyuJia 提交于 11月 02, 2022
```
This reverts commit f9134045.
```
  a57a19ea
01 11月, 2022 1 次提交

[Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325) · f9134045

由 HongyuJia 提交于 11月 01, 2022

* move cudnn hardcode outside GetExpectedKernelType

* add header file

* debug

* update interpreter_util with hardcode

* update interpreter_util headerfile

* solve activation hardcode

* debug with CI

* add mkldnn_op_list header file

* temporarily uncomment mkldnn

* temporarily uncomment mkldnn

* delete sequence_softmax cudnn hardcode

* add hardcode to data_transfer.cc

* update data_transfer headerfile

* try fix segment fault

* update cudnn&miopen_helper

* reset HasAttr of DygraphExctnCtx

* debug, this commit should pass all CI

* debug should pass CI, temporarily disable activation

* debug should pass CI

* fix default_attr=nullptr bug

* clean debug code

f9134045

31 10月, 2022 1 次提交
- C
  
  [MLU] fix compile error & add mlu blacklist function. (#47439) · bb6356e8
  由 Chenxiao Niu 提交于 10月 31, 2022
  
  bb6356e8
28 10月, 2022 2 次提交
- H
  
  [Dygraph] Finish fixing mem bugs of no sync in DataParallel (#47444) · e77c062e
  由 Haohongxiang 提交于 10月 28, 2022
  
  e77c062e
- H
  [Dygraph] Fix memory bugs of no sync and SplitTensors in DataParallel (#47369) · 57d5ffa5
  由 Haohongxiang 提交于 10月 28, 2022
```
* fix no sync bugs

* update

* update task chain

fix: update wait chain

feat: add `GetDeviceContext` for gloo

* fix oom

* fix dev

* update

* update
Co-authored-by: NLiYuRio <liyuruijx@163.com>
Co-authored-by: NForFishes <2282912238@qq.com>
```
  57d5ffa5
26 10月, 2022 1 次提交
- W
  fix uninitialized, tautological-constant-out-of-range-compare and... · 076c41ef
  由 Wang Xin 提交于 10月 26, 2022
```
fix uninitialized, tautological-constant-out-of-range-compare and literal-conversion warning on macos (#47341)
```
  076c41ef
25 10月, 2022 2 次提交
- W
  
  fix braced-scalar-init warnings on macos (#47309) · d8690564
  由 Wang Xin 提交于 10月 25, 2022
  
  d8690564
- H
  [Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (Part2 add dnn_fallback flag) (#47200) · 6f5e7826
  由 HongyuJia 提交于 10月 25, 2022
```
* use dnn_fallback flag to delete mkldnn hardcode

* polish code style

* fix protected error

* fix const error

* fix reduce_op fallback

* fix pool_op fallback

* add Set function of dnn_fallback_
```
  6f5e7826
24 10月, 2022 1 次提交
- W
  [CodeStyle] fix macos inconsistent-missing-override warnings and add -Werror (#47264) · c5fe109b
  由 Wang Xin 提交于 10月 24, 2022
```
* fix macos inconsistent-missing-override warnings

* fix inconsistent-missing-override error in test
```
  c5fe109b
20 10月, 2022 1 次提交
- H
  
  opt mkldnn selection judgement (#47217) · 1ba592d6
  由 HongyuJia 提交于 10月 20, 2022
  
  1ba592d6
19 10月, 2022 1 次提交
- W
  
  fix old dygraph a vlog bug (#47115) · 3c39475d
  由 wanghuancoder 提交于 10月 19, 2022
  
  3c39475d
17 10月, 2022 1 次提交
- Y
  [PHI]Modify DataLayout's namespace from paddle::experimental to phi (#46869) · ec749398
  由 YuanRisheng 提交于 10月 17, 2022
```
* namespace modify

* update by comment
```
  ec749398
13 10月, 2022 1 次提交

[Kernel Selection] Remove hard code of PADDLE_WITH_MKLDNN (#46606) · ef1c8759

由 HongyuJia 提交于 10月 13, 2022

* remove PADDLE_WITH_MKLDNN, test white_list=abs

* fix unique_ptr

* fix op.Type()

* remove TODO in kernel_dispatch.h

* remove IndicateVarDataType function, update white_list

* remove mkldnn hard code

* add comments

* fix ==

* update mkldnn_op_list

* delete hard code of OPs

* update mkldnn_op_list

* update mkldnn_op_list, remove interp

* add error check for ExecutionContext

* update mkldnn_op_list, remove transpose2_grad

* remove interpolate mkldnn

* remove fill_constant mkldnn

* opt HasAttr in DygraphExecutionContext

* deprecated commit, test mkldnn_white_list

* deprecated commit, test mkldnn_white_list

* deprecated commit, test mkldnn_black_list

* update mkldnn_op_list, add assert error op

* solve cudnn related op

* fix error

* add mkldnn fallback in phi_utils.cc

* remove mkldnn fallback in phi_utils.cc

* opt code implementation

* polish Copyright License

ef1c8759

11 10月, 2022 2 次提交
- C
  Remove LoDTensor using in fluid (Part 1) (#46663) · 940d8f25
  由 Chen Weihang 提交于 10月 11, 2022
```
* remove using lodtensor part1

* polish history code format
```
  940d8f25
- N
  
  Update layout autotune for module with no modified (#46541) · 3da3462f
  由 niuliling123 提交于 10月 11, 2022
  
  3da3462f
10 10月, 2022 1 次提交

[PHI]Add RNN yaml (#46812) · ab60fd8b

由 YuanRisheng 提交于 10月 10, 2022

* add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta

* WIP: move rnn kernrl to phi

* Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.

* fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.

* fix ci bugs

* fix ci bugs

* fix ci bugs

* modify code according comment
Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>

ab60fd8b

30 9月, 2022 1 次提交
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46629) · abee2210
  由 HongyuJia 提交于 9月 30, 2022
  
  abee2210
28 9月, 2022 1 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

27 9月, 2022 1 次提交
- N
  
  Update for SetOutDataLayout when VarType is EagerVariable (#46515) · 52009d19
  由 niuliling123 提交于 9月 27, 2022
  
  52009d19
23 9月, 2022 1 次提交
- Y
  
  move selected_rows_functor (#46373) · b6c6f4f9
  由 YuanRisheng 提交于 9月 23, 2022
  
  b6c6f4f9
20 9月, 2022 1 次提交
- H
  [PolishComments] Polish some code comments (#46032) · 56f9452c
  由 HongyuJia 提交于 9月 20, 2022
```
* polish code comments

* polish data_device_transform.cc
```
  56f9452c
19 9月, 2022 1 次提交
- N
  
  Update layoutautotune for inplace (#45826) · 16439bb9
  由 niuliling123 提交于 9月 19, 2022
  
  16439bb9
15 9月, 2022 1 次提交
- N
  
  Revert "Fix argsort in XPU black list for XPU KP (#45975)" (#46064) · f3206b09
  由 niuliling123 提交于 9月 15, 2022
  
  f3206b09
13 9月, 2022 1 次提交
- N
  
  Fix argsort in XPU black list for XPU KP (#45975) · 2d45f68f
  由 niuliling123 提交于 9月 13, 2022
  
  2d45f68f
08 9月, 2022 1 次提交
- H
  
  polish code comment, test=doc (#45859) · 447d79da
  由 HongyuJia 提交于 9月 08, 2022
  
  447d79da
06 9月, 2022 1 次提交

[PHI]Add TensorArray for PHI (#45479) · 68f99b78

由 YuanRisheng 提交于 9月 06, 2022

* add tensor array

* fix ci bugs

* fix ci bugs

* fix ci bugs

* fix ci bugs

* update by comment

* update code

68f99b78

05 9月, 2022 1 次提交
- N
  
  Add eager layout autotune (#45409) · d7d9807e
  由 niuliling123 提交于 9月 05, 2022
  
  d7d9807e
01 9月, 2022 1 次提交
- S
  Lazy initialize dense_contents_ in reducer (#45631) · 196b0187
  由 sneaxiy 提交于 9月 01, 2022
```
* make dense_contents_ lazy init

* update legacy dygraph

* fix legacy dygraph bug
```
  196b0187
30 8月, 2022 1 次提交

Remove extra attribute in OpMaker (#44310) · fe321f9a

由 zyfncg 提交于 8月 30, 2022

* add runtime config in phi

* add runtime attr for op desc and op

* fix no proto error

* adjust opdesc set_attr impl

* try to remove conv_op extra attrs

* add init runtime attr map

* change extra header path

* fix runtime_attr

* fix trace_op

* fix bug of pass

* fix merge conflict

* fix dygraph attrs

* fix bug of pass

* fix dygraph bug

* fix unittest module

* delete extra attr default

* fix dropout kernel

* polish code

* fix extra output of instance_norm

* fix merge confilct

* fix op_desc bug

* add extra attr in yaml for conv3d_transpose

* don't remove extra input and output

* fix save_inference_model

* fix bug of batch_norm

* revert some change

* polish log

* polish code

* add code comment
Co-authored-by: NChen Weihang <chenweihang@baidu.com>

fe321f9a

29 8月, 2022 1 次提交
- A
  [OpAttr]num_rows/num_colums of eye support Tensor type (#45427) · b93b710a
  由 Aurelius84 提交于 8月 29, 2022
```
* [OpAttr]num_rows/num_colums of eye support Tensor type

* fix attr cast with long type
```
  b93b710a

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致