提交 · cd59c10ce768ea2e782350d9cfb7720a218bb071 · BaiXuePrincess / Paddle

03 11月, 2022 8 次提交

W

Weight and bias's stop_gradient of BatchNorm must be True or False at the same time (#47634) · 21277904
由 wanghuancoder 提交于 11月 03, 2022

21277904
sparse attention kernel is used from 11.8 (#47594) · 7648f429
由 zhouweiwei2014 提交于 11月 03, 2022

7648f429

[CodeStyle][py2][U008] remove unnecessary args in `super()` (#47549) · 3de3e45e

由 Nyakku Shigure 提交于 11月 03, 2022

* [CodeStyle][py2][U008] remove unnecessary args in `super()`

* remove remained args

* revert changes in test_pylayer_op

* Revert "revert changes in test_pylayer_op"

This reverts commit ff185a9ae738afac3b0264f61bde6c6b7f72e7c4.

* revert some changes in example code

3de3e45e

S

fix gemm compute_type (#47613) · 954be40d
由 sneaxiy 提交于 11月 03, 2022

954be40d

[PHI] Migrate softmax kernel (#47339) · b8ae3858

由 Sławomir Siwek 提交于 11月 03, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* remove redundant imports

* migrate softmax

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* merge dev

* fix map at error

* adjust attribute

* adapt funcs to PHI
Co-authored-by: NChen Weihang <chenweihang@baidu.com>
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>

b8ae3858

Z

[Sparse] Unified api args name (#47529) · f9a0605d
由 zhangkaihuo 提交于 11月 03, 2022

f9a0605d
[Zero-Dim] support input 0D Tensor for min/max/amin/amax/prod/logsumexp/all/any (#47501) · a7509ce3
由 zhouweiwei2014 提交于 11月 03, 2022

a7509ce3
Y

fix xpu ci bugs, test=kunlun (#47581) · da083436
由 YuanRisheng 提交于 11月 03, 2022

da083436

02 11月, 2022 8 次提交
- Z
  fix ci bug (#47583) · 0967506e
  由 zhangbo9674 提交于 11月 02, 2022
```
* fix ci bug

* test
```
  0967506e
- T
  
  fix amax/amin/max/min write overflow (#47570) · 6f7a80c3
  由 Tao Luo 提交于 11月 02, 2022
  
  6f7a80c3
- C
  Add storage properties into DenseTensor for supporting extra device properties (#47527) · 246fb841
  由 Chen Weihang 提交于 11月 02, 2022
```
* add storage properties for npu

* fix compile failed

* fix api name mismatch

* polish design
```
  246fb841
- Y
  [PHI]Standardise some C++ API (Part3) (#47532) · fe8c6796
  由 YuanRisheng 提交于 11月 02, 2022
```
* Standardise batch norm

* standardize conv3d and depwise_conv2d

* fix ci bugs
```
  fe8c6796
- [Zero-Dim] support input 0D Tensor for some binary api (#46909) · cad2e68d
  由 zhouweiwei2014 提交于 11月 02, 2022
  
  cad2e68d
- Z
  Support generating static code of high order grad op by yaml (#47511) · bafa890a
  由 zyfncg 提交于 11月 02, 2022
```
* support generating static code of high order grad op by yaml

* polish code
```
  bafa890a
- H
  [XPU] add int64 support for slice and subtract. (#47409) · 77395619
  由 houj04 提交于 11月 02, 2022
```
* [XPU] add int64 support for slice and subtract. test=kunlun

* try to fix xpu compile. test=kunlun

* try to fix xpu compile. test=kunlun

* try to fix xpu compile. test=kunlun

* remove unnecessary modification. test=kunlun
```
  77395619
- T
  Add build option for CUDNN Frontend API (#47524) · eb100c7b
  由 Tian Zheng 提交于 11月 02, 2022
```
* Add build option for CUDNN Frontend API

* Fix review comments

* Change namespace for cudnn_frontend.h
```
  eb100c7b
01 11月, 2022 8 次提交

S

[geometric] Optimize graph sample speed (#47531) · 2a932e55
由 Siming Dai 提交于 11月 01, 2022

2a932e55

Fix bugs in tranpose kernel (#47212) · ec7fe888

由 limingshu 提交于 11月 01, 2022

* first commit

* transpose_kernel_optimization

* first complishment of transpose op

* second commit

* refine code logics of tranpose_kernel

* refine transpose kernel

* first commit

* fix DtoD copy bugs for hip

* refine code according to the PR advice

* change dim to int64_t type.

* fix some type error

ec7fe888

Y
[PHI]Standardise some C++ API (Part2) (#47510) · 399047d7
由 YuanRisheng 提交于 11月 01, 2022
```
* standard_api

* add hardtanh
```
399047d7

[EinsumOp] Einsum support complex grad (#47514) · e930c576

由 xiongkun 提交于 11月 01, 2022

* Einsum Support Complex

* code fix

* add unittest for complex grad with einsum

* set rtol=1e-4

* fix

e930c576

W

remove unused-local-typedefs warning on linux (#47513) · 96f36962
由 Wang Xin 提交于 11月 01, 2022

96f36962

Generate static graph code for some activation ops by Yaml (part2) (#47440) · c5d99138

由 zyfncg 提交于 11月 01, 2022

* gene static graph code for ceil, expm1 op

* gene static graph code for some activation op

* fix bug

* revert doc of silu and logsigmoid

c5d99138

Adapting device-specific Extra Attributes for the PHI kernel (#46342) · c923e6c9

由 Chen Weihang 提交于 10月 31, 2022

* add extra attr property set

* add type_info for all context

* add onednn context to all context

* fix context compile error

* simplify conv kernel args

* pass runtime attr into dev_ctx

* fix marco error

* clear conv_grad_kernel extra args

* merge conv_grad_grad into conv_grad

* clear conv2d_grad_grad extra attrs

* clear yaml and eager extra attr

* fix conv1d error

* change to thread local

* fix npu compile failed

* try to fix windows compile failed

* add conv2d onednn phi kernel

* fix ci bugs (#36)

* fix compile bugs (#38)

* fix extra input transform bug (#39)

* support dynamic created attr (#40)

* reset extra info gen code

* rm conv_grad_grad kernel

* reimpl pass attr adapting

* add int attr support

* remove vector inputnames creating

* fix map at error

* Update paddle/phi/kernels/onednn/conv_grad_kernel.cc
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

* remove useless extra attrs

* replace mkldnn_engine by onednn_engine
Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
Co-authored-by: NSławomir Siwek <slawomir.siwek@intel.com>

c923e6c9

U

summer-ospp 2022: 飞桨PaddlePaddle Sparse Conv开发和优化: gather-gemm-scatter fuse (#46679) · 5158fa4f
由 umiswing 提交于 11月 01, 2022

5158fa4f

31 10月, 2022 6 次提交

Y
[PHI]Standardise some C++ API (#47385) · 60e0c506
由 YuanRisheng 提交于 10月 31, 2022
```
* standard api

* fix ci bugs

* fix ci bugs

* fix ce bugs
```
60e0c506

[Einsum] Einsum support repeated labels. (#47290) · 6e1c14e3

由 xiongkun 提交于 10月 31, 2022

* add unittest for einsum-v2-trace and diagonal

* repeat labels.

* einsum support repeated labels.

* forward is ok for diagonal and undiagonalized.
TODO: check backward is ok by our theorem.

* backward is ok!

* fix by PR suggestions.

* fix ci error

* fix ci error

* fix ci warning

6e1c14e3

R
[CustomDevice] GetCCLComm add custom device support (#47168) · 34d13d6a
由 ronnywang 提交于 10月 31, 2022
```
* [CustomDevice] GetCCLComm add custom device support

* update

* update

* update
```
34d13d6a

[ControlFlow] replace executor in run method of control flow ops with standalone_executor (#45696) · 3b219e5e

由 kangguangli 提交于 10月 31, 2022

* replace executor in conditional_block_op.run with standalone_executor

* add block_id as the argument of standalone executor's method run; add print for program

* fix scope bug about conditional block op

* fix bug: unnecessary return of fetch value

* fix typo

* fix: quantization will set variable persistable, and these variables must exist in global scope

* add interpretercore cache for conditional block op but not activate in default

* fix bug: local scope reuse for conditional block op

* reset scope when conditional block op runs

* fix typo

* fix typo and code style

* add build scope for conditional block op

* add skip for transfer_layout kernel

* refind code

* fix reset_scope

* fix reset_scope

* refine code

* refine code

* refine code

1. remove flag use in conditional_block_op
2. pass execution_config to BuildOpFuncList instead of individual parameter

* refine code

* remove the use of FLAGS_control_flow_use_new_executor_cache

* change FLAGS_control_flow_use_new_executor to false

3b219e5e

[Zero-Dim] support input 0D Tensor for reduce_sum/reduce_mean (#47219) · c8fc3379
由 zhouweiwei2014 提交于 10月 31, 2022

c8fc3379
W

remove boost compiler flags in flags.cmake (#47468) · 91096ae2
由 Wang Xin 提交于 10月 31, 2022

91096ae2

28 10月, 2022 2 次提交
- Z
  
  generate static graph code for some ops by yaml (#47416) · 17fb92b3
  由 zyfncg 提交于 10月 28, 2022
  
  17fb92b3
- Z
  Generate static graph code for some activation ops by Yaml (#47382) · 6baeb2d1
  由 zyfncg 提交于 10月 28, 2022
```
* generate static graph code for some activation op

* fix example code of cosh
```
  6baeb2d1
27 10月, 2022 7 次提交
- Z
  
  support prepare_data for selected_rows in c++ api (#47380) · 8775545a
  由 zyfncg 提交于 10月 27, 2022
  
  8775545a
- L
  make all cpp tests dynamic linked to libpaddle.so [except windows] (#47088) · 2096448b
  由 Leo Chen 提交于 10月 27, 2022
```
* make all cpp tests dynamic linked to libpaddle.so

* add comments

* keep old cc_test for some tests

* fix some ut

* make some ut use cc_test_old

* fix typos and fit for win32

* fix lib path

* fix some tests

* skip lite test

* fit for rocm

* fit for cinn

* fit for mac

* fit for win32

* skip inference ut

* skip  windows

* fix coverage
```
  2096448b
- H
  
  clean gelu cudnn (#47378) · 539f3006
  由 HongyuJia 提交于 10月 27, 2022
  
  539f3006
- H
  
  clean angle cudnn (#47375) · 4d5c8a69
  由 HongyuJia 提交于 10月 27, 2022
  
  4d5c8a69
- H
  
  clean abs cudnn (#47374) · 8607a180
  由 HongyuJia 提交于 10月 27, 2022
  
  8607a180
- J
  Update of PHI transpose_grad (#47311) · 493fbfd7
  由 Jacek Czaja 提交于 10月 27, 2022
```
* - halfway transforming transpose grad

- Fixes

- buildable

* - lint

* rerunning the process
```
  493fbfd7
- B
  fix reduce_any kernel data race on sharedMem (#47233) · 77dbb318
  由 Bo Zhang 提交于 10月 27, 2022
```
* fix reduce_any kernel data race on sharedMem

* use bit operation instead of div & mod

* unbranch

* modified according to PR comments
```
  77dbb318
26 10月, 2022 1 次提交

[MKLDNN] Delete mkldnn hard code of prior_box (#47068) · d78dd7ea

由 HongyuJia 提交于 10月 26, 2022

* remove prior_box mkldnn hard code

* add header file

* simplify PD_VISIT_TYPE

* decouple dependency between prior_box and density_prior_box

* fix pragma omp parallel error

* bypass #pragma omp_parallel_for error

* polish code

* remove visit_type headerfile

* polish codestyle

* polish codestyle

* try fix CI error

* add testcase, datatype=float64

* reset test_prior_box testcase

* add datacheck to DenseTensor

* update template name

* call prior_box with macro expand

d78dd7ea

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致