提交 · c90dd8435072889098a99a4d3f9606a8598f123c · PaddlePaddle / Paddle

03 11月, 2022 2 次提交
- Z
  
  int32/64 does not call backward in unittest (#47604) · c90dd843
  由 zhangkaihuo 提交于 11月 03, 2022
  
  c90dd843
- Y
  
  fix xpu ci bugs, test=kunlun (#47581) · da083436
  由 YuanRisheng 提交于 11月 03, 2022
  
  da083436
02 11月, 2022 24 次提交
- J
  
  remove functions not belong to public-api from __all__ (#47502) · 698128dd
  由 JYChen 提交于 11月 02, 2022
  
  698128dd
- H
  Revert "[Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325)" (#47582) · a57a19ea
  由 HongyuJia 提交于 11月 02, 2022
```
This reverts commit f9134045.
```
  a57a19ea
- Z
  [inference][trt] bilinear support OutSize input (#47495) · c061c082
  由 Zhang Jun 提交于 11月 02, 2022
```
* add bilinear OutSize
```
  c061c082
- L
  
  fix link order (#47584) · 05a4be36
  由 Leo Chen 提交于 11月 02, 2022
  
  05a4be36
- Z
  fix ci bug (#47583) · 0967506e
  由 zhangbo9674 提交于 11月 02, 2022
```
* fix ci bug

* test
```
  0967506e
- Z
  Support NHWC layout in GroupNorm (#47533) · 2dc3d40c
  由 Zhang Zheng 提交于 11月 02, 2022
```
* Support NHWC layout in GroupNorm

* fix cteset
```
  2dc3d40c
- T
  add cuda117 dockerfile (#47412) · c79ae02b
  由 tianshuo78520a 提交于 11月 02, 2022
```
* add cuda117 dockerfile; test=cuda117

* notest;test=cuda117

* test=cuda117

* test=document_fix
```
  c79ae02b
- 丁
  
  Logsigmoid and Tanhshrink ops convert to trt (#47322) · b045fdfb
  由丁一提交于 11月 02, 2022
  
  b045fdfb
- H
  
  rename fw_bw func name of interleave pp (#47571) · dac1087e
  由 Haohongxiang 提交于 11月 02, 2022
  
  dac1087e
- R
  Dispatch computation OPs before communication in standalone executor (#47471) · 5ed487bf
  由 Ruibiao Chen 提交于 11月 02, 2022
```
* Dispath computation OPs before communication in standalone executor

* Update code

* Fix CI errors
```
  5ed487bf
- T
  
  fix amax/amin/max/min write overflow (#47570) · 6f7a80c3
  由 Tao Luo 提交于 11月 02, 2022
  
  6f7a80c3
- C
  Add phi core file into ci checking list (#47564) · 2d058cce
  由 Chen Weihang 提交于 11月 02, 2022
```
* add phi core file into ci list, test=document_fix

* remove repated file, test=document_fix
```
  2d058cce
- C
  Add storage properties into DenseTensor for supporting extra device properties (#47527) · 246fb841
  由 Chen Weihang 提交于 11月 02, 2022
```
* add storage properties for npu

* fix compile failed

* fix api name mismatch

* polish design
```
  246fb841
- Y
  [PHI]Standardise some C++ API (Part3) (#47532) · fe8c6796
  由 YuanRisheng 提交于 11月 02, 2022
```
* Standardise batch norm

* standardize conv3d and depwise_conv2d

* fix ci bugs
```
  fe8c6796
- [Zero-Dim] support input 0D Tensor for some binary api (#46909) · cad2e68d
  由 zhouweiwei2014 提交于 11月 02, 2022
  
  cad2e68d
- L
  
  Fix TRT UT failures (#47488) · 623dce83
  由 Leo Chen 提交于 11月 02, 2022
  
  623dce83
- K
  
  Remove redundant numpy import (#47483) · 20db5221
  由 Kevin吴嘉文提交于 11月 02, 2022
  
  20db5221
- R
  Modify test file (#47544) · 4325da39
  由 risemeup1 提交于 11月 02, 2022
```
* 修改.gitigore文件，把ljd_sh文件忽略掉

* 修复改动单测文件没有触发精准测试的问题

* 取消改动.gitignore

* 修复改动单测没有出发精准测试的问题

* 修改变量名含义更加容易理解,test=coverage
```
  4325da39
- Y
  Improve the tool for checking nan and inf, and support to compute the max, min... · ad39043f
  由 Yiqun Liu 提交于 11月 02, 2022
```
Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor. (#47095)

* Improve the tool for checking nan and inf, and support to compute the max, min and mean of output tensor.

* Add a FLAGS to control whether abort when meets inf/nan and polish codes.

* Fix unittest.

* Change the computing of mean.
```
  ad39043f
- S
  support unbalanced data for pipeline (#47199) · 99f60188
  由 ShenLiang 提交于 11月 02, 2022
```
* add unbalanced data

* fix utest
```
  99f60188
- Z
  Support generating static code of high order grad op by yaml (#47511) · bafa890a
  由 zyfncg 提交于 11月 02, 2022
```
* support generating static code of high order grad op by yaml

* polish code
```
  bafa890a
- H
  [XPU] add int64 support for slice and subtract. (#47409) · 77395619
  由 houj04 提交于 11月 02, 2022
```
* [XPU] add int64 support for slice and subtract. test=kunlun

* try to fix xpu compile. test=kunlun

* try to fix xpu compile. test=kunlun

* try to fix xpu compile. test=kunlun

* remove unnecessary modification. test=kunlun
```
  77395619
- Z
  
  fix sparse_attention unittest (#47547) · 75b73400
  由 zhangkaihuo 提交于 11月 02, 2022
  
  75b73400
- T
  Add build option for CUDNN Frontend API (#47524) · eb100c7b
  由 Tian Zheng 提交于 11月 02, 2022
```
* Add build option for CUDNN Frontend API

* Fix review comments

* Change namespace for cudnn_frontend.h
```
  eb100c7b
01 11月, 2022 14 次提交

N
[CodeStyle][E711][E712] update flake8 config (#47465) · d38010e8
由 Nyakku Shigure 提交于 11月 01, 2022
```
* [CodeStyle][E711][E712] update flake8 config

* empty commit, test=document_fix
```
d38010e8

[CodeStyle][E711] use `is`/`is not` for comparison with `None` (#47452) · a35a4a53

由 Nyakku Shigure 提交于 11月 01, 2022

* [CodeStyle][E711] use `is`/`is not` for comparison with `None`

* `self.assertTrue($A is None)` -> `self.assertIsNone($A)`

* `self.assertTrue($A is not None)` -> `self.assertIsNotNone($A)`

* `self.assertFalse($A is None)` -> `self.assertIsNotNone($A)`

* `self.assertEqual($A, None)` -> `self.assertIsNone($A)`

* `self.assertNotEqual($A, None)` -> `self.assertIsNotNone($A)`

a35a4a53

fix dynamic link of xpu library (#47434) · 9d801855

由 Leo Chen 提交于 11月 01, 2022

* refine comments,test=kunlun

* link xpu lib, test=kunlun

* add sleep for test, test=kunlun

* merge develop, fix compile, test=kunlun

* remove debug code, test=kunlun

* add dependency to avoid potential concurrency error, test=kunlun

9d801855

[Kernel Selection] Remove hard code of PADDLE_WITH_CUDA (#47325) · f9134045

由 HongyuJia 提交于 11月 01, 2022

* move cudnn hardcode outside GetExpectedKernelType

* add header file

* debug

* update interpreter_util with hardcode

* update interpreter_util headerfile

* solve activation hardcode

* debug with CI

* add mkldnn_op_list header file

* temporarily uncomment mkldnn

* temporarily uncomment mkldnn

* delete sequence_softmax cudnn hardcode

* add hardcode to data_transfer.cc

* update data_transfer headerfile

* try fix segment fault

* update cudnn&miopen_helper

* reset HasAttr of DygraphExctnCtx

* debug, this commit should pass all CI

* debug should pass CI, temporarily disable activation

* debug should pass CI

* fix default_attr=nullptr bug

* clean debug code

f9134045

Y

[Paddle Inference] add RegisterOutputHook interface (#47050) · db323927
由 Yuanle Liu 提交于 11月 01, 2022

db323927
H

clean mkldnn headerfile (#47507) · a341bb8c
由 HongyuJia 提交于 11月 01, 2022

a341bb8c
S

[geometric] Optimize graph sample speed (#47531) · 2a932e55
由 Siming Dai 提交于 11月 01, 2022

2a932e55
H

support no_sync attr for params in DataParallel (#47536) · 32efda3d
由 Haohongxiang 提交于 11月 01, 2022

32efda3d

Fix bugs in tranpose kernel (#47212) · ec7fe888

由 limingshu 提交于 11月 01, 2022

* first commit

* transpose_kernel_optimization

* first complishment of transpose op

* second commit

* refine code logics of tranpose_kernel

* refine transpose kernel

* first commit

* fix DtoD copy bugs for hip

* refine code according to the PR advice

* change dim to int64_t type.

* fix some type error

ec7fe888

Y
[PHI]Standardise some C++ API (Part2) (#47510) · 399047d7
由 YuanRisheng 提交于 11月 01, 2022
```
* standard_api

* add hardtanh
```
399047d7
S

fix (#47537) · 957fbb02
由 shentanyue 提交于 11月 01, 2022

957fbb02

[CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True` (#47464) · 5a2ab683

由 Nyakku Shigure 提交于 11月 01, 2022

* [CodeStyle][E712] use `if cond`/`if cond is True` for comparison with `True`

* revert changes in fluid

* revert unrelated file

* revert changes in norm

* revert changes in auto_parallel_amp

* fix norm and auto_parallel_amp

* revert a typo fix due to fixed at #47477

5a2ab683

Support custom stream for standalone executor (#47411) · e12b6c04

由 Ruibiao Chen 提交于 11月 01, 2022

* [Auto Parallel] Improve the c++ dist attr

* [Auto Parallel] Modify test_program.py

* Support custom stream for standalone executor
Co-authored-by: NYulong Ao <aoyulong@baidu.com>

e12b6c04

[EinsumOp] Einsum support complex grad (#47514) · e930c576

由 xiongkun 提交于 11月 01, 2022

* Einsum Support Complex

* code fix

* add unittest for complex grad with einsum

* set rtol=1e-4

* fix

e930c576

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功