提交 · 9900ed5278cd2e3190cc9697bbde30f1b2179cf9 · PaddlePaddle / Paddle

30 9月, 2022 17 次提交
- H
  
  remove MKLDNN hard code in addmm (#46660) · 9900ed52
  由 HongyuJia 提交于 9月 30, 2022
  
  9900ed52
- Fix undefined reference PD_IntArrayGetElementCount (#46662) · 2055a1d2
  由 engineer1109 提交于 9月 30, 2022
```
* Fix undefined reference PD_IntArrayGetElementCount

* Delete PD_IntArrayGetSize Unused
```
  2055a1d2
- A
  [IPU] paddle-inference support custom-ops (#45235) · a6b4bee3
  由 Allen Guo 提交于 9月 30, 2022
```
* paddle-inference support custom-ops
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>

* fix tolower
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
```
  a6b4bee3
- Z
  Optimize performance of depthwise_conv_bwd of filter (#46490) · 04eb211a
  由 Zhang Zheng 提交于 9月 30, 2022
```
* Optimize performance of depthwise_conv_bwd of filter

* op-benchmark

* fix

* op benchmark

* merge bwd
```
  04eb211a
- Z
  Optimize performance of depthwise_conv_bwd (#46362) · f17a73e9
  由 Zhang Zheng 提交于 9月 30, 2022
```
* Optimize performance of depthwise_conv_bwd

* fix
```
  f17a73e9
- C
  
  [MLU] fix phi::Tensor compile error of mlu. (#46649) · 2e231402
  由 Chenxiao Niu 提交于 9月 30, 2022
  
  2e231402
- [MLU] add_fluid_mluop_yolo_box (#46573) · 832b0a15
  由光明和真理提交于 9月 30, 2022
  
  832b0a15
- Y
  fix bugs of tipc, test=kunlun (#46540) · d16360c8
  由 ykkk2333 提交于 9月 30, 2022
```
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun

* migrate add_n kernep to phi, test=kunlun

* fix bugs of tipc, test=kunlun
```
  d16360c8
- A
  add ipu related authors (#46634) · 678c200b
  由 Allen Guo 提交于 9月 30, 2022
```
* add ipu related authors, test=document_fix

* add gc, test=document_fix
```
  678c200b
- H
  [Opt Code] Opt GetExpectedKernelType code of conv_transpose_op (#46666) · 8c067ec1
  由 HongyuJia 提交于 9月 30, 2022
```
* opt GetExpectedKernelType code of conv_transpose_op

* fix if error
```
  8c067ec1
- H
  
  remove_dequantize_mkldnn_headerfile (#46665) · 7a1e1f99
  由 HongyuJia 提交于 9月 30, 2022
  
  7a1e1f99
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46626) · 22e81907
  由 HongyuJia 提交于 9月 30, 2022
  
  22e81907
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46628) · 4744cbc7
  由 HongyuJia 提交于 9月 30, 2022
  
  4744cbc7
- 六
  
  【Hackathon No.21】为 Paddle 新增 paddle.incubate.sparse.transpose 稀疏 API (#45849) · 2b879a69
  由六个骨头提交于 9月 30, 2022
  
  2b879a69
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46627) · 4b9dae01
  由 HongyuJia 提交于 9月 30, 2022
  
  4b9dae01
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46629) · abee2210
  由 HongyuJia 提交于 9月 30, 2022
  
  abee2210
- S
  support pure bfloat16 for more ops (#46364) · b7b231a6
  由 sneaxiy 提交于 9月 30, 2022
```
* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* add bfloat16 to selu_grad to pass CI

* fix selu grad compilation error
```
  b7b231a6
29 9月, 2022 23 次提交
- C
  
  Optimize softmax's performance when dim_size >= 100000. (#46535) · 9012787f
  由 carryyu 提交于 9月 29, 2022
  
  9012787f
- X
  
  fix mpi include bug (#46601) · 7057093e
  由 Xinger 提交于 9月 29, 2022
  
  7057093e
- Z
  
  [AutoParallel] fix amp when predict (#46637) · 6bc855d8
  由 zhaoyingli 提交于 9月 29, 2022
  
  6bc855d8
- Z
  
  update docs for ResNetBasicBlock, test=kunlun (#44607) · 09569323
  由 zhangyikun02 提交于 9月 29, 2022
  
  09569323
- Z
  Move valid check from python to kernel (#46412) · 37bc2d7b
  由 Zhang Zheng 提交于 9月 29, 2022
```
* Move valid check from python to kernel

* fix error throw

* fix

* invalid label check

* fix

* Revert "fix"

This reverts commit 79fad6799cfa4b30423dbc84e67d7d843d22b84a.

* Revert "invalid label check"

This reverts commit 402a9707390ad5386b3222e85844b92d2e9b9fa4.

* Revert "fix"

This reverts commit 09ba3080ee0587447f875c19cdf060485f15ae3b.

* Revert "fix error throw"

This reverts commit a901bfcc2179d5c120ec29af766f392b122dab52.

* Revert "Move valid check from python to kernel"

This reverts commit baa03cc4ef82d8d45516c30dfb52bf5aead30748.

* final fix

* fix

* fix
```
  37bc2d7b
- Z
  [GPUPS]add afs OpenWriter (#46611) · c7d60ce4
  由 zmxdream 提交于 9月 29, 2022
```
* add afs OpenWriter

* update
```
  c7d60ce4
- Z
  [Hackathon No.18] 为 Paddle 新增 frexp API (#46401) · 1e2af54c
  由 Zheng_Bicheng 提交于 9月 29, 2022
```
* 之前的pr合并了大量错误代码，重新提交一份

* 之前的pr合并了大量错误代码，重新提交一份

* 修正格式问题

* 改回原来的格式

* 按照要求修改

* 按照要求修改格式

* 修复注释的问题

* 更新格式

* 测试自动格式化

* 修正英文注释

* fix docs build error

* pre-commit

* for docs build

* for docs build

* 修复mantissa计算错误的bug

* 修复误判exponent可能存在负数，导致计算量增加的情况
Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com>
```
  1e2af54c
- L
  Add index_select, index_select_grad, reduce_min kernel and their unittests for... · 9a1855ff
  由 Leo Guo 提交于 9月 29, 2022
```
Add index_select, index_select_grad, reduce_min kernel and their unittests for kunlun. Add registers of index_select, index_select_grad, reduce_min, sqrt, sqrt_grad to xpu2_op_list.test=kunlun. (#46557)
```
  9a1855ff
- N
  
  [CodeStyle][F401] update flake8 F401 config (unittest/asp,interpreter,autograd,collective) (#46616) · 98deee29
  由 Nyakku Shigure 提交于 9月 29, 2022
  
  98deee29
- N
  [CodeStyle][F401] remove unused imports in unittests/collective (#46615) · 0ef7a02f
  由 Nyakku Shigure 提交于 9月 29, 2022
```
* [CodeStyle][F401] remove unused import in unittests/collective

* empty commit, test=document_fix

* empty commit
```
  0ef7a02f
- C
  fix P40 topk: Make the optimized topk compatible with P40. (#46547) · 667082c0
  由 carryyu 提交于 9月 29, 2022
```
* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.
```
  667082c0
- Y
  Remove calibration file path when deploy quantize model (#46283) · d71f1b3f
  由 yeliang2258 提交于 9月 29, 2022
```
* remove calibration file path

* remove useless code
```
  d71f1b3f
- [MLU] add mlu kernel for add_reduce_max_grad (#45651) · 1ef1cace
  由光明和真理提交于 9月 29, 2022
```
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
```
  1ef1cace
- Z
  [AutoParallel] fix reshard when train with eval (#46605) · 8e9c719d
  由 zhaoyingli 提交于 9月 29, 2022
```
* [AutoParallel] fix reshard when train with eval

* fix mppp
```
  8e9c719d
- M
  
  add register for strided_slice_grad (#46549) · 40ab6faf
  由 ming1753 提交于 9月 29, 2022
  
  40ab6faf
- N
  
  [CodeStyle][F401] remove unused import in unittests/{asp,autograd,interpreter} (#46376) · f6039929
  由 Nyakku Shigure 提交于 9月 29, 2022
  
  f6039929
- 傅
  
  fix uniform_rand_kernel FP16 support in dygraph mode (#46212) · ccab0e2a
  由傅剑寒提交于 9月 29, 2022
  
  ccab0e2a
- H
  [OptLayoutSelect] Select the highest priority layout (#46598) · 596d8209
  由 HongyuJia 提交于 9月 29, 2022
```
* select highest priority layout

* opt performance, save virtual table find
```
  596d8209
- H
  [Fix KernelKeyParser] Unify the logic of `operator()` in `KernelKeyParser` (#46560) · 4140d7ec
  由 HongyuJia 提交于 9月 29, 2022
```
* add datatype check for ParseKernelKeyByInputArgs

* polish error message

* Actually, einsum has vector<Tensor> inpute with DataType::COMPLEX64, see test_einsum_v2.py

* headerfile remove enforce.h
```
  4140d7ec
- Z
  Improve the python file annotation check strategy for precise testing (#46559) · 3e0a1765
  由 zhangbo9674 提交于 9月 29, 2022
```
* test

* test

* refine check pr is_comment chanege

* test
```
  3e0a1765
- R
  [CustomDevice] add to_static, amp ut (#46536) · acf785b6
  由 ronnywang 提交于 9月 29, 2022
```
* [CustomDevice] add to_static, amp ut

* update

* fix failed ut

* update
```
  acf785b6
- W
  [Eager, Performance optimization] support mod / matmul ( % and @ operator) to... · 7d7444cc
  由 Weilong Wu 提交于 9月 29, 2022
```
[Eager, Performance optimization] support mod / matmul ( % and @ operator) to sink to Cpp layer (#46565)

* [Eager, Performance optimization] support mod ( % operator) to sink to Cpp layer

* fix mod logic

* support matmul math operator

* rm LOG(warning), use VLOG(6)

* fix conflicts mistake
```
  7d7444cc
- H
  [XPU] update xpu cmake to 0928. (#46437) · 58a478f8
  由 houj04 提交于 9月 29, 2022
```
* [XPU] update xpu cmake to 0923. test=kunlun

* [XPU] update xpu cmake to 0928. test=kunlun
```
  58a478f8

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功