提交 · ecae7b31f87fca92cacacb922f8715ce50568a97 · PaddlePaddle / Paddle

30 9月, 2022 19 次提交
- W
  
  Support both use_calc_stream and sync_op in allgather API (#46295) · ecae7b31
  由 Wen Sun 提交于 9月 30, 2022
  
  ecae7b31
- R
  
  Release memory cache after build_op_func_list in interpretercore (#46670) · 255890ff
  由 Ruibiao Chen 提交于 9月 30, 2022
  
  255890ff
- H
  
  opt GetExpectedKernelType code of fill_constant_op (#46667) · 136b1f42
  由 HongyuJia 提交于 9月 30, 2022
  
  136b1f42
- H
  
  remove MKLDNN hard code in addmm (#46660) · 9900ed52
  由 HongyuJia 提交于 9月 30, 2022
  
  9900ed52
- Fix undefined reference PD_IntArrayGetElementCount (#46662) · 2055a1d2
  由 engineer1109 提交于 9月 30, 2022
```
* Fix undefined reference PD_IntArrayGetElementCount

* Delete PD_IntArrayGetSize Unused
```
  2055a1d2
- A
  [IPU] paddle-inference support custom-ops (#45235) · a6b4bee3
  由 Allen Guo 提交于 9月 30, 2022
```
* paddle-inference support custom-ops
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>

* fix tolower
Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai>
```
  a6b4bee3
- Z
  Optimize performance of depthwise_conv_bwd of filter (#46490) · 04eb211a
  由 Zhang Zheng 提交于 9月 30, 2022
```
* Optimize performance of depthwise_conv_bwd of filter

* op-benchmark

* fix

* op benchmark

* merge bwd
```
  04eb211a
- Z
  Optimize performance of depthwise_conv_bwd (#46362) · f17a73e9
  由 Zhang Zheng 提交于 9月 30, 2022
```
* Optimize performance of depthwise_conv_bwd

* fix
```
  f17a73e9
- C
  
  [MLU] fix phi::Tensor compile error of mlu. (#46649) · 2e231402
  由 Chenxiao Niu 提交于 9月 30, 2022
  
  2e231402
- [MLU] add_fluid_mluop_yolo_box (#46573) · 832b0a15
  由光明和真理提交于 9月 30, 2022
  
  832b0a15
- Y
  fix bugs of tipc, test=kunlun (#46540) · d16360c8
  由 ykkk2333 提交于 9月 30, 2022
```
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun

* migrate add_n kernep to phi, test=kunlun

* fix bugs of tipc, test=kunlun
```
  d16360c8
- H
  [Opt Code] Opt GetExpectedKernelType code of conv_transpose_op (#46666) · 8c067ec1
  由 HongyuJia 提交于 9月 30, 2022
```
* opt GetExpectedKernelType code of conv_transpose_op

* fix if error
```
  8c067ec1
- H
  
  remove_dequantize_mkldnn_headerfile (#46665) · 7a1e1f99
  由 HongyuJia 提交于 9月 30, 2022
  
  7a1e1f99
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46626) · 22e81907
  由 HongyuJia 提交于 9月 30, 2022
  
  22e81907
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46628) · 4744cbc7
  由 HongyuJia 提交于 9月 30, 2022
  
  4744cbc7
- 六
  
  【Hackathon No.21】为 Paddle 新增 paddle.incubate.sparse.transpose 稀疏 API (#45849) · 2b879a69
  由六个骨头提交于 9月 30, 2022
  
  2b879a69
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46627) · 4b9dae01
  由 HongyuJia 提交于 9月 30, 2022
  
  4b9dae01
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46629) · abee2210
  由 HongyuJia 提交于 9月 30, 2022
  
  abee2210
- S
  support pure bfloat16 for more ops (#46364) · b7b231a6
  由 sneaxiy 提交于 9月 30, 2022
```
* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* add bfloat16 to selu_grad to pass CI

* fix selu grad compilation error
```
  b7b231a6
29 9月, 2022 15 次提交

C

Optimize softmax's performance when dim_size >= 100000. (#46535) · 9012787f
由 carryyu 提交于 9月 29, 2022

9012787f
X

fix mpi include bug (#46601) · 7057093e
由 Xinger 提交于 9月 29, 2022

7057093e

Move valid check from python to kernel (#46412) · 37bc2d7b

由 Zhang Zheng 提交于 9月 29, 2022

* Move valid check from python to kernel

* fix error throw

* fix

* invalid label check

* fix

* Revert "fix"

This reverts commit 79fad6799cfa4b30423dbc84e67d7d843d22b84a.

* Revert "invalid label check"

This reverts commit 402a9707390ad5386b3222e85844b92d2e9b9fa4.

* Revert "fix"

This reverts commit 09ba3080ee0587447f875c19cdf060485f15ae3b.

* Revert "fix error throw"

This reverts commit a901bfcc2179d5c120ec29af766f392b122dab52.

* Revert "Move valid check from python to kernel"

This reverts commit baa03cc4ef82d8d45516c30dfb52bf5aead30748.

* final fix

* fix

* fix

37bc2d7b

Z
[GPUPS]add afs OpenWriter (#46611) · c7d60ce4
由 zmxdream 提交于 9月 29, 2022
```
* add afs OpenWriter

* update
```
c7d60ce4

Add index_select, index_select_grad, reduce_min kernel and their unittests for... · 9a1855ff

由 Leo Guo 提交于 9月 29, 2022

Add index_select, index_select_grad, reduce_min kernel and their unittests for kunlun. Add registers of index_select, index_select_grad, reduce_min, sqrt, sqrt_grad to xpu2_op_list.test=kunlun. (#46557)

9a1855ff

fix P40 topk: Make the optimized topk compatible with P40. (#46547) · 667082c0

由 carryyu 提交于 9月 29, 2022

* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.

667082c0

Y
Remove calibration file path when deploy quantize model (#46283) · d71f1b3f
由 yeliang2258 提交于 9月 29, 2022
```
* remove calibration file path

* remove useless code
```
d71f1b3f
[MLU] add mlu kernel for add_reduce_max_grad (#45651) · 1ef1cace
由光明和真理提交于 9月 29, 2022
```
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
```
1ef1cace
M

add register for strided_slice_grad (#46549) · 40ab6faf
由 ming1753 提交于 9月 29, 2022

40ab6faf
傅

fix uniform_rand_kernel FP16 support in dygraph mode (#46212) · ccab0e2a
由傅剑寒提交于 9月 29, 2022

ccab0e2a
H
[OptLayoutSelect] Select the highest priority layout (#46598) · 596d8209
由 HongyuJia 提交于 9月 29, 2022
```
* select highest priority layout

* opt performance, save virtual table find
```
596d8209

[Fix KernelKeyParser] Unify the logic of `operator()` in `KernelKeyParser` (#46560) · 4140d7ec

由 HongyuJia 提交于 9月 29, 2022

* add datatype check for ParseKernelKeyByInputArgs

* polish error message

* Actually, einsum has vector<Tensor> inpute with DataType::COMPLEX64, see test_einsum_v2.py

* headerfile remove enforce.h

4140d7ec

[Eager, Performance optimization] support mod / matmul ( % and @ operator) to... · 7d7444cc

由 Weilong Wu 提交于 9月 29, 2022

[Eager, Performance optimization] support mod / matmul ( % and @ operator) to sink to Cpp layer (#46565)

* [Eager, Performance optimization] support mod ( % operator) to sink to Cpp layer

* fix mod logic

* support matmul math operator

* rm LOG(warning), use VLOG(6)

* fix conflicts mistake

7d7444cc

[XPU] update xpu cmake to 0928. (#46437) · 58a478f8

由 houj04 提交于 9月 29, 2022

* [XPU] update xpu cmake to 0923. test=kunlun

* [XPU] update xpu cmake to 0928. test=kunlun

58a478f8

R
check change of unittest before checking coverage rate,test=coverage (#46593) · 2f76ddd7
由 risemeup1 提交于 9月 29, 2022
```
* check change of unittest before checking coverage rate,test=coverage

* modify paddle_build.sh

* adding test_list.py
```
2f76ddd7

28 9月, 2022 6 次提交

S

fix collective helper (#46582) · bd10211c
由 sneaxiy 提交于 9月 28, 2022

bd10211c

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

R
Convert GradMergeAllReduceOpHandle in GraphToBlock (#46544) · 6a706e63
由 Ruibiao Chen 提交于 9月 28, 2022
```
* Convert GradMergeAllReduceOpHandle in GraphToBlock

* Set FLAGS_CONVERT_GRAPH_TO_PROGRAM to False
```
6a706e63
H

rename filenames from pten to phi (#46579) · 3f8585a9
由 HongyuJia 提交于 9月 28, 2022

3f8585a9

[phi Backend] Change BackendSet from uint64_t to uint32_t (#46532) · f6f8c935

由 HongyuJia 提交于 9月 28, 2022

* change BackendSet from 64bits to 32bits

* fix _MSC_VER error, __lzcnt32->__lzcnt

* fix __GNUC__ error, __builtin_clzl->__builtin_clz

f6f8c935

W
[Eager, Performance optimization] support less_than & less_equal( < & <=... · 7d238139
由 Weilong Wu 提交于 9月 28, 2022
```
[Eager, Performance optimization] support less_than & less_equal( < & <= operator) to sink to Cpp layer (#46542)
```
7d238139

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功