提交 · 7a1e1f9992f696a04d3e0d4fe20b40b53e341267 · PaddlePaddle / Paddle

30 9月, 2022 3 次提交
- H
  
  remove_dequantize_mkldnn_headerfile (#46665) · 7a1e1f99
  由 HongyuJia 提交于 9月 30, 2022
  
  7a1e1f99
- H
  
  change mkldnn kernel layout, ALL_LAYOUT->ONEDNN (#46629) · abee2210
  由 HongyuJia 提交于 9月 30, 2022
  
  abee2210
- S
  support pure bfloat16 for more ops (#46364) · b7b231a6
  由 sneaxiy 提交于 9月 30, 2022
```
* support pure bfloat16

* support bf16 linear

* update PR to pass CI

* tiny fix where_grad_kernel.cu

* add bfloat16 to selu_grad to pass CI

* fix selu grad compilation error
```
  b7b231a6
29 9月, 2022 2 次提交

fix P40 topk: Make the optimized topk compatible with P40. (#46547) · 667082c0

由 carryyu 提交于 9月 29, 2022

* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.

667082c0

[MLU] add mlu kernel for add_reduce_max_grad (#45651) · 1ef1cace
由光明和真理提交于 9月 29, 2022
```
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
```
1ef1cace

28 9月, 2022 5 次提交

Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e

由 Chen Weihang 提交于 9月 28, 2022

* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict

e12a905e

Replacing set_format with set_mem_desc in FC onednn kernel (#46372) · 844d9855

由 Jacek Czaja 提交于 9月 28, 2022

* added fc int8 tests

* CI fix

* added skipping UTs for GPUs

* fixes for CI

* added support for residual connections inside fc

* fix for quant int8 bias

* - lint
Co-authored-by: Njakpiase <jakpia21@gmail.com>

844d9855

L

first commit (#46525) · 806b252c
由 limingshu 提交于 9月 28, 2022

806b252c

[PHI] relu6_grad kernel (#46501) · cee2b12d

由 Sławomir Siwek 提交于 9月 28, 2022

* Relu6

* remove fluid handler

* add individual kernel signature

* coding style

* replace bounded_relu with clip

* whitespace

* code style

cee2b12d

Z
Fix clip_extra logic in remove_training_info (#46534) · 7e2e2ee7
由 zyfncg 提交于 9月 28, 2022
```
* fix clip_extra code in remove_training_info

* revert rnn opmaker clear
```
7e2e2ee7

27 9月, 2022 4 次提交
- C
  
  [MLU] add huber_loss kernel. (#46455) · f786fcf9
  由 Chenxiao Niu 提交于 9月 27, 2022
  
  f786fcf9
- L
  Add bernoulli primitive op and support dropout op in new AD. (#46238) · fee84e09
  由 levi131 提交于 9月 27, 2022
```
* init dropout

* small format fix

* fix pr comments

* add value test
```
  fee84e09
- C
  
  speedup ChannelClipAndQuantDequantKernelQuantAxis1 kernel (#46471) · 9c426728
  由 ceci3 提交于 9月 27, 2022
  
  9c426728
- Z
  
  [Sparse] Support static graph (#46245) · a02eb143
  由 zhangkaihuo 提交于 9月 27, 2022
  
  a02eb143
26 9月, 2022 5 次提交
- J
  Conv grad to use set_mem_desc() (#46459) · f4a6c539
  由 Jacek Czaja 提交于 9月 26, 2022
```
* - Conv grad changed for MD

* - lint

* - compilation fix

* yet another lint
```
  f4a6c539
- C
  
  [MLU] fluid: add mluop (#46429) · 3e1e482b
  由 cifar10 提交于 9月 26, 2022
  
  3e1e482b
- Z
  
  clear extra atts of sequence_softmax in opmaker (#46457) · 159f10e3
  由 zyfncg 提交于 9月 26, 2022
  
  159f10e3
- J
  Support rsqrt_p (#46369) · 4c438d30
  由 Jiabin Yang 提交于 9月 26, 2022
```
* support rsqrt_p

* refine code and ut

* add_prim_rsqrt

* fix ut
```
  4c438d30
- Z
  
  clear extra attrs of distribute op in opmaker (#46451) · 4f847433
  由 zyfncg 提交于 9月 26, 2022
  
  4f847433
23 9月, 2022 5 次提交
- D
  add phi reduce_sum test=kunlun (#46241) · 22fe4f03
  由 dongfangshenzhu 提交于 9月 23, 2022
```
* add phi reduce_sum test=kunlun

* add fhi reduce_sum test=kunlun

* add fhi reduce_sum test=kunlun
```
  22fe4f03
- C
  
  [MLU] add barrier_op kernel. (#46417) · ead54eab
  由 Chenxiao Niu 提交于 9月 23, 2022
  
  ead54eab
- Y
  
  move selected_rows_functor (#46373) · b6c6f4f9
  由 YuanRisheng 提交于 9月 23, 2022
  
  b6c6f4f9
- Z
  
  clear extra attrs of quantize op in opmaker (#46418) · 62c05369
  由 zyfncg 提交于 9月 23, 2022
  
  62c05369
- A
  [BugFix]Fix reduce_mean/min/sum/prod, cumsum grad_op infershape bug (#46408) · 812e4b47
  由 Aurelius84 提交于 9月 23, 2022
```
* [BugFix]Fix reduce_mean/min/sum/prod, cumsum grad_op infershape bug

* fix typo

* fix typo
```
  812e4b47
22 9月, 2022 11 次提交
- P
  [PHI] Sum op migration (#46239) · 3448afc1
  由 Paulina Gacek 提交于 9月 22, 2022
```
* Sum kernel migrated to phi

* Static cast added, file name changed

* OneDNNGetDataType to uppercase

* refactoring

* AddOneDNNHandler changed to SumOneDNNHandler
```
  3448afc1
- Z
  Clear extra attrs of lookup_table_v2 in OpMaker (#46321) · ffc697ff
  由 zyfncg 提交于 9月 22, 2022
```
* clear extra attrs of look_up_table_v2 in opmaker

* fix bug
```
  ffc697ff
- P
  [PHI] Migrate sgd and stack oneDNN kernels (#46374) · 4ae37aee
  由 Piotr Paturej 提交于 9月 22, 2022
```
* Convert slice+grad oneDNN fluid kernels to PHI

* Change mutable_data to Alloc

* Refactor licences
```
  4ae37aee
- 王
  
  [NPU] fix CI error in new executor. test=develop (#46292) · 497eb948
  由王明冬提交于 9月 22, 2022
  
  497eb948
- S
  [PHI] Migrate gelu kernels (#45596) · 567e2fc8
  由 Sławomir Siwek 提交于 9月 22, 2022
```
* gaussian random

* mkldnn to onednn renaming

* fix merge conflicts

* remove fluid code

* onednn renaming

* gelu fwd

* sort activations

* gelu gradient

* remove unused macros

* merge conflicts

* fix merge conflicts

* remove extra contraint from gelu op
```
  567e2fc8
- L
  
  convert grad_merge_all_reduce in graph to program (#46353) · 0a144ca1
  由 Leo Chen 提交于 9月 22, 2022
  
  0a144ca1
- Y
  
  TensorRT engine context memory sharing (#45842) · 173b39bb
  由 Yuanle Liu 提交于 9月 22, 2022
  
  173b39bb
- H
  [mkldnn] Fix elementwise_sub sign reverse for mkldnn (#46049) · ab97b760
  由 Hui Zhang 提交于 9月 22, 2022
```
* fix sub sign reverse for mkldnn

* refactor code as comment

* remove useless

* format code
```
  ab97b760
- H
  [Dygraph] Fix bugs of mp in eager mode (#46303) · 11002430
  由 Haohongxiang 提交于 9月 22, 2022
```
* fix bugs of mp

* fix bugs of mp

* update

* update

* fix bug
```
  11002430
- C
  Optimize topk's performance when k is small and input_width is large (#45312) · 2c687df0
  由 carryyu 提交于 9月 22, 2022
```
* Optimize topk's performance when k is small and input_width is large

* 修改blockdim设置逻辑

* Update top_k_function_cuda.h
```
  2c687df0
- C
  
  [MLU] add int64 support for mlu one_hot_v2 (#46313) · 9cc3b28d
  由 Chenxiao Niu 提交于 9月 22, 2022
  
  9cc3b28d
21 9月, 2022 5 次提交

add layer_norm trt fp16 support (#45043) · b7a1ae22

由 ccrrong 提交于 9月 21, 2022

* add fp16 support

* update

* update half

* code format

* fix unittest

* fix rocm compile error

* code format

* code format

* fix rocm compile error

* fix rocm compile error

b7a1ae22

P

Revert pool+grad oneDNN kernel conversion (#45989) · dc31d2aa
由 Piotr Paturej 提交于 9月 21, 2022

dc31d2aa
R

fix multihead_matmul nan error when seq len et 1024 (#46286) · face8f1f
由 RichardWooSJTU 提交于 9月 21, 2022

face8f1f

migrate add_n kernel to phi (#46318) · 0f9dde43

由 ykkk2333 提交于 9月 21, 2022

* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun

* migrate add_n kernep to phi, test=kunlun

0f9dde43

[PHI] Migrate concat+grad, expand+grad, fill_constant, nearest_interp and... · 3d59fee5

由 Piotr Paturej 提交于 9月 21, 2022

[PHI] Migrate concat+grad, expand+grad, fill_constant, nearest_interp and bilinear_interp oneDNN kernels (#45863)

* Migrate concat+grad, expand+grad, fill_constant, nearest_interp_v2 and bilinear_interp_v2 oneDNN kernels to PHI

* Remove old namespace variable

* Fix invalid out dims error

* Add mutable_data method to concat output

* Add check for -1 dim before computing out_dims

* Capitalize oneDNNGetDataType function name

* Change fill_constant kernel to correct PHI kernel

* Attempt to fix dims error

* Fix fill_constant (full) kernel

3d59fee5

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功