提交 · 667082c0adc72770bfb2ecbede8f726af35bc460 · PaddlePaddle / Paddle

29 9月, 2022 10 次提交
- C
  fix P40 topk: Make the optimized topk compatible with P40. (#46547) · 667082c0
  由 carryyu 提交于 9月 29, 2022
```
* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.

* fix P40 topk: Make the optimized topk compatible with P40.
```
  667082c0
- Y
  Remove calibration file path when deploy quantize model (#46283) · d71f1b3f
  由 yeliang2258 提交于 9月 29, 2022
```
* remove calibration file path

* remove useless code
```
  d71f1b3f
- [MLU] add mlu kernel for add_reduce_max_grad (#45651) · 1ef1cace
  由光明和真理提交于 9月 29, 2022
```
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
```
  1ef1cace
- M
  
  add register for strided_slice_grad (#46549) · 40ab6faf
  由 ming1753 提交于 9月 29, 2022
  
  40ab6faf
- 傅
  
  fix uniform_rand_kernel FP16 support in dygraph mode (#46212) · ccab0e2a
  由傅剑寒提交于 9月 29, 2022
  
  ccab0e2a
- H
  [OptLayoutSelect] Select the highest priority layout (#46598) · 596d8209
  由 HongyuJia 提交于 9月 29, 2022
```
* select highest priority layout

* opt performance, save virtual table find
```
  596d8209
- H
  [Fix KernelKeyParser] Unify the logic of `operator()` in `KernelKeyParser` (#46560) · 4140d7ec
  由 HongyuJia 提交于 9月 29, 2022
```
* add datatype check for ParseKernelKeyByInputArgs

* polish error message

* Actually, einsum has vector<Tensor> inpute with DataType::COMPLEX64, see test_einsum_v2.py

* headerfile remove enforce.h
```
  4140d7ec
- W
  [Eager, Performance optimization] support mod / matmul ( % and @ operator) to... · 7d7444cc
  由 Weilong Wu 提交于 9月 29, 2022
```
[Eager, Performance optimization] support mod / matmul ( % and @ operator) to sink to Cpp layer (#46565)

* [Eager, Performance optimization] support mod ( % operator) to sink to Cpp layer

* fix mod logic

* support matmul math operator

* rm LOG(warning), use VLOG(6)

* fix conflicts mistake
```
  7d7444cc
- H
  [XPU] update xpu cmake to 0928. (#46437) · 58a478f8
  由 houj04 提交于 9月 29, 2022
```
* [XPU] update xpu cmake to 0923. test=kunlun

* [XPU] update xpu cmake to 0928. test=kunlun
```
  58a478f8
- R
  check change of unittest before checking coverage rate,test=coverage (#46593) · 2f76ddd7
  由 risemeup1 提交于 9月 29, 2022
```
* check change of unittest before checking coverage rate,test=coverage

* modify paddle_build.sh

* adding test_list.py
```
  2f76ddd7
28 9月, 2022 17 次提交
- S
  
  fix collective helper (#46582) · bd10211c
  由 sneaxiy 提交于 9月 28, 2022
  
  bd10211c
- C
  Remove the declaration of using Tensor in framework/tensor.h (#46432) · e12a905e
  由 Chen Weihang 提交于 9月 28, 2022
```
* remove needless using tensor

* remove needless using tensor

* resolve conflict

* replace tensor using

* fix format error

* revert needless changing

* fix rocm and npu compile error

* fix cinn compile error

* fix format error

* fix mkldnn format error

* fix mkldnn format error

* fix cinn compile error

* fix cinn compile error

* fix cinn compile error

* resolve conflict
```
  e12a905e
- R
  Convert GradMergeAllReduceOpHandle in GraphToBlock (#46544) · 6a706e63
  由 Ruibiao Chen 提交于 9月 28, 2022
```
* Convert GradMergeAllReduceOpHandle in GraphToBlock

* Set FLAGS_CONVERT_GRAPH_TO_PROGRAM to False
```
  6a706e63
- H
  
  rename filenames from pten to phi (#46579) · 3f8585a9
  由 HongyuJia 提交于 9月 28, 2022
  
  3f8585a9
- H
  [phi Backend] Change BackendSet from uint64_t to uint32_t (#46532) · f6f8c935
  由 HongyuJia 提交于 9月 28, 2022
```
* change BackendSet from 64bits to 32bits

* fix _MSC_VER error, __lzcnt32->__lzcnt

* fix __GNUC__ error, __builtin_clzl->__builtin_clz
```
  f6f8c935
- W
  [Eager, Performance optimization] support less_than & less_equal( < & <=... · 7d238139
  由 Weilong Wu 提交于 9月 28, 2022
```
[Eager, Performance optimization] support less_than & less_equal( < & <= operator) to sink to Cpp layer (#46542)
```
  7d238139
- Z
  
  [GPUPS]fix ChannelReader (#46575) · 2aec65be
  由 zmxdream 提交于 9月 28, 2022
  
  2aec65be
- L
  
  remove const qualifier in function return (#46546) · 8c5b9cf8
  由 Leo Chen 提交于 9月 28, 2022
  
  8c5b9cf8
- J
  Replacing set_format with set_mem_desc in FC onednn kernel (#46372) · 844d9855
  由 Jacek Czaja 提交于 9月 28, 2022
```
* added fc int8 tests

* CI fix

* added skipping UTs for GPUs

* fixes for CI

* added support for residual connections inside fc

* fix for quant int8 bias

* - lint
Co-authored-by: Njakpiase <jakpia21@gmail.com>
```
  844d9855
- L
  
  first commit (#46525) · 806b252c
  由 limingshu 提交于 9月 28, 2022
  
  806b252c
- S
  [PHI] relu6_grad kernel (#46501) · cee2b12d
  由 Sławomir Siwek 提交于 9月 28, 2022
```
* Relu6

* remove fluid handler

* add individual kernel signature

* coding style

* replace bounded_relu with clip

* whitespace

* code style
```
  cee2b12d
- Y
  
  add decode_jpeg yaml (#46562) · c7da8602
  由 YuanRisheng 提交于 9月 28, 2022
  
  c7da8602
- Y
  [BugFix]Fix concat bugs when call onednn kernel (#46518) · 0ee6dfbe
  由 YuanRisheng 提交于 9月 28, 2022
```
* fix concat bug

* fix ci bugs

* fix ci bugs
```
  0ee6dfbe
- K
  [NPU] add gpu kernel for transfer layout (#46307) · 526d963e
  由 kangguangli 提交于 9月 28, 2022
```
* add gpu kernel for transfer layout

* comment error throw

* fix: flag setting in testcase; add condition check for raising error

* fix typo

* fix: add error type for PADDLE_THROW

* remove kernel fallback in data_transfer.cc

* remove useless variable definition
```
  526d963e
- W
  
  merge develop (#46520) · 1ecc39b4
  由 Weilong Wu 提交于 9月 28, 2022
  
  1ecc39b4
- W
  [PHI] phi support xpu black list (#46527) · 84f7835d
  由 wanghuancoder 提交于 9月 28, 2022
```
* phi support xpu black list
```
  84f7835d
- Z
  Fix clip_extra logic in remove_training_info (#46534) · 7e2e2ee7
  由 zyfncg 提交于 9月 28, 2022
```
* fix clip_extra code in remove_training_info

* revert rnn opmaker clear
```
  7e2e2ee7
27 9月, 2022 13 次提交
- J
  
  adjust backend priority, GPUDNN>GPU>ONEDNN>CPU · 7467221b
  由 jiahongyu 提交于 9月 27, 2022
  
  7467221b
- C
  
  [MLU] add huber_loss kernel. (#46455) · f786fcf9
  由 Chenxiao Niu 提交于 9月 27, 2022
  
  f786fcf9
- J
  
  polish typo, emum->enum, defalutly->defaultly · c82d1020
  由 jiahongyu 提交于 9月 27, 2022
  
  c82d1020
- W
  [Eager, Performance optimization] support divide( / operator) to sink to Cpp layer (#46329) · f20b361c
  由 Weilong Wu 提交于 9月 27, 2022
```
* [Eager] math op sink to Cpp level

* fix ci errors

* draft version

* support + and - operator under cpp directly

* add static test

* polish code

* promote types or unify right type to left

* recover static test case

* polish code and fix some ci errors

* support complex and polish code

* fix conflicts

* fix windows ci errors

* fix windows-inference-ci errors

* polish and fix tests

* fix test case

* polish code

* [Eager, Performance optimization] support multiply( * operator) to sink to Cpp layer

* rm useless glog

* [Eager, Performance optimization] support divide( / and // operator) to sink to Cpp layer

* polish code

* polish code and fix code-format

* polish code

* fix CI

* polish code

* update test

* support div operator under cpp

* fix scalar as input

* Polish div logic, fix ci test

* fix errors
```
  f20b361c
- L
  Add bernoulli primitive op and support dropout op in new AD. (#46238) · fee84e09
  由 levi131 提交于 9月 27, 2022
```
* init dropout

* small format fix

* fix pr comments

* add value test
```
  fee84e09
- L
  
  Delete int kernel type in Scatter Kernel.test=kunlun (#46030) · 403cd2b5
  由 Leo Guo 提交于 9月 27, 2022
  
  403cd2b5
- C
  
  speedup ChannelClipAndQuantDequantKernelQuantAxis1 kernel (#46471) · 9c426728
  由 ceci3 提交于 9月 27, 2022
  
  9c426728
- W
  preln_residual_bias optimization (#46496) · 55accdfc
  由 wenbin 提交于 9月 27, 2022
```
* half2

* add epsilon
```
  55accdfc
- W
  [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(3) (#46243) · 4d772144
  由 Wangzheee 提交于 9月 27, 2022
```
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(3)
```
  4d772144
- R
  
  Fix op_happen_before_ update bug for AddDownstreamOp (#46486) · ba1bbe8e
  由 Ruibiao Chen 提交于 9月 27, 2022
  
  ba1bbe8e
- Z
  
  Fix syntax errors of args name in int_array.h (#46521) · 38e82868
  由 zyfncg 提交于 9月 27, 2022
  
  38e82868
- C
  Add README.md for phi (#46506) · a7aefaea
  由 Chen Weihang 提交于 9月 27, 2022
```
* add readme for phi

* polish details, test=document_fix
```
  a7aefaea
- W
  [Eager] refine gil use (#46452) · b106c424
  由 wanghuancoder 提交于 9月 27, 2022
```
* refine gil use
```
  b106c424

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功