提交 · d7d35ff80f649838e3e2ddc585778b3f482c89a4 · PaddlePaddle / Paddle

14 9月, 2022 22 次提交
- J
  delay tensorrt registry (#45824) · d7d35ff8
  由 JingZhuangzhuang 提交于 9月 14, 2022
```
* Delay TensorRT registry
* Add unused define
* Fix TensorRT test
* fix function to reference
* Update trt_plugin.h
```
  d7d35ff8
- C
  
  normize yaml backward op label (#46028) · 6891a4fe
  由 Chen Weihang 提交于 9月 14, 2022
  
  6891a4fe
- J
  [PHI] Support bmm and bmm_grad in xpu (#45887) · 6bd2762c
  由 Jiabin Yang 提交于 9月 14, 2022
```
* support bmm and bmm_grad in xpu

* add error removal

* test=kunlun

* refactor code for better structure

* test=kunlun

* add fp16 kernel for bmm

* test=kunlun
```
  6bd2762c
- W
  
  CastPyArg2IntArray use int64_t (#45919) · c53e92fc
  由 wanghuancoder 提交于 9月 14, 2022
  
  c53e92fc
- Z
  
  [Sparse]Remove unused code (#46021) · 0b82fb32
  由 zhangkaihuo 提交于 9月 14, 2022
  
  0b82fb32
- L
  
  Support fp16 for index_select and index_add (#45601) · 61012a76
  由 Li Min 提交于 9月 14, 2022
  
  61012a76
- N
  [CodeStyle] trim trailing whitespace in .md and .rst (#45990) · 3404ff67
  由 Nyakku Shigure 提交于 9月 14, 2022
```
* [CodeStyle] trim trailing whitespace in .md and .rst

* empty commit, test=document_fix
```
  3404ff67
- L
  Migrate scale and scatter to phi, and modify the code style for... · 1349584e
  由 Leo Guo 提交于 9月 14, 2022
```
Migrate scale and scatter to phi, and modify the code style for roi_align_kernel. test=kunlun (#45938)
```
  1349584e
- Z
  fix trt multiclass_nms3 (#45166) · f85f2e83
  由 Zhang Jun 提交于 9月 14, 2022
```
* update

* update

* update
```
  f85f2e83
- C
  
  support slice op backward refuse forward and add high level unit test (#45960) · d9fac780
  由 Charles-hit 提交于 9月 14, 2022
  
  d9fac780
- Y
  
  [XPU] migrate reduce kernels to phi, test=kunlun (#45973) · 5829069d
  由 ykkk2333 提交于 9月 14, 2022
  
  5829069d
- L
  
  add check_memory_continue kernel (#45999) · a5021e89
  由 Leo Chen 提交于 9月 14, 2022
  
  a5021e89
- Z
  [AMP] Support AMP-O2 for bfloat16 (#45541) · e8809d99
  由 zhangbo9674 提交于 9月 14, 2022
```
* support bfloat16 for amp_decorate

* add check_finite for bf16

* fix bug

* add ut

* add ut

* refine code
```
  e8809d99
- C
  Fix arm fp16 compile error (#45991) · 9f4f18f2
  由 Chen Weihang 提交于 9月 14, 2022
```
* fix arm fp16 compile error

* polish macro impl
```
  9f4f18f2
- S
  Fix DistributedFusedLAMB NaN problem (#46011) · 6833ecfe
  由 sneaxiy 提交于 9月 14, 2022
```
* fix distributed_fused_lamb nan

* remove CUDA_ASSERT
```
  6833ecfe
- C
  
  support assign op backward refuse forward (#45879) · 65dd828e
  由 Charles-hit 提交于 9月 14, 2022
  
  65dd828e
- W
  
  fix compile (#45996) · 654eff5f
  由 wenbin 提交于 9月 14, 2022
  
  654eff5f
- Y
  
  Simplify the codes of conv. (#45966) · 3a5b5048
  由 Yiqun Liu 提交于 9月 14, 2022
  
  3a5b5048
- X
  add mean,sum,ge,gt,ne,abs primitive operators for supporting deepxde (#45888) · 62176f63
  由 Xiaoxu Chen 提交于 9月 14, 2022
```
* add reduce_mean,reduce_sum primitive ops

* add ne_p gt_p primitive operators

* add ge_p abs_p primitive oparators
```
  62176f63
- C
  [PHI] Normalize yaml op label (#45976) · e43e4825
  由 Chen Weihang 提交于 9月 14, 2022
```
* normalize yaml op label

* revert op_compat yaml change

* fix prelu and rnn compat problem

* replace api by op
```
  e43e4825
- C
  
  [MLU] add mergedAdam kernel. (#45965) · bf6ec262
  由 Chenxiao Niu 提交于 9月 14, 2022
  
  bf6ec262
- Z
  
  [Sparse]Sparse add support gpu (#45974) · da33f7b0
  由 zhangkaihuo 提交于 9月 14, 2022
  
  da33f7b0
13 9月, 2022 11 次提交
- N
  
  Fix argsort in XPU black list for XPU KP (#45975) · 2d45f68f
  由 niuliling123 提交于 9月 13, 2022
  
  2d45f68f
- L
  
  set device id before op run (#45993) · a896b32b
  由 Leo Chen 提交于 9月 13, 2022
  
  a896b32b
- C
  
  support concat backward refuse forward (#45940) · ff1da188
  由 Charles-hit 提交于 9月 13, 2022
  
  ff1da188
- C
  
  support tile op backward refuse forward (#45942) · c6f173b0
  由 Charles-hit 提交于 9月 13, 2022
  
  c6f173b0
- C
  
  support expand_v2 op backward refuse forward (#45941) · 1eefd66a
  由 Charles-hit 提交于 9月 13, 2022
  
  1eefd66a
- P
  add log while running New Executor, Old Executor and ParallelExecutor and change log level (#45814) · f639bc69
  由 pangyoki 提交于 9月 13, 2022
```
* optimize executor log

* delete log in new exe

* add log for old executor

* use LOG_FIRST_N(INFO, 1)
```
  f639bc69
- R
  
  [CustomDevice] register load_combine op (#45980) · b2122239
  由 ronnywang 提交于 9月 13, 2022
  
  b2122239
- Z
  Clear extra attributes of activation op in OpMaker (#45772) · c7b373f2
  由 zyfncg 提交于 9月 13, 2022
```
* clear extra attr of activation op in opmaker

* fix syntax bug

* fix mkldnn kernel

* fix merge conflict

* fix bug
```
  c7b373f2
- J
  add softmax infer kernel (#45955) · 01888482
  由 JingZhuangzhuang 提交于 9月 13, 2022
```
* add softmax infer kernel
```
  01888482
- Y
  
  migrate squeeze kernel to phi, test=kunlun (#45968) · d3366853
  由 ykkk2333 提交于 9月 13, 2022
  
  d3366853
- Y
  
  fix transformer bug, test=kunlun (#45927) · e6d397e6
  由 ykkk2333 提交于 9月 13, 2022
  
  e6d397e6
10 9月, 2022 1 次提交
- Q
  
  [MLU] fix compute error of dropout op (#45923) · 36915474
  由 qipengh 提交于 9月 10, 2022
  
  36915474
09 9月, 2022 6 次提交
- Z
  Run_program_op add scope cache & reuse (#45813) · 369a235d
  由 zhangbo9674 提交于 9月 09, 2022
```
* add scope cache & reuse

* add gc scope for end of each train step

* del scope reuse for jit

* refine code

* test
```
  369a235d
- D
  make memcpy op to support custom_device (#45918) · 1ed8e9b8
  由 duanyanhui 提交于 9月 09, 2022
```
* make memcpy op to support custom device

* fix bug
```
  1ed8e9b8
- Fix namespace error (#45925) · a687b531
  由 engineer1109 提交于 9月 09, 2022
```
paddle::platform::CudaAtomicAdd
https://github.com/PaddlePaddle/Paddle/issues/45881
```
  a687b531
- S
  Fix softmax op when the input shape is larger than INT32_MAX (#45897) · 38edea9a
  由 sneaxiy 提交于 9月 09, 2022
```
* fix softmax int64

* follow comments
```
  38edea9a
- C
  Fix split bug in static mode (#45906) · bd8f998b
  由 Charles-hit 提交于 9月 09, 2022
```
* fix split bug in static mode

* modify code style

* modify code style

* add unit test for split
```
  bd8f998b
- W
  
  remove skip_quant in op_teller (#45872) · 8487f79c
  由 Wangzheee 提交于 9月 09, 2022
  
  8487f79c

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功