提交 · 173b39bb5703c297ae89c6ef442f634c56f2f2bf · BaiXuePrincess / Paddle

22 9月, 2022 8 次提交
- Y
  
  TensorRT engine context memory sharing (#45842) · 173b39bb
  由 Yuanle Liu 提交于 9月 22, 2022
  
  173b39bb
- S
  
  Fix UVA Tensor (#46371) · d772166c
  由 Siming Dai 提交于 9月 22, 2022
  
  d772166c
- H
  [mkldnn] Fix elementwise_sub sign reverse for mkldnn (#46049) · ab97b760
  由 Hui Zhang 提交于 9月 22, 2022
```
* fix sub sign reverse for mkldnn

* refactor code as comment

* remove useless

* format code
```
  ab97b760
- Z
  [Paddle-TRT]fix bug in fill_constant_batch_size_like op (#46334) · d8edf487
  由 zhoutianzi666 提交于 9月 22, 2022
```
* fix beta bug in fill_constant_batch_size_like
```
  d8edf487
- H
  [Dygraph] Fix bugs of mp in eager mode (#46303) · 11002430
  由 Haohongxiang 提交于 9月 22, 2022
```
* fix bugs of mp

* fix bugs of mp

* update

* update

* fix bug
```
  11002430
- C
  Optimize topk's performance when k is small and input_width is large (#45312) · 2c687df0
  由 carryyu 提交于 9月 22, 2022
```
* Optimize topk's performance when k is small and input_width is large

* 修改blockdim设置逻辑

* Update top_k_function_cuda.h
```
  2c687df0
- Z
  
  fix compile problem (#46354) · b1771368
  由 zyfncg 提交于 9月 22, 2022
  
  b1771368
- C
  
  [MLU] add int64 support for mlu one_hot_v2 (#46313) · 9cc3b28d
  由 Chenxiao Niu 提交于 9月 22, 2022
  
  9cc3b28d
21 9月, 2022 17 次提交
- C
  add layer_norm trt fp16 support (#45043) · b7a1ae22
  由 ccrrong 提交于 9月 21, 2022
```
* add fp16 support

* update

* update half

* code format

* fix unittest

* fix rocm compile error

* code format

* code format

* fix rocm compile error

* fix rocm compile error
```
  b7a1ae22
- P
  
  Revert pool+grad oneDNN kernel conversion (#45989) · dc31d2aa
  由 Piotr Paturej 提交于 9月 21, 2022
  
  dc31d2aa
- C
  
  recover get ci check and fix typo (#46340) · a93a95bf
  由 Chen Weihang 提交于 9月 21, 2022
  
  a93a95bf
- Z
  Enable PaddleInference to use CINN. (#45009) · 3aa6bd57
  由 Zhen Wang 提交于 9月 21, 2022
```
* use cinn in the paddle inference

* fix some cmake errors

* Avoid division by zero in the arange_kernel.

* Avoid dynamic ops.

* Remove some useless codes.

* Use OpTransInfo to encapsulate some codes used in the build_cinn_pass.
```
  3aa6bd57
- R
  
  fix multihead_matmul nan error when seq len et 1024 (#46286) · face8f1f
  由 RichardWooSJTU 提交于 9月 21, 2022
  
  face8f1f
- X
  
  fix error: control reaches end of non-void function (#46312) · 7c4efa5a
  由 Xinger 提交于 9月 21, 2022
  
  7c4efa5a
- Y
  migrate add_n kernel to phi (#46318) · 0f9dde43
  由 ykkk2333 提交于 9月 21, 2022
```
* migrate sigmoid with cross entropy, and tile xpu kernels to phi, test=kunlun

* migrate add_n kernep to phi, test=kunlun
```
  0f9dde43
- W
  
  Mpi final dev simple (#46247) · 9ce31e96
  由 wuhuachaocoding 提交于 9月 21, 2022
  
  9ce31e96
- W
  residual_no_bias (#46129) · aa0e84e3
  由 wenbin 提交于 9月 21, 2022
```
* residual_no_bias

* comments

* more ut

* fix input
```
  aa0e84e3
- P
  [PHI] Migrate concat+grad, expand+grad, fill_constant, nearest_interp and... · 3d59fee5
  由 Piotr Paturej 提交于 9月 21, 2022
```
[PHI] Migrate concat+grad, expand+grad, fill_constant, nearest_interp and bilinear_interp oneDNN kernels (#45863)

* Migrate concat+grad, expand+grad, fill_constant, nearest_interp_v2 and bilinear_interp_v2 oneDNN kernels to PHI

* Remove old namespace variable

* Fix invalid out dims error

* Add mutable_data method to concat output

* Add check for -1 dim before computing out_dims

* Capitalize oneDNNGetDataType function name

* Change fill_constant kernel to correct PHI kernel

* Attempt to fix dims error

* Fix fill_constant (full) kernel
```
  3d59fee5
- Z
  [Paddle-TRT] remove trt_reshape2_matmul_fuse_pass (#46090) · c9a7a3bc
  由 zhoutianzi666 提交于 9月 21, 2022
```
* Remove trt_reshape2_matmul_fuse_pass
```
  c9a7a3bc
- W
  
  inference python api fix miss return (#46300) · 44d72ceb
  由 Wilber 提交于 9月 21, 2022
  
  44d72ceb
- J
  
  refine mkldnn code · 4b8d4ade
  由 jiahongyu 提交于 9月 20, 2022
  
  4b8d4ade
- L
  
  share threadpool of executor in dy2static (#46281) · 8232da7c
  由 Leo Chen 提交于 9月 21, 2022
  
  8232da7c
- N
  [CodeStyle] remove tabs in cpp files (#46236) · 9e917a1e
  由 Nyakku Shigure 提交于 9月 21, 2022
```
* [CodeStyle] remove tabs in cpp files

* update comment format
```
  9e917a1e
- W
  [Eager, Performance Optimization] Optimize clone interface (#46190) · 669c7d51
  由 Weilong Wu 提交于 9月 21, 2022
```
* [Eager] polish clone interface

* rm clone in python, add clone in eager_method.cc
```
  669c7d51
- Y
  Remove audio ParameterError (#46316) · b28bff06
  由 YangZhou 提交于 9月 21, 2022
```
* unexpose audio ParameterError

* clean audio utils api
```
  b28bff06
20 9月, 2022 15 次提交
- W
  [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(1) (#46230) · 3441e5e8
  由 Wangzheee 提交于 9月 20, 2022
```
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(1): add some funtion for embedding
```
  3441e5e8
- S
  [PHI] migrate softmax_grad kernel (#46257) · 4dad95cc
  由 Sławomir Siwek 提交于 9月 20, 2022
```
* init

* remove softmaxop

* merge dev

* correct dir

* style
```
  4dad95cc
- P
  [PHI] Migrate slice, slice_grad, split, pad and pad3d oneDNN kernels (#46101) · b232b5e9
  由 Piotr Paturej 提交于 9月 20, 2022
```
* Convert split, pad and pad3d kernels

* Convert slice+grad oneDNN fluid kernels to PHI

* change out->mutable_data to dev_ctx.Alloc
```
  b232b5e9
- J
  
  reverted changes (#46254) · ec1376ae
  由 jakpiase 提交于 9月 20, 2022
  
  ec1376ae
- P
  [PHI] Shape op migration (#46051) · 27fe77bc
  由 Paulina Gacek 提交于 9月 20, 2022
```
* First approach

* Shape kernel corrected

* Compilation error fixed

* Resize corrected

* Registered types added

* Mistake corrected & types added

* sum kernel deleted
```
  27fe77bc
- Z
  [Paddle-TRT] matmul_v2 support (#44918) · aee4f8ab
  由 zhoutianzi666 提交于 9月 20, 2022
```
* Support matmul_v2 in PaddleTensorRT
```
  aee4f8ab
- W
  [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(2) (#46234) · 85c7be42
  由 Wangzheee 提交于 9月 20, 2022
```
* [Paddle Inference]support n lookup_tables fuse to embeddinglayernorm(2)
```
  85c7be42
- Z
  [Paddle-TRT] Full support for ops with persistable input (#45545) · 668ffd59
  由 zhoutianzi666 提交于 9月 20, 2022
```
* Move ITensor construction for Weight (persistable variable) from OpConvert to TensorRTEngine.
```
  668ffd59
- Y
  [PHI]Move merge_selected_rows kernel to PHI (#46004) · b1658232
  由 YuanRisheng 提交于 9月 20, 2022
```
* move_merge_selected_rows

* update code
```
  b1658232
- N
  
  [CodeStyle] remove crlf for cpp files (#46156) · 846c7e70
  由 Nyakku Shigure 提交于 9月 20, 2022
  
  846c7e70
- W
  [JitLayer]Erase out vars in scope to avoid data rewritinig (#46249) · 9941ec12
  由 WangZhen 提交于 9月 20, 2022
```
* [JitLayer]Erase out vars to avoid data rewrittinig

* Fix code comments
```
  9941ec12
- W
  
  Add symbolic shape deduction function for general Plugin mechanism (#46172) · 82399bdf
  由 weishengying 提交于 9月 20, 2022
  
  82399bdf
- J
  
  restore gaussian_random_op_npu.cc header · 3b89e7c0
  由 jiahongyu 提交于 9月 19, 2022
  
  3b89e7c0
- J
  
  refine mkldnn code · b2b9a1bb
  由 jiahongyu 提交于 9月 17, 2022
  
  b2b9a1bb
- R
  [NPU] fix run_program_op, test=develop (#46122) · db97773b
  由 ronnywang 提交于 9月 20, 2022
```
* [NPU] fix run_program_op, test=develop

* [NPU] fix matmul_v2 in cann502, test=develop
```
  db97773b

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致