提交 · e4e94a889a7e172ca92b9d0c4aca8c3c08a39fea · PaddlePaddle / Paddle

01 2月, 2023 25 次提交

[Zero-Dim] Fix 0-dim tensor for arg_min_max op. (#49570) · e4e94a88

由 Zhong Hui 提交于 2月 01, 2023

* fix 0-d tensor for arg_min_max op.

* fix xpu.

* fix zero dims

* fix

* Update arg_min_max_kernel.cc

* Update arg_min_max_kernel.cc

* Update arg_min_max_kernel.cc

* Update test_zero_dim_tensor.py

* Update test_zero_dim_tensor_xpu.py

* Update test_zero_dim_tensor.py

* Update arg_min_max_kernel.cc

* Update arg_min_max_kernel.cc

* Update arg_min_max_kernel.cc

e4e94a88

run infer ut in A10 (#48535) · 71f247b1

由 YUNSHEN XIE 提交于 2月 01, 2023

* run infer ut in A10

* 增加cuda11.2-cudnn8-trt8.4镜像

* add paddle_coverage_new.sh

71f247b1

P
[Numpy]Fix NumpyScaler2Tensor dtype error (#50018) · 6f0ae156
由 PuQing 提交于 2月 01, 2023
```
* fix numpyScaler2Tensor type error

* fix to_tensor docs, test=document_fix
```
6f0ae156
G

Skip the int input operator when inserting a quant node & fix some bug (#49926) · 03619037
由 Guanghua Yu 提交于 2月 01, 2023

03619037
张
fix the div 0 error of sparse_embedding (#49948) · 3a73d348
由张春乔提交于 2月 01, 2023
```
* fix the div 0 error of sparse_embedding

* add unittest
```
3a73d348
张

fix the NullPointerError of matrix_power (#50015) · 776021c1
由张春乔提交于 2月 01, 2023

776021c1

Preln fix (#49802) · e03718f5

由 Wang Bojun 提交于 2月 01, 2023

* preln_residual 2 fused_bias_residual

* skip layernorm fix and ut

* code refine

* code style refine

* fix ut

* fix output

* add trt layer fall back info

* refine op teller and ut

* DropoutMaskOut output fix

e03718f5

jit layer support multi thread and fix predictor clone (#50095) · 9fa2eb38

由 Hui Zhang 提交于 2月 01, 2023

* jit layer support multi thread

* fix bug

* clone prediector not do graph optimizer

* format

* fix comment and format

* fix override and fromat

* fix

* fix

9fa2eb38

R
Fix Python IndexError of case9: paddle.static.nn.deform_conv2d (#49990) · c62657b3
由 RedContritio 提交于 2月 01, 2023
```
* add dimension check for deformable_conv

* add unittest
```
c62657b3
bump isort version to 5.11.5 (#50126) · 5349b9b9
由 MarDino 提交于 2月 01, 2023

5349b9b9
Z

support grid_sampler_grad op for XPU (#49857) · 520f48d6
由 zhangyikun02 提交于 2月 01, 2023

520f48d6
G
[Divide by 0 Error] add lu check (#49974) · f71796b6
由 gouzil 提交于 2月 01, 2023
```
* [Divide by 0 Error] add lu check

* [Divide by 0 Error] lu check migrate to c++
```
f71796b6
R

Fix errors for test_standalone_custom_stream (#50103) · f0811bb7
由 Ruibiao Chen 提交于 2月 01, 2023

f0811bb7

[Divide by 0 Error] add eig check (#49971) · 226a6567

由 gouzil 提交于 2月 01, 2023

* [Divide by 0 Error] add eig check

* [Divide by 0 Error] eig check migrate to c++

* [Divide by 0 Error] Fix class name error

226a6567

[Divide by 0 Error] add norm check (#49966) · 5dfddaea

由 gouzil 提交于 2月 01, 2023

* [Divide by 0 Error] add norm check

* [Divide by 0 Error] fix x AttributeError

* [Divide by 0 Error] norm check migrate to c++

5dfddaea

Combination of multiple paddle::memory::allocate operation into one for ops (#49126) · bdae5481

由 limingshu 提交于 2月 01, 2023

* A leap of try for cudaLaunchCooperativeKernel

* fix bugs

* Totally replace the lar cuda kernel

* Fix bugs

* fix code according to comments

* fix codes according to  review comments

* adding some function overload

* relocate the power operation.

* add bf16 support for index select relevant ops

* revert bf16 type change.

* add changes for more op

* fix code writting bugs

bdae5481

Z

add dynamic shape support for running paddle-trt in calib_mode (#50033) · af673090
由 zhoutianzi666 提交于 2月 01, 2023

af673090
W

clean ps_trainer_pass (#50117) · 73f3e676
由 wangxiaoning 提交于 2月 01, 2023

73f3e676
Z

nccl 2.7.8 to 2.10.3 (#50121) · 2b636166
由 zqw_1997 提交于 2月 01, 2023

2b636166

add clip_grad_norm_ API (#49935) · 0855d982

由 zxcd 提交于 2月 01, 2023

* add clip_grad_norm_ api.

* fix docs and some details according to the comments.

* fix code style.

* fix no_grad problem, and fix doc.

* fix code style.

* fix doc and remove type information

0855d982

Fix UFA非法地址访问(UFA illegal address access) of case4: paddle.unbind (#49995) · 9ce8cfcf

由 RedContritio 提交于 2月 01, 2023

* add axis check for unbind

* add axis range check for unbind

* update unittest and axis validation for unbind

* add unittest invalid axis for unbind

* restore axis extract for unbind

9ce8cfcf

R

Fix Python IndexError of case1: paddle.linalg.lstsq (#49985) · 7f1a1570
由 RedContritio 提交于 2月 01, 2023

7f1a1570
L

fix gc and infinite buffer size (#50122) · 3e9d8548
由 LiYuRio 提交于 2月 01, 2023

3e9d8548
A
[PrimCinn]Fix some vars are wrongly gc in CINN+InterpreterCore (#50116) · 9f231147
由 Aurelius84 提交于 2月 01, 2023
```
* [PrimCinn]Fix some vars are wrongly gc in CINN+InterpreterCore

* fix baseline unittest config

* fix code style
```
9f231147

H2D data transfer optimization for split kernel (#49086) · 057ba778

由 limingshu 提交于 2月 01, 2023

* profile reduce kernel for fp16 and reduceHigherdim

* use reinterpret_cast

* fix for CI on ROCm

* add Macro for ROCm

* ROCm CI config

* ROCm CI config

* unit test repair

* pull

* add common_funcs.h

* reduceType

* Update reduce_function.h

* not higher

* rename

* implement of matmul using cublasLt instead of cublas

* cublasLt bugfix

* Update matmul_kernel_impl.h

* Update matmul_kernel_impl_via_blasLt.h

* for-loop-algo

* PR comments changes

* add macro

* ci unused variable isCublasLt

* ci unused variable isCublasLt macro

* split matmul to autotune

* rewrite the split kernel with segmented_array

* rewrite the split kernel with segmented_array

* rewrite the split kernel with segmented_array

* add some method for cuda_graph

* fix bugs for rocm

* change for ci-error

* i dont know why ci-model-benchmark gives a shit error, so i recover codes with original one to see if original codes work.

* add some changes for passing mode_benchmark and coverage ci

* fix ci error

* fix ci-rocm error

* add some changes for header

---------
Co-authored-by: Nzhangbopd <1299246947@qq.com>
Co-authored-by: NBo Zhang <105368690+zhangbopd@users.noreply.github.com>

057ba778

31 1月, 2023 15 次提交
- R
  
  support empty input for unique_consecutive (#49978) · dc1b6511
  由 RedContritio 提交于 1月 31, 2023
  
  dc1b6511
- W
  gn_silu (#49928) · 111075a3
  由 wenbin 提交于 1月 31, 2023
```
* gn_silu

* add ut

* set TIMEOUT

* correct comments

* comments

* disable windows ut

* rename parameter
```
  111075a3
- 姜
  migrating dot/sign/fill/norm from old dynamic graph to new dynamic graph (#49895) · b0ee022b
  由姜永久提交于 1月 31, 2023
```
* check dygraph on for op tests

* reset eigh and modify prelu&sign

* update eager_op_test

* lint

* add more ops

* fix reduce

* modify reduce test

* reset reduce_op

* modify matmul test

* revert prelu
```
  b0ee022b
- 姜
  update ops for new dynamic graph tests (#50061) · 47ddd36e
  由姜永久提交于 1月 31, 2023
```
* update elementwise ops tests

* add more ops

* modify sum&split

* lint

* rm check_dygraph

* revert pow

* modify add for cpu test

* revert reshape

* modify min
```
  47ddd36e
- 姜
  imigrating from old dynamic graph to new dynamic graph for argmin/argmax/adalta test (#50093) · 86a22ad4
  由姜永久提交于 1月 31, 2023
```
* more ops

* revert some ops

* reset some ops
```
  86a22ad4
- W
  
  bind pixel_shuffle & pixel_shuffle_grad op for xpu (#50090) · a5f2e1f7
  由 wangshengxiang 提交于 1月 31, 2023
  
  a5f2e1f7
- W
  Unary (#49914) · 0d9185b9
  由 wenbin 提交于 1月 31, 2023
```
* disable integer

* disable integer

* add cast layer
```
  0d9185b9
- Z
  
  [pass] Upgrade Constant Folding Pass (#49908) · c3cd8502
  由 Zhang Jun 提交于 1月 31, 2023
  
  c3cd8502
- N
  
  Save nan log to file when output_dir is setted (#49200) · c18fddd3
  由 niuliling123 提交于 1月 31, 2023
  
  c18fddd3
- C
  Integrate static code gen info (#49858) · 0e51f398
  由 Charles-hit 提交于 1月 31, 2023
```
* polish static grad op maker gen

* fix some bugs

* fix static code gen

* solve conflict

* modify composite grad maker name

* integrate phi and fluid info in static code gen

* rename some composite maker

* modify static code gen format
```
  0e51f398
- Z
  
  [inference][trt] add elementwise input data type check (#49675) · 5822e15c
  由 Zhang Jun 提交于 1月 31, 2023
  
  5822e15c
- P
  [Numpy] Add FP16 dtype for CastNumpy2Scalar (#50002) · 86a23818
  由 PuQing 提交于 1月 31, 2023
```
* add FP16 dtype for CastNumpy2Scalar

* fix throw message

* add test

* fix SyntaxWarning

* test skip for float16

* fix dtype mistakes
```
  86a23818
- 张
  fix div 0 error of NoamDecay (#49953) · 96a0ce60
  由张春乔提交于 1月 31, 2023
```
* fix div 0 error of NoamDecay

* add unittest

* Update lr.py
```
  96a0ce60
- R
  Add unified device management api (#48651) · 7aaaa1c6
  由 ronnywang 提交于 1月 31, 2023
```
* [CustomDevice] add custom device api

* update

* update

* test=document_fix

* update

* update

* add  examples
```
  7aaaa1c6
- J
  [KUNLUN] rename test_pool_max_op.py (#49945) · 5d110365
  由 jameszhang 提交于 1月 31, 2023
```
* [KUNLUN] rename test_pool_max_op.py

* update xpu toolchain
```
  5d110365

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功