提交 · 808be6574a46e552688acdd3066e271598c4f132 · PaddlePaddle / Paddle

15 10月, 2021 2 次提交

[New Feature] Support tanh triple grad (#36225) · 808be657

由 Jiabin Yang 提交于 10月 15, 2021

* native commit for triple grad of sigmod

* Updated unittests files

* init functional jacobian api

* Updated trible_test func

* Updated gradient_checker & test_script

* finish test with dtype float32

* add float64 test case

* polish code

* use atol=1e-5 with dtype float64

* fix for ci

* set timeout for test_jacobian

* fix dygraph grad to support high differential

* polish API docstring

* Updated gradient checker and some related files

* fix double grad strip error for high differential

* fix double grad strip error for high differential

* Add Sigmoid triple grad tests

* fix dygraph double grad dtype error when calling for high differential senario

* Updated triple grad teses func

* Use np.random to initialize ddx

* Updated triple_grad_check func

* add todo for gradient checker and refine some comments

* remove additional code

* add test for warnging in backward.py

* add tanh triple grad

* format python code

* refine code
Co-authored-by: Nveyron95 <veyron_wu@163.com>
Co-authored-by: Nlevi131 <limaolin01@baidu.com>

808be657

Z

fix momentum ops (#36452) · 4dda18a8
由 Zeng Jinle 提交于 10月 15, 2021

4dda18a8

14 10月, 2021 10 次提交
- Z
  
  fix lars (#36431) · 8256f6fa
  由 Zeng Jinle 提交于 10月 14, 2021
  
  8256f6fa
- Z
  
  refine merge lars (#36428) · 63fd7d66
  由 Zeng Jinle 提交于 10月 14, 2021
  
  63fd7d66
- W
  inference support bert when exists matmul_v2 (#36424) · 3e6d9dbb
  由 Wilber 提交于 10月 14, 2021
```
* support bert when exists matmul_v2

* update
```
  3e6d9dbb
- Z
  
  Add the complete code and related files of resnet_unit_op (#36366) · 12e6dbbc
  由 Zhang Zheng 提交于 10月 14, 2021
  
  12e6dbbc
- Z
  [NPU] Add density_prior_box (#36361) · bed4fb27
  由 zhulei 提交于 10月 14, 2021
```
* [NPU] Add density_prior_box op

* [NPU] Add density_prior_box op
```
  bed4fb27
- L
  Revert "Implemented LRU based cache clearing (#36290)" (#36426) · 5d18967b
  由 lidanqing 提交于 10月 14, 2021
```
This reverts commit bf748f24.
```
  5d18967b
- Z
  Merge momentum ops/kernels (#36380) · f4eda869
  由 Zeng Jinle 提交于 10月 14, 2021
```
* merge momentum ops

* update

* add ut to improve coverage

* remove optimizer change

* fix error msg

* update ut

* add __restrict__ for CUDA

* update ut

* move merged_momentum_op to optimizer dir

* fix coverage
```
  f4eda869
- Y
  
  [hybrid enhance] add flag to control the avg position for grad merge under pipeline mode (#36384) · 03d8304f
  由 Yuang Liu 提交于 10月 14, 2021
  
  03d8304f
- J
  Sparsity support (#36413) · b857d755
  由 JingZhuangzhuang 提交于 10月 13, 2021
```
* add pool2d convert test

* modify error

* modify error

* modify error

* modify error

* modify error

* modify error

* sparsity support
```
  b857d755
- P
  
  clean inference logs when config.DisableGlogInfo is triggered (#36356) · 7f5128f4
  由 Pei Yang 提交于 10月 14, 2021
  
  7f5128f4
13 10月, 2021 9 次提交

Y
[PaddlePaddle hackathon] + ADD CELU (#36088) · d7064f04
由 yujun 提交于 10月 13, 2021
```
* update

* update

* update

* try make CI pass

* doc typo

* update doc string
```
d7064f04

Merge lars op (#35476) · 0c31579c

由 limingshu 提交于 10月 13, 2021

* A leap of try for cudaLaunchCooperativeKernel

* fix bugs

* Totally replace the lar cuda kernel

* Fix bugs

* a test for lars merge

* Adding las_op_momentum infer_shape

* Fix codes

* use avg_numel instead of max_numel to acquire grid num

* modify unittest files about lars op

* Finally converge when merged-lars works

* fix ctest files

* add merged_operation kernel when cuda version is older than 11

* Fix code style

* fix ctest failure

* fix error

* fix all ctest error and change lars compute code of cpu

* fix bugs on v100.

* revert python modififation about lars

* revert python modification codes

0c31579c

W
Verify the correctness of graph rewrited by GeneratePass (#36116) · 24418479
由 wuhuanzhou 提交于 10月 13, 2021
```
Check detail PR description at https://github.com/PaddlePaddle/Paddle/pull/36116
```
24418479
W
pool fix (#36388) · 192e08cb
由 wenbin 提交于 10月 13, 2021
```
* pool fix

* comments
```
192e08cb
J
Implemented LRU based cache clearing (#36290) · bf748f24
由 Jacek Czaja 提交于 10月 13, 2021
```
- Lint

- Merge with develop

- lint
```
bf748f24
L
[Amp] refine code of amp level (#36362) · 59e425cd
由 Leo Chen 提交于 10月 13, 2021
```
* refine amp level

* fix typo

* update tracer._amp_level
```
59e425cd
H
Remove RunFromCinn in PE because We Will Call CinnRunner in Compute of SubgraphOp (#36385) · e051bba0
由 Huihuang Zheng 提交于 10月 13, 2021
```
Remove RunFromCinn method in PE because We Will Call CinnRunner in Compute method of SubgraphOp
```
e051bba0

[New Feature] Support triple grad in Paddle (#36187) · 2c44ee7e

由 Jiabin Yang 提交于 10月 13, 2021

* native commit for triple grad of sigmod

* Updated unittests files

* init functional jacobian api

* Updated trible_test func

* Updated gradient_checker & test_script

* finish test with dtype float32

* add float64 test case

* polish code

* use atol=1e-5 with dtype float64

* fix for ci

* set timeout for test_jacobian

* fix dygraph grad to support high differential

* polish API docstring

* Updated gradient checker and some related files

* fix double grad strip error for high differential

* fix double grad strip error for high differential

* Add Sigmoid triple grad tests

* fix dygraph double grad dtype error when calling for high differential senario

* Updated triple grad teses func

* Use np.random to initialize ddx

* Updated triple_grad_check func

* add todo for gradient checker and refine some comments

* remove additional code

* add test for warnging in backward.py

* format python code
Co-authored-by: Nveyron95 <veyron_wu@163.com>
Co-authored-by: Nlevi131 <limaolin01@baidu.com>

2c44ee7e

[PaddleInference] Pass: add int8 flag for op (#36042) · d7858c99

由 Wangzheee 提交于 10月 13, 2021

* add_int_pass

* add_int8_flag_pass

* add_int8_flag_pass

* fix CMakeLists.txt

* fix test_trt_fc_fuse_quant_dequant_pass.py

* fix python/paddle/fluid/tests/unittests/ir/inference/test_trt_fc_fuse_quant_dequant_pass.py

* fix test_trt_fc_fuse_quant_dequant_pass.py

d7858c99

12 10月, 2021 7 次提交
- Z
  
  Change the input param of fusion op interface from pointer to tensor (#36349) · 3e2dec5b
  由 Zhang Zheng 提交于 10月 12, 2021
  
  3e2dec5b
- A
  [NPU] concat supports dtype int64 for model deepfm (#36327) · 5f1eb839
  由 Aganlengzi 提交于 10月 12, 2021
```
* [NPU] modify for model deepfm

* [NPU] unit test delete precision control

* [NPU] add more unit test

* revert elementwise_mul related modification

* [NPU] add more unit tests for concat
```
  5f1eb839
- Q
  [NPU] fix elementwise_mul to support broadcast, test=develop (#36258) · 09778f46
  由 Qi Li 提交于 10月 12, 2021
```
* [NPU] fix elementwise_mul to support broadcast, test=develop

* remove debug files, test=develop

* add axis support, test=develop
```
  09778f46
- Q
  [NPU] add int64 kernel for slice, test=develop (#36328) · 8cc7146d
  由 Qi Li 提交于 10月 12, 2021
```
* [NPU] add int64 kernel for scale and slice, test=develop

* remove int64 for scale, test=develop
```
  8cc7146d
- J
  
  Add pool2d test convert (#36338) · e275e423
  由 JingZhuangzhuang 提交于 10月 11, 2021
  
  e275e423
- Z
  Revert "refine case when thread_num = 1 (#36201)" (#36347) · 0594d2a7
  由 Zeng Jinle 提交于 10月 12, 2021
```
This reverts commit 7e60cc63.
```
  0594d2a7
- A
  Fix stop_gradient in RunProgramOp (#36339) · 2a75b447
  由 Aurelius84 提交于 10月 12, 2021
```
* Fix stop_gradient in RunProgramOp

* fix reference
```
  2a75b447
11 10月, 2021 12 次提交
- L
  refine auto_growth allocator (#35732) · 6d353aa5
  由 Leo Chen 提交于 10月 11, 2021
```
* do not use alignedAllocator when cuda has alignment

* update test

* fix error during multiple process
```
  6d353aa5
- J
  
  fix for matmul_v2 6D x 2D (#36342) · 339cb191
  由 jakpiase 提交于 10月 11, 2021
  
  339cb191
- Z
  Add FLAGS_allreduce_record_one_event to remove event waiting number (#36263) · 7b45a46e
  由 Zeng Jinle 提交于 10月 11, 2021
```
* add FLAGS_allreduce_record_one_event

* add more comments

* fix ut

* improve coverage

* fix ut, improve coverage
```
  7b45a46e
- L
  Add nn.functional.sparse_attention and some test cases, test=develop (#35757) · 85b77232
  由 Liu-xiandong 提交于 10月 11, 2021
```
Add paddle.nn.functional.sparse_attention API

    本个PR主要将sparse_attention功能在python层进行了一层封装，OP的主体代码见：#PR35676

    此外，对于封装的python 接口，增加了相应的单测。
```
  85b77232
- J
  
  added missing bf16 ops (#36291) · 14393876
  由 jakpiase 提交于 10月 11, 2021
  
  14393876
- Z
  
  Add more tests and fix bugs for cudnn_norm_conv_test and cudnn_bn_and_relu_test (#36314) · a679fcbb
  由 Zhang Zheng 提交于 10月 11, 2021
  
  a679fcbb
- N
  Add functor_primitives.h for kernel primtive api (#36203) · 830debc2
  由 niuliling123 提交于 10月 11, 2021
```
* Add functor_primitives.h for kernel primtive api

* update

* move namespace kps

* subFunctor init_data

* delete InvalidArgumentError
```
  830debc2
- Y
  
  fix multi-node (#36329) · 7a724ddb
  由 yaoxuefeng 提交于 10月 11, 2021
  
  7a724ddb
- W
  enhance yolobox trt plugin (#34128) · 71cb3ff8
  由 wangxinxin08 提交于 10月 11, 2021
```
* enhance yolobox plugin
```
  71cb3ff8
- Q
  [NPU] fix matmul_v2 and utils.run_check, test=develop (#36164) · 7850f7ce
  由 Qi Li 提交于 10月 11, 2021
```
* [NPU] fix matmul_v2 and utils.run_check, test=develop

* remove debug files, test=develop

* fix install_check, test=develop

* fix doc, test=develop

* fix review comments, test=develop
```
  7850f7ce
- Q
  [NPU] fix set_value, test=develop (#36272) · 83541fd4
  由 Qi Li 提交于 10月 11, 2021
```
* [NPU] fix set_value, test=develop

* fix typo, test=develop

* fix typo, test=develop
```
  83541fd4
- Q
  
  [NPU] fix softmax_with_cross_entropy in dygraph, test=develop (#36297) · 11061325
  由 Qi Li 提交于 10月 11, 2021
  
  11061325

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功