提交 · 2c44ee7e8033d6abef02ed492c07caa154402193 · BaiXuePrincess / Paddle

13 10月, 2021 4 次提交

[New Feature] Support triple grad in Paddle (#36187) · 2c44ee7e

由 Jiabin Yang 提交于 10月 13, 2021

* native commit for triple grad of sigmod

* Updated unittests files

* init functional jacobian api

* Updated trible_test func

* Updated gradient_checker & test_script

* finish test with dtype float32

* add float64 test case

* polish code

* use atol=1e-5 with dtype float64

* fix for ci

* set timeout for test_jacobian

* fix dygraph grad to support high differential

* polish API docstring

* Updated gradient checker and some related files

* fix double grad strip error for high differential

* fix double grad strip error for high differential

* Add Sigmoid triple grad tests

* fix dygraph double grad dtype error when calling for high differential senario

* Updated triple grad teses func

* Use np.random to initialize ddx

* Updated triple_grad_check func

* add todo for gradient checker and refine some comments

* remove additional code

* add test for warnging in backward.py

* format python code
Co-authored-by: Nveyron95 <veyron_wu@163.com>
Co-authored-by: Nlevi131 <limaolin01@baidu.com>

2c44ee7e

[PaddleInference] Pass: add int8 flag for op (#36042) · d7858c99

由 Wangzheee 提交于 10月 13, 2021

* add_int_pass

* add_int8_flag_pass

* add_int8_flag_pass

* fix CMakeLists.txt

* fix test_trt_fc_fuse_quant_dequant_pass.py

* fix python/paddle/fluid/tests/unittests/ir/inference/test_trt_fc_fuse_quant_dequant_pass.py

* fix test_trt_fc_fuse_quant_dequant_pass.py

d7858c99

F

Set NIGHTLY tag for 'tensordot' UT (#36354) · 90457d8c
由 From00 提交于 10月 13, 2021

90457d8c
L
unify usage of tuple and list (#36368) · 3c2bdaa8
由 levi131 提交于 10月 13, 2021
```
* modify format

* modify format
```
3c2bdaa8

12 10月, 2021 13 次提交
- Z
  Revert "refine LarsOptimizer (#36351)" (#36369) · 033a73c3
  由 Zeng Jinle 提交于 10月 12, 2021
```
This reverts commit b3f6eedb.
```
  033a73c3
- A
  [NPU] concat supports dtype int64 for model deepfm (#36327) · 5f1eb839
  由 Aganlengzi 提交于 10月 12, 2021
```
* [NPU] modify for model deepfm

* [NPU] unit test delete precision control

* [NPU] add more unit test

* revert elementwise_mul related modification

* [NPU] add more unit tests for concat
```
  5f1eb839
- 0
  delete remove_static_file() function in error.py (#36153) · 40cfe7b2
  由 0x45f 提交于 10月 12, 2021
```
* change time to remove static tempfile

* delete remove_static_file() function
```
  40cfe7b2
- T
  [Autograd.functional] VJP and JVP (#36020) · 1e1aa197
  由 Tongxin Bai 提交于 10月 12, 2021
```
* autograd.functional passed pylint checker.

* autograd.functional: fix import errors.

* autograd.functional: fixed unit tests.

* autograd.functional minor format change
```
  1e1aa197
- Q
  [NPU] fix elementwise_mul to support broadcast, test=develop (#36258) · 09778f46
  由 Qi Li 提交于 10月 12, 2021
```
* [NPU] fix elementwise_mul to support broadcast, test=develop

* remove debug files, test=develop

* add axis support, test=develop
```
  09778f46
- Z
  
  refine LarsOptimizer (#36351) · b3f6eedb
  由 Zeng Jinle 提交于 10月 12, 2021
  
  b3f6eedb
- H
  
  Update test_cross_entropy_loss.py · 59841e6f
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  59841e6f
- H
  
  Update test_cross_entropy_loss.py · a4246b90
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  a4246b90
- H
  
  Fix the bug when axis is specified and weight is provided · 1d660eb6
  由 HydrogenSulfate 提交于 10月 11, 2021
  
  1d660eb6
- Q
  [NPU] add int64 kernel for slice, test=develop (#36328) · 8cc7146d
  由 Qi Li 提交于 10月 12, 2021
```
* [NPU] add int64 kernel for scale and slice, test=develop

* remove int64 for scale, test=develop
```
  8cc7146d
- J
  
  Add pool2d test convert (#36338) · e275e423
  由 JingZhuangzhuang 提交于 10月 11, 2021
  
  e275e423
- H
  fix bugs in mp_layers、pp_layers and HybridParallelClipGrad (#36144) · d247cf17
  由 Haohongxiang 提交于 10月 12, 2021
```
* fix calling bug of HybridParallelClipGrad

* fix bugs of HybridParallelClipGrad

* add unittest of pp with HybridParallelClipGrad

* fix bugs in mp_layers.py

* update

* fix bugs in pp_layers.py

* update
```
  d247cf17
- A
  Fix stop_gradient in RunProgramOp (#36339) · 2a75b447
  由 Aurelius84 提交于 10月 12, 2021
```
* Fix stop_gradient in RunProgramOp

* fix reference
```
  2a75b447
11 10月, 2021 15 次提交

[heterps] add fuse_allreduce (#35131) · e5b4dd73

由 danleifeng 提交于 10月 11, 2021

* heterps:add fuse_allreduce op; test=develop
* add program_mode in minimize for pslib mode;test=develop

e5b4dd73

J

fix for matmul_v2 6D x 2D (#36342) · 339cb191
由 jakpiase 提交于 10月 11, 2021

339cb191
Z
Add FLAGS_allreduce_record_one_event to remove event waiting number (#36263) · 7b45a46e
由 Zeng Jinle 提交于 10月 11, 2021
```
* add FLAGS_allreduce_record_one_event

* add more comments

* fix ut

* improve coverage

* fix ut, improve coverage
```
7b45a46e

Add nn.functional.sparse_attention and some test cases, test=develop (#35757) · 85b77232

由 Liu-xiandong 提交于 10月 11, 2021

Add paddle.nn.functional.sparse_attention API

本个PR主要将sparse_attention功能在python层进行了一层封装，OP的主体代码见：#PR35676

此外，对于封装的python 接口，增加了相应的单测。

85b77232

Y

fix_dp_grad_merge_with_grad_clip_by_global_norm (#36334) · 1026052c
由 Yuang Liu 提交于 10月 11, 2021

1026052c

[Paddle-ASP] Revise 4d tensor sparsity mask pattern for conv2d sparsity (#36054) · 00245cfd

由 zlsh80826 提交于 10月 11, 2021

Sparse tensor core for convolution requires the input channel dimension is 2:4 structed sparse.
So we have to mask the input channel dimension for using sparse tensor core

00245cfd

add reshard module (#35779) · c38b0488

由 caozhou 提交于 10月 11, 2021

* add reshard module

* fix conflict

* update reshard module

* update and add unitest

* update reshard module and unitest

* add more unitests

c38b0488

Y

fix multi-node (#36329) · 7a724ddb
由 yaoxuefeng 提交于 10月 11, 2021

7a724ddb
W
enhance yolobox trt plugin (#34128) · 71cb3ff8
由 wangxinxin08 提交于 10月 11, 2021
```
* enhance yolobox plugin
```
71cb3ff8

[NPU] fix matmul_v2 and utils.run_check, test=develop (#36164) · 7850f7ce

由 Qi Li 提交于 10月 11, 2021

* [NPU] fix matmul_v2 and utils.run_check, test=develop

* remove debug files, test=develop

* fix install_check, test=develop

* fix doc, test=develop

* fix review comments, test=develop

7850f7ce

Q
[NPU] fix set_value, test=develop (#36272) · 83541fd4
由 Qi Li 提交于 10月 11, 2021
```
* [NPU] fix set_value, test=develop

* fix typo, test=develop

* fix typo, test=develop
```
83541fd4

add mish trt plugin (#34123) · 2b7b752a

由 wangxinxin08 提交于 10月 11, 2021

* add mish trt plugin, compile & install success, run error. test=develop
* modify code according to review
* add TRT_NOEXCEPT for mish trt plugin
* add unittest for mish trt plugin
* remove unnecessary check of mish in op_teller.cc
* fix some problem of trt8
* add check and modify unittest while converting mish to trt plugin
Co-authored-by: Ndengkaipeng <dengkaipeng@baidu.com>

2b7b752a

B
add skip case in trt converter ut (#36287) · 34bd18ff
由 baoachun 提交于 10月 11, 2021
```
* add skip case in trt converter ut

* disable group_norm trt plugin
```
34bd18ff

Add use_cinn Flag and RunFromCinn in PE (#36107) · 5690666c

由 Huihuang Zheng 提交于 10月 11, 2021

Add use_cinn flag and use it to control whether we run PaddlePaddle using CINN.

Also add:

Replace PaddlePaddle graph with a CINN graph in a pass
PE Method to feed data and run the graph by CINN

5690666c

J

Add skip case for conv2d convert test (#36301) · 9b987b3d
由 JingZhuangzhuang 提交于 10月 10, 2021

9b987b3d

09 10月, 2021 5 次提交
- Y
  
  Enhance OpTest for bfloat16. (#36079) · 91119271
  由 Yiqun Liu 提交于 10月 09, 2021
  
  91119271
- F
  Add new API 'tensordot' (#36273) · 21dc7f40
  由 From00 提交于 10月 09, 2021
```
* Add new API tensordot

* Set timeout value 400 for UT; Fix format for EN docs

* Set timeout value 1000 for UT; Fix format for EN docs

* Remove some input check

* Coding style improve: don't compare boolean values to True or False
using ==
```
  21dc7f40
- Z
  
  fill_diagonal op fix border cross caused by offset (#36212) · 62e41150
  由 zhiboniu 提交于 10月 09, 2021
  
  62e41150
- Z
  support ClipGradByGlobalNorm in sharding (#36012) · 623df429
  由 zhaoyingli 提交于 10月 09, 2021
```
* support ClipGradByGlobalNorm in sharding

* support ClipGradByGlobalNorm in sharding

* test=allcase
```
  623df429
- W
  fix hasattr(paddle.fluid.ir.PassDesc.OP, '__name__') error (#36229) · d8887afa
  由 wuhuanzhou 提交于 10月 09, 2021
```
对于__getattr__重载后不满足条件的参数，全部抛出AttributeError异常，达到与未重载版本一致。
```
  d8887afa
08 10月, 2021 3 次提交
- Z
  Support CUDA Graph on ParallelExecutor (#36250) · f9591bb1
  由 Zeng Jinle 提交于 10月 08, 2021
```
* support CUDA Graph on PE

* add ut, fix CI compile

* reduce memory consumption

* fix CUDA 10 CI

* improve coverage

* improve python coverage
```
  f9591bb1
- Y
  
  add fs list_files_info (#36224) · ca16e8fd
  由 yaoxuefeng 提交于 10月 08, 2021
  
  ca16e8fd
- Q
  [NPU] BatchNorm support layout of NCL and NLC, test=develop (#35668) · 7cb19f57
  由 Qi Li 提交于 10月 08, 2021
```
* [NPU] support NCL and NCL for BatchNorm, test=develop

* [NPU] remove debug files, test=develop

* update, test=develop
```
  7cb19f57

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致