提交 · b36fb036a5dd4b9f88c646390450db2564903f84 · BaiXuePrincess / Paddle

31 8月, 2021 5 次提交
- R
  [hybrid] Fix row parallel linear bias (#35186) (#35297) · b36fb036
  由 Roc 提交于 8月 31, 2021
```
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  b36fb036
- R
  [hybrid][npu] fix npu clear float status in pipeline (#35165) (#35295) · 167685e5
  由 Roc 提交于 8月 31, 2021
```
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  167685e5
- R
  [hybrid npu] fix npu found_finite in hybrid (#35134) (#35291) · e64105f6
  由 Roc 提交于 8月 31, 2021
```
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  e64105f6
- Y
  [cherry-pick][Hybrid Performance] Move the cast op of AMP which cast fp32... · 6fb58aef
  由 Yuang Liu 提交于 8月 31, 2021
```
[cherry-pick][Hybrid Performance] Move the cast op of AMP which cast fp32 param to fp16 param to the optimizer (#34965) (#35296)
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  6fb58aef
- Y
  [cherry-pick] NPU use squared_l2_norm in GradientClipByGlobalNorm (#34836) (#35289) · 38c27d55
  由 Yuang Liu 提交于 8月 31, 2021
```
Co-authored-by: NWangXi <wangxi16@baidu.com>
```
  38c27d55
18 8月, 2021 3 次提交
- L
  [NPU] add retry on HcclGetRootInfo to fix "bind fail" (#34977) · 52a7b0c4
  由 Leo Chen 提交于 8月 18, 2021
```
* add retry for HcclGetRootInfo

* refine code

* reduce retry interval
```
  52a7b0c4
- G
  support class center sample of PartialFC (#34106) · 100db44f
  由 Guoxia Wang 提交于 8月 18, 2021
```
* support class center sample of PartialFC
```
  100db44f
- W
  [Paddle-TRT] unitest_quant_dequant (#34929) · c7070cb8
  由 Wangzheee 提交于 8月 18, 2021
```
* unitest_quant_dequant

* fix

* fix

* deleted: test_trt_quant_conv2d_dequant_fuse_pass.py

* fix
```
  c7070cb8
17 8月, 2021 14 次提交

R

[NPU]Adamw skip update for npu (#34897) · b4474fb4
由 Roc 提交于 8月 17, 2021

b4474fb4
A

[NPU] add where_index op and tests (#34951) · 1ef21855
由 Aganlengzi 提交于 8月 17, 2021

1ef21855
T
Update op-benchmark CI (#34962) · 690f5831
由 tianshuo78520a 提交于 8月 17, 2021
```
* fix op-benchmark

* test=document_fix
```
690f5831

Copy boost optional to Paddle (#34780) · 9be41447

由 chentianyu03 提交于 8月 17, 2021

* copy boost optional.hpp to paddle

* copy boost optional.hpp to paddle

* move directions

* del fluid/utils

* modify .hpp to .h

* move directions

* modify to paddle::optional

* add modification description

* format code stype for the files in paddle/utils

* format code stype

9be41447

[oneDNN ] disabling more ops caching (#34830) · f1c1d9e0

由 Jacek Czaja 提交于 8月 17, 2021

* - disabled caching of layer norm

- fix in compilation

- compilation fix

- transpose caching disabled

- compilation fix

- more compilation fixes

- sum caching disabled

- compilation fix

* - LRN with disabled cache

* lint fixes

f1c1d9e0

add exclude rules of pre-commit for paddle/utils and third_party (#34880) · 7b3295a4

由 chentianyu03 提交于 8月 17, 2021

* add exclude rules of pre-commit to paddle/utils and third_party

* remove exclude direction distributed/third_party

* remove exclude of paddle/utils for format cpplint check

7b3295a4

W
Modify the name of class in unittest with the same name (#34952) · 01a3a2e0
由 WeiXin 提交于 8月 17, 2021
```
* polish unittest.

* polish code

* polish code
```
01a3a2e0
S
[bug fix] fix unfold negative_size_param (#34943) · 8ef1bf87
由 shangliang Xu 提交于 8月 17, 2021
```
* [bug fix] fix unfold negative_size_param
```
8ef1bf87
P
add mkl multi-thread test cases in PR-CI-INFERENCE (#34946) · 9d4f00bc
由 Peihan 提交于 8月 17, 2021
```
* add mkl multi-thread test cases

* fix codestyle

* fix codestyle & enable ernie mkl test
```
9d4f00bc

Align CTC grad scale same with ESPNet (#34729) · 10f9644c

由 Hui Zhang 提交于 8月 16, 2021

* dygraph support more ctc grad scale

* scale for 1.x

* fix unitest

* fix unitest

* format code

* fix unittest

* fix log info

* unittest cov

* fix format;notest,test=cpu,coverage

* skip ctc_loss egs;test=cpu

* warpctc grad cov;test=coverage

* add dygraph test;test=coverage

* format;test=cpu,coverage

* format;test=cpu

* add api compat;test=cpu

* add cpu test

* rename

* rename

* fix

* fix test

* format

* eigen cpu

* eigen gpu grad pass

* cuda gpu pass

* format

* fix ci

10f9644c

Add some passes which can be applied to Program (#34730) · 8046e33d

由 Zeng Jinle 提交于 8月 17, 2021

* add inplace passes and tests

* update

* fix use_cuda undefined
fix compile error of op compat

* add more ut

* fix CPU CI error

* check adam unique

* fix mac/windows ci, improve coverage

* fix ci error

* follow weihang's comment

* fix BlockDesc::MoveFrom

* follow qiuliang's comment

* update

* follow huihuang's comments

8046e33d

Z

add api fill_diagonal_inplace (#34460) · 5de576b0
由 zhiboniu 提交于 8月 17, 2021

5de576b0
K
fix drop_last not work on IterableDataset (#34801) · 16146088
由 Kaipeng Deng 提交于 8月 17, 2021
```
* fix drop_last not work in IterableDataset. test=develop
```
16146088
N
fix a bug in nlp: text_matching/sentence_transformers when last dim is 1 and... · 181f7cec
由 niuliling123 提交于 8月 17, 2021
```
fix a bug in nlp: text_matching/sentence_transformers when last dim is 1 and reduce mid dim (#34941)
```
181f7cec

16 8月, 2021 18 次提交

Z

concurrent (#34908) · ed6624ab
由 zhangchunle 提交于 8月 16, 2021

ed6624ab
L
Fix typos in English docs for diag and diagflat. (#34869) · 35ef4180
由 Li Min 提交于 8月 16, 2021
```
* Fix typos in english docs for diag and diagflat.
```
35ef4180

[NPU] Support npu op:(1)arg_min (2)arg_max (#34867) · b1cc4a46

由 veyron95 提交于 8月 16, 2021

* [NPU] Support npu op:(1)arg_min (2)arg_max

* Modify and add unit test cases

* Modify unit test cases

b1cc4a46

Jetson nano bilinear (#34751) · 2a4ed087

由 feng_shuai 提交于 8月 16, 2021

* change bilinear thread for nano and tx2

* change bilinear thread for nano and tx2

2a4ed087

B

hccl init sync (#34918) · 6b4b9fea
由 Baibaifan 提交于 8月 16, 2021

6b4b9fea

[NPU] Add size npu op (#34636) · 49818943

由 0x45f 提交于 8月 16, 2021

* add size npu op

* modify support data type

* no longer use NPU size OP

* remove useless comments, add test case

* fix copyright, remove useless include

49818943

Change the invoking method of settiem by Ellipsis and None index from numpy to... · 2e30134f

由 zyfncg 提交于 8月 16, 2021

Change the invoking method of settiem by Ellipsis and None index from numpy to set_value op (#34911)

* Change invoking mathod of the settiem by Ellipsis and None index from numpy to set_value op

* add none_axes into attr of set_value_op in dygraph mode

2e30134f

F

[CPU-PSLIB] Add config for scale_sparse_grad in config_fleet.py,test=develop (#34893) · d028214d
由 Fan Zhang 提交于 8月 16, 2021

d028214d

Fix elementwise_add quantization (#34820) · ae80df91

由 joanna.wozna.intel 提交于 8月 16, 2021

* Remove force_fp32_output from elementwise_add quantization

* Fix cpu_quantize_placement test

* Review related changes

ae80df91

[oneDNN] Fix to 34554 (same as previous PR but should build with GPU) (#34859) · 9cb65653

由 Jacek Czaja 提交于 8月 16, 2021

* - Added softmax without caching

* - Binary is no longer manually cached

* - Activation onednn caching removed

* - Removed manual caching of activation

* - modified UT

* - fix

* - fix

* - fixes to building

* - fix

* - fix

* - fix to UT

* - Faulty UT workaround

* - approval workaround

* - Fixes after review

* - compilation fixes

* - more lint fixes

* - more fixes after review

* - fixes after another round of review

* - hopefully compilation fix

- compilation fix

9cb65653

Z

fix iscan bug in test file (#34912) · f6d8ab54
由 zhangchunle 提交于 8月 16, 2021

f6d8ab54
Q

[NPU] add nearest_interp_v2 and nearest_interp_v2_grad, test=develop (#34769) · 3b9f040d
由 Qi Li 提交于 8月 16, 2021

3b9f040d

[NPU] Support NPU kernel for nearest_interp and nearest_interp_grad op (#34881) · e4e8cc9b

由 From00 提交于 8月 16, 2021

* Add NPU kernel for nearest_interp op

* Add grad op

* Modify codes according to the review comments

* Modify codes according to the review comments

e4e8cc9b

add unique_consecutive_op (#34334) · 875cfd57

由 duanboqiang 提交于 8月 16, 2021

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* remove unity build

* add unique_consecutive op

* add unique_consecutive op

* add enable static

* add noqa

* add space line

* add default case.

* add comma

* add space line

* modify unique_consecutive unittest

* optimize ut coverage

* rebase develop

* improve coverage

* update en docs

* update en docs

* update en docs

* update en docs

* update en docs

* update en doc

875cfd57

L
[amp] dygraph amp support param_group (#34899) · e29c2d12
由 Leo Chen 提交于 8月 16, 2021
```
* dygraph amp support param_group

* remove unused code

* fix doc
```
e29c2d12
G
support margin loss (arcface, cosface, sphereface) for single GPU and cross GPUs (#34247) · b0cb4148
由 Guoxia Wang 提交于 8月 16, 2021
```
* support margin loss (arcface, cosface, sphereface)
```
b0cb4148
Z

Enhance tensor shape check for dist op. (#34915) · dc439a12
由 Zhong Hui 提交于 8月 16, 2021

dc439a12

Support npu op hard_swish and hard_swish_grad (#34608) · fd92d949

由 zyfncg 提交于 8月 16, 2021

* Support NPU OP hard_swish and hard_swish_grad

* Support NPU OP hard_swish and hard_swish_grad

* add the unittest to compare the result between npu ans cpu

* format the prompt of exception

* replace Min and Max op by ClipByValue op

* fix the precision problem for fp16

* Using HardtanhGrad to improve performace

fd92d949

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致