提交 · 9cbba97b3d3fcd4c2f4ca1bf8b6088df93af2cf9 · Crayon鑫 / Paddle

18 8月, 2021 13 次提交

L
[NPU]add rmsprop op (#34864) · 9cbba97b
由 lzzyzlbb 提交于 8月 18, 2021
```
* [npu]add rmsprop op
```
9cbba97b

Add NPU kernel for norm Op: float16 and float32 (#34609) · 755c8a19

由 xiongkun 提交于 8月 18, 2021

* Add NPU kernel for norm Op: float16 and float32

* fix code for code review

* fix for code review

* add type for paddle_throw

* remove unnecessary head file.\nAdd more testcase

* remove a broadcast

755c8a19

fix pad outliers err (#34979) · 248e27b7

由 littletomatodonkey 提交于 8月 18, 2021

* fix pad outliers err

* fix pad api input type and doc

* fix example of pad

* add unittest for pad3d

* fix unittest

* fix error format

* fix pad doc

248e27b7

code refactoring for new executor (#34970) · 40d4d834

由 wanghuancoder 提交于 8月 18, 2021

* code refactoring, test=develop

* refine, test=develop

* refine, test=develop

* refine, test=develop

40d4d834

P

add paddle detection model in pr-ci-inference (#34986) · 1b747de7
由 Peihan 提交于 8月 18, 2021

1b747de7
J
[NPU] Add square grad (#34889) · 1b71a718
由 Jackwaterveg 提交于 8月 18, 2021
```
* test=develop

* test=develop
```
1b71a718
J
[NPU] Add leaky Relu (#34894) · 40f62737
由 Jackwaterveg 提交于 8月 18, 2021
```
* test=develop

* test=develop
```
40f62737
W
[Hybrid Performance] Move the cast op of AMP which cast fp32 param to fp16... · a9673b44
由 WangXi 提交于 8月 18, 2021
```
[Hybrid Performance] Move the cast op of AMP which cast fp32 param to fp16 param to the optimizer (#34965)
```
a9673b44

[CustomOp] Fix ext_tensor.cast failed bug (#34884) · 4d88cdb8

由 Chen Weihang 提交于 8月 18, 2021

* fix ext_tensor.cast failed bug

* remove useless deps

* fix windows cmake failed

* try to fix windows make failed

* fix make error on windwos

4d88cdb8

Add function to disable paddle signal handler (#34577) · dd533dd3

由 Zhanlue Yang 提交于 8月 18, 2021

* Add function to disable paddle signal handler

Paddle used google::InstallFaultSignalHandler to handle selected system signals,
mainly for debugging and bug report purposes.

However, this can be conflicted with other python packages whoever captures similar signals.
Such python package involves tvm and more

To resolve this issue, we support a function to disable signal handler

* Remove signal test from WIN32 platform

* Remove redundant return from disable_signal_handler() function

* Add detailed messages to en_doc

dd533dd3

W

add the safe check for the some ops (#34978) · 12bf046b
由 wawltor 提交于 8月 18, 2021

12bf046b
L
[NPU] add retry on HcclGetRootInfo to fix "bind fail" (#34977) · 52a7b0c4
由 Leo Chen 提交于 8月 18, 2021
```
* add retry for HcclGetRootInfo

* refine code

* reduce retry interval
```
52a7b0c4
G
support class center sample of PartialFC (#34106) · 100db44f
由 Guoxia Wang 提交于 8月 18, 2021
```
* support class center sample of PartialFC
```
100db44f

17 8月, 2021 10 次提交

R

[NPU]Adamw skip update for npu (#34897) · b4474fb4
由 Roc 提交于 8月 17, 2021

b4474fb4
A

[NPU] add where_index op and tests (#34951) · 1ef21855
由 Aganlengzi 提交于 8月 17, 2021

1ef21855

Copy boost optional to Paddle (#34780) · 9be41447

由 chentianyu03 提交于 8月 17, 2021

* copy boost optional.hpp to paddle

* copy boost optional.hpp to paddle

* move directions

* del fluid/utils

* modify .hpp to .h

* move directions

* modify to paddle::optional

* add modification description

* format code stype for the files in paddle/utils

* format code stype

9be41447

[oneDNN ] disabling more ops caching (#34830) · f1c1d9e0

由 Jacek Czaja 提交于 8月 17, 2021

* - disabled caching of layer norm

- fix in compilation

- compilation fix

- transpose caching disabled

- compilation fix

- more compilation fixes

- sum caching disabled

- compilation fix

* - LRN with disabled cache

* lint fixes

f1c1d9e0

S
[bug fix] fix unfold negative_size_param (#34943) · 8ef1bf87
由 shangliang Xu 提交于 8月 17, 2021
```
* [bug fix] fix unfold negative_size_param
```
8ef1bf87
P
add mkl multi-thread test cases in PR-CI-INFERENCE (#34946) · 9d4f00bc
由 Peihan 提交于 8月 17, 2021
```
* add mkl multi-thread test cases

* fix codestyle

* fix codestyle & enable ernie mkl test
```
9d4f00bc

Align CTC grad scale same with ESPNet (#34729) · 10f9644c

由 Hui Zhang 提交于 8月 16, 2021

* dygraph support more ctc grad scale

* scale for 1.x

* fix unitest

* fix unitest

* format code

* fix unittest

* fix log info

* unittest cov

* fix format;notest,test=cpu,coverage

* skip ctc_loss egs;test=cpu

* warpctc grad cov;test=coverage

* add dygraph test;test=coverage

* format;test=cpu,coverage

* format;test=cpu

* add api compat;test=cpu

* add cpu test

* rename

* rename

* fix

* fix test

* format

* eigen cpu

* eigen gpu grad pass

* cuda gpu pass

* format

* fix ci

10f9644c

Add some passes which can be applied to Program (#34730) · 8046e33d

由 Zeng Jinle 提交于 8月 17, 2021

* add inplace passes and tests

* update

* fix use_cuda undefined
fix compile error of op compat

* add more ut

* fix CPU CI error

* check adam unique

* fix mac/windows ci, improve coverage

* fix ci error

* follow weihang's comment

* fix BlockDesc::MoveFrom

* follow qiuliang's comment

* update

* follow huihuang's comments

8046e33d

Z

add api fill_diagonal_inplace (#34460) · 5de576b0
由 zhiboniu 提交于 8月 17, 2021

5de576b0
N
fix a bug in nlp: text_matching/sentence_transformers when last dim is 1 and... · 181f7cec
由 niuliling123 提交于 8月 17, 2021
```
fix a bug in nlp: text_matching/sentence_transformers when last dim is 1 and reduce mid dim (#34941)
```
181f7cec

16 8月, 2021 17 次提交

L
Fix typos in English docs for diag and diagflat. (#34869) · 35ef4180
由 Li Min 提交于 8月 16, 2021
```
* Fix typos in english docs for diag and diagflat.
```
35ef4180

[NPU] Support npu op:(1)arg_min (2)arg_max (#34867) · b1cc4a46

由 veyron95 提交于 8月 16, 2021

* [NPU] Support npu op:(1)arg_min (2)arg_max

* Modify and add unit test cases

* Modify unit test cases

b1cc4a46

Jetson nano bilinear (#34751) · 2a4ed087

由 feng_shuai 提交于 8月 16, 2021

* change bilinear thread for nano and tx2

* change bilinear thread for nano and tx2

2a4ed087

B

hccl init sync (#34918) · 6b4b9fea
由 Baibaifan 提交于 8月 16, 2021

6b4b9fea

[NPU] Add size npu op (#34636) · 49818943

由 0x45f 提交于 8月 16, 2021

* add size npu op

* modify support data type

* no longer use NPU size OP

* remove useless comments, add test case

* fix copyright, remove useless include

49818943

Change the invoking method of settiem by Ellipsis and None index from numpy to... · 2e30134f

由 zyfncg 提交于 8月 16, 2021

Change the invoking method of settiem by Ellipsis and None index from numpy to set_value op (#34911)

* Change invoking mathod of the settiem by Ellipsis and None index from numpy to set_value op

* add none_axes into attr of set_value_op in dygraph mode

2e30134f

F

[CPU-PSLIB] Add config for scale_sparse_grad in config_fleet.py,test=develop (#34893) · d028214d
由 Fan Zhang 提交于 8月 16, 2021

d028214d

Fix elementwise_add quantization (#34820) · ae80df91

由 joanna.wozna.intel 提交于 8月 16, 2021

* Remove force_fp32_output from elementwise_add quantization

* Fix cpu_quantize_placement test

* Review related changes

ae80df91

[oneDNN] Fix to 34554 (same as previous PR but should build with GPU) (#34859) · 9cb65653

由 Jacek Czaja 提交于 8月 16, 2021

* - Added softmax without caching

* - Binary is no longer manually cached

* - Activation onednn caching removed

* - Removed manual caching of activation

* - modified UT

* - fix

* - fix

* - fixes to building

* - fix

* - fix

* - fix to UT

* - Faulty UT workaround

* - approval workaround

* - Fixes after review

* - compilation fixes

* - more lint fixes

* - more fixes after review

* - fixes after another round of review

* - hopefully compilation fix

- compilation fix

9cb65653

Q

[NPU] add nearest_interp_v2 and nearest_interp_v2_grad, test=develop (#34769) · 3b9f040d
由 Qi Li 提交于 8月 16, 2021

3b9f040d

[NPU] Support NPU kernel for nearest_interp and nearest_interp_grad op (#34881) · e4e8cc9b

由 From00 提交于 8月 16, 2021

* Add NPU kernel for nearest_interp op

* Add grad op

* Modify codes according to the review comments

* Modify codes according to the review comments

e4e8cc9b

add unique_consecutive_op (#34334) · 875cfd57

由 duanboqiang 提交于 8月 16, 2021

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* add unique_consecutive_op

* remove unity build

* add unique_consecutive op

* add unique_consecutive op

* add enable static

* add noqa

* add space line

* add default case.

* add comma

* add space line

* modify unique_consecutive unittest

* optimize ut coverage

* rebase develop

* improve coverage

* update en docs

* update en docs

* update en docs

* update en docs

* update en docs

* update en doc

875cfd57

G
support margin loss (arcface, cosface, sphereface) for single GPU and cross GPUs (#34247) · b0cb4148
由 Guoxia Wang 提交于 8月 16, 2021
```
* support margin loss (arcface, cosface, sphereface)
```
b0cb4148
Z

Enhance tensor shape check for dist op. (#34915) · dc439a12
由 Zhong Hui 提交于 8月 16, 2021

dc439a12

Support npu op hard_swish and hard_swish_grad (#34608) · fd92d949

由 zyfncg 提交于 8月 16, 2021

* Support NPU OP hard_swish and hard_swish_grad

* Support NPU OP hard_swish and hard_swish_grad

* add the unittest to compare the result between npu ans cpu

* format the prompt of exception

* replace Min and Max op by ClipByValue op

* fix the precision problem for fp16

* Using HardtanhGrad to improve performace

fd92d949

Z

Add bcast semantics checks at C++ level to BroadcastTensorsOp (#34874) · e84b2e9b
由 Zhanlue Yang 提交于 8月 16, 2021

e84b2e9b
L

[NPU] remove npu int64 kernel for increment op (#34909) · 28279f6f
由 Leo Chen 提交于 8月 16, 2021

28279f6f

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致