提交 · 549855ac20329cac96331b072ebede5eea2c2619 · 机器未来 / Paddle

19 1月, 2021 7 次提交
- Y
  add rmsprop_op_xpu test=kunlun (#30493) · 549855ac
  由 ykkk2333 提交于 1月 19, 2021
```
* add rmsprop_op_xpu test=kunlun

* modified rmsprop_op_xpu error code. test=kunlun
```
  549855ac
- Z
  
  fix bug of multicard grad ncclAllReduce (#30553) · fb20ec9a
  由 Zhou Wei 提交于 1月 19, 2021
  
  fb20ec9a
- Z
  
  Fix the compiling error of update_loss_scaling when using cuda9. (#30538) · f30d0055
  由 Zhen Wang 提交于 1月 19, 2021
  
  f30d0055
- L
  unify calling cudaSetDevice (#30470) · 81217a94
  由 Leo Chen 提交于 1月 19, 2021
```
* unify calling cudaSetDevice

* fix compile
```
  81217a94
- P
  
  fix error message of Inplace strategy (#30520) · 00554b3f
  由 pangyoki 提交于 1月 19, 2021
  
  00554b3f
- L
  support layer_norm fp16 in dygraph amp (#30430) · 7043b8cf
  由 Leo Chen 提交于 1月 19, 2021
```
* support layer_norm fp16 in dygraph amp

* add ut

* refine code
```
  7043b8cf
- W
  
  delete empty line of pybing.cc, test=develop (#30529) · 59ad6ff3
  由 wanghuancoder 提交于 1月 19, 2021
  
  59ad6ff3
18 1月, 2021 7 次提交
- H
  
  Ascend Framework Part2: pybind files (#30410) · e207fe63
  由 hutuxian 提交于 1月 18, 2021
  
  e207fe63
- H
  
  Ascend Framework Part1: OP & Wrapper (#30281) · 40ede126
  由 hutuxian 提交于 1月 18, 2021
  
  40ede126
- L
  
  [Kunlun]PR3: add xpu executor, multi xpu card train function optimization (#30317) · 843dc3cd
  由 liuyuhui 提交于 1月 18, 2021
  
  843dc3cd
- Q
  
  optimize batch_norm & pool op for kunlun (#30490) · 8489d4f7
  由 QingshuChen 提交于 1月 18, 2021
  
  8489d4f7
- W
  
  if pybind.cc changed, generate total report, test=develop (#30514) · bd971922
  由 wanghuancoder 提交于 1月 18, 2021
  
  bd971922
- T
  
  fix range op crash in dygraph xpu place (#30469) · 5e5c2827
  由 taixiurong 提交于 1月 18, 2021
  
  5e5c2827
- J
  
  Recompute Offload: fixed bug in memcpy (#30484) · 16ba0abc
  由 JZ-LIANG 提交于 1月 18, 2021
  
  16ba0abc
17 1月, 2021 1 次提交
- G
  Modify the calculation logic of LambOptimizer (#29313) · 11e78eba
  由 guofei 提交于 1月 17, 2021
```
* Modify the calculation logic of LambOptimizer
```
  11e78eba
16 1月, 2021 1 次提交
- A
  [oneDNN] Refactor fuse pass helper functions to one place. (#30460) · c5ffad12
  由 Adam Osewski 提交于 1月 16, 2021
```
* Move pass tester helper functions to single common place.

* Use helper functions in two more fuse pass tests.
```
  c5ffad12
15 1月, 2021 6 次提交

Z

add VecCastCUDAKernel (#30296) · c9a334e1
由 Zhang Ting 提交于 1月 15, 2021

c9a334e1

Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103) · 13d75736

由 pangyoki 提交于 1月 15, 2021

* add view strategy on squeeze,unsqueeze,reshape,flatten

* add squeeze unittest

* add unittests

* use View strategy as name rather than Reuse Allacation

* fix view api doc

* fix format

* use core.ops when input of reshape2 is Tensor

* fix test_cross_entropy_loss error because of reshape2

* fix test_cross_entropy_loss error because of reshape2

* add inplace strategy

* add elementwise_add sub

* let backward op not use inplace

* grad op do not use inplace

* fix memory increase error and add leaf error message

* delete selected_rows

* change op_function

* little change

* solve HandleViewBetweenInputAndOutput

* add unittest and leaf error message

* merge view error

* optimize op_function_generator format and support sum inplace op

* fix format of basic_engine

* fix format for framework

* little change of variable wrapper

* add reshape, squeeze, unsqueeze, scatter api

* add relu elu tanh softmax inplace api

* fix test_squeeze_op unittest

* fix test_relu_op unittest

* fix comment problems

* delete sample code of inplace api

* add reference of grad_pending_nodes in basic_engine

* fix unittest name

* add inplace apis into wlist

* fix error message

* add PADDLE_ENFORCE for set grad op twice

* fix head file error

13d75736

Y
Fix float64 bug in layer norm (#30452) · 008b0a8b
由 Yang Zhang 提交于 1月 15, 2021
```
built-in `rsqrt` is shadowed
```
008b0a8b
石

export global google flags to users, test=develop (#30448) · 715d8628
由石晓伟提交于 1月 15, 2021

715d8628
W

fix cache key for inplaced elementwise ops (#30404) · 88fc7a7d
由 Wojciech Uss 提交于 1月 15, 2021

88fc7a7d
W
fix the rnn mask memory bug for out of read (#30459) · 3d49882e
由 wawltor 提交于 1月 15, 2021
```
* fix the rnn mask memory bug for out of read

* update the code for the rnn
```
3d49882e

14 1月, 2021 5 次提交
- T
  
  support transformer v2.0 (#30381) · 6a3c8725
  由 taixiurong 提交于 1月 14, 2021
  
  6a3c8725
- S
  
  fix flatten api grad (#30426) · e85be1b1
  由 ShenLiang 提交于 1月 14, 2021
  
  e85be1b1
- Y
  
  Heter ps new (#30198) · 6e0da01c
  由 yaoxuefeng 提交于 1月 14, 2021
  
  6e0da01c
- 1
  test=develop, add distributed_infer (#30300) · 2a98e932
  由 123malin 提交于 1月 14, 2021
```
* test=develop, add distributed_infer
```
  2a98e932
- Q
  
  fix bug that cann't find mkldnn(kunlun) (#30394) · cf786d22
  由 QingshuChen 提交于 1月 14, 2021
  
  cf786d22
13 1月, 2021 8 次提交

C
skip quantizing ops in cpu inference (#30342) · 8e3a2940
由 cc 提交于 1月 13, 2021
```
* skip quantizing ops in cpu inference, test=develop
```
8e3a2940

Added support for inference using quantization aware trained dygraph (#30288) · 7bbf3ac5

由 alncat 提交于 1月 13, 2021

* added support for inference using qunatization aware trained dygraph

* added support for inference using qunatization aware trained dygraph
correct boost get usage

* Delete incorrect warning message (#30196)

* fix warning and no grad

* clean redundant API alias in 2.0 - part 2 (#30013)

* delete paddle.nn.functional.assign

* fix dynamic to static error

* just add the op error message for the matmul xpu (#30246)

 add the op error message for the matmul xpu

* Add Static Variable Clone (#30208)

Add clone method for static Variable so that this interface will be same as dygraph. It fixed some bugs in dy2stat

* use wget to replace curl to download the lcov file (#30229)

* use wget to replace curl to download the lcov file

* add cache for lcov

* fix test_pool3d_op timeout issue (#30248)

* Fix unittests bugs. (#30250)

* modify error message based on comments (#30189)

* modify error message based on comments

* edit code according to review.

* Correct spelling according to review.

* Fix bug for 'save mutiple method' (#30218)

* Fix bug for 'save mutiple method'

* To pass coverage.

* edit code to pass coverage.

* edit code to pass coverage.

* add unittest for coverage.

* change for coverage.

* edit for coverage.

* added support for inference using qunatization aware trained dygraph

* Alias from  paddle.fluid.layers.auc to paddle.static.auc (#30206)

* add alias from  fluid.layers.auc to static.auc

* Update __init__.py

* added support for inference using qunatization aware trained dygraph
correct boost get usage

* corrected boost get usage

* corrected naming issues and enforcing zero check

* correct paddle enforce message

* added more error checkings

* corrected error report message and optimized code

* corrected findvar usage

* corrected paddle_enforce in scope

* correct error messages

* correct error reporting format
Co-authored-by: NLielinJiang <50691816+LielinJiang@users.noreply.github.com>
Co-authored-by: NXiaoguangHu <46782768+XiaoguangHu01@users.noreply.github.com>
Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
Co-authored-by: NHuihuang Zheng <zhhsplendid@gmail.com>
Co-authored-by: NYUNSHEN XIE <1084314248@qq.com>
Co-authored-by: NBai Yifan <me@ethanbai.com>
Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
Co-authored-by: NWeiXin <weixin10@baidu.com>
Co-authored-by: NJiaqi Liu <liujiaqi06@baidu.com>

7bbf3ac5

G
Softmax backward optimize (#30249) · 180877e9
由 GaoWei8 提交于 1月 13, 2021
```
* softmax backward optimize
```
180877e9

fix bug on compiling inference shared lib with crypto;test=develop (#30269) · 10a8f3e5

由 Zhang Jun 提交于 1月 13, 2021

* fix bug on compiling inference shared lib with crypto;test=develop

* fix cmake bug when build inference lib using -DWITH_CRYPTO=OFF

* update cmake

* remove unnecessary enforce message

10a8f3e5

Fix Sleep Error in enforce.h (#30335) · 28e156c2

由 Huihuang Zheng 提交于 1月 13, 2021

usleep function in <unistd.h> only takes argument less than 1,000,000. Current call can exceed this limit, we have to fix it. This PR can fix random CI error.

28e156c2

Set expected place in child thread for dataloader to avoid costing cuda memory... · 3d015f1c

由 Leo Chen 提交于 1月 13, 2021

Set expected place in child thread for dataloader to avoid costing cuda memory on other card (#30338)

* set expected place in child thread for dataloader

* set device id when set tensor from numpy

* revert tensor_py change

* add compile guard

* fix ci

* fix bug

3d015f1c

Q
optimize memcpy perf for kunlun (#30291) · 2c1bba02
由 QingshuChen 提交于 1月 13, 2021
```
* optimize memcpy perf for kunlun

* remove useless unitest for kunlun mean

* minor
```
2c1bba02
S

Support unused parameters in dynamic graph distributed (#30224) · a60f17b8
由 ShenLiang 提交于 1月 13, 2021

a60f17b8

12 1月, 2021 5 次提交
- J
  
  Recompute Offload (#30233) · 75936d83
  由 JZ-LIANG 提交于 1月 12, 2021
  
  75936d83
- L
  
  correct the allowed dimension size (#30326) · a60893f6
  由 lidanqing 提交于 1月 12, 2021
  
  a60893f6
- C
  
  remove c++ stacktrace hint (#30325) · c8c8f205
  由 Chen Weihang 提交于 1月 12, 2021
  
  c8c8f205
- T
  add sparse embedding & load vars for 2.0 & gloo bug fix (#30306) · 5e839e4d
  由 tangwei12 提交于 1月 12, 2021
```
* add sparse embedding & load vars for 2.0

Change-Id: I36b59ed5f015189dc9d9d2e34a9357722d369f1b

* fix hdfs gloo

Change-Id: Ia84d579053720ad804183e54c9a04b4f031c79c6

* fix gloo hdfs

Change-Id: I5ab982fd483cddc10adcdef0b8aa83aca976cb9e

* move loadvar/sparse embedding from incubute to static

Change-Id: I57081d3545ad2efab78c72420d2162c0eacaf3a0
```
  5e839e4d
- T
  Fix/distributed proto (#29981) · 25f80fd3
  由 tangwei12 提交于 1月 12, 2021
```
* rename sendrecv.proto to namespace paddle.distributed

* split ps with distributed
```
  25f80fd3

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致