提交 · d6b54de46753827c23cabe5f3307f7493db194d0 · 机器未来 / Paddle

20 9月, 2020 1 次提交

【paddle.fleet】Fix/role maker api fix (#27326) · d6b54de4

由 tangwei12 提交于 9月 20, 2020

* fix fleet util and gloo

* fix worker endpoints

* fix

* fix UT

* fix gloo

* fix gloo

* update gloo

* update gloo

* update gloo

* update gloo

* update gloo

* fix gloo wrapper for hdfs

* add file gloo and UT

* fix UT

* fix UT

* fix UT

* hide public method of RoleMaker

* fix UT

* GPU fleetrun support gloo

* parameterserver fleetrun support gloo

* add UT

* add UT

* fix UT

* fix get server endpoint

* fix get server endpoint

* fix UT

* hide public method of rolemaker

* hide public method of rolemaker

* hide public method of rolemaker

* Update test_fleet_rolemaker_new.py

* hide public method of rolemaker

* hide public method of rolemaker

d6b54de4

18 9月, 2020 5 次提交
- T
  【paddle.fleet】gloo and util (#27213) · 99626502
  由 tangwei12 提交于 9月 18, 2020
```
* fix worker endpoints

* fix gloo wrapper for hdfs

* GPU fleetrun support gloo

* parameterserver fleetrun support gloo

* fix get server endpoint
```
  99626502
- L
  
  fix paddle.nn.Transformer api (#27391) · 4c5cfdea
  由 liu zhengxi 提交于 9月 18, 2020
  
  4c5cfdea
- P
  register pass compatibility (#27357) · fd7ab4e6
  由 Pei Yang 提交于 9月 18, 2020
```
* pass compatibility

* add compatibility registry

* add unittests for different padding

* add assert

* drop errmsg
```
  fd7ab4e6
- H
  
  Add 3 pass version check (#27283) · 7e6dfcf9
  由 haozech 提交于 9月 18, 2020
  
  7e6dfcf9
- C
  
  fix cross_entropy bug of the axis parameter in log_softmax (#27311) · fef94eac
  由 chajchaj 提交于 9月 18, 2020
  
  fef94eac
17 9月, 2020 10 次提交
- L
  [Dy2Stat-log] Add feature also_to_stdout and optimize log messages (#27285) · ac82baa8
  由 liym27 提交于 9月 17, 2020
```
* Add env value to  log to stdout; 2.Add logger name

* Optimize log messages in dygraph-to-static

* Replace logging.warn and warnings.warn with logging_utils.warn
```
  ac82baa8
- S
  
  add op version checker to ir passes (#27329) · 3c117179
  由 Shang Zhizhou 提交于 9月 17, 2020
  
  3c117179
- F
  add empty_like op (python, and unit test), use c++ implementation of empty op, (#27287) · 515efe42
  由 furnace 提交于 9月 17, 2020
```
and optimize the c++ implmentation of empty op as PR#26659 reviews,
and add bool for shape op.
```
  515efe42
- 1
  【Fleet2.0 Util】 add documents (#26698) · f36b9a7f
  由 123malin 提交于 9月 17, 2020
```
* test=develop, util documents
```
  f36b9a7f
- J
  enhance reduce op which can reduce tensor with arbitrary rank · 63203c4a
  由 Jack Zhou 提交于 9月 17, 2020
```
enhance reduce op which can reduce tensor with arbitrary rank 
```
  63203c4a
- Y
  
  cancel three disable ut (#27359) · f0a5eef5
  由 YUNSHEN XIE 提交于 9月 17, 2020
  
  f0a5eef5
- Y
  
  del exclusive ut which name with test_dist_ (#27316) · 25902b2c
  由 YUNSHEN XIE 提交于 9月 17, 2020
  
  25902b2c
- S
  Fix elementwise_floordiv op (#27352) · 9ee77b1f
  由 ShenLiang 提交于 9月 17, 2020
```
* fix floordiv
```
  9ee77b1f
- G
  Refine the unittest to support py38 (#27208) · 9bea834e
  由 guofei 提交于 9月 17, 2020
```
* Refine the unittest to support py38

    test=develop
```
  9bea834e
- W
  modify test_imperative_using_non_zero_gpu from use two gpus to one gpu (#27348) · bf8e030e
  由 wanghuancoder 提交于 9月 17, 2020
```
* add op_function_generator.exe retry in windows, test=develop

* modify test_imperative_using_non_zero_gpu from use two gpus to one gpu, test=develop
```
  bf8e030e
16 9月, 2020 12 次提交
- L
  Remove unnecessary requirements (#27341) · 189e10f1
  由 Leo Chen 提交于 9月 16, 2020
```
* remove objgraph

* remove graphviz

* fix ut
```
  189e10f1
- G
  
  Cleanup redundant code files (#27319) · 11bcf0e2
  由 gongweibao 提交于 9月 16, 2020
  
  11bcf0e2
- S
  add adaptivelsgd in meta_optimizer (#27289) · 54b81fa3
  由 ShenLiang 提交于 9月 16, 2020
```
* add adaptivelsgd

* Todo fix the code to avoid the conflict.
```
  54b81fa3
- C
  Support load state_dict from save_params/persistables (#27298) · c23f09fe
  由 Chen Weihang 提交于 9月 16, 2020
```
* support load state_dict from save_params/persistables

* remove failed unittest

* add load eof check & unittest

* remove eof check
```
  c23f09fe
- Y
  
  refine fleet dataset class api (#27133) · c67c3916
  由 yaoxuefeng 提交于 9月 16, 2020
  
  c67c3916
- S
  fix error message in broadcast/allreduce/gather (#27302) · c296618c
  由 ShenLiang 提交于 9月 16, 2020
```
* fix error message
```
  c296618c
- C
  Add input_spec & output_spec for TranslatedLayer (#27284) · 950301bf
  由 Chen Weihang 提交于 9月 16, 2020
```
* add input_spec & output_spec for translated_layer

* update error message
```
  950301bf
- L
  
  add regularizer api (#27292) · 18fc9275
  由 littletomatodonkey 提交于 9月 16, 2020
  
  18fc9275
- Y
  
  move three ut to execute only at night (#27314) · 8fe1c2d1
  由 YUNSHEN XIE 提交于 9月 16, 2020
  
  8fe1c2d1
- Z
  
  fix the test_fleet_lars_meta_optimizer ut. (#27291) · ef6dd6b8
  由 Zhen Wang 提交于 9月 16, 2020
  
  ef6dd6b8
- D
  fix ports conflict when use paddlecloud to launch analogue multi-nodes (#26191) · 389a9a7e
  由 danleifeng 提交于 9月 16, 2020
```
* fix ports conflict when launching multi-nodes in paddlecloud;test=develop

* add DISTRIBUTED_TRAINER_ENDPOINTS env for cloud;test=develop
```
  389a9a7e
- C
  
  Disable unit-test test_fleet_rolemaker_new · c8e54c5e
  由 chalsliu 提交于 9月 16, 2020
  
  c8e54c5e
15 9月, 2020 4 次提交

Optimize slice trt plugin (#26970) · 47fdc60e

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize slice TRT plugin

This patch removes unnecessary barrier for data transfer of needed offset,
so data transfer can be overlap with GPU kernel execution.

This patch also fixes incorrect name of slice plugin. That is, replaces
"layernorm" with "slice"

test=develop

* add serialize/deserialize to slice plugin

* add static shape slice trt plugin

* fix slice trt op convertor dynamic shape bug

* fix format by clang-format

* fix pylint format error

* fix problems commented by peiyang
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>

47fdc60e

W

[Pass Compatible] Bind python compatible. (#27262) · f827665a
由 Wilber 提交于 9月 15, 2020

f827665a

Optimize error report (#27254) · e6e2e537

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize errror report

* add test case for pad op converter

* fix some spelling mistake commented by peiyang

e6e2e537

G
change sequence length attribute to input (#27193) · ee1ed42c
由 GaoWei8 提交于 9月 15, 2020
```
* replace sequence length attr to input
```
ee1ed42c

14 9月, 2020 7 次提交

Y

disable three unittests,test=document_fix (#27299) · 6947a58a
由 YUNSHEN XIE 提交于 9月 14, 2020

6947a58a
Z

paddle.nn.functional.logsigmoid -> log_sigmoid (#27277) · ac9afa02
由 zhupengyang 提交于 9月 14, 2020

ac9afa02
M
add check for sparse parameters with weight_decay (#27141) · 91663073
由 MRXLT 提交于 9月 14, 2020
```
* add check for sparse parameters with weight_decay

* move sparse check to adam.py
```
91663073
C
move DataLoader._worker_loop to top level (#27247) · 8d531727
由 Chen Weihang 提交于 9月 14, 2020
```
* move worker loop to top level

* move reader process loop to top level

* fix failed unittests
```
8d531727

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

S
remove auto mode from localsgd optimizer (#27237) · 2b6a5793
由 ShenLiang 提交于 9月 14, 2020
```
* rm auto from localsgd
```
2b6a5793
A
Add int8 GRU kernel (#27220) · cc3f4b81
由 Adam 提交于 9月 14, 2020
```
* Add int8 GRU kernel with UTs

* Lint fixes

* More lint fixes
```
cc3f4b81

11 9月, 2020 1 次提交
- L
  Temporally disable zero_copy (#27248) · 19228bd1
  由 Leo Chen 提交于 9月 11, 2020
```
* temporally disable zero_copy

* add test

* follow comments
```
  19228bd1

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致