提交 · c67c39168282bd3da37b4905b8d1856ca30275f2 · Crayon鑫 / Paddle

16 9月, 2020 8 次提交
- Y
  
  refine fleet dataset class api (#27133) · c67c3916
  由 yaoxuefeng 提交于 9月 16, 2020
  
  c67c3916
- S
  fix error message in broadcast/allreduce/gather (#27302) · c296618c
  由 ShenLiang 提交于 9月 16, 2020
```
* fix error message
```
  c296618c
- C
  Add input_spec & output_spec for TranslatedLayer (#27284) · 950301bf
  由 Chen Weihang 提交于 9月 16, 2020
```
* add input_spec & output_spec for translated_layer

* update error message
```
  950301bf
- L
  
  add regularizer api (#27292) · 18fc9275
  由 littletomatodonkey 提交于 9月 16, 2020
  
  18fc9275
- Y
  
  move three ut to execute only at night (#27314) · 8fe1c2d1
  由 YUNSHEN XIE 提交于 9月 16, 2020
  
  8fe1c2d1
- Z
  
  fix the test_fleet_lars_meta_optimizer ut. (#27291) · ef6dd6b8
  由 Zhen Wang 提交于 9月 16, 2020
  
  ef6dd6b8
- D
  fix ports conflict when use paddlecloud to launch analogue multi-nodes (#26191) · 389a9a7e
  由 danleifeng 提交于 9月 16, 2020
```
* fix ports conflict when launching multi-nodes in paddlecloud;test=develop

* add DISTRIBUTED_TRAINER_ENDPOINTS env for cloud;test=develop
```
  389a9a7e
- C
  
  Disable unit-test test_fleet_rolemaker_new · c8e54c5e
  由 chalsliu 提交于 9月 16, 2020
  
  c8e54c5e
15 9月, 2020 5 次提交

Optimize slice trt plugin (#26970) · 47fdc60e

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize slice TRT plugin

This patch removes unnecessary barrier for data transfer of needed offset,
so data transfer can be overlap with GPU kernel execution.

This patch also fixes incorrect name of slice plugin. That is, replaces
"layernorm" with "slice"

test=develop

* add serialize/deserialize to slice plugin

* add static shape slice trt plugin

* fix slice trt op convertor dynamic shape bug

* fix format by clang-format

* fix pylint format error

* fix problems commented by peiyang
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>

47fdc60e

W

[Pass Compatible] Bind python compatible. (#27262) · f827665a
由 Wilber 提交于 9月 15, 2020

f827665a

Optimize error report (#27254) · e6e2e537

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize errror report

* add test case for pad op converter

* fix some spelling mistake commented by peiyang

e6e2e537

G
change sequence length attribute to input (#27193) · ee1ed42c
由 GaoWei8 提交于 9月 15, 2020
```
* replace sequence length attr to input
```
ee1ed42c
C
Remove the cache in post_traning_quantization, test=develop (#26450) · 2d8281d5
由 cc 提交于 9月 15, 2020
```
* Remove the cache in post_traning_quantization, test=develop
```
2d8281d5

14 9月, 2020 7 次提交

Y

disable three unittests,test=document_fix (#27299) · 6947a58a
由 YUNSHEN XIE 提交于 9月 14, 2020

6947a58a
Z

paddle.nn.functional.logsigmoid -> log_sigmoid (#27277) · ac9afa02
由 zhupengyang 提交于 9月 14, 2020

ac9afa02
M
add check for sparse parameters with weight_decay (#27141) · 91663073
由 MRXLT 提交于 9月 14, 2020
```
* add check for sparse parameters with weight_decay

* move sparse check to adam.py
```
91663073
C
move DataLoader._worker_loop to top level (#27247) · 8d531727
由 Chen Weihang 提交于 9月 14, 2020
```
* move worker loop to top level

* move reader process loop to top level

* fix failed unittests
```
8d531727

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

S
remove auto mode from localsgd optimizer (#27237) · 2b6a5793
由 ShenLiang 提交于 9月 14, 2020
```
* rm auto from localsgd
```
2b6a5793
A
Add int8 GRU kernel (#27220) · cc3f4b81
由 Adam 提交于 9月 14, 2020
```
* Add int8 GRU kernel with UTs

* Lint fixes

* More lint fixes
```
cc3f4b81

11 9月, 2020 8 次提交
- L
  Temporally disable zero_copy (#27248) · 19228bd1
  由 Leo Chen 提交于 9月 11, 2020
```
* temporally disable zero_copy

* add test

* follow comments
```
  19228bd1
- L
  
  fix bug when axis is a tensor with more than 1 element (#27263) · f402d8d8
  由 Leo Chen 提交于 9月 11, 2020
  
  f402d8d8
- A
  fix unused var with zero gradient bug in fluid.gradient (#27246) · 20a84820
  由 Aurelius84 提交于 9月 11, 2020
```
* fix calcu_gradients

* fix code place

* fix embedding interface usage
```
  20a84820
- C
  
  fix loaded no params layer run error (#27241) · 33ff833a
  由 Chen Weihang 提交于 9月 11, 2020
  
  33ff833a
- L
  
  [Dy2Stat - Error Handling] Fix bug and optimize dy2stat error. (#27225) · 3e20ddf7
  由 liym27 提交于 9月 11, 2020
  
  3e20ddf7
- C
  
  use structured name in loaded dict (#27242) · ac8afe18
  由 Chen Weihang 提交于 9月 11, 2020
  
  ac8afe18
- A
  [Dy2stat] support usage: to_static(model) (#27040) · 5e0dde02
  由 Aurelius84 提交于 9月 11, 2020
```
* support to_static(model)

* add warning and unittest
```
  5e0dde02
- F
  
  add empty op (c++, python, unit test) (#26659) · 2e597696
  由 furnace 提交于 9月 11, 2020
  
  2e597696
10 9月, 2020 11 次提交
- Z
  * Reduce the training iterations in test_fetch_unmerged and test_fuse_bn_act_pass. (#27234) · b6715386
  由 Zhen Wang 提交于 9月 10, 2020
```
* Use the single GPU card to execute the test_fuse_bn_act_pass UT.
```
  b6715386
- Z
  
  Update the _get_fake_quant_type definition in imperative QAT. (#27222) · ece74c4c
  由 Zhen Wang 提交于 9月 10, 2020
  
  ece74c4c
- L
  add double grad for tile op and expand_v2 op (#27114) · c5f957ae
  由 lilong12 提交于 9月 10, 2020
```
* add double grad for tile, test=develop

* add double grad for expand_v2 op, test=develop
```
  c5f957ae
- L
  add double grad for expand (#27183) · 58a88ba9
  由 lilong12 提交于 9月 10, 2020
```
* add double grad for expand, test=develop
```
  58a88ba9
- C
  Refine jit.save implement to adapt InputSpec using cases (#26959) · 5406b014
  由 Chen Weihang 提交于 9月 10, 2020
```
* add some unittest cases ot verify jit.save, no_test

* add more unittests

* add test with example inputs

* polish implement details

* remove useless blank

* fix fetch random error
```
  5406b014
- S
  
  revert divide (#27202) · 5bd84b22
  由 ShenLiang 提交于 9月 10, 2020
  
  5bd84b22
- 1
  【paddle.fleet】parameter_server_optimizer support auto_strategy (#27181) · 60c3ef3a
  由 123malin 提交于 9月 10, 2020
```
* parameter_server_optimizer support auto_strategy
```
  60c3ef3a
- W
  fix the CudaPinMemory bug for the equal op (#27176) · fde5cfe8
  由 wawltor 提交于 9月 10, 2020
```
 fix the CudaPinMemory bug for the equal op and add the test case for the equal op
```
  fde5cfe8
- L
  
  Move unittest test_optimizer_in_control_flow from CI multi_cards. (#27185) · d3874ab4
  由 liym27 提交于 9月 10, 2020
  
  d3874ab4
- H
  Decrease test_parallel_executor_crf CI time, test=develop (#27212) · 40dd563d
  由 Huihuang Zheng 提交于 9月 10, 2020
```
Decrease the number of running iterations to reduce CI time.

CI system shows it decreased the unittest time from about 90 seconds to about 30 seconds
```
  40dd563d
- Z
  
  restruct logsumexp to speed up compiling (#27191) · cc3306f7
  由 zhupengyang 提交于 9月 10, 2020
  
  cc3306f7
09 9月, 2020 1 次提交
- J
  modified the implement of Lars optimizer (#26733) · 5d039f40
  由 JZ-LIANG 提交于 9月 09, 2020
```
add lars to fleet meta optimizer
```
  5d039f40

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致