提交 · 9dedafa0df88042b348c502da07e55841df4a7a9 · PaddlePaddle / Paddle

16 9月, 2020 2 次提交
- M
  fix strategy, test=develop (#27323) · 9dedafa0
  由 mapingshuo 提交于 9月 16, 2020
```
* fix strategy, test=develop

* fix can_apply
```
  9dedafa0
- C
  
  Disable unit-test test_fleet_rolemaker_new · c8e54c5e
  由 chalsliu 提交于 9月 16, 2020
  
  c8e54c5e
15 9月, 2020 5 次提交

Optimize slice trt plugin (#26970) · 47fdc60e

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize slice TRT plugin

This patch removes unnecessary barrier for data transfer of needed offset,
so data transfer can be overlap with GPU kernel execution.

This patch also fixes incorrect name of slice plugin. That is, replaces
"layernorm" with "slice"

test=develop

* add serialize/deserialize to slice plugin

* add static shape slice trt plugin

* fix slice trt op convertor dynamic shape bug

* fix format by clang-format

* fix pylint format error

* fix problems commented by peiyang
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>

47fdc60e

W

[Pass Compatible] Bind python compatible. (#27262) · f827665a
由 Wilber 提交于 9月 15, 2020

f827665a

Optimize error report (#27254) · e6e2e537

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize errror report

* add test case for pad op converter

* fix some spelling mistake commented by peiyang

e6e2e537

G
change sequence length attribute to input (#27193) · ee1ed42c
由 GaoWei8 提交于 9月 15, 2020
```
* replace sequence length attr to input
```
ee1ed42c
C
Remove the cache in post_traning_quantization, test=develop (#26450) · 2d8281d5
由 cc 提交于 9月 15, 2020
```
* Remove the cache in post_traning_quantization, test=develop
```
2d8281d5

14 9月, 2020 10 次提交
- Y
  
  disable three unittests,test=document_fix (#27299) · 6947a58a
  由 YUNSHEN XIE 提交于 9月 14, 2020
  
  6947a58a
- Z
  
  paddle.nn.functional.logsigmoid -> log_sigmoid (#27277) · ac9afa02
  由 zhupengyang 提交于 9月 14, 2020
  
  ac9afa02
- L
  check the validation of parameters for expand and tile apis (#26816) · bc3e9ba1
  由 lilong12 提交于 9月 14, 2020
```
* bug fix, test=develop
```
  bc3e9ba1
- L
  fix conv depthwise bug (#27278) · a6854359
  由 LielinJiang 提交于 9月 14, 2020
```
Fix conv deepwise bug when in_channels=1.
```
  a6854359
- X
  
  fix for tuple,test=develop (#27190) · d4f03dfb
  由 xiaoting 提交于 9月 14, 2020
  
  d4f03dfb
- M
  add check for sparse parameters with weight_decay (#27141) · 91663073
  由 MRXLT 提交于 9月 14, 2020
```
* add check for sparse parameters with weight_decay

* move sparse check to adam.py
```
  91663073
- C
  move DataLoader._worker_loop to top level (#27247) · 8d531727
  由 Chen Weihang 提交于 9月 14, 2020
```
* move worker loop to top level

* move reader process loop to top level

* fix failed unittests
```
  8d531727
- Z
  Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210
  由 Zhen Wang 提交于 9月 14, 2020
```
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.
```
  d708b210
- S
  remove auto mode from localsgd optimizer (#27237) · 2b6a5793
  由 ShenLiang 提交于 9月 14, 2020
```
* rm auto from localsgd
```
  2b6a5793
- A
  Add int8 GRU kernel (#27220) · cc3f4b81
  由 Adam 提交于 9月 14, 2020
```
* Add int8 GRU kernel with UTs

* Lint fixes

* More lint fixes
```
  cc3f4b81
11 9月, 2020 8 次提交
- L
  Temporally disable zero_copy (#27248) · 19228bd1
  由 Leo Chen 提交于 9月 11, 2020
```
* temporally disable zero_copy

* add test

* follow comments
```
  19228bd1
- L
  
  fix bug when axis is a tensor with more than 1 element (#27263) · f402d8d8
  由 Leo Chen 提交于 9月 11, 2020
  
  f402d8d8
- A
  fix unused var with zero gradient bug in fluid.gradient (#27246) · 20a84820
  由 Aurelius84 提交于 9月 11, 2020
```
* fix calcu_gradients

* fix code place

* fix embedding interface usage
```
  20a84820
- C
  
  fix loaded no params layer run error (#27241) · 33ff833a
  由 Chen Weihang 提交于 9月 11, 2020
  
  33ff833a
- L
  
  [Dy2Stat - Error Handling] Fix bug and optimize dy2stat error. (#27225) · 3e20ddf7
  由 liym27 提交于 9月 11, 2020
  
  3e20ddf7
- C
  
  use structured name in loaded dict (#27242) · ac8afe18
  由 Chen Weihang 提交于 9月 11, 2020
  
  ac8afe18
- A
  [Dy2stat] support usage: to_static(model) (#27040) · 5e0dde02
  由 Aurelius84 提交于 9月 11, 2020
```
* support to_static(model)

* add warning and unittest
```
  5e0dde02
- F
  
  add empty op (c++, python, unit test) (#26659) · 2e597696
  由 furnace 提交于 9月 11, 2020
  
  2e597696
10 9月, 2020 11 次提交
- Z
  * Reduce the training iterations in test_fetch_unmerged and test_fuse_bn_act_pass. (#27234) · b6715386
  由 Zhen Wang 提交于 9月 10, 2020
```
* Use the single GPU card to execute the test_fuse_bn_act_pass UT.
```
  b6715386
- Z
  
  Update the _get_fake_quant_type definition in imperative QAT. (#27222) · ece74c4c
  由 Zhen Wang 提交于 9月 10, 2020
  
  ece74c4c
- L
  add double grad for tile op and expand_v2 op (#27114) · c5f957ae
  由 lilong12 提交于 9月 10, 2020
```
* add double grad for tile, test=develop

* add double grad for expand_v2 op, test=develop
```
  c5f957ae
- L
  add double grad for expand (#27183) · 58a88ba9
  由 lilong12 提交于 9月 10, 2020
```
* add double grad for expand, test=develop
```
  58a88ba9
- C
  Refine jit.save implement to adapt InputSpec using cases (#26959) · 5406b014
  由 Chen Weihang 提交于 9月 10, 2020
```
* add some unittest cases ot verify jit.save, no_test

* add more unittests

* add test with example inputs

* polish implement details

* remove useless blank

* fix fetch random error
```
  5406b014
- S
  
  revert divide (#27202) · 5bd84b22
  由 ShenLiang 提交于 9月 10, 2020
  
  5bd84b22
- 1
  【paddle.fleet】parameter_server_optimizer support auto_strategy (#27181) · 60c3ef3a
  由 123malin 提交于 9月 10, 2020
```
* parameter_server_optimizer support auto_strategy
```
  60c3ef3a
- W
  fix the CudaPinMemory bug for the equal op (#27176) · fde5cfe8
  由 wawltor 提交于 9月 10, 2020
```
 fix the CudaPinMemory bug for the equal op and add the test case for the equal op
```
  fde5cfe8
- L
  
  Move unittest test_optimizer_in_control_flow from CI multi_cards. (#27185) · d3874ab4
  由 liym27 提交于 9月 10, 2020
  
  d3874ab4
- H
  Decrease test_parallel_executor_crf CI time, test=develop (#27212) · 40dd563d
  由 Huihuang Zheng 提交于 9月 10, 2020
```
Decrease the number of running iterations to reduce CI time.

CI system shows it decreased the unittest time from about 90 seconds to about 30 seconds
```
  40dd563d
- Z
  
  restruct logsumexp to speed up compiling (#27191) · cc3306f7
  由 zhupengyang 提交于 9月 10, 2020
  
  cc3306f7
09 9月, 2020 4 次提交
- J
  modified the implement of Lars optimizer (#26733) · 5d039f40
  由 JZ-LIANG 提交于 9月 09, 2020
```
add lars to fleet meta optimizer
```
  5d039f40
- L
  
  Fix test_origin_info to be compatible with PY3.8, because ast module is different in PY3.8 (#27201) · a1b640bc
  由 liym27 提交于 9月 09, 2020
  
  a1b640bc
- D
  【paddle.fleet】refine launch and distributed repr string for print (#27093) · f7d08b7d
  由 Dong Daxiang 提交于 9月 09, 2020
```
* refine launch and distributed repr string for print
```
  f7d08b7d
- Q
  Add double grad in reduce sum (#27115) · 43b0445b
  由 Qinghe JING 提交于 9月 09, 2020
```
* set default value to strategy in distributed_optimizer test=develop
```
  43b0445b

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功