提交 · 1d3b27cae8a7d88db80358a2810279874835fc68 · PaddlePaddle / Paddle

21 9月, 2020 6 次提交

add double grad compute for batch norm (#27296) · 1d3b27ca

由 ceci3 提交于 9月 21, 2020

* add double grad compute for batch norm,test=develop

* fix unittest, test=develop

* remove unuse tensor,test=develop

* add format,test=develop

* update, test=develop

1d3b27ca

fix bug sequececonv_eltadd_relu_fuse_pass (#27404) · d9366194

由 Shang Zhizhou 提交于 9月 21, 2020

* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0

* fix seqconv_eltadd_relu_fuse_pass unitest error

d9366194

[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112) · aba759ba

由 Leo Chen 提交于 9月 21, 2020

* support use add instead of sum to do gradient accumulation

* add inplace addto pass

* add grad_add op and inplace addto pass

* remove debug code

* code refine

* fix bug when sereral sum ops inserts at same op_idx

* fix Flags type

* add addto attribute for conv3d

* fix ut

* code clean

* fix type

aba759ba

L
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor · 669efb98
由 LutaoChu 提交于 9月 21, 2020
```
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor 
```
669efb98
W

Add pass compatible and unit test. (#27377) · 39546aa2
由 Wilber 提交于 9月 21, 2020

39546aa2

Quant op dev (#25932) · 02606d45

由 huangxu96 提交于 9月 21, 2020

* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests.

* Finished channel-wise quantize strategy in imperative quantization.

* Added Cuda code of ChannelWiseQuantDequantMaxAbsOP
Add Cuda code of ChannelWiseQuantDequantMaxAbsOp

* Add quant_axis for channel_wise quant.

* fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement.

* Added some assert infomation and fixed some coding style mistakes.

02606d45

20 9月, 2020 1 次提交

【paddle.fleet】Fix/role maker api fix (#27326) · d6b54de4

由 tangwei12 提交于 9月 20, 2020

* fix fleet util and gloo

* fix worker endpoints

* fix

* fix UT

* fix gloo

* fix gloo

* update gloo

* update gloo

* update gloo

* update gloo

* update gloo

* fix gloo wrapper for hdfs

* add file gloo and UT

* fix UT

* fix UT

* fix UT

* hide public method of RoleMaker

* fix UT

* GPU fleetrun support gloo

* parameterserver fleetrun support gloo

* add UT

* add UT

* fix UT

* fix get server endpoint

* fix get server endpoint

* fix UT

* hide public method of rolemaker

* hide public method of rolemaker

* hide public method of rolemaker

* Update test_fleet_rolemaker_new.py

* hide public method of rolemaker

* hide public method of rolemaker

d6b54de4

18 9月, 2020 6 次提交
- T
  【paddle.fleet】gloo and util (#27213) · 99626502
  由 tangwei12 提交于 9月 18, 2020
```
* fix worker endpoints

* fix gloo wrapper for hdfs

* GPU fleetrun support gloo

* parameterserver fleetrun support gloo

* fix get server endpoint
```
  99626502
- L
  
  fix paddle.nn.Transformer api (#27391) · 4c5cfdea
  由 liu zhengxi 提交于 9月 18, 2020
  
  4c5cfdea
- P
  register pass compatibility (#27357) · fd7ab4e6
  由 Pei Yang 提交于 9月 18, 2020
```
* pass compatibility

* add compatibility registry

* add unittests for different padding

* add assert

* drop errmsg
```
  fd7ab4e6
- H
  
  Add 3 pass version check (#27283) · 7e6dfcf9
  由 haozech 提交于 9月 18, 2020
  
  7e6dfcf9
- C
  
  fix cross_entropy bug of the axis parameter in log_softmax (#27311) · fef94eac
  由 chajchaj 提交于 9月 18, 2020
  
  fef94eac
- Z
  
  Remove save_quantized_model in ImperativeQuantAware. (#27240) · d28162b9
  由 Zhen Wang 提交于 9月 18, 2020
  
  d28162b9
17 9月, 2020 12 次提交
- L
  [Dy2Stat-log] Add feature also_to_stdout and optimize log messages (#27285) · ac82baa8
  由 liym27 提交于 9月 17, 2020
```
* Add env value to  log to stdout; 2.Add logger name

* Optimize log messages in dygraph-to-static

* Replace logging.warn and warnings.warn with logging_utils.warn
```
  ac82baa8
- S
  
  add op version checker to ir passes (#27329) · 3c117179
  由 Shang Zhizhou 提交于 9月 17, 2020
  
  3c117179
- F
  add empty_like op (python, and unit test), use c++ implementation of empty op, (#27287) · 515efe42
  由 furnace 提交于 9月 17, 2020
```
and optimize the c++ implmentation of empty op as PR#26659 reviews,
and add bool for shape op.
```
  515efe42
- 1
  【Fleet2.0 Util】 add documents (#26698) · f36b9a7f
  由 123malin 提交于 9月 17, 2020
```
* test=develop, util documents
```
  f36b9a7f
- J
  enhance reduce op which can reduce tensor with arbitrary rank · 63203c4a
  由 Jack Zhou 提交于 9月 17, 2020
```
enhance reduce op which can reduce tensor with arbitrary rank 
```
  63203c4a
- Y
  
  cancel three disable ut (#27359) · f0a5eef5
  由 YUNSHEN XIE 提交于 9月 17, 2020
  
  f0a5eef5
- Y
  
  del exclusive ut which name with test_dist_ (#27316) · 25902b2c
  由 YUNSHEN XIE 提交于 9月 17, 2020
  
  25902b2c
- S
  Fix elementwise_floordiv op (#27352) · 9ee77b1f
  由 ShenLiang 提交于 9月 17, 2020
```
* fix floordiv
```
  9ee77b1f
- G
  Refine the unittest to support py38 (#27208) · 9bea834e
  由 guofei 提交于 9月 17, 2020
```
* Refine the unittest to support py38

    test=develop
```
  9bea834e
- Z
  
  fix dll load bug on windows from python3.8 (#27324) · a7fadce8
  由 Zhou Wei 提交于 9月 17, 2020
  
  a7fadce8
- H
  [Dy2stat] Change the Global Switch Name of ProgramTranslator for API 2.0 (#27203) · d4b4357b
  由 Huihuang Zheng 提交于 9月 17, 2020
```
Change ProgramTranslator.enable_declarative to ProgramTranslator.enable_to_static to meet API 2.0
```
  d4b4357b
- W
  modify test_imperative_using_non_zero_gpu from use two gpus to one gpu (#27348) · bf8e030e
  由 wanghuancoder 提交于 9月 17, 2020
```
* add op_function_generator.exe retry in windows, test=develop

* modify test_imperative_using_non_zero_gpu from use two gpus to one gpu, test=develop
```
  bf8e030e
16 9月, 2020 13 次提交
- L
  Remove unnecessary requirements (#27341) · 189e10f1
  由 Leo Chen 提交于 9月 16, 2020
```
* remove objgraph

* remove graphviz

* fix ut
```
  189e10f1
- G
  
  Cleanup redundant code files (#27319) · 11bcf0e2
  由 gongweibao 提交于 9月 16, 2020
  
  11bcf0e2
- S
  add adaptivelsgd in meta_optimizer (#27289) · 54b81fa3
  由 ShenLiang 提交于 9月 16, 2020
```
* add adaptivelsgd

* Todo fix the code to avoid the conflict.
```
  54b81fa3
- Y
  
  Fix bug in continuous apply, test=develop (#27337) · 34091533
  由 Yibing Liu 提交于 9月 16, 2020
  
  34091533
- C
  Support load state_dict from save_params/persistables (#27298) · c23f09fe
  由 Chen Weihang 提交于 9月 16, 2020
```
* support load state_dict from save_params/persistables

* remove failed unittest

* add load eof check & unittest

* remove eof check
```
  c23f09fe
- Y
  
  refine fleet dataset class api (#27133) · c67c3916
  由 yaoxuefeng 提交于 9月 16, 2020
  
  c67c3916
- S
  fix error message in broadcast/allreduce/gather (#27302) · c296618c
  由 ShenLiang 提交于 9月 16, 2020
```
* fix error message
```
  c296618c
- C
  Add input_spec & output_spec for TranslatedLayer (#27284) · 950301bf
  由 Chen Weihang 提交于 9月 16, 2020
```
* add input_spec & output_spec for translated_layer

* update error message
```
  950301bf
- L
  
  add regularizer api (#27292) · 18fc9275
  由 littletomatodonkey 提交于 9月 16, 2020
  
  18fc9275
- Y
  
  move three ut to execute only at night (#27314) · 8fe1c2d1
  由 YUNSHEN XIE 提交于 9月 16, 2020
  
  8fe1c2d1
- Z
  
  fix the test_fleet_lars_meta_optimizer ut. (#27291) · ef6dd6b8
  由 Zhen Wang 提交于 9月 16, 2020
  
  ef6dd6b8
- D
  fix ports conflict when use paddlecloud to launch analogue multi-nodes (#26191) · 389a9a7e
  由 danleifeng 提交于 9月 16, 2020
```
* fix ports conflict when launching multi-nodes in paddlecloud;test=develop

* add DISTRIBUTED_TRAINER_ENDPOINTS env for cloud;test=develop
```
  389a9a7e
- C
  
  Disable unit-test test_fleet_rolemaker_new · c8e54c5e
  由 chalsliu 提交于 9月 16, 2020
  
  c8e54c5e
15 9月, 2020 2 次提交

Optimize slice trt plugin (#26970) · 47fdc60e

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize slice TRT plugin

This patch removes unnecessary barrier for data transfer of needed offset,
so data transfer can be overlap with GPU kernel execution.

This patch also fixes incorrect name of slice plugin. That is, replaces
"layernorm" with "slice"

test=develop

* add serialize/deserialize to slice plugin

* add static shape slice trt plugin

* fix slice trt op convertor dynamic shape bug

* fix format by clang-format

* fix pylint format error

* fix problems commented by peiyang
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>

47fdc60e

W

[Pass Compatible] Bind python compatible. (#27262) · f827665a
由 Wilber 提交于 9月 15, 2020

f827665a

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功