提交 · 5c8fdb59265e7e22a4bd52629e0038180d494ff5 · PaddlePaddle / Paddle

24 9月, 2020 1 次提交
- Z
  Add GPU Kernels of Segment Ops, support, sum, max, min, mean · 4a9d21de
  由 Zhong Hui 提交于 9月 24, 2020
```
Add GPU Kernels of Segment Ops,  support, sum, max, min, mean
```
  4a9d21de
23 9月, 2020 12 次提交
- Y
  
  modified timeout value for 4 ut (#27462) · 66951ab2
  由 YUNSHEN XIE 提交于 9月 23, 2020
  
  66951ab2
- S
  [bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436) · c17f9cf2
  由 Shang Zhizhou 提交于 9月 23, 2020
```
* [bug fix]:Memory increases after adapting the cudnn version to 8

* [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
```
  c17f9cf2
- Z
  Make the Bind Method of Tensor more automatic (#27270) · 1e1ae5c5
  由 Zhou Wei 提交于 9月 23, 2020
```
* Makes the Bind Method more intelligent

* Makes the Bind Method more intelligent

* fix unittest

* fix unittest

* fix conflict
```
  1e1ae5c5
- L
  Fix bug: The calculation result of Diag_v2 Op under large size input is wrong (#27447) · 5508c787
  由 LutaoChu 提交于 9月 23, 2020
```
The calculation result of Diag_v2 Op under large size input is wrong 
```
  5508c787
- T
  large scale kv speedup (#26510) · bc5f0246
  由 tangwei12 提交于 9月 23, 2020
```
* rename communicator meet->BatchesCounter

* fix parame recv for sparse

* geo sparse init from pserver

* optimize init from pserver

* add large scale optimizer fuse(SGD/ADAM)

* rectification init_worker and exe.run startup program
```
  bc5f0246
- Q
  
  fix cmake dependencies of test_recognize_digits, test=develop (#27475) · d7b7dcd1
  由 Qi Li 提交于 9月 23, 2020
  
  d7b7dcd1
- Z
  
  fix bug MD of compile, And add MD/STATIC/OPENBLAS inference lib check on windows (#27051) · 292b24aa
  由 Zhou Wei 提交于 9月 23, 2020
  
  292b24aa
- C
  Polish no onwer ops error message (#27448) · 41b59555
  由 Chen Weihang 提交于 9月 23, 2020
```
* polish no onwer op error message

* fix unittest failed

* polish details based reviewer comment
```
  41b59555
- Z
  add fuse_bn_act op (#27230) · 906e7f92
  由 Zhang Ting 提交于 9月 23, 2020
```
* add fused_bn_add_relu op
```
  906e7f92
- W
  
  update for 2.0 inference api. (#27473) · 5034d181
  由 Wilber 提交于 9月 23, 2020
  
  5034d181
- C
  Polish some lost invalid error message (#27445) · 76506447
  由 Chen Weihang 提交于 9月 23, 2020
```
* polish some lost error msg

* add some math file to white list

* polish detail based reviewer commnet
```
  76506447
- W
  
  avoid data transform for linspace OP (#27444) · 76fb95fe
  由 wangchaochaohu 提交于 9月 22, 2020
  
  76fb95fe
22 9月, 2020 9 次提交
- 1
  Enhance Op's Error Message (#27455) · a0452475
  由 123malin 提交于 9月 22, 2020
```
* test=develop, update error message
```
  a0452475
- W
  
  refine the precious of linspace Op using half way (#27452) · 0a862fd3
  由 wangchaochaohu 提交于 9月 22, 2020
  
  0a862fd3
- P
  
  errmsg refine of trt plugin (#27309) · fda54c02
  由 Pei Yang 提交于 9月 22, 2020
  
  fda54c02
- T
  
  update python 2.7.15 (#27435) · 9f3a9be7
  由 tianshuo78520a 提交于 9月 22, 2020
  
  9f3a9be7
- 石
  
  enhance error messages, test=develop (#27423) · dd4c2d86
  由石晓伟提交于 9月 22, 2020
  
  dd4c2d86
- Z
  
  judge whether remove build dir to accelerate compile,test=develop (#27334) · b7371fa5
  由 Zhou Wei 提交于 9月 22, 2020
  
  b7371fa5
- Z
  Add the cpu version of segment sum mean max min op · f4c750d7
  由 Zhong Hui 提交于 9月 22, 2020
```
Add the cpu version of segment sum mean max min op
```
  f4c750d7
- W
  
  Rename fluid_inference to paddle_inference. (#27422) · afe94903
  由 Wilber 提交于 9月 22, 2020
  
  afe94903
- P
  
  clear pass logs (#27434) · 81823370
  由 Pei Yang 提交于 9月 22, 2020
  
  81823370
21 9月, 2020 10 次提交

F

add mv op(c++, python, unit test) (#27024) · 13a4c74e
由 furnace 提交于 9月 21, 2020

13a4c74e

Optimize argsort Op performance on GPU · f11a53ee

由 LutaoChu 提交于 9月 21, 2020

* argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension

f11a53ee

add double grad compute for batch norm (#27296) · 1d3b27ca

由 ceci3 提交于 9月 21, 2020

* add double grad compute for batch norm,test=develop

* fix unittest, test=develop

* remove unuse tensor,test=develop

* add format,test=develop

* update, test=develop

1d3b27ca

fix bug sequececonv_eltadd_relu_fuse_pass (#27404) · d9366194

由 Shang Zhizhou 提交于 9月 21, 2020

* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0

* fix seqconv_eltadd_relu_fuse_pass unitest error

d9366194

[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112) · aba759ba

由 Leo Chen 提交于 9月 21, 2020

* support use add instead of sum to do gradient accumulation

* add inplace addto pass

* add grad_add op and inplace addto pass

* remove debug code

* code refine

* fix bug when sereral sum ops inserts at same op_idx

* fix Flags type

* add addto attribute for conv3d

* fix ut

* code clean

* fix type

aba759ba

L
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor · 669efb98
由 LutaoChu 提交于 9月 21, 2020
```
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor 
```
669efb98
W

Add pass compatible and unit test. (#27377) · 39546aa2
由 Wilber 提交于 9月 21, 2020

39546aa2

Quant op dev (#25932) · 02606d45

由 huangxu96 提交于 9月 21, 2020

* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests.

* Finished channel-wise quantize strategy in imperative quantization.

* Added Cuda code of ChannelWiseQuantDequantMaxAbsOP
Add Cuda code of ChannelWiseQuantDequantMaxAbsOp

* Add quant_axis for channel_wise quant.

* fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement.

* Added some assert infomation and fixed some coding style mistakes.

02606d45

Refine error msg in paddle/fluid/framework/details [part 1] (#25631) · bbc84e0f

由 Leo Chen 提交于 9月 21, 2020

* refine error msg in var_handle.h, test=develop

* refine all_reduce_op_handle

* fix some error msg

* refine variable_visitor

* refine threaded_ssa_graph_executor

* refine inplace related files

* refine executor related files

* refine fetch_op_handle.cc

* fix bug

* follow comments

bbc84e0f

M
fix adam (#27343) · f936adbd
由 MRXLT 提交于 9月 21, 2020
```
* fix adam

* rmsprop support double
```
f936adbd

18 9月, 2020 8 次提交
- T
  【paddle.fleet】gloo and util (#27213) · 99626502
  由 tangwei12 提交于 9月 18, 2020
```
* fix worker endpoints

* fix gloo wrapper for hdfs

* GPU fleetrun support gloo

* parameterserver fleetrun support gloo

* fix get server endpoint
```
  99626502
- P
  
  Optimize emb_eltwise_layernorm_plugin and support fp16 (#27128) · a5ef246c
  由 Pei Yang 提交于 9月 18, 2020
  
  a5ef246c
- Y
  
  enhance dataset err msg (#27363) · d726fd5e
  由 yaoxuefeng 提交于 9月 18, 2020
  
  d726fd5e
- G
  Support python3.8 (#26850) · 9fdcfe89
  由 guofei 提交于 9月 18, 2020
```
* Support python3.8

test=notest
```
  9fdcfe89
- P
  register pass compatibility (#27357) · fd7ab4e6
  由 Pei Yang 提交于 9月 18, 2020
```
* pass compatibility

* add compatibility registry

* add unittests for different padding

* add assert

* drop errmsg
```
  fd7ab4e6
- H
  
  Add 3 pass version check (#27283) · 7e6dfcf9
  由 haozech 提交于 9月 18, 2020
  
  7e6dfcf9
- G
  fix cudnn dyload (#27308) · 1a755971
  由 GaoWei8 提交于 9月 18, 2020
```
* fix cudnn dyload error
```
  1a755971
- W
  fix the error message for the math dir · b6a4349d
  由 wawltor 提交于 9月 18, 2020
```
https://github.com/PaddlePaddle/Paddle/pull/27332
```
  b6a4349d

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功