提交 · 5c8fdb59265e7e22a4bd52629e0038180d494ff5 · PaddlePaddle / Paddle

24 9月, 2020 1 次提交
- Z
  Add GPU Kernels of Segment Ops, support, sum, max, min, mean · 4a9d21de
  由 Zhong Hui 提交于 9月 24, 2020
```
Add GPU Kernels of Segment Ops,  support, sum, max, min, mean
```
  4a9d21de
23 9月, 2020 6 次提交
- S
  [bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436) · c17f9cf2
  由 Shang Zhizhou 提交于 9月 23, 2020
```
* [bug fix]:Memory increases after adapting the cudnn version to 8

* [bug fix]cudnnGetConvolutionForwardAlgorithm not defined
```
  c17f9cf2
- L
  Fix bug: The calculation result of Diag_v2 Op under large size input is wrong (#27447) · 5508c787
  由 LutaoChu 提交于 9月 23, 2020
```
The calculation result of Diag_v2 Op under large size input is wrong 
```
  5508c787
- T
  large scale kv speedup (#26510) · bc5f0246
  由 tangwei12 提交于 9月 23, 2020
```
* rename communicator meet->BatchesCounter

* fix parame recv for sparse

* geo sparse init from pserver

* optimize init from pserver

* add large scale optimizer fuse(SGD/ADAM)

* rectification init_worker and exe.run startup program
```
  bc5f0246
- C
  Polish no onwer ops error message (#27448) · 41b59555
  由 Chen Weihang 提交于 9月 23, 2020
```
* polish no onwer op error message

* fix unittest failed

* polish details based reviewer comment
```
  41b59555
- Z
  add fuse_bn_act op (#27230) · 906e7f92
  由 Zhang Ting 提交于 9月 23, 2020
```
* add fused_bn_add_relu op
```
  906e7f92
- W
  
  avoid data transform for linspace OP (#27444) · 76fb95fe
  由 wangchaochaohu 提交于 9月 22, 2020
  
  76fb95fe
22 9月, 2020 4 次提交
- 1
  Enhance Op's Error Message (#27455) · a0452475
  由 123malin 提交于 9月 22, 2020
```
* test=develop, update error message
```
  a0452475
- W
  
  refine the precious of linspace Op using half way (#27452) · 0a862fd3
  由 wangchaochaohu 提交于 9月 22, 2020
  
  0a862fd3
- 石
  
  enhance error messages, test=develop (#27423) · dd4c2d86
  由石晓伟提交于 9月 22, 2020
  
  dd4c2d86
- Z
  Add the cpu version of segment sum mean max min op · f4c750d7
  由 Zhong Hui 提交于 9月 22, 2020
```
Add the cpu version of segment sum mean max min op
```
  f4c750d7
21 9月, 2020 8 次提交

F

add mv op(c++, python, unit test) (#27024) · 13a4c74e
由 furnace 提交于 9月 21, 2020

13a4c74e

Optimize argsort Op performance on GPU · f11a53ee

由 LutaoChu 提交于 9月 21, 2020

* argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension

f11a53ee

add double grad compute for batch norm (#27296) · 1d3b27ca

由 ceci3 提交于 9月 21, 2020

* add double grad compute for batch norm,test=develop

* fix unittest, test=develop

* remove unuse tensor,test=develop

* add format,test=develop

* update, test=develop

1d3b27ca

fix bug sequececonv_eltadd_relu_fuse_pass (#27404) · d9366194

由 Shang Zhizhou 提交于 9月 21, 2020

* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0

* fix seqconv_eltadd_relu_fuse_pass unitest error

d9366194

[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112) · aba759ba

由 Leo Chen 提交于 9月 21, 2020

* support use add instead of sum to do gradient accumulation

* add inplace addto pass

* add grad_add op and inplace addto pass

* remove debug code

* code refine

* fix bug when sereral sum ops inserts at same op_idx

* fix Flags type

* add addto attribute for conv3d

* fix ut

* code clean

* fix type

aba759ba

L
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor · 669efb98
由 LutaoChu 提交于 9月 21, 2020
```
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor 
```
669efb98

Quant op dev (#25932) · 02606d45

由 huangxu96 提交于 9月 21, 2020

* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests.

* Finished channel-wise quantize strategy in imperative quantization.

* Added Cuda code of ChannelWiseQuantDequantMaxAbsOP
Add Cuda code of ChannelWiseQuantDequantMaxAbsOp

* Add quant_axis for channel_wise quant.

* fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement.

* Added some assert infomation and fixed some coding style mistakes.

02606d45

M
fix adam (#27343) · f936adbd
由 MRXLT 提交于 9月 21, 2020
```
* fix adam

* rmsprop support double
```
f936adbd

18 9月, 2020 2 次提交
- G
  fix cudnn dyload (#27308) · 1a755971
  由 GaoWei8 提交于 9月 18, 2020
```
* fix cudnn dyload error
```
  1a755971
- W
  fix the error message for the math dir · b6a4349d
  由 wawltor 提交于 9月 18, 2020
```
https://github.com/PaddlePaddle/Paddle/pull/27332
```
  b6a4349d
17 9月, 2020 5 次提交
- H
  Polish operators error message in average_accumlate OP (#27268) · 01659a69
  由 HappyAngel 提交于 9月 17, 2020
```
* fix op print error info problem. test=develop

* fix build error

* fix format

* fix error msg info

* fix format
```
  01659a69
- F
  add empty_like op (python, and unit test), use c++ implementation of empty op, (#27287) · 515efe42
  由 furnace 提交于 9月 17, 2020
```
and optimize the c++ implmentation of empty op as PR#26659 reviews,
and add bool for shape op.
```
  515efe42
- Y
  OP报错信息优化 (#27301) · e9a0fbff
  由 Yi Liu 提交于 9月 17, 2020
```
paddle/fluid/operators/distributed_ops OP报错信息优化
```
  e9a0fbff
- J
  enhance reduce op which can reduce tensor with arbitrary rank · 63203c4a
  由 Jack Zhou 提交于 9月 17, 2020
```
enhance reduce op which can reduce tensor with arbitrary rank 
```
  63203c4a
- S
  Fix elementwise_floordiv op (#27352) · 9ee77b1f
  由 ShenLiang 提交于 9月 17, 2020
```
* fix floordiv
```
  9ee77b1f
16 9月, 2020 5 次提交
- J
  Error description optimize for the math dir · 6e29c2da
  由 Jack Zhou 提交于 9月 16, 2020
```
Error description optimize for the math dir
```
  6e29c2da
- J
  - Fix to concat oneDNN overwritting data (#27273) · 4582f697
  由 Jacek Czaja 提交于 9月 16, 2020
```
test=develop
```
  4582f697
- S
  fix error message in broadcast/allreduce/gather (#27302) · c296618c
  由 ShenLiang 提交于 9月 16, 2020
```
* fix error message
```
  c296618c
- W
  update the error message check for the some ops · 4e8582fe
  由 wawltor 提交于 9月 16, 2020
```
update the error message check for the some ops
```
  4e8582fe
- W
  add the error message check for the some operator · d003573f
  由 wawltor 提交于 9月 16, 2020
```
add the error message check for the some operator
```
  d003573f
15 9月, 2020 1 次提交
- G
  change sequence length attribute to input (#27193) · ee1ed42c
  由 GaoWei8 提交于 9月 15, 2020
```
* replace sequence length attr to input
```
  ee1ed42c
14 9月, 2020 7 次提交

J

Add bfloat16 passes (#26999) · 1483ea23
由 joanna.wozna.intel 提交于 9月 14, 2020

1483ea23
L
Improving error report message for sequence_expand op (#27245) · bf461fa5
由 lilong12 提交于 9月 14, 2020
```
* improve err report, test=develop
```
bf461fa5
Z
Enhance the error messages for files in operators/math · bbad3414
由 Zhong Hui 提交于 9月 14, 2020
```
Enhance the error messages for  files in operators/math
```
bbad3414
P

refine error message related to paddle-TRT (#27256) · aae41c6f
由 Pei Yang 提交于 9月 14, 2020

aae41c6f

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210

由 Zhen Wang 提交于 9月 14, 2020

Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.

d708b210

A
Add int8 GRU kernel (#27220) · cc3f4b81
由 Adam 提交于 9月 14, 2020
```
* Add int8 GRU kernel with UTs

* Lint fixes

* More lint fixes
```
cc3f4b81
J
Error description optimize for math dir · 9437ce36
由 Jack Zhou 提交于 9月 14, 2020
```
Error description optimize for math dir
```
9437ce36

13 9月, 2020 1 次提交
- Z
  
  use eval to improve performance, test=develop (#25459) · 5c1bafbb
  由 Zhang Ting 提交于 9月 13, 2020
  
  5c1bafbb

PaddlePaddle / Paddle 1 年多 前同步成功

PaddlePaddle / Paddle
1 年多前同步成功