提交 · f4c750d721a1226738bea382f6c0cf725cca8481 · 机器未来 / Paddle

22 9月, 2020 3 次提交
- Z
  Add the cpu version of segment sum mean max min op · f4c750d7
  由 Zhong Hui 提交于 9月 22, 2020
```
Add the cpu version of segment sum mean max min op
```
  f4c750d7
- W
  
  Rename fluid_inference to paddle_inference. (#27422) · afe94903
  由 Wilber 提交于 9月 22, 2020
  
  afe94903
- P
  
  clear pass logs (#27434) · 81823370
  由 Pei Yang 提交于 9月 22, 2020
  
  81823370
21 9月, 2020 10 次提交

F

add mv op(c++, python, unit test) (#27024) · 13a4c74e
由 furnace 提交于 9月 21, 2020

13a4c74e

Optimize argsort Op performance on GPU · f11a53ee

由 LutaoChu 提交于 9月 21, 2020

* argsort op acceleration on GPU when the input size is equal to the length of the ‘axis’ dimension

f11a53ee

add double grad compute for batch norm (#27296) · 1d3b27ca

由 ceci3 提交于 9月 21, 2020

* add double grad compute for batch norm,test=develop

* fix unittest, test=develop

* remove unuse tensor,test=develop

* add format,test=develop

* update, test=develop

1d3b27ca

fix bug sequececonv_eltadd_relu_fuse_pass (#27404) · d9366194

由 Shang Zhizhou 提交于 9月 21, 2020

* fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0

* fix seqconv_eltadd_relu_fuse_pass unitest error

d9366194

[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112) · aba759ba

由 Leo Chen 提交于 9月 21, 2020

* support use add instead of sum to do gradient accumulation

* add inplace addto pass

* add grad_add op and inplace addto pass

* remove debug code

* code refine

* fix bug when sereral sum ops inserts at same op_idx

* fix Flags type

* add addto attribute for conv3d

* fix ut

* code clean

* fix type

aba759ba

L
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor · 669efb98
由 LutaoChu 提交于 9月 21, 2020
```
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor 
```
669efb98
W

Add pass compatible and unit test. (#27377) · 39546aa2
由 Wilber 提交于 9月 21, 2020

39546aa2

Quant op dev (#25932) · 02606d45

由 huangxu96 提交于 9月 21, 2020

* Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests.

* Finished channel-wise quantize strategy in imperative quantization.

* Added Cuda code of ChannelWiseQuantDequantMaxAbsOP
Add Cuda code of ChannelWiseQuantDequantMaxAbsOp

* Add quant_axis for channel_wise quant.

* fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement.

* Added some assert infomation and fixed some coding style mistakes.

02606d45

Refine error msg in paddle/fluid/framework/details [part 1] (#25631) · bbc84e0f

由 Leo Chen 提交于 9月 21, 2020

* refine error msg in var_handle.h, test=develop

* refine all_reduce_op_handle

* fix some error msg

* refine variable_visitor

* refine threaded_ssa_graph_executor

* refine inplace related files

* refine executor related files

* refine fetch_op_handle.cc

* fix bug

* follow comments

bbc84e0f

M
fix adam (#27343) · f936adbd
由 MRXLT 提交于 9月 21, 2020
```
* fix adam

* rmsprop support double
```
f936adbd

18 9月, 2020 7 次提交
- T
  【paddle.fleet】gloo and util (#27213) · 99626502
  由 tangwei12 提交于 9月 18, 2020
```
* fix worker endpoints

* fix gloo wrapper for hdfs

* GPU fleetrun support gloo

* parameterserver fleetrun support gloo

* fix get server endpoint
```
  99626502
- P
  
  Optimize emb_eltwise_layernorm_plugin and support fp16 (#27128) · a5ef246c
  由 Pei Yang 提交于 9月 18, 2020
  
  a5ef246c
- Y
  
  enhance dataset err msg (#27363) · d726fd5e
  由 yaoxuefeng 提交于 9月 18, 2020
  
  d726fd5e
- P
  register pass compatibility (#27357) · fd7ab4e6
  由 Pei Yang 提交于 9月 18, 2020
```
* pass compatibility

* add compatibility registry

* add unittests for different padding

* add assert

* drop errmsg
```
  fd7ab4e6
- H
  
  Add 3 pass version check (#27283) · 7e6dfcf9
  由 haozech 提交于 9月 18, 2020
  
  7e6dfcf9
- G
  fix cudnn dyload (#27308) · 1a755971
  由 GaoWei8 提交于 9月 18, 2020
```
* fix cudnn dyload error
```
  1a755971
- W
  fix the error message for the math dir · b6a4349d
  由 wawltor 提交于 9月 18, 2020
```
https://github.com/PaddlePaddle/Paddle/pull/27332
```
  b6a4349d
17 9月, 2020 8 次提交
- H
  Polish operators error message in average_accumlate OP (#27268) · 01659a69
  由 HappyAngel 提交于 9月 17, 2020
```
* fix op print error info problem. test=develop

* fix build error

* fix format

* fix error msg info

* fix format
```
  01659a69
- S
  
  add op version checker to ir passes (#27329) · 3c117179
  由 Shang Zhizhou 提交于 9月 17, 2020
  
  3c117179
- F
  add empty_like op (python, and unit test), use c++ implementation of empty op, (#27287) · 515efe42
  由 furnace 提交于 9月 17, 2020
```
and optimize the c++ implmentation of empty op as PR#26659 reviews,
and add bool for shape op.
```
  515efe42
- Y
  OP报错信息优化 (#27301) · e9a0fbff
  由 Yi Liu 提交于 9月 17, 2020
```
paddle/fluid/operators/distributed_ops OP报错信息优化
```
  e9a0fbff
- J
  enhance reduce op which can reduce tensor with arbitrary rank · 63203c4a
  由 Jack Zhou 提交于 9月 17, 2020
```
enhance reduce op which can reduce tensor with arbitrary rank 
```
  63203c4a
- L
  
  fix the bug of non-exit, test=develop (#27350) · 9f9d15e2
  由 lilong12 提交于 9月 17, 2020
  
  9f9d15e2
- S
  Fix elementwise_floordiv op (#27352) · 9ee77b1f
  由 ShenLiang 提交于 9月 17, 2020
```
* fix floordiv
```
  9ee77b1f
- Z
  
  fix cache file judge (#27369) · ebc6d544
  由 Zhou Wei 提交于 9月 17, 2020
  
  ebc6d544
16 9月, 2020 10 次提交
- S
  add adaptivelsgd in meta_optimizer (#27289) · 54b81fa3
  由 ShenLiang 提交于 9月 16, 2020
```
* add adaptivelsgd

* Todo fix the code to avoid the conflict.
```
  54b81fa3
- J
  Error description optimize for the math dir · 6e29c2da
  由 Jack Zhou 提交于 9月 16, 2020
```
Error description optimize for the math dir
```
  6e29c2da
- Z
  fix judge cache file of inference api more accurate (#27175) · f992f8d7
  由 Zhou Wei 提交于 9月 16, 2020
```
fix judge cache file of inference api more accurate
```
  f992f8d7
- J
  - Fix to concat oneDNN overwritting data (#27273) · 4582f697
  由 Jacek Czaja 提交于 9月 16, 2020
```
test=develop
```
  4582f697
- S
  fix error message in broadcast/allreduce/gather (#27302) · c296618c
  由 ShenLiang 提交于 9月 16, 2020
```
* fix error message
```
  c296618c
- C
  Polish framework error message part 7 (#27266) · 4f9d6529
  由 Chen Weihang 提交于 9月 16, 2020
```
* polish framework error message part 7

* fix typo

* polish by reviewes comment
```
  4f9d6529
- W
  update the error message check for the some ops · 4e8582fe
  由 wawltor 提交于 9月 16, 2020
```
update the error message check for the some ops
```
  4e8582fe
- W
  add the error message check for the some operator · d003573f
  由 wawltor 提交于 9月 16, 2020
```
add the error message check for the some operator
```
  d003573f
- W
  
  Enhance infer error info message (#26731) · dae62556
  由 Wilber 提交于 9月 16, 2020
  
  dae62556
- L
  
  use shared dev_ctx (#27313) · 4c8ea492
  由 Leo Chen 提交于 9月 16, 2020
  
  4c8ea492
15 9月, 2020 2 次提交

Optimize slice trt plugin (#26970) · 47fdc60e

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize slice TRT plugin

This patch removes unnecessary barrier for data transfer of needed offset,
so data transfer can be overlap with GPU kernel execution.

This patch also fixes incorrect name of slice plugin. That is, replaces
"layernorm" with "slice"

test=develop

* add serialize/deserialize to slice plugin

* add static shape slice trt plugin

* fix slice trt op convertor dynamic shape bug

* fix format by clang-format

* fix pylint format error

* fix problems commented by peiyang
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>

47fdc60e

W

[Pass Compatible] Bind python compatible. (#27262) · f827665a
由 Wilber 提交于 9月 15, 2020

f827665a

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致