提交 · c89f269c4d32447eccbd1e53d8d324936c1cd5ba · PaddlePaddle / Paddle

16 9月, 2020 1 次提交
- L
  
  use shared dev_ctx (#27313) · 4c8ea492
  由 Leo Chen 提交于 9月 16, 2020
  
  4c8ea492
15 9月, 2020 7 次提交

Optimize slice trt plugin (#26970) · 47fdc60e

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize slice TRT plugin

This patch removes unnecessary barrier for data transfer of needed offset,
so data transfer can be overlap with GPU kernel execution.

This patch also fixes incorrect name of slice plugin. That is, replaces
"layernorm" with "slice"

test=develop

* add serialize/deserialize to slice plugin

* add static shape slice trt plugin

* fix slice trt op convertor dynamic shape bug

* fix format by clang-format

* fix pylint format error

* fix problems commented by peiyang
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>

47fdc60e

W

[Pass Compatible] Bind python compatible. (#27262) · f827665a
由 Wilber 提交于 9月 15, 2020

f827665a
石

error messages of inference/tests, test=develop (#27259) · bd77a425
由石晓伟提交于 9月 15, 2020

bd77a425

Polish framework error message part 6 (#27257) · dafb0e3b

由 Chen Weihang 提交于 9月 15, 2020

* polish framework error msg part 6

* polish lossed item

* fix failed unittest

* polish by reviewer comments

dafb0e3b

Optimize error report (#27254) · e6e2e537

由 Shang Zhizhou 提交于 9月 15, 2020

* optimize errror report

* add test case for pad op converter

* fix some spelling mistake commented by peiyang

e6e2e537

G
change sequence length attribute to input (#27193) · ee1ed42c
由 GaoWei8 提交于 9月 15, 2020
```
* replace sequence length attr to input
```
ee1ed42c
P
fix trt_dynamic_shape_ernie_deserialize_test (#27290) · 3ae3b864
由 Pei Yang 提交于 9月 15, 2020
```
* fix trt_dynamic_shape_ernie_deserialize_test

* support when opt cache dir does not exist
```
3ae3b864

14 9月, 2020 10 次提交
- J
  
  Add bfloat16 passes (#26999) · 1483ea23
  由 joanna.wozna.intel 提交于 9月 14, 2020
  
  1483ea23
- L
  Improving error report message for sequence_expand op (#27245) · bf461fa5
  由 lilong12 提交于 9月 14, 2020
```
* improve err report, test=develop
```
  bf461fa5
- Z
  Enhance the error messages for files in operators/math · bbad3414
  由 Zhong Hui 提交于 9月 14, 2020
```
Enhance the error messages for  files in operators/math
```
  bbad3414
- C
  
  polish framework error message part 8 (#27269) · 79149c8e
  由 Chen Weihang 提交于 9月 14, 2020
  
  79149c8e
- P
  
  refine error message related to paddle-TRT (#27256) · aae41c6f
  由 Pei Yang 提交于 9月 14, 2020
  
  aae41c6f
- Z
  Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for... · d708b210
  由 Zhen Wang 提交于 9月 14, 2020
```
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)

* update amp_check_finite_and_scale_op for static_amp.

* use amp_check_finite_and_scale in static graph amp.

* update grads to zero when grads own infinite values(as for amp_checkout_finite_and_scale op).

* add update_loss_scaling op in cpp.

* add update_loss_scaling_op unit test.

* update the doc of the check_finite_and_unscale op

* Update the process of gradients updating skipping if the gradients have infinite values.

* update the way to zero grads.

* update test_update_loss_scaling_op.py

* add log info when find infinite grads.

* add the unit test for UpdateLossScaling Layer.
```
  d708b210
- S
  remove auto mode from localsgd optimizer (#27237) · 2b6a5793
  由 ShenLiang 提交于 9月 14, 2020
```
* rm auto from localsgd
```
  2b6a5793
- A
  Add int8 GRU kernel (#27220) · cc3f4b81
  由 Adam 提交于 9月 14, 2020
```
* Add int8 GRU kernel with UTs

* Lint fixes

* More lint fixes
```
  cc3f4b81
- 石
  
  error messages of inference/capi, test=develop (#27258) · 255e0cf9
  由石晓伟提交于 9月 14, 2020
  
  255e0cf9
- J
  Error description optimize for math dir · 9437ce36
  由 Jack Zhou 提交于 9月 14, 2020
```
Error description optimize for math dir
```
  9437ce36
13 9月, 2020 1 次提交
- Z
  
  use eval to improve performance, test=develop (#25459) · 5c1bafbb
  由 Zhang Ting 提交于 9月 13, 2020
  
  5c1bafbb
12 9月, 2020 1 次提交

Fix GRU mkldnn kernel fail on look_table_v2 (#27198) · 5c4eed66

由 lidanqing 提交于 9月 12, 2020

* Fix the lookup_table_v2 failed on GRU mkldnn kernel issue
test=develop

* fix according to reviews, removed x_num_col_dims
test=develop

* update gru model. change according to reviews
test=develop

* change according to reviews
test=develop

5c4eed66

11 9月, 2020 4 次提交
- C
  
  fix loaded no params layer run error (#27241) · 33ff833a
  由 Chen Weihang 提交于 9月 11, 2020
  
  33ff833a
- W
  
  enhance inference error info. (#27251) · f1ab2882
  由 Wilber 提交于 9月 11, 2020
  
  f1ab2882
- W
  
  Lite subgraph refine predictor (#27167) · 1b84c0bf
  由 Wilber 提交于 9月 11, 2020
  
  1b84c0bf
- F
  
  add empty op (c++, python, unit test) (#26659) · 2e597696
  由 furnace 提交于 9月 11, 2020
  
  2e597696
10 9月, 2020 9 次提交
- L
  add double grad for tile op and expand_v2 op (#27114) · c5f957ae
  由 lilong12 提交于 9月 10, 2020
```
* add double grad for tile, test=develop

* add double grad for expand_v2 op, test=develop
```
  c5f957ae
- L
  add double grad for expand (#27183) · 58a88ba9
  由 lilong12 提交于 9月 10, 2020
```
* add double grad for expand, test=develop
```
  58a88ba9
- Q
  
  fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231) · 7c7fbd32
  由 Qi Li 提交于 9月 10, 2020
  
  7c7fbd32
- Q
  [UT] fix run type of ut test cases of test_train_recognize_digits and... · 78446ecd
  由 Qi Li 提交于 9月 10, 2020
```
[UT] fix run type of ut test cases of test_train_recognize_digits and test_api_impl, test=develop (#27218)
```
  78446ecd
- J
  [oneDNN]Introducing oneDNN 1.6 (#27137) · e0058615
  由 Jacek Czaja 提交于 9月 10, 2020
```
* - introducing oneDNN 1.6

test=develop

* - Removed redundant code

test=develop
```
  e0058615
- S
  
  revert divide (#27202) · 5bd84b22
  由 ShenLiang 提交于 9月 10, 2020
  
  5bd84b22
- W
  fix the CudaPinMemory bug for the equal op (#27176) · fde5cfe8
  由 wawltor 提交于 9月 10, 2020
```
 fix the CudaPinMemory bug for the equal op and add the test case for the equal op
```
  fde5cfe8
- Z
  
  restruct logsumexp to speed up compiling (#27191) · cc3306f7
  由 zhupengyang 提交于 9月 10, 2020
  
  cc3306f7
- S
  update error info for selected_rows_functor · 50e60e87
  由 Steffy-zxf 提交于 9月 10, 2020
```
update error info for selected_rows_functor
```
  50e60e87
09 9月, 2020 5 次提交
- W
  
  Add 2.0 inference api doc. (#27125) · edd962b1
  由 Wilber 提交于 9月 09, 2020
  
  edd962b1
- J
  modified the implement of Lars optimizer (#26733) · 5d039f40
  由 JZ-LIANG 提交于 9月 09, 2020
```
add lars to fleet meta optimizer
```
  5d039f40
- W
  
  [cuda11 support] change the CMakeLists to support the cuda11 (#27124) · c71d79b1
  由 wangchaochaohu 提交于 9月 09, 2020
  
  c71d79b1
- Q
  Add double grad in reduce sum (#27115) · 43b0445b
  由 Qinghe JING 提交于 9月 09, 2020
```
* set default value to strategy in distributed_optimizer test=develop
```
  43b0445b
- K
  optimize the error message for math dir · ed292695
  由 kinghuin 提交于 9月 09, 2020
```
optimize the error message for math dir
```
  ed292695
08 9月, 2020 2 次提交
- fix Norm op error (#26771) · 4558d395
  由 myq406450149 提交于 9月 08, 2020
```
* fix frobenius_norm error, rm p=0 2-axis support. test=develop
```
  4558d395
- L
  Fix kl and summary bug (#27132) · 4d7d6612
  由 LielinJiang 提交于 9月 08, 2020
```
* fix summary rnn

* fix kl_div bug when input shape is [1] and reduction is batchmean
```
  4d7d6612

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功