提交 · 931cba2e649b5cc832e08fb7807b9cdbd6ff9409 · Crayon鑫 / Paddle

20 4月, 2020 3 次提交

OP(fusion_gru) error message enhancement. test=develop (#23591) · a28a63a9

由 zhaoyuchen2018 提交于 4月 20, 2020

* OP(fusion_gru) error message enhancement. test=develop

* refine code, test=develop

* Refine inout log, test=develop

* Refine description, test=develop

a28a63a9

Optimize the error messages of paddle CUDA API (#23816) · 78170037

由 Zhou Wei 提交于 4月 20, 2020

* Optimize the error messages of paddle CUDA API, test=develop

* fix the error messages of paddle CUDA API, test=develop

* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL,test=develop

* remove build_ex_string,test=develop

* merge conflict,test=develop

78170037

Y

Op(conv2d_fusion) error message enhancement. (#23596) · 8d0b0cb4
由 Yiqun Liu 提交于 4月 20, 2020

8d0b0cb4

15 4月, 2020 2 次提交
- Y
  
  fusion_seqconv_eltadd_relu error message enhancement. (#23554) · f5f76e61
  由 yiicy 提交于 4月 15, 2020
  
  f5f76e61
- Z
  OP(fused_embedding_fc_lstm) error message enhancement. test=develop (#23527) · f0b08123
  由 zhaoyuchen2018 提交于 4月 15, 2020
```
* API(fused_embedding_fc_lstm) error message enhancement. test=develop

C++ API enhancement.

* Refine code, test=develop

* Refine code. test=develop
```
  f0b08123
14 4月, 2020 2 次提交
- Y
  
  fusion_seqexpand_concat_fc error message enhancement, test=develop (#23558) · de3e299d
  由 yiicy 提交于 4月 14, 2020
  
  de3e299d
- H
  
  [error message enhancement] fused_elemwise_activation_op and fusion_conv_inception_op (#23686) · 5fe3b638
  由 huzhiqiang 提交于 4月 14, 2020
  
  5fe3b638
12 4月, 2020 1 次提交
- Z
  
  fix bug for exhaustive_search in conv_fusion_op, test=develop (#23727) · b4b6763a
  由 zhongpu 提交于 4月 12, 2020
  
  b4b6763a
10 4月, 2020 3 次提交
- Z
  OP(fusion_gru) error message enhancement. test=develop (#23599) · 7b5e23c0
  由 zhaoyuchen2018 提交于 4月 10, 2020
```
C++ OP enhancement.
```
  7b5e23c0
- W
  error message enhancement for fusion_seqpool_concat_op. test=develop (#23563) · 1ac9db43
  由 Wilber 提交于 4月 10, 2020
```
error message enhancement for fusion_seqpool_concat_op
```
  1ac9db43
- W
  error message enhancement for py_func op. (#23565) · 286c2e0e
  由 Wilber 提交于 4月 10, 2020
```
error message enhancement for py_func op. 
```
  286c2e0e
09 4月, 2020 2 次提交
- Z
  Refine transpose flatten concat error message (#23625) · f3456071
  由 Zhaolong Xing 提交于 4月 09, 2020
```
* refine fusion_transpose_flatten_concat_op log
test=develop

* fix ci error
test=develop
```
  f3456071
- W
  error message enhancement for repeated fc. test=develop (#23562) · 5f22478a
  由 Wilber 提交于 4月 09, 2020
```
error message enhancement for repeated fc
```
  5f22478a
07 4月, 2020 1 次提交
- Z
  
  Op (FusionSquaredMatSub) error message enhancement. (#23498) · 638d924d
  由 zhangchunle 提交于 4月 07, 2020
  
  638d924d
04 4月, 2020 2 次提交

Z

Op (FusedEmbeddingSeqPool) error message enhancement. (#23454) · fd9b7bdb
由 zhangchunle 提交于 4月 04, 2020

fd9b7bdb

Delete Ref & VectorRef and add GetDataSafely (#22997) · 16315d3d

由 Chen Weihang 提交于 4月 04, 2020

* delete invalid check inferface Ref & VectorRef, test=develop

* fix vector ref delete error, test=develop

* try the new check inferface, test=develop

* change all related code with new check macro, test=develop

* remove static assert, test=develop

* polish detail, test=develop

* skip coverage problem, test=develop

* add new check macro, test=develop

16315d3d

03 4月, 2020 1 次提交

support Exhaustive search in dygraph (#23415) · dbfbd7ea

由 zhongpu 提交于 4月 03, 2020

* use global conv cache; test=develop

* use singleton cache; test=develop

* fix format error; test=develop

* add cudnn helper header; test=develop

* fix header error; test=develop

* fix mac unitest; test=develop

* fix mac unitest; test=develop

* fix file format; test=develop

* fix include file error, test=develop

* remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop

* fix test_elementwise_mul_op_dim, test=develop

* fix compile error, test=develop
Co-authored-by: Nphlrain <phliuhongyu@126.com>

dbfbd7ea

02 4月, 2020 2 次提交

Z
Revert "Exhaustive search (#22821)", test=develop (#23401) · bfb07aaf
由 zhongpu 提交于 4月 02, 2020
```
This reverts commit 48144e40.
```
bfb07aaf

Exhaustive search (#22821) · 48144e40

由 zhongpu 提交于 4月 02, 2020

* use global conv cache; test=develop

* use singleton cache; test=develop

* fix format error; test=develop

* add cudnn helper header; test=develop

* fix header error; test=develop

* fix mac unitest; test=develop

* fix mac unitest; test=develop

* fix file format; test=develop

* fix include file error, test=develop

* remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop

* fix test_elementwise_mul_op_dim, test=develop
Co-authored-by: Nphlrain <phliuhongyu@126.com>

48144e40

26 3月, 2020 1 次提交

[Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099

由 Zhaolong Xing 提交于 3月 26, 2020

* add dynamic plugin support.
test=develop

* change emb eltwise layernorm to math function
test=develop

* add emb eltwise layernorm
test=develop

* can run dynamic shape ernie
test=develop

* fix ci
test=develop

* add ut for trt ernie dynamic

test=develop

* refine dynamic shape c++ interface.
test=develop

* fix comments
test=develop

* fix comments
test=develop

430b0099

20 3月, 2020 2 次提交

W
update embedding_eltwise_layernorm fuse and kernel. test=develop (#23114) · 95b356a0
由 Wilber 提交于 3月 20, 2020
```
update embedding_eltwise_layernorm fuse pass and fused kernel, to support multi input
```
95b356a0

Add dygraph double grad implementation (#22939) · a31d7328

由 Zeng Jinle 提交于 3月 20, 2020

* add double grad implementation for dygraph, test=develop

* polish code, add uts, test=develop

* fix place bug, test=develop

* polish codes, add more uts for coverages, test=develop

* add no_grad_set, test=develop

* add star gan ut, test=develop

* follow comments, test=develop

a31d7328

19 3月, 2020 1 次提交
- Z
  fix align error (#23090) · 8c6fde9e
  由 Zhaolong Xing 提交于 3月 19, 2020
```
test=develop
```
  8c6fde9e
13 3月, 2020 1 次提交
- W
  Add Unittest for backward of fusion group (#22932) · 3757e068
  由 wangchaochaohu 提交于 3月 13, 2020
```
* add fusion group test for backward and refine code
```
  3757e068
12 3月, 2020 1 次提交
- W
  Cast fusion for fusion group (#22876) · f0d193a2
  由 wangchaochaohu 提交于 3月 12, 2020
```
* add support for expression type convert and add cast Op support in fusion group
```
  f0d193a2
11 3月, 2020 1 次提交

[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494) · 8d6dc102

由 Zhaolong Xing 提交于 3月 11, 2020

* 1. add embedding eltwise layernorm fuse
2. add embedding eltwise layernorm op
3. refine inplace_add_relu
4. refine fc_eltwise_layernorm
test=develop

* 1. refine fc
test=develop

* fix comments
test=develop

* fix comments

test=develop

8d6dc102

09 3月, 2020 1 次提交

Imperative tracer refactoring (#22457) · d33c4343

由 Zeng Jinle 提交于 3月 09, 2020

* refine grad maker, test=develop

* refactor tracer stage 1, test=develop

* merge develop to solve conflict third times, test=develop

d33c4343

05 3月, 2020 1 次提交
- Z
  [BUG]: Multihead matmul op's ouput size should be BxSx(N*H) (#22848) · 1a533ed2
  由 Zhaolong Xing 提交于 3月 05, 2020
```
test=develop
```
  1a533ed2
28 2月, 2020 1 次提交
- T
  
  fix typo word (#22784) · 433cef03
  由 tianshuo78520a 提交于 2月 28, 2020
  
  433cef03
23 2月, 2020 1 次提交
- T
  
  fix typo words (#22653) · d2ba91aa
  由 tianshuo78520a 提交于 2月 23, 2020
  
  d2ba91aa
21 2月, 2020 1 次提交
- Y
  
  Add the support of fp16 in fusion_group (#22239) · 22bbd547
  由 Yiqun Liu 提交于 2月 21, 2020
  
  22bbd547
13 2月, 2020 1 次提交

[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486) · 8acd745c

由 Zhaolong Xing 提交于 2月 13, 2020

* 1. optim multihead matmul: fuse three fc to multihtead matmul

test=develop

* fix conflict
test=develop

* fix comments
test=develop

8acd745c

10 2月, 2020 1 次提交
- W
  
  fix test_fusion_seqpool_concat lod level between compile and runtime (#22488) · 870f4658
  由 Wilber 提交于 2月 10, 2020
  
  870f4658
16 1月, 2020 1 次提交
- L
  
  change std::cout to log(INFO), vlog (#22316) · 895f8da7
  由 lidanqing 提交于 1月 16, 2020
  
  895f8da7
10 1月, 2020 1 次提交

Add bn and relu fuse pass (#22048) · 46189b16

由 Zhen Wang 提交于 1月 10, 2020

* add bn and relu fuse pass

* add op attr assert and dtype assert

* fix some inputs&&outputs bugs for the fused op and pattern.

* add the unittest for fuse_bn_act_pass. test=develop

* use normative enforce statements. test=develop

* add the cpu test. test=develop

* add the support of batch_size=1 for the bn with relu op. test=develop

* add the error type for paddle throws. test=develop

* add fused_batch_norm_act and fused_batch_norm_act_grad to op_has_unsed_vars_white_list. test=develop

46189b16

07 1月, 2020 2 次提交
- Z
  Fix windows build not kernel issue, test=develop (#22105) · 3dbd4087
  由 zhaoyuchen2018 提交于 1月 07, 2020
```
windows conv_fusion failed as no kernel， explicit declare lambda
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
```
  3dbd4087
- C
  
  replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109) · ba8414d3
  由 Chen Weihang 提交于 1月 07, 2020
  
  ba8414d3
03 1月, 2020 1 次提交

Add the first implememtation of fusion_group op (#19621) · d4832077

由 Yiqun Liu 提交于 1月 03, 2020

* Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
test=develop

* Call CUDA driver api to launch the kernel compiled by nvrtc.
test=develop

* Disable for mac and windows.
test=develop

* Refine the codes to support manually specified num_threads and workload_per_thread.
test=develop

* Refine the CUDA kernel to support large dims.
test=develop

* Add DeviceCodePool to manage all device codes.

* Add the first implementation fusion_group op.

* Add unit-test for fusion_group op.

* Add the check of result.

* Add the check of nvrtc in unit-test.
test=develop

* Add comment to explain the inputs, outputs and features of fusion_group op.
test=develop

* Disable fusion_group op for mac and windows.
test=develop

* Make the compiling of device code return status instead of hanging up.
test=develop

* Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.

* Unify fusion_group_op's input and output names.
test=develop

* Add the check of CUDA driver library in unittest.
test=develop

* Refine the calling of PADDLE_ENFORCE.
test=develop

d4832077

27 12月, 2019 1 次提交

Refine multihead kernel, align block to 32 (#21961) · 8859ddd6

由 zhaoyuchen2018 提交于 12月 27, 2019

* Refine multihead kernel, align block to 32

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* Refine log comments

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

8859ddd6

16 12月, 2019 1 次提交
- Z
  Fix softmax cuda bug (#21720) · a5a8d144
  由 zhaoyuchen2018 提交于 12月 16, 2019
```
* Fix softmax cuda bug

* Refine multihead log and softmax logic
```
  a5a8d144

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致