提交 · f79293387b330f615503bdd2cc009fc9fd88bb86 · Crayon鑫 / Paddle

24 4月, 2020 1 次提交
- Z
  
  fix compilation failure (#24090) · f7929338
  由 Zeng Jinle 提交于 4月 24, 2020
  
  f7929338
21 4月, 2020 1 次提交

[cherry-pick2.0]Optimize the error messages of paddle CUDA API (#23849) · 3f4678c9

由 Zhou Wei 提交于 4月 21, 2020

* cherry-pick,Optimize the error messages of paddle CUDA API

* fix the error messages of paddle CUDA API

* Refactoring PADDLE_ENFORCE_CUDA_SUCCESS, and apply to curand/cudnn/cublas/NCCL

* remove build_ex_string

3f4678c9

10 4月, 2020 1 次提交
- L
  test=develop, add addmm op (#23384) · 1c08a213
  由 littletomatodonkey 提交于 4月 10, 2020
```
add addmm op
```
  1c08a213
26 3月, 2020 1 次提交

[Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099

由 Zhaolong Xing 提交于 3月 26, 2020

* add dynamic plugin support.
test=develop

* change emb eltwise layernorm to math function
test=develop

* add emb eltwise layernorm
test=develop

* can run dynamic shape ernie
test=develop

* fix ci
test=develop

* add ut for trt ernie dynamic

test=develop

* refine dynamic shape c++ interface.
test=develop

* fix comments
test=develop

* fix comments
test=develop

430b0099

11 3月, 2020 1 次提交

[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494) · 8d6dc102

由 Zhaolong Xing 提交于 3月 11, 2020

* 1. add embedding eltwise layernorm fuse
2. add embedding eltwise layernorm op
3. refine inplace_add_relu
4. refine fc_eltwise_layernorm
test=develop

* 1. refine fc
test=develop

* fix comments
test=develop

* fix comments

test=develop

8d6dc102

28 2月, 2020 1 次提交
- T
  
  fix typo word (#22784) · 433cef03
  由 tianshuo78520a 提交于 2月 28, 2020
  
  433cef03
23 2月, 2020 1 次提交
- T
  
  fix typo words (#22653) · d2ba91aa
  由 tianshuo78520a 提交于 2月 23, 2020
  
  d2ba91aa
10 2月, 2020 1 次提交
- Y
  Fix dismatch of std::max's arguments type on windows. (#22507) · 4b2227e9
  由 Yiqun Liu 提交于 2月 10, 2020
```
test=develop
```
  4b2227e9
07 2月, 2020 1 次提交

Fix the integer overflow problem of sequence2batch (#22479) · a61d0952

由 Zhong Hui 提交于 2月 07, 2020

Fix the  integer overflow problem in the op of sequence2batch, change the int32_t to size_t，
In the /paddle/fluid/operators/math/sequence2batch.h#L122.

a61d0952

06 2月, 2020 1 次提交

Correct the use of DeviceContext in unittest sequence_pooling_test and... · 44b45b9f

由 Yiqun Liu 提交于 2月 06, 2020

Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)

* Add log in memory::Copy for debug purpose.

* Change to use context in DeviceContextPool directly in sequence_pooling_test, instead to new one.

* Change to use context in DeviceContextPool directly in sequence_padding_test, instead to new one.
test=develop

* Change the type of second_dim from size_t to int64_t.
test=develop

44b45b9f

19 1月, 2020 1 次提交
- W
  
  Optimize the depthwise op test=develop (#22265) · 0d8b222b
  由 wangchaochaohu 提交于 1月 19, 2020
  
  0d8b222b
07 1月, 2020 1 次提交
- C
  
  replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109) · ba8414d3
  由 Chen Weihang 提交于 1月 07, 2020
  
  ba8414d3
04 1月, 2020 1 次提交
- K
  
  polish cross_entropy ENFORCE (#22056) · 34c57120
  由 Kaipeng Deng 提交于 1月 04, 2020
  
  34c57120
23 12月, 2019 1 次提交
- G
  optimize fc jit (#21878) · d4dda862
  由 GaoWei8 提交于 12月 23, 2019
```
test=develop
```
  d4dda862
11 12月, 2019 1 次提交
- G
  Modify padding strategy: remove weight copy in fc padding (#21650) · 5af0c7ba
  由 GaoWei8 提交于 12月 11, 2019
```
test=develop
```
  5af0c7ba
02 12月, 2019 1 次提交

fix -Wno-error=sign-compare warning in gcc8 (#21434) · 01fa4ead

由 Tao Luo 提交于 12月 02, 2019

* fix -Wno-error=sign-compare warning in gcc8

test=develop

* fix warning in distributed codes

test=develop

01fa4ead

28 11月, 2019 1 次提交

remove -Wno-error=sign-compare, make warning as error (#21358) · c0656dcb

由 Tao Luo 提交于 11月 28, 2019

* remove -Wno-error=sign-compare, make warning as error

test=develop test=document_fix

* fix exist compile warning

test=develop

c0656dcb

27 11月, 2019 1 次提交
- G
  Polish the codes of fc when needs padding (#21378) · 8493f20e
  由 GaoWei8 提交于 11月 27, 2019
```
test=develop
```
  8493f20e
26 11月, 2019 1 次提交

Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) · 234060f8

由 GaoWei8 提交于 11月 26, 2019

* Add fc padding to solve mkl performance
test=develop

* fix gpu pass and error information
test=develop

* fix fc_fuse_pass_test
test=develop

* fix error information
test=develop

* fix error information
test=develop

* fix name and add fc op padding test
test=develop

* fix attributes
test=develop

* optimize fc padding
test=develop

* fix test
test=develop

234060f8

22 11月, 2019 1 次提交

add dequantize_abs_max op and modify lookup_table op (#20899) · f0b15184

由 Liufang Sang 提交于 11月 22, 2019

* add int8 kernel to lookup_table op and add dequantize op test=develop

* change paddle_enforce to paddle_enforce_eq test=develop

* change copyright and change some not suitable code test=develop

* remove debug log test=develop

* replace GetInputType with IndicateVarDataType test=develop

* fix EmptyGradMaker test=develop

* fix diff between cpu and gpu test=develop

* use memcopy when int8_t test=develop

f0b15184

14 11月, 2019 1 次提交
- W
  
  Fix warpctc in padding mode. (#21033) · cfdd1fc2
  由 whs 提交于 11月 14, 2019
  
  cfdd1fc2
12 11月, 2019 1 次提交

fix the computation for dx (grad for x) for prelu operation. (#20949) · e249d9a3

由 lilong12 提交于 11月 12, 2019

* set the default value of alpha for prelu to 0.25, test=develop

* add the call to __syncthreads(), test=develop

* fix the implementation of cpu prelu, test=develop

* repair the implementation of element mode prelu, test=develop

* modify test_prelu_op.py, test=develop

e249d9a3

08 11月, 2019 1 次提交

Add dependency for error_codes.proto (#21084) · 2f27b103

由 Chen Weihang 提交于 11月 08, 2019

* fix activation_functions deps, test=develop, test=document_fix

* add error_codes_proto deps, test=develop, test=document_fix

* try delete enforce.h, test=develop, test=document_fix

2f27b103

05 11月, 2019 2 次提交
- Z
  Fix ce ocr_recognition test fails (#20987) · 0059404e
  由 zhaoyuchen2018 提交于 11月 05, 2019
```
ocr_recognition fails, so add a path to handle small frame_size.

test=develop
```
  0059404e
- T
  refine murmurhash3_x64_128 for bloom_filter (#20996) · 25ffa844
  由 Tao Luo 提交于 11月 05, 2019
```
test=develop
```
  25ffa844
01 11月, 2019 1 次提交

Fix gru as small frame_size has error. (#20922) · 7f3a445e

由 zhaoyuchen2018 提交于 10月 31, 2019

seems shuffle_sync cannot handle small size

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

7f3a445e

31 10月, 2019 2 次提交
- Z
  maxout supports channel_last input (#20846) · 8d1e9f0f
  由 Zhang Ting 提交于 10月 31, 2019
```
* maxout support channel_last input, test=develop

* modified details of Input(X) and Attr(groups, axis) in doc, test=develop
```
  8d1e9f0f
- Z
  
  fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897) · c18f1bd7
  由 Zhang Ting 提交于 10月 31, 2019
  
  c18f1bd7
30 10月, 2019 1 次提交
- Z
  
  fix select_rows mergeadd bug, test=develop (#20876) · d4289125
  由 zhang wenhui 提交于 10月 30, 2019
  
  d4289125
28 10月, 2019 1 次提交
- A
  
  add pyramid_hash_op (#20698) · aacd16db
  由 Aurelius84 提交于 10月 28, 2019
  
  aacd16db
23 10月, 2019 1 次提交

Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and... · e89c16b9

由 Pei Yang 提交于 10月 23, 2019

Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)

* fix pool2d trt converter, test=develop

* add fix for split op converter, test=develop

e89c16b9

16 10月, 2019 1 次提交
- Q
  Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636) · 01eddc1a
  由 qingqing01 提交于 10月 16, 2019
```
* Support fp16 in fused_elemwise_activation_op.
* Fix unit testing in ONLY-CPU mode.
```
  01eddc1a
13 10月, 2019 1 次提交
- Z
  
  fix conv_transpose's bug: compatible with Anylayout setting, test=develop (#20589) · 78910480
  由 Zhang Ting 提交于 10月 13, 2019
  
  78910480
09 10月, 2019 1 次提交

mv two function in conv op for good code style (#20116) · ad60b3b8

由 liym27 提交于 10月 09, 2019

* Delete PadFuntion, include padding.h instead. test=develop

* move function(IsSymmetricPadding) from conv_cudnn_op.cu/conv_transpose_cudnn_op.cu to padding.h, test=develop

ad60b3b8

07 10月, 2019 1 次提交
- Z
  
  conv_transpose supports channel_last input, test=develop, test=document_preview (#20072) · cf6919bf
  由 Zhang Ting 提交于 10月 07, 2019
  
  cf6919bf
30 9月, 2019 1 次提交
- D
  Improve elementwise operators performance in same dimensions. (#19763) · 425279a5
  由 danleifeng 提交于 9月 30, 2019
```
Improve elementwise operators performance in same dimensions
```
  425279a5
29 9月, 2019 1 次提交

fix conv2d and conv3d: (#20042) · 3aa331d9

由 liym27 提交于 9月 29, 2019

1.support asymmetric padding;
    2.support padding algorithm:"SAME" and "VALID";
    3.support channel_last: data_format NHWC and NDHWC;
    4.change doc of python API and c++;

    test=develop, test=document_preview

3aa331d9

28 9月, 2019 1 次提交

fix pool2d pool3d,support asymmetric padding and channel_last (#19739) · 24010472

由 liym27 提交于 9月 28, 2019

* fix pool2d pool3d:
1. support asymmetric padding;
2. support padding algorithm:"SAME" and "VALID";
3. support channel_last: data_format NHWC and NDHWC;
4. support inferring shape when input with negative dims in compile time;
5. change doc of python API and c++;
6. fix bug in cuda kernel when Attr(adaptive) is true.

test=develop,test=document_preview

* fix 'tensors' to 'Tensors'. test=develop,test=document_preview

* add test for converage ValueError.test=develop,test=document_preview

* resolve conflict in test_pool2d. test=develop

24010472

27 9月, 2019 1 次提交
- C
  Add fp16 support for pad and split (#19881) · fb2a9cdf
  由 chengduo 提交于 9月 27, 2019
```
* make pad and split support fp16
test=develop
```
  fb2a9cdf
25 9月, 2019 1 次提交

add support of matmul with multiple head even different width and height (#19708) · c670058a

由 Bob Zhu 提交于 9月 25, 2019

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* add support of matmul with multiple head even different width and height

Original matmul with multiple head supports only the mat_a.width == mat_b.height,
in that case, mat_b will be horizontally split. In this patch, we extend the
support when mat_a.width != mat_b.height but mat_a.width/head_number == mat_b.height,
in this case, mab_b will be vertically split.

One example is A is [3, 8], B is [2, 16], head_number is 4. In this
case, A will be split as [3, 2], B will be (vertically) split as
[2, 4]. The final result will be 4 matrix of 4 matrix of [3,4], i.e. [3, 16]

test=develop

* refactor the code of matmul with multiple head even different width and height

test=develop

c670058a

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致