提交 · 1c6d0646276dc9e694c8dab0a3d5284670e358e7 · BaiXuePrincess / Paddle

13 5月, 2019 2 次提交

Optimize the elementwise op using eigen (#15494) · dcda2023

由 Yiqun Liu 提交于 5月 13, 2019

* Optimize the elementwise op with CUDA kernels.
test=develop

* Support setting of attr in op config file.
test=develop

* Add the support the setting dtype and initializer in config.
test=develop

* Save workspace.

* Add initializer "zeros".
test=develop

* Fix compiling error.

* Support the use of existed file to initailize tensor in op_tester.

* Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims.
test=develop

dcda2023

add double grad for elementwise_mul op (#17255) · 8bae8590

由 Kaipeng Deng 提交于 5月 13, 2019

* add double grad for elementwise_mul. test=develop

* remove comment. test=develop

* fix grad sum. test=develop

* fix for axis expand. test=develop

* add test for axis expand. test=develop

8bae8590

09 5月, 2019 1 次提交

Mod floordiv (#17251) · 4292bd86

由 zhoukunsheng 提交于 5月 09, 2019

* test=develop
add elementwise_mod and elementwise_floordiv, fix equation problem in elementwise_mod

4292bd86

08 5月, 2019 2 次提交

Refine elementwise kernel. (#16952) · 792443ef

由 zhaoyuchen2018 提交于 5月 08, 2019

* Refine elementwise kernel.

Add a simple cuda kernel if grad x and y both exist
Use 2D block cuda kernel to do broadcast.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

* refine code.

test=develop
Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>

792443ef

G

Fix code in document. (#17237) · 91784f8e
由 gongweibao 提交于 5月 08, 2019

91784f8e

06 5月, 2019 1 次提交

Add use_cuda to inplace pass (#17205) · ee2028a1

由 Zeng Jinle 提交于 5月 05, 2019

* add use_cuda to inplace pass,test=develop

* add test softmax_with_xe_inplace test,test=develop

ee2028a1

16 4月, 2019 1 次提交
- L
  
  disable test_elementwise_mul_mkldnn_op case · 61cc842a
  由 Leo Zhao 提交于 4月 16, 2019
  
  61cc842a
12 4月, 2019 1 次提交
- L
  convert output to nchw format to align with native version in avx512 mode · a9694bd3
  由 Leo Zhao 提交于 4月 12, 2019
```
test = develop
resolve #16764
```
  a9694bd3
03 4月, 2019 1 次提交
- Z
  Fix some grad op desc makers (#16633) · 1c526e1d
  由 Zeng Jinle 提交于 4月 02, 2019
```
* fix some grad op desc maker
test=develop

* fix grad op desc makers
test=develop
```
  1c526e1d
28 3月, 2019 1 次提交

[MKL-DNN] Tensor modifications revert (#16462) · 26323274

由 Jacek Czaja 提交于 3月 28, 2019

* Revert "[MKL-DNN] Fix to crash of Transformer when mkldnn is to be used (#16233)"

This reverts commit 13816dd4.
Apart from enabling transformer for MKL-DNN

* Revert "- MKL-DNN pooling updated to set_prim_desc"

This reverts commit c63f6b20.

Conflicts:
	paddle/fluid/operators/mkldnn/concat_mkldnn_op.cc

* Revert "[MKL-DNN] MKL-DNN specific Tensor modification (#15429)"

test=develop

This reverts commit dec9cf53.

* - concat compilation fix

- lint

test=develop

- Lint fixes

test=develop

- Lint fixes

test=develop

- Fix Transpose MKLDNN op

test=develop

26323274

27 3月, 2019 1 次提交

Memory optimize (#16410) · 8d22bc17

由 liuwei1031 提交于 3月 27, 2019

* fix cdn issue, test=develop

* fix memory optimize bugs, test=develop

* fix memory optimize bugs, test=develop

* remove add/sub_2 op, test=develop

* disable memory_optimize by default, test=develop

* disable inplace activation in python, test=develop

* fix unittests, test=develop

* fix unittests, test=develop

* bug-fix, test=develop

8d22bc17

24 3月, 2019 1 次提交
- S
  add op registry type · a93a9eef
  由 sneaxiy 提交于 3月 22, 2019
```
refine gc code
test=develop
```
  a93a9eef
21 3月, 2019 3 次提交
- P
  
  fix time; test=develop · 5dc9b519
  由 phlrain 提交于 3月 21, 2019
  
  5dc9b519
- P
  
  add floordiv and mod op; test=develop · 18d107c2
  由 phlrain 提交于 3月 21, 2019
  
  18d107c2
- P
  
  add elementwise floordiv, mod; test=develop · 56c2d384
  由 phlrain 提交于 3月 21, 2019
  
  56c2d384
08 3月, 2019 1 次提交
- T
  simplify the jitkernel templates and tests · 14a764c9
  由 tensor-tang 提交于 3月 08, 2019
```
test=develop
```
  14a764c9
07 3月, 2019 1 次提交
- T
  unify the kernelfuncs cache and add unit test · 802f362a
  由 tensor-tang 提交于 3月 07, 2019
```
test=develop
```
  802f362a
26 2月, 2019 1 次提交

- MKL-DNN pooling updated to set_prim_desc · c63f6b20

由 Jacek Czaja 提交于 2月 04, 2019

- MKLDNN ops revisited

- disabled softmax modifications

- disabled elementwise_add

- reverted LRN modifications

- reverted SUM primitive

- Partial reviing of softmax

- Enable softmax

- Softmax changes

- LRN is back

- LRN partially disabled

- LRN is back

- LRN fix

- compilation fixes

- Sum fixed(hopefully)

- Enabling (partially) elementwise_add

- Fixes to elemenwise_add

- Lint fixes

quantize fix

- compilation fix

test=develop

Disabling pooling

- Disabled quantize op

test=develop

c63f6b20

09 2月, 2019 1 次提交
- D
  
  add details. test=develop · 104d3b4e
  由 dzhwinter 提交于 2月 09, 2019
  
  104d3b4e
06 2月, 2019 1 次提交
- D
  
  add details. test=develop · 94dd50c3
  由 dzhwinter 提交于 2月 06, 2019
  
  94dd50c3
29 1月, 2019 2 次提交
- K
  Small fix · 69b7c595
  由 Krzysztof Binias 提交于 1月 29, 2019
```
test=develop
```
  69b7c595
- K
  Make separate folders for mkldnn codes · b1bdcd4d
  由 Krzysztof Binias 提交于 1月 28, 2019
```
test=develop
```
  b1bdcd4d
24 1月, 2019 1 次提交
- C
  Clean elementwise_op_function (#15502) · bf91d11e
  由 chengduo 提交于 1月 24, 2019
```
test=develop
```
  bf91d11e
21 1月, 2019 1 次提交
- D
  
  squash commits. test=develop · 8f3b2523
  由 dzhwinter 提交于 1月 21, 2019
  
  8f3b2523
10 1月, 2019 1 次提交

[Feature] support mix precision training for resnet (#14899) · fd854183

由 Wu Yi 提交于 1月 10, 2019

* clip softmax for fp16

* updates

* fuse xent support fp16 test=develop

* wip

* wip

* add simple row reduce

* wip fp16 accurate softmax

* add accurate softmax kernel for fp16 test=develop

* update test=develop

* fix cpu build test=develop

* update api.spec test=develop

* follow comments test=develop

* fix build test=develop

* fix trt build test=develop

* fix inference build test=develop

* fix merge test=develop

* update test=develop

* try fix build test=develop

* fix build test=develop

* rename real_exp test=develop

* fortest

* remove hacky kernels test=develop

* clean up test=develop

fd854183

26 12月, 2018 1 次提交

Fp16 training (#14992) · 856f0da0

由 Wu Yi 提交于 12月 26, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

* make fp16 lr schedule simple test=develop

* fix ut test=develop

* fix tests test=develop

* remove fp16 learning rate cast test=develop

856f0da0

24 12月, 2018 1 次提交
- Y
  Fix the exception when tensor format is x · d4606bcb
  由 Yihua Xu 提交于 12月 24, 2018
```
test=develop
```
  d4606bcb
21 12月, 2018 2 次提交
- P
  fix build issue · 2e35290f
  由 peizhilin 提交于 12月 21, 2018
```
test=develop
```
  2e35290f
- P
  fix code style · 201283f9
  由 peizhilin 提交于 12月 21, 2018
```
test=develop
```
  201283f9
20 12月, 2018 3 次提交

T
Revert "[Feature] Fp16 training for resnet50 (#14850)" · da87f7a6
由 typhoonzero 提交于 12月 20, 2018
```
This reverts commit 3d750f9c.
```
da87f7a6
T
fix enum style · 1aaec571
由 tensor-tang 提交于 12月 20, 2018
```
test=develop
```
1aaec571

[Feature] Fp16 training for resnet50 (#14850) · 3d750f9c

由 Wu Yi 提交于 12月 20, 2018

* wip

* wip

* wip

* wip for test

* add fp16 tests test=develop

* fix cpu build test=develop

* fix test=develop

* fix py3 tests test=develop

* fix lr_scheduler dtype test=develop

* fix test=dvelop

* test fix ci compile test=develop

* fix build and merge test=develop

* fallback momentumop change to general test=develop

3d750f9c

19 12月, 2018 5 次提交
- P
  use the platform api to decide the specific instruction support or not · 9f55f1ff
  由 peizhilin 提交于 12月 19, 2018
```
test=develop
```
  9f55f1ff
- S
  rewrite variable type · ae6f46a1
  由 sneaxiy 提交于 12月 19, 2018
```
test=develop
```
  ae6f46a1
- P
  fix the build issue · 0b4f742e
  由 peizhilin 提交于 12月 19, 2018
```
test=develop
```
  0b4f742e
- P
  fix build issue when xbyak is disabled on windows · da42cf20
  由 peizhilin 提交于 12月 19, 2018
```
test=develop
```
  da42cf20
- P
  disable xbyak on windows · 1cc9d598
  由 peizhilin 提交于 12月 19, 2018
```
test=develop
```
  1cc9d598
18 12月, 2018 3 次提交
- T
  
  fix build · 6648995f
  由 tensor-tang 提交于 12月 17, 2018
  
  6648995f
- S
  rewrite ddim · a500dfa5
  由 sneaxiy 提交于 12月 18, 2018
```
test=develop
```
  a500dfa5
- P
  Fix the mkl build script on windows · fa135bbf
  由 peizhilin 提交于 12月 18, 2018
```
test=develop
```
  fa135bbf

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致