提交 · dcda20233cedcc700a7556ec3fb7dbf689da6c15 · BaiXuePrincess / Paddle

13 5月, 2019 1 次提交

Optimize the elementwise op using eigen (#15494) · dcda2023

由 Yiqun Liu 提交于 5月 13, 2019

* Optimize the elementwise op with CUDA kernels.
test=develop

* Support setting of attr in op config file.
test=develop

* Add the support the setting dtype and initializer in config.
test=develop

* Save workspace.

* Add initializer "zeros".
test=develop

* Fix compiling error.

* Support the use of existed file to initailize tensor in op_tester.

* Use eigen to optimize the elementwise_add/mul for the case that x and y have the same dims.
test=develop

dcda2023

11 12月, 2018 1 次提交
- Y
  Fix Eigen macro when using GPU · 7604b1ad
  由 Yu Yang 提交于 12月 11, 2018
```
The macro should be defined by compiler rather than by source.

test=develop
```
  7604b1ad
16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

07 11月, 2018 1 次提交

Add fp16 backward support (#14202) · a9b5d42d

由 chengduo 提交于 11月 07, 2018

* add fp16 backward support
test=develop

* add sum_op fp16 test

* disable test_dist_save_load
test=develop

* add check_grad for sum

* add unit test for softmax_grad fp16
test=develop

* add scale_op unit test

* add mul_grad_op unit test for fp16

* add cross_entropy_grad and eman_grad unit test for fp16
test=develop

* fix cross_entropy unit test

* add pool2d fp16 unit test

* refine conv2d fp16 unit test
test=develop

* refine activation unit test
test=develop

* fix ci
test=develop

* follow zhihong's comment, copy from https://github.com/PaddlePaddle/Paddle/pull/12796
test=develop

a9b5d42d

17 8月, 2018 1 次提交
- D
  Revert ""cherry picked operators changes" (#12184)" (#12747) · 4069262f
  由 dzhwinter 提交于 8月 17, 2018
```
This reverts commit bf3c3496.
```
  4069262f
16 8月, 2018 1 次提交

"cherry picked operators changes" (#12184) · bf3c3496

由 dzhwinter 提交于 8月 16, 2018

* "cherry picked operators changes"

* "remove duplicated code"

* "add constant setter"

* "add get expected kernel"

* "fix ci"

* "add fill constant"

bf3c3496

14 8月, 2018 1 次提交
- T
  
  Revert "Refine elementwise_add op" · 6a2a9a83
  由 tensor-tang 提交于 8月 14, 2018
  
  6a2a9a83
06 8月, 2018 1 次提交
- S
  
  refine elementwise_add op · b2d0ee51
  由 sneaxiy 提交于 8月 06, 2018
  
  b2d0ee51
20 3月, 2018 2 次提交
- K
  
  rearrange test · 3da094fd
  由 Kexin Zhao 提交于 3月 19, 2018
  
  3da094fd
- K
  
  add fp16 kernel for elementwise add · 4bf168b2
  由 Kexin Zhao 提交于 3月 19, 2018
  
  4bf168b2
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

15 11月, 2017 1 次提交
- D
  
  "fix gpu related op registered" (#5647) · 7c3ec220
  由 dzhwinter 提交于 11月 14, 2017
  
  7c3ec220
22 9月, 2017 1 次提交
- G
  Elementwise operator. (#4139) · f99841dd
  由 gongweibao 提交于 9月 22, 2017
```
Elementwise operator add/sub/mul/div
```
  f99841dd
13 9月, 2017 1 次提交
- G
  Add element-wise multiplication operator. (#3787) · 8778957c
  由 gongweibao 提交于 9月 13, 2017
```
Add element-wise multiplication operator
```
  8778957c
24 8月, 2017 1 次提交
- Q
  
  register rowwise add gpu kernel · 12864f14
  由 qiaolongfei 提交于 8月 23, 2017
  
  12864f14
07 8月, 2017 1 次提交
- D
  
  "remove a lot alias" · 610801b5
  由 dongzhihong 提交于 8月 07, 2017
  
  610801b5
04 8月, 2017 2 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
- D
  
  fix op name · 8ff3590e
  由 dongzhihong 提交于 8月 04, 2017
  
  8ff3590e
31 7月, 2017 1 次提交
- Q
  
  add EIGEN_USE_GPU macro to op.cu file · 61f94f00
  由 qijun 提交于 7月 31, 2017
  
  61f94f00
25 7月, 2017 1 次提交
- Y
  Add type_alias to import framework into ops · efc119b4
  由 Yu Yang 提交于 7月 25, 2017
```
Make implement an operator less noisy.
```
  efc119b4
19 7月, 2017 1 次提交
- Q
  
  replace Tensor::tensor to EigenTensor::From · 736d078c
  由 qijun 提交于 7月 19, 2017
  
  736d078c
18 7月, 2017 1 次提交
- Q
  
  implement some basic OpKernel · b6c07552
  由 qijun 提交于 7月 18, 2017
  
  b6c07552
17 7月, 2017 2 次提交
- Y
  
  Fix unittest · 122e83e3
  由 Yu Yang 提交于 7月 17, 2017
  
  122e83e3
- Y
  Add skeletons of `mul`, `rowwise_add`, `sigmoid`, `softmax` ops · 1ed237c1
  由 Yu Yang 提交于 7月 17, 2017
```
* Implement InferShape and register them, give a stub Kernel method
  by LOG(INFO)
```
  1ed237c1

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致