提交 · d497bd9079184fac0ec781475efa83ae1967e760 · BaiXuePrincess / Paddle

26 2月, 2019 1 次提交

Optimize the CUDA implementation of sequence_expand op by reduce the times of... · f4634d76

由 Yiqun Liu 提交于 2月 26, 2019

Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)

* Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU.
test=develop

* Refine the op benchmark to support setting lod in config.
test=develop

f4634d76

16 11月, 2018 1 次提交

Refine operator cmake (#14413) · a2d9b344

由 Wu Yi 提交于 11月 16, 2018

* wip simplify operator framework

* wip

* wip

* done test=develop

* clean test=develop

* fix test=develop

* fix deps test=develop

* fix cpu build test=develop

* fix tensorrt build test=develop

* fix tests test=develop

* fix test=develop

* fix cpu build test=develop

a2d9b344

30 4月, 2018 1 次提交
- D
  Feature/cuda9 cudnn7 (#10140) · eb6f9dd5
  由 dzhwinter 提交于 4月 30, 2018
```
* "re-commit "

* "picked up"

* "fix ci"

* "fix pdb hang up issue in cuda 9"
```
  eb6f9dd5
11 4月, 2018 2 次提交
- D
  
  "done" · 62d1f9a7
  由 dzhwinter 提交于 4月 11, 2018
  
  62d1f9a7
- D
  
  "fix the style" · 80bd1ca0
  由 dzhwinter 提交于 4月 11, 2018
  
  80bd1ca0
30 3月, 2018 1 次提交
- D
  
  "fix based on comment" · fbdb5b7b
  由 dzhwinter 提交于 3月 29, 2018
  
  fbdb5b7b
28 3月, 2018 2 次提交
- D
  
  "fix ci" · 0412f5e0
  由 dzhwinter 提交于 3月 28, 2018
  
  0412f5e0
- D
  
  "fix ci" · 0be1e09f
  由 dzhwinter 提交于 3月 28, 2018
  
  0be1e09f
21 3月, 2018 1 次提交
- D
  
  "debug the process" · 53c8c36a
  由 dzhwinter 提交于 3月 21, 2018
  
  53c8c36a
20 3月, 2018 3 次提交
- D
  
  "add details" · e4c35d83
  由 dzhwinter 提交于 3月 20, 2018
  
  e4c35d83
- D
  
  "add sequence kernel" · 26822bd7
  由 dzhwinter 提交于 3月 20, 2018
  
  26822bd7
- D
  
  "add sequence expand kernel" · 4ee1c9e6
  由 dzhwinter 提交于 3月 19, 2018
  
  4ee1c9e6
15 3月, 2018 1 次提交
- Y
  
  Finish adapting forward. · 352fa41a
  由 yangyaming 提交于 3月 15, 2018
  
  352fa41a
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
26 12月, 2017 1 次提交
- L
  
  unify the indentation of license · 761b3297
  由 Luo Tao 提交于 12月 26, 2017
  
  761b3297
18 12月, 2017 1 次提交
- W
  
  rename seq to sequence · c30bc561
  由 wanghaoshuang 提交于 12月 18, 2017
  
  c30bc561
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

13 9月, 2017 1 次提交
- Y
  
  Implemented by boost preprocessor. · ad5e7cc0
  由 yangyaming 提交于 9月 13, 2017
  
  ad5e7cc0
01 9月, 2017 1 次提交
- X
  
  Add cos_sim op. · ed72af48
  由 Xinghai Sun 提交于 9月 01, 2017
  
  ed72af48
08 8月, 2017 1 次提交
- D
  
  "fix clang format" · 22f03c39
  由 dongzhihong 提交于 8月 08, 2017
  
  22f03c39
07 8月, 2017 1 次提交
- D
  
  "remove a lot alias" · 610801b5
  由 dongzhihong 提交于 8月 07, 2017
  
  610801b5
04 8月, 2017 1 次提交
- L
  
  Add cpplint for *.h and cuda *.cu · b58725bd
  由 liaogang 提交于 8月 04, 2017
  
  b58725bd
02 8月, 2017 1 次提交
- D
  
  Add sigmoid backward implenmention. · 0560733c
  由 dangqingqing 提交于 8月 02, 2017
  
  0560733c
31 7月, 2017 1 次提交
- Q
  
  add EIGEN_USE_GPU macro to op.cu file · 61f94f00
  由 qijun 提交于 7月 31, 2017
  
  61f94f00
25 7月, 2017 1 次提交
- Y
  Add type_alias to import framework into ops · efc119b4
  由 Yu Yang 提交于 7月 25, 2017
```
Make implement an operator less noisy.
```
  efc119b4
19 7月, 2017 1 次提交
- Q
  
  replace Tensor::tensor to EigenTensor::From · 736d078c
  由 qijun 提交于 7月 19, 2017
  
  736d078c
18 7月, 2017 1 次提交
- Q
  
  implement some basic OpKernel · b6c07552
  由 qijun 提交于 7月 18, 2017
  
  b6c07552
17 7月, 2017 1 次提交
- Y
  Add skeletons of `mul`, `rowwise_add`, `sigmoid`, `softmax` ops · 1ed237c1
  由 Yu Yang 提交于 7月 17, 2017
```
* Implement InferShape and register them, give a stub Kernel method
  by LOG(INFO)
```
  1ed237c1

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致