提交 · d7bd0361cb36587c07f1edf973672fd24e67e720 · BaiXuePrincess / Paddle

29 9月, 2018 1 次提交

Optimization of Kernels that related to DeepLabv3+ (#13534) · 161c3e31

由 Dun 提交于 9月 29, 2018

* refine reduce by cub
* optimize KernelDepthwiseConvFilterGrad
* optimize depthwise conv and reduce mean and reduce sum
* fix bug: dilation
* cuda arch and cuda 8 compatible

161c3e31

08 5月, 2018 2 次提交

Clean OpProtoAndCheckerMaker · 0e78cb69

由 Yu Yang 提交于 5月 08, 2018

Do not use ctor

* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.

0e78cb69

C

fix MatMul parameter · 187e23a7
由 chengduoZH 提交于 5月 08, 2018

187e23a7

04 5月, 2018 1 次提交
- Y
  
  Clean and extract blas · ef6ea790
  由 Yu Yang 提交于 5月 04, 2018
  
  ef6ea790
03 5月, 2018 1 次提交
- Y
  
  Clean MatMul · 815d8884
  由 Yu Yang 提交于 5月 03, 2018
  
  815d8884
18 4月, 2018 1 次提交
- A
  Fix cpplint issues in Detection_map_op (#9969) · 2d1a6f8d
  由 Abhinav Arora 提交于 4月 17, 2018
```
* Fix conv_op.h

* Fix conv_mkldnn_op

* Fix cpplint issues in detection_map_op
```
  2d1a6f8d
28 2月, 2018 1 次提交
- C
  
  follow comments · a779b424
  由 chengduoZH 提交于 2月 27, 2018
  
  a779b424
16 2月, 2018 2 次提交
- Y
  
  change outputsize func name · cb06337f
  由 Yang Yang 提交于 2月 16, 2018
  
  cb06337f
- Y
  
  pass test_recognize_digits · 1d9fd1c0
  由 Yang Yang 提交于 2月 16, 2018
  
  1d9fd1c0
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
01 2月, 2018 1 次提交
- X
  
  fix comments · 84ded49d
  由 xzl 提交于 2月 01, 2018
  
  84ded49d
23 1月, 2018 1 次提交
- X
  
  ../../../../../paddle/api · 06db7038
  由 xzl 提交于 1月 23, 2018
  
  06db7038
22 1月, 2018 1 次提交
- Z
  
  add depthwise conv forward · 3772d27d
  由 zlx 提交于 1月 22, 2018
  
  3772d27d
15 1月, 2018 2 次提交
- C
  
  set use_cudnn as default · 251c6032
  由 chengduoZH 提交于 1月 15, 2018
  
  251c6032
- C
  
  fix conv, pool, conv_trans to decide use cudnn or not · 79aa5122
  由 chengduoZH 提交于 1月 15, 2018
  
  79aa5122
14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

08 1月, 2018 1 次提交

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

05 1月, 2018 1 次提交

Feature/use cudnn (#7141) · 5593858d

由 dzhwinter 提交于 1月 05, 2018

* "add c++ side kernel selection"

* "add multiple kernel op test"

* "kernel selection only support cudnn"

* "better formatter"

* "small fix with UseCPU"

* "depends on change interface Get(Place, Library)"

* "fix CI"

* "fix python cudnn test"

* "leave the register cudnn op to another PR"

* "fix CI"

* "use all kernel by default"

* "fix CI"

5593858d

20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

11 12月, 2017 1 次提交
- C
  
  refine conv · a6ef8758
  由 chengduoZH 提交于 12月 11, 2017
  
  a6ef8758
22 11月, 2017 1 次提交
- C
  
  refine code · a93227a1
  由 chengduoZH 提交于 11月 22, 2017
  
  a93227a1
21 11月, 2017 1 次提交
- C
  
  remove vector::eraze · e5bf9c56
  由 chengduoZH 提交于 11月 21, 2017
  
  e5bf9c56
15 11月, 2017 1 次提交
- C
  
  follow comments · 356d6954
  由 chengduoZH 提交于 11月 14, 2017
  
  356d6954
10 11月, 2017 1 次提交
- C
  
  Add dilation for vol2col · 271fc9c1
  由 chengduoZH 提交于 11月 10, 2017
  
  271fc9c1
09 11月, 2017 1 次提交
- C
  
  refine conv2d for filter size:(1,1) · 21ce7042
  由 chengduoZH 提交于 11月 09, 2017
  
  21ce7042
08 11月, 2017 1 次提交
- C
  
  add dilation for im2col · 97e9dd72
  由 chengduoZH 提交于 11月 08, 2017
  
  97e9dd72
06 11月, 2017 1 次提交
- C
  
  write conv2d and conv3d together · f302c6a3
  由 chengduoZH 提交于 11月 06, 2017
  
  f302c6a3
30 10月, 2017 1 次提交
- C
  
  fix code format and doc · 17248153
  由 chengduoZH 提交于 10月 30, 2017
  
  17248153
26 10月, 2017 1 次提交
- C
  
  write conv2d and conv3d together · eafbbc11
  由 chengduoZH 提交于 10月 26, 2017
  
  eafbbc11
21 10月, 2017 1 次提交
- C
  
  add padding up, down, left, right · dc7d0735
  由 chengduoZH 提交于 10月 21, 2017
  
  dc7d0735
20 10月, 2017 1 次提交

Remove template parameter for Tensor methods (#4937) · c532b967

由 Yu Yang 提交于 10月 19, 2017

* Remove template parameter for Tensor methods

* Also check the type is correct when data()
* Simplize holder_

* Fix accuracy_op

* Register Code

c532b967

17 10月, 2017 1 次提交
- Y
  Correct OpWithKernel's infershape (#4847) · 73a8b78a
  由 Yu Yang 提交于 10月 16, 2017
```
They are public now
```
  73a8b78a
12 10月, 2017 1 次提交

武

Cudnn conv op (#4195) · a3ccbdb3

由武毅提交于 10月 12, 2017

* add cudnn_conv_op

* WIP

* update

* update

* fix grad check

* use platform::memory

* add support group for cudnn

* update

* follow comments

* fix onlycpu build

* update cuda define

* follow comments

* follow comments

* merge with updates

* fix compile error

* follow comments

* follow comments

a3ccbdb3

28 9月, 2017 1 次提交
- Y
  
  Add Skeleton of Double support · 3a5693e0
  由 Yu Yang 提交于 9月 27, 2017
  
  3a5693e0
21 9月, 2017 3 次提交
- H
  
  Refine code. · c42e2049
  由 hedaoyuan 提交于 9月 21, 2017
  
  c42e2049
- H
  
  Bug fix. · bb546cf1
  由 hedaoyuan 提交于 9月 21, 2017
  
  bb546cf1
- H
  
  Bug fix for get device_context. · 659f2f71
  由 hedaoyuan 提交于 9月 21, 2017
  
  659f2f71

BaiXuePrincess / Paddle 与 Fork 源项目一致

BaiXuePrincess / Paddle
与 Fork 源项目一致