提交 · 02c6edc0d5edb5189660001040b27eb483e1dcdb · PaddlePaddle / Paddle

28 9月, 2019 1 次提交
- L
  
  fix conv_grad_grad (#20054) · c92348c3
  由 lvmengsi 提交于 9月 28, 2019
  
  c92348c3
17 9月, 2019 1 次提交
- L
  cpu Conv double grad (#19672) · b76343c3
  由 lvmengsi 提交于 9月 17, 2019
```
* cpu conv_grad_grad
```
  b76343c3
10 5月, 2019 1 次提交

Double backward of conv2d. (#17211) · e32c9888

由 qingqing01 提交于 5月 10, 2019

* Add conv2d_grad_grad_op
* Extracte the cuDNN conv algo searching code in conv_cudnn_helper.h.
    - Now use it in conv2d_grad_grad.
    - Will simply the searching code in conv2d and conv2d_grad in next PR.
* Enhance and fix bug in unit testing of gradient_checker.
* Support to fetch empty variables，return None in Python.

e32c9888

21 1月, 2019 1 次提交

Memory optimization of depthwise conv op and group norm op (#15313) · 9f8f0fc2

由 Dun 提交于 1月 21, 2019

* mem opt

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* test=develop

* refine code  test=develop

* refine code  test=develop

* refine code  test=develop

* refine code  test=develop

* refine with cub test=develop

* fix mkldnn test && remove comments && test=develop

* polish code && test=develop

* add only_forward test && test=develop

9f8f0fc2

04 1月, 2019 1 次提交

Enable basic MKL-DNN INT8 Conv OP (#15124) · bbc93368

由 xiaolil1 提交于 1月 04, 2019

* Enable basic MKL-DNN INT8 Conv OP
test=develop

* Modify test case
test=develop

* Clean unittest code
test=develop

* Fix test
test=develop

* Modify test
test=develop

* Modify basic INT8 Conv
test=develop

bbc93368

02 1月, 2019 1 次提交
- X
  hide GetTensor · 9186451f
  由 Xin Pan 提交于 1月 02, 2019
```
test=develop
```
  9186451f
25 12月, 2018 1 次提交

Move GetTensor to tensor_util (#15011) · b9fb03cf

由 chengduo 提交于 12月 25, 2018

* refine tensor
test=develop

* refine tensor
test=develop

* fix device_context log
test=develop

b9fb03cf

21 12月, 2018 1 次提交

[Feature] Add Temporary Allocator (#14875) · 79bd6dfa

由 chengduo 提交于 12月 21, 2018

* Add Temporal Allocator

* add Temporay Allocator to DeviceContext
test=develop

* code refine
test=develop

* fix mean_iou
test=develop

* Add DeviceTemporaryAllocator
test=develop

* fix conv_op bug
test=develop

* small fix
test=develop

* code refine
test=develop

* log refine
test=develop

* fix unit test
test=develop

* move double check

* refine concat_and_split
test=develop

* add limit_of_temporary_allocation
test=develop

* fix name
test=develop

79bd6dfa

05 12月, 2018 1 次提交
- X
  allow customize kernel selection · 41c28d54
  由 Xin Pan 提交于 12月 05, 2018
```
test=develop
```
  41c28d54
19 11月, 2018 1 次提交
- Q
  Convolution fusion operator. (#14449) · fd7e6431
  由 qingqing01 提交于 11月 19, 2018
```
* Convolution fusion operator.
* Clean code
test=develop
```
  fd7e6431
29 9月, 2018 1 次提交

Optimization of Kernels that related to DeepLabv3+ (#13534) · 161c3e31

由 Dun 提交于 9月 29, 2018

* refine reduce by cub
* optimize KernelDepthwiseConvFilterGrad
* optimize depthwise conv and reduce mean and reduce sum
* fix bug: dilation
* cuda arch and cuda 8 compatible

161c3e31

08 5月, 2018 2 次提交

Clean OpProtoAndCheckerMaker · 0e78cb69

由 Yu Yang 提交于 5月 08, 2018

Do not use ctor

* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.

0e78cb69

C

fix MatMul parameter · 187e23a7
由 chengduoZH 提交于 5月 08, 2018

187e23a7

04 5月, 2018 1 次提交
- Y
  
  Clean and extract blas · ef6ea790
  由 Yu Yang 提交于 5月 04, 2018
  
  ef6ea790
03 5月, 2018 1 次提交
- Y
  
  Clean MatMul · 815d8884
  由 Yu Yang 提交于 5月 03, 2018
  
  815d8884
18 4月, 2018 1 次提交
- A
  Fix cpplint issues in Detection_map_op (#9969) · 2d1a6f8d
  由 Abhinav Arora 提交于 4月 17, 2018
```
* Fix conv_op.h

* Fix conv_mkldnn_op

* Fix cpplint issues in detection_map_op
```
  2d1a6f8d
28 2月, 2018 1 次提交
- C
  
  follow comments · a779b424
  由 chengduoZH 提交于 2月 27, 2018
  
  a779b424
16 2月, 2018 2 次提交
- Y
  
  change outputsize func name · cb06337f
  由 Yang Yang 提交于 2月 16, 2018
  
  cb06337f
- Y
  
  pass test_recognize_digits · 1d9fd1c0
  由 Yang Yang 提交于 2月 16, 2018
  
  1d9fd1c0
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
01 2月, 2018 1 次提交
- X
  
  fix comments · 84ded49d
  由 xzl 提交于 2月 01, 2018
  
  84ded49d
23 1月, 2018 1 次提交
- X
  
  ../../../../../paddle/api · 06db7038
  由 xzl 提交于 1月 23, 2018
  
  06db7038
22 1月, 2018 1 次提交
- Z
  
  add depthwise conv forward · 3772d27d
  由 zlx 提交于 1月 22, 2018
  
  3772d27d
15 1月, 2018 2 次提交
- C
  
  set use_cudnn as default · 251c6032
  由 chengduoZH 提交于 1月 15, 2018
  
  251c6032
- C
  
  fix conv, pool, conv_trans to decide use cudnn or not · 79aa5122
  由 chengduoZH 提交于 1月 15, 2018
  
  79aa5122
14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

08 1月, 2018 1 次提交

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

05 1月, 2018 1 次提交

Feature/use cudnn (#7141) · 5593858d

由 dzhwinter 提交于 1月 05, 2018

* "add c++ side kernel selection"

* "add multiple kernel op test"

* "kernel selection only support cudnn"

* "better formatter"

* "small fix with UseCPU"

* "depends on change interface Get(Place, Library)"

* "fix CI"

* "fix python cudnn test"

* "leave the register cudnn op to another PR"

* "fix CI"

* "use all kernel by default"

* "fix CI"

5593858d

20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

11 12月, 2017 1 次提交
- C
  
  refine conv · a6ef8758
  由 chengduoZH 提交于 12月 11, 2017
  
  a6ef8758
22 11月, 2017 1 次提交
- C
  
  refine code · a93227a1
  由 chengduoZH 提交于 11月 22, 2017
  
  a93227a1
21 11月, 2017 1 次提交
- C
  
  remove vector::eraze · e5bf9c56
  由 chengduoZH 提交于 11月 21, 2017
  
  e5bf9c56
15 11月, 2017 1 次提交
- C
  
  follow comments · 356d6954
  由 chengduoZH 提交于 11月 14, 2017
  
  356d6954
10 11月, 2017 1 次提交
- C
  
  Add dilation for vol2col · 271fc9c1
  由 chengduoZH 提交于 11月 10, 2017
  
  271fc9c1
09 11月, 2017 1 次提交
- C
  
  refine conv2d for filter size:(1,1) · 21ce7042
  由 chengduoZH 提交于 11月 09, 2017
  
  21ce7042
08 11月, 2017 1 次提交
- C
  
  add dilation for im2col · 97e9dd72
  由 chengduoZH 提交于 11月 08, 2017
  
  97e9dd72
06 11月, 2017 1 次提交
- C
  
  write conv2d and conv3d together · f302c6a3
  由 chengduoZH 提交于 11月 06, 2017
  
  f302c6a3

PaddlePaddle / Paddle 大约 1 年 前同步成功

PaddlePaddle / Paddle
大约 1 年前同步成功