提交 · 0a44cd91df5db634c1467bc7fabf39855cc35c64 · 兽拳 / Paddle

14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

08 1月, 2018 2 次提交

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

E
Show argument dimensions with operator::DebugStringEx (#7268) · 8814bec0
由 emailweixu 提交于 1月 07, 2018
```
This can make it easier to locate error.
```
8814bec0

05 1月, 2018 1 次提交

Feature/use cudnn (#7141) · 5593858d

由 dzhwinter 提交于 1月 05, 2018

* "add c++ side kernel selection"

* "add multiple kernel op test"

* "kernel selection only support cudnn"

* "better formatter"

* "small fix with UseCPU"

* "depends on change interface Get(Place, Library)"

* "fix CI"

* "fix python cudnn test"

* "leave the register cudnn op to another PR"

* "fix CI"

* "use all kernel by default"

* "fix CI"

5593858d

25 12月, 2017 3 次提交
- T
  
  fix send recv unit test · 4dde9a00
  由 typhoonzero 提交于 12月 25, 2017
  
  4dde9a00
- Q
  Impl kernel hint (#6883) · af0c4c45
  由 Qiao Longfei 提交于 12月 25, 2017
```
* init kernel hint

* fix typo

* rm unused code

* add include in op_kernel.h

* restore op_kernel since it will be moved to op_kernel_type

* change force_cpu to use_cpu

* fix compilation
```
  af0c4c45
- Q
  
  add op_kernel_type_test · 313afc9c
  由 qiaolongfei 提交于 12月 25, 2017
  
  313afc9c
24 12月, 2017 2 次提交

Q
refine OpKernelType (#6879) · 37e96264
由 QI JUN 提交于 12月 24, 2017
```
* refine OpKernelKey

* refine codes

* fix code style

* follow comments
```
37e96264

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

08 11月, 2017 1 次提交

Polish OpWithKernel · bbdac7f7

由 Yu Yang 提交于 11月 07, 2017

* Chage `IndicateDataType` to `GetKernelType`. Make it easier to
  understand.
* Change `OpKernelKey` to `OpKernelType`
* Make operator developers can customize which kernel the operator will
  use in runtime.

bbdac7f7

06 11月, 2017 1 次提交
- T
  
  refine get cuda context · 272f3e6d
  由 typhoonzero 提交于 11月 06, 2017
  
  272f3e6d
04 11月, 2017 1 次提交

Add acc test to image classification (#5336) · 906e2565

由 Qiao Longfei 提交于 11月 04, 2017

* add acc layer
* memory log level change from 3 to 10
* use gaussian random to init conv parameters
* use initializer
* fix import
* batch_norm use helper to create persistable var
* refine code
* train only 2 batches for test
* use g_program and g_init_program
* use XavierInitializer to init fc parameter

906e2565

30 10月, 2017 2 次提交

03 image classification (#5192) · 0049ce04

由 Qiao Longfei 提交于 10月 30, 2017

* add batch_norm_layer

* add img_conv_group layer and test

* add check to Tensor.type()

* forward can run

* with backward

* change label data time from int32 to int64

* refine code

* follow comment

0049ce04

D

"polish code based on comment" · 71305e5f
由 dzhwinter 提交于 10月 29, 2017

71305e5f

29 10月, 2017 1 次提交
- Y
  Extract InferShape to many cc files (#5174) · 8f6c0a0f
  由 Yu Yang 提交于 10月 28, 2017
```
* Shrink Operator.h

* Fix CI compile
```
  8f6c0a0f
27 10月, 2017 4 次提交

Y
Make InferShape as a field in OpInfo (#5139) · b44f4ccb
由 Yu Yang 提交于 10月 26, 2017
```
* Op developer can add `InferShape` to any operator
```
b44f4ccb

add sparse support for sum op (#5093) · 7f8574c0

由 QI JUN 提交于 10月 26, 2017

* add sparse support for sum op

* typo fix

* fix gpu build error

* fix unittest error

* typo fix

* infer var type and shape in op_test

* follow comments

* fix build error

* bypass some unittests depend on NetOp

7f8574c0

Gradient check use graph (#5027) · be00b0c4

由 Yu Yang 提交于 10月 26, 2017

* Simplize Gradient Check

* Stash

* Extract apply_backward_pass to backward.py

Rename apply_backward_pass to append_backward_ops

* Use graph API to check gradient

* Fix ci

* Fix CI

* Fix backward for double precision

* Stash

* Fix CI

* Fix ci

* Ignore GRU test

* Ignore xe op

* Fix CI

* Fix softmax with xe gradient

The correct equation should be IG = OG * (d_softmax_with_xe())

* Fix typo

* Fix merge error

* Disable LRN

be00b0c4

D

"fixed based on comment" · 6cce5268
由 Dong Zhihong 提交于 10月 26, 2017

6cce5268

26 10月, 2017 1 次提交
- C
  
  Add pool2d cudnn · 1bb0e294
  由 chengduoZH 提交于 10月 11, 2017
  
  1bb0e294
25 10月, 2017 1 次提交
- D
  
  "redefine the initop from kernel to OpBase" · 63fb41b3
  由 Dong Zhihong 提交于 10月 24, 2017
  
  63fb41b3
24 10月, 2017 2 次提交
- C
  
  fix the computation kernels. · 427644b2
  由 caoying03 提交于 10月 23, 2017
  
  427644b2
- D
  
  "add reduce hash function" · ec47565c
  由 Dong Zhihong 提交于 10月 23, 2017
  
  ec47565c
23 10月, 2017 1 次提交
- Q
  CompileTime InferShape should find var recursively in stack of blocks (#4998) · c91de280
  由 Qiao Longfei 提交于 10月 22, 2017
```
* recursive find var in BlockDesc

* add HasVarRecursive and FindVarRecursive to BlockDesc

* fix FindVarRecursive
```
  c91de280
22 10月, 2017 1 次提交
- Q
  
  fix InferShapeContext Has interface (#4994) · e7f62703
  由 Qiao Longfei 提交于 10月 21, 2017
  
  e7f62703
21 10月, 2017 1 次提交
- Y
  
  Global function, op_support_gpu (#4980) · 86437a8d
  由 Yu Yang 提交于 10月 20, 2017
  
  86437a8d
19 10月, 2017 1 次提交

Add glog as dependencies of ops (#4908) · e9249d16

由 Yu Yang 提交于 10月 18, 2017

* Add glog as dependencies of ops

* Use VLOG to logging some information is helpful when we debug Paddle

* Fix Unittests

e9249d16

15 10月, 2017 1 次提交
- D
  
  " add interface to scopeDesc bind" · 2434b8f5
  由 Dong Zhihong 提交于 10月 14, 2017
  
  2434b8f5
12 10月, 2017 1 次提交

武

Cudnn conv op (#4195) · a3ccbdb3

由武毅提交于 10月 12, 2017

* add cudnn_conv_op

* WIP

* update

* update

* fix grad check

* use platform::memory

* add support group for cudnn

* update

* follow comments

* fix onlycpu build

* update cuda define

* follow comments

* follow comments

* merge with updates

* fix compile error

* follow comments

* follow comments

a3ccbdb3

11 10月, 2017 1 次提交
- Y
  
  dynamic recurrent op forward c++ implentation (#4597) · 843ed8e3
  由 Yan Chunwei 提交于 10月 10, 2017
  
  843ed8e3
07 10月, 2017 3 次提交
- Q
  
  rename InferShapeContextBase to InferShapeContext · c0a34e1c
  由 qiaolongfei 提交于 10月 07, 2017
  
  c0a34e1c
- Q
  
  merge InferShapeContext and ExecutionContext · a0767228
  由 qiaolongfei 提交于 10月 07, 2017
  
  a0767228
- Q
  
  update comment for input/output length check · 4acd5aba
  由 qiaolongfei 提交于 10月 06, 2017
  
  4acd5aba
05 10月, 2017 1 次提交
- Q
  
  tmp work · 5917e09c
  由 qiaolongfei 提交于 10月 04, 2017
  
  5917e09c
04 10月, 2017 1 次提交
- Q
  
  optimize infershape context · 81fc7774
  由 qiaolongfei 提交于 10月 03, 2017
  
  81fc7774
03 10月, 2017 3 次提交
- Q
  
  fix compile problem · 455436e5
  由 qiaolongfei 提交于 10月 02, 2017
  
  455436e5
- Q
  
  add CompileTimeInferShapeContext · d550380e
  由 qiaolongfei 提交于 10月 02, 2017
  
  d550380e
- Q
  
  tmp · 31bdb3f3
  由 qiaolongfei 提交于 10月 02, 2017
  
  31bdb3f3

兽拳 / Paddle 与 Fork 源项目一致

兽拳 / Paddle
与 Fork 源项目一致