提交 · 90648f336d0a73630d0a862259a4f73ab3c9fe8c · Crayon鑫 / Paddle

10 2月, 2018 1 次提交
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
14 1月, 2018 1 次提交

"cudnn operators change to cudnn kernel" (#6660) · 5ad1aef0

由 dzhwinter 提交于 1月 14, 2018

* "unified operators"

* "add CUDNN register"

* "add use cudnn attribute"

* "add attribute"

* "test conv tranpose op"

* "remove duplicated attr"

* "fix op test"

* "add attribute to set cudnn"

* "add more log"

* "need layout op register support"

* "add more log"

* "change GetExpectedKernelType "

* "fix Get attr in conv_op"

* "fix CI"

* "fix tests"

* "removed kernel priority fallback"

* "fix CI"

* "fix stack pointer bug"

* "refine buggy interface"

* "add const cast to save life"

* "fix get_output_with_grad"

* "fix op test with dataformat"

* ""fix pooling

* "fix pooling test"

* "fix CI"

* "fix with_gpu error"

* "add transform needed functional check"

* "fix unpack list error"

* "comment out parallel.do temporary"

* "fix CI"

* "fix compile doc error"

* "make threshold larger"

5ad1aef0

08 1月, 2018 2 次提交

Q

add back priority · ca90356b
由 qiaolongfei 提交于 1月 08, 2018

ca90356b

cpu gpu transform function (#7191) · 0f353ab4

由 Qiao Longfei 提交于 1月 08, 2018

* add rename guard

* add device_data_transform

* add device_data_transform_test

* modify GetExpectedKernelType

* update operator.run

* support test test_label_semantic_roles

* optimize code

* optimize code

* rename GetActualKernelType to GetExpectedKernelType

* fix chunk_eval_op and device_data_transform_test

* add is_same_place to place

* optimize code, refine rename_guard

* refine rename guard, add GetKernelTypeForVar

* optimize code

* add some log

* rename guard

* use sub scope to create var

* fix compile

* add IsInitialized for Tensor

* add VarIsTensor

* fix op_registry_test

* test

* tmp disable priority

* restore switch_kernel.md

* code clean

0f353ab4

05 1月, 2018 1 次提交

Feature/use cudnn (#7141) · 5593858d

由 dzhwinter 提交于 1月 05, 2018

* "add c++ side kernel selection"

* "add multiple kernel op test"

* "kernel selection only support cudnn"

* "better formatter"

* "small fix with UseCPU"

* "depends on change interface Get(Place, Library)"

* "fix CI"

* "fix python cudnn test"

* "leave the register cudnn op to another PR"

* "fix CI"

* "use all kernel by default"

* "fix CI"

5593858d

27 12月, 2017 1 次提交

"refine kernel registrar" (#6998) · 35c1683e

由 dzhwinter 提交于 12月 27, 2017

* "refine kernel registrar"

* "refine registrar with multikey"

* "fix register"

* "refine multikernel register"

* "fix CI"

* "fix CI"

* "fix registry"

* "swtich GPU to CUDA"

* "add register macro test case"

* "fix CI"

35c1683e

24 12月, 2017 1 次提交

Feature/operator run place (#6783) · 735eba29

由 dzhwinter 提交于 12月 24, 2017

* "change operator interface"

* "move devicepool to device_context"

* "fix operator test"

* "fix op_registry Run interface"

* "net op passed. Need to fix nccl multi-Context"

* "add nccl group function"

* "add nccl group function"

* "fix gpu count exceed 32 error"

* "fix recurrent op, nccl op"

* "change the other operators interface with Place"

* "fix typo"

* "fix pybind"

* "fix device in python side"

* "fix pybind failed"

* "add init for test"

* "fix CI"

735eba29

20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
01 11月, 2017 1 次提交

Feature/executor use program bind (#5196) · 1363ddb6

由 Yu Yang 提交于 10月 31, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

1363ddb6

19 10月, 2017 1 次提交

Change ProgramDesc not a global variable (#4879) · e747623e

由 Yu Yang 提交于 10月 18, 2017

* Change ProgramDesc not a global variable

* Polish code style

* Correct implement BlockDesc destructor

* Unify program as parameter name

e747623e

05 10月, 2017 1 次提交
- Y
  
  Use PADDLE_WITH_CUDA instead of PADDLE_WITH_GPU · 4558807c
  由 Yi Wang 提交于 10月 04, 2017
  
  4558807c
01 10月, 2017 1 次提交
- Y
  
  Add GradOpDescMaker to OpInfo and complete OperatorRegistrar method · d9e3c4ff
  由 Yu Yang 提交于 9月 30, 2017
  
  d9e3c4ff
28 9月, 2017 1 次提交

Remove OperatorBase::InferShape · 61962094

由 Yu Yang 提交于 9月 27, 2017

InferShape in Operator should be performed in OperatorBase::Run.

* cond_op, recurrent_op and mnist might be changed in following PR

61962094

27 9月, 2017 1 次提交

Refactoring InferShape (#3946) · 9a9d50a6

由 Qiao Longfei 提交于 9月 26, 2017

* init Infershape

* add static InferShape interface

* refactor add-op infershape

* add AttrReader

* add all maker's infershape

* add all InferShape

* add python infer api

* add VarDesc interface

* add python VarDesc and OpDesc interface

* update python code

* use infershape function to do shape inference

* clean code

* do not use pointer

* refine code of op_proto_maker

* add get_dims to VarDesc

* refine the code

* remove the dependency from operator to op registry

* remove OpProtoAndCheckerMaker from operator

* restore complete_add_op

* add shape_infer_impl.h

* code optimization

* remove const return value

* add fake BlockDesc class

* optimize code

* remove infer function in op_info

* move InferShapeContextImpl to operator.h

* optimize the interface of InferShapeContextBase

* add temperary interface of new infershape

* change add_op, clip_op, conv2d_op and activation_op

* change all operators InferShape

* fix SetDim

* update cos_sim_op

* update crop_op

* update lookup_table_op

* allocate tensor when call GetDim in InferShapeContext

* update modified_huber_loss_op

* update rowwise_add_op

* update mean_op

* update sequence_avg_pool_op

* typo

* remove old InferShape interface

* can compile

* fix or unit test

* clean code

* clean code

* remove const before InferShapeContext

* change InferenceContextBase to pointer

* rename RunTime to Runtime, code clean

9a9d50a6

07 9月, 2017 1 次提交
- F
  
  Rename `LargerThan` to `GreaterThan` · 1f0341e1
  由 fengjiayi 提交于 9月 06, 2017
  
  1f0341e1
06 9月, 2017 2 次提交
- Y
  Change `Op::GetAttr` to `Op::Attr` · 9de6a4b3
  由 Yu Yang 提交于 9月 05, 2017
```
Fix #3902
```
  9de6a4b3
- F
  Move two tests form `op_registry_test` to `operator_test` · bc0f9495
  由 fengjiayi 提交于 9月 05, 2017
```
1. TEST(ProtoMaker, DuplicatedAttr)
2. TEST(ProtoMaker, DuplicatedInOut)
```
  bc0f9495
16 8月, 2017 1 次提交
- Y
  
  Complete remove std::shared_ptr · 8c653ba7
  由 Yu Yang 提交于 8月 16, 2017
  
  8c653ba7
14 8月, 2017 3 次提交
- Y
  
  Follow comments from WangYi · f09cb657
  由 Yu Yang 提交于 8月 14, 2017
  
  f09cb657
- Y
  
  Simplify unit test code · ef29b522
  由 Yu Yang 提交于 8月 14, 2017
  
  ef29b522
- Y
  
  Polish Our code by YuYang's review · 4a604c26
  由 Yu Yang 提交于 8月 14, 2017
  
  4a604c26
12 8月, 2017 5 次提交
- Y
  
  Remove empty constructor for operator · 11c35605
  由 Yu Yang 提交于 8月 12, 2017
  
  11c35605
- Y
  
  Get `DEFINE_OPERATOR_CTOR` Back to code · 0b1052fc
  由 Yu Yang 提交于 8月 12, 2017
  
  0b1052fc
- Y
  
  Update · 65bd7c77
  由 Yi Wang 提交于 8月 11, 2017
  
  65bd7c77
- F
  
  Refine macro · f784741d
  由 fengjiayi 提交于 8月 11, 2017
  
  f784741d
- Y
  
  Add constructors to OperatorBase and all sub-classes · f83876a0
  由 Yi Wang 提交于 8月 11, 2017
  
  f83876a0
09 8月, 2017 2 次提交
- Q
  
  Update grad_op_builder after refactoring framework proto. · 665e1a33
  由 qingqing01 提交于 8月 09, 2017
  
  665e1a33
- Y
  
  Rename op_proto_name/var_names -> parameter/arguments · b368c6ca
  由 Yu Yang 提交于 8月 09, 2017
  
  b368c6ca
08 8月, 2017 1 次提交
- Y
  Make Compile Pass · dba618c0
  由 Yu Yang 提交于 8月 08, 2017
```
* Although backward_test/rnn_test is not pass, just comment them.
```
  dba618c0
01 8月, 2017 1 次提交
- Y
  Refine remove std::shared_ptr in Scope · 5d134a03
  由 Yu Yang 提交于 8月 01, 2017
```
* Make interface of Operator to `const Scope&`
```
  5d134a03
26 7月, 2017 1 次提交
- Y
  Refine OpRegistry::AddInput/AddOutput · 00615ebc
  由 Yu Yang 提交于 7月 26, 2017
```
Remove bool argument, use a class to handle that.
```
  00615ebc
25 7月, 2017 1 次提交
- Y
  
  Fix unittest · bc09551e
  由 Yu Yang 提交于 7月 25, 2017
  
  bc09551e
24 7月, 2017 1 次提交

Remove ScopePtr and OperatorPtr · c2543f5b

由 Yu Yang 提交于 7月 24, 2017

* ScopePtr means pointer of scope, but it can be shared or uniqued.
Change it to std::shared_ptr<Scope> to make code better to read.

c2543f5b

17 7月, 2017 1 次提交
- Q
  
  check duplicate of ProtoAndCheckerMaker (#2903) · 80a26a63
  由 Qiao Longfei 提交于 7月 17, 2017
  
  80a26a63
15 7月, 2017 1 次提交
- L
  
  ENH: unify PADDLE_ENFORCE · f812de2c
  由 liaogang 提交于 7月 15, 2017
  
  f812de2c
14 7月, 2017 2 次提交

Optimize ptr (#2851) · 58f3de95

由 Qiao Longfei 提交于 7月 14, 2017

* use OperatorPtr = std::shared_ptr<OperatorBase>;
* use ScopePtr = std::share_ptr<Scope>;

58f3de95

Let OpProto support multiple and temporary (#2860) · 2462d0c5

由 Yu Yang 提交于 7月 14, 2017

* Let OpProto support multiple and temporary

* Each input/output of Paddle's Op could be a list. Add multiple mark to
  OpProto. Also add a `input_format`/`output_format` attribute if that
  Op has multiple input or output. The format of that attribute please
  reference the comments in `op_proto.proto`
* Add temporary mark, because some output of an Op is not used by user
  but used by other op for faster computation. Explicitly mark which
  output is temporary could let future memory/computation optimization.
* Add generated field to AttrProto.

* Add `AddInputs`/`AddOutputs` function

* It is more readable to invoke `AddInputs` not
  `AddInput(multiple=true)`.

2462d0c5

13 7月, 2017 2 次提交

Follow comments · 79b70c2d

由 Yu Yang 提交于 7月 13, 2017

* Convert `op` --> `operators`
* Remove AddType in OpProtoMaker, because type is part of registry.
* Rename CPU_OR_GPU --> DEVICE_TYPE in registry macro.

79b70c2d

Add a sample op, `add_op` · a0aaafe9

由 Yu Yang 提交于 7月 13, 2017

* Refine register methods, make Op can get rid of whole-archieve
* `USE_OP` before a op is used.
* Add unittest for add_op.

a0aaafe9

12 7月, 2017 1 次提交
- Q
  test OpKernel (#2820) · be441f7d
  由 Qiao Longfei 提交于 7月 12, 2017
```
Add unit test for OpKernel
```
  be441f7d

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致