提交 · 30aad88449f1f671cbc80f006c13eced9b2bca33 · 机器未来 / Paddle

11 12月, 2018 1 次提交
- X
  fix clang · 1735022a
  由 Xin Pan 提交于 12月 11, 2018
```
test=develop
```
  1735022a
05 12月, 2018 1 次提交
- X
  allow customize kernel selection · 41c28d54
  由 Xin Pan 提交于 12月 05, 2018
```
test=develop
```
  41c28d54
22 11月, 2018 1 次提交

由 wopeizl 提交于 11月 22, 2018

* add recordio support

* disable the openblas multi-thread on windows since no support
adjust the python script

* code style

* code style
test=develop

* add create_recordio_file_reader back

* fix code style
test=develop

* fix the gtest.cmake on windows

* fix cc_test on windows

* fix the win build
test=develop

* remove fused compile support on windows
test=develop

* add the jit support
test=develop

* add the jit support, test=develop

* add the jit support, test=develop

* add the jit back
fix compile error on windows

* rollback test=develop

* test case fix

* disable DSO by default on windows

* exclude warpctc_op on windows

* exclude the dynload_warpctc out on windows
test=develop

* fix the scripts error
test=develop

* disable avx on windows by default
test=develop

* re-organize the cmake file

* disable mkl on windows by default

* add warp_ctc back

* fix the dependency

* fix the dependency

* fix the build issue on windows

* remove unsupported flag on windows

* code style

* code style
test=develop

* fix issue

* add profiler, parallel_executor back

* clean up the pre-definitions on windows

* fix build issue

* test=develop

d9a1f3e5

21 11月, 2018 1 次提交
- P
  
  clean up the pre-definitions on windows · 6e66fadb
  由 peizhilin 提交于 11月 21, 2018
  
  6e66fadb
24 9月, 2018 1 次提交
- D
  
  "fix link error" (#13545) · 97636a9f
  由 dzhwinter 提交于 9月 24, 2018
  
  97636a9f
12 9月, 2018 1 次提交
- D
  
  add demo · c3e1fb5a
  由 dzhwinter 提交于 9月 12, 2018
  
  c3e1fb5a
02 9月, 2018 1 次提交
- D
  
  switch to 9.2 · 75681c0a
  由 dzhwinter 提交于 9月 02, 2018
  
  75681c0a
25 8月, 2018 1 次提交
- D
  
  more platform is done · d7f98f37
  由 dzhwinter 提交于 8月 25, 2018
  
  d7f98f37
03 7月, 2018 1 次提交
- Y
  Remove Op::Clone method · 4e4438a8
  由 yuyang18 提交于 7月 03, 2018
```
It is used by NetOp before.
```
  4e4438a8
02 7月, 2018 3 次提交
- Y
  Add register kernel functor and shrink reshape op · 82866d4a
  由 yuyang18 提交于 7月 02, 2018
```
* Shrink reshape_op library size
* User can register a standard C++ functor as a op kernel
```
  82866d4a
- Y
  
  Make Kernel registed as a function · 3b00ed81
  由 yuyang18 提交于 7月 02, 2018
  
  3b00ed81
- Y
  
  Polish reshape op · 1ce478f1
  由 yuyang18 提交于 7月 02, 2018
  
  1ce478f1
07 6月, 2018 2 次提交

split reduce op into multiple libraries, accelerate the compiling (#11029) · d48172f2

由 dzhwinter 提交于 6月 07, 2018

* "split into multiple .ccl"

* "refine file structure"

* "refine files"

* "remove the cmakelist"

* "fix typo"

* "fix typo"

* fix ci

d48172f2

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

18 4月, 2018 1 次提交
- Y
  
  remove REGISTER_OP and REGISTER_OP_EX · 68d96385
  由 Yang Yang 提交于 4月 17, 2018
  
  68d96385
17 4月, 2018 1 次提交
- Y
  
  first commit · dafe06af
  由 Yang Yang 提交于 4月 13, 2018
  
  dafe06af
12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
09 2月, 2018 1 次提交
- E
  
  cumsum operator (#8288) · 725e6448
  由 emailweixu 提交于 2月 09, 2018
  
  725e6448
17 1月, 2018 1 次提交
- Q
  
  change DEVICE_TYPE in op_registry to LIBRARY_TYPE (#7588) · 6f71f89d
  由 Qiao Longfei 提交于 1月 17, 2018
  
  6f71f89d
03 1月, 2018 1 次提交
- L
  
  add more comments in CMakelists.txt of operator · 2d2b6332
  由 Luo Tao 提交于 1月 03, 2018
  
  2d2b6332
27 12月, 2017 1 次提交

"refine kernel registrar" (#6998) · 35c1683e

由 dzhwinter 提交于 12月 27, 2017

* "refine kernel registrar"

* "refine registrar with multikey"

* "fix register"

* "refine multikernel register"

* "fix CI"

* "fix CI"

* "fix registry"

* "swtich GPU to CUDA"

* "add register macro test case"

* "fix CI"

35c1683e

25 12月, 2017 1 次提交
- D
  
  GPUPlace to CUDAPlace (#6960) · 0d2235aa
  由 dzhwinter 提交于 12月 25, 2017
  
  0d2235aa
24 12月, 2017 1 次提交
- Q
  
  rm unsed RegisterOp method in OpRegistry · 6b99402d
  由 qiaolongfei 提交于 12月 24, 2017
  
  6b99402d
22 12月, 2017 1 次提交

Enforce drop_empty_grad=false When the input of an op is duplicable. · 0bfa1f7c

由 xuwei06 提交于 12月 01, 2017

For input argument with a list of variables, drop_empty_grad is not allowed because it makes the correspondence bewteen a variable and its gradient ambiguous. Use REGISTER_OP_EX to register the op or call InputGrad(?,false) in GradOpDescMaker.

0bfa1f7c

21 12月, 2017 1 次提交
- Y
  Rename XXDescBind --> XXDesc (#6797) · 09189732
  由 Yu Yang 提交于 12月 21, 2017
```
* Rename XXDescBind --> XXDesc

* Fix Compile
```
  09189732
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

08 11月, 2017 1 次提交

Polish OpWithKernel · bbdac7f7

由 Yu Yang 提交于 11月 07, 2017

* Chage `IndicateDataType` to `GetKernelType`. Make it easier to
  understand.
* Change `OpKernelKey` to `OpKernelType`
* Make operator developers can customize which kernel the operator will
  use in runtime.

bbdac7f7

01 11月, 2017 1 次提交

Feature/executor use program bind (#5196) · 1363ddb6

由 Yu Yang 提交于 10月 31, 2017

* Init commit

* Make executor use ProgramDescBind

* Change Attribute from BlockDesc to BlockDescBind

* Since we will get the program desc in RNN, just BlockDesc is not
  enough.

1363ddb6

29 10月, 2017 2 次提交
- Y
  Cast Operator (#5149) · b84e8226
  由 Yu Yang 提交于 10月 28, 2017
```
* Cast Operator

Cast input variable to other data type

* Fix compile error

* Add cast op

* Follow comments
```
  b84e8226
- Y
  Extract InferShape to many cc files (#5174) · 8f6c0a0f
  由 Yu Yang 提交于 10月 28, 2017
```
* Shrink Operator.h

* Fix CI compile
```
  8f6c0a0f
24 10月, 2017 1 次提交
- D
  
  "add register gpu macro" · 423d7438
  由 Dong Zhihong 提交于 10月 23, 2017
  
  423d7438
19 10月, 2017 2 次提交

Add glog as dependencies of ops (#4908) · e9249d16

由 Yu Yang 提交于 10月 18, 2017

* Add glog as dependencies of ops

* Use VLOG to logging some information is helpful when we debug Paddle

* Fix Unittests

e9249d16

Change ProgramDesc not a global variable (#4879) · e747623e

由 Yu Yang 提交于 10月 18, 2017

* Change ProgramDesc not a global variable

* Polish code style

* Correct implement BlockDesc destructor

* Unify program as parameter name

e747623e

18 10月, 2017 1 次提交
- Y
  
  Remove private data members in OpRegister (#4871) · 5d67677c
  由 Yu Yang 提交于 10月 17, 2017
  
  5d67677c
17 10月, 2017 1 次提交
- Q
  
  remove unused C++ class OpRegistrar · eb27c735
  由 qijun 提交于 10月 16, 2017
  
  eb27c735
13 10月, 2017 1 次提交

Add no_grad_vars for grad_op_maker (#4770) · a36d2416

由 Yu Yang 提交于 10月 12, 2017

* Add no_grad_vars for grad_op_maker

* Add unittest

* Fix unittest

* Fix unittest

* Follow comment

a36d2416

10 10月, 2017 1 次提交
- Y
  
  Fix bug of foward default attribute not passed to backward · c464ec21
  由 Yu Yang 提交于 10月 09, 2017
  
  c464ec21

机器未来 / Paddle 与 Fork 源项目一致

机器未来 / Paddle
与 Fork 源项目一致