提交 · bfa3fd6f152227c5a527b3fa89bbdd34e37ca94a · Crayon鑫 / Paddle

11 6月, 2018 1 次提交

add inplace attribute to op_proto_maker (#10665) · bfa3fd6f

由 dzhwinter 提交于 6月 11, 2018

* "add inplace attribute"

* "register inplace attribute"

* "change se-next model for memory-reuse"

* "fix typo"

* repick

* fix merge conflict

* "fix stupid error"

bfa3fd6f

07 6月, 2018 1 次提交

Mkldnn layout (#11040) · 3ff9ba0e

由 mozga-intel 提交于 6月 07, 2018

* Add MKLDNN layout support in Paddle

Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout
can be used in MKLDNN enabled OP kernel. Before this commit, NCHW
is hardcode to be used in all MKLDNN op kernels. As a result,
non-optimized execution path is selected in MKLDNN primitive which
bring worse performance.
Besides framework change, three MKLDNN OP kernels were updated
for using new MKLDNN layout. They are conv/pool2d/batch_norm.
Other MKLDNN OP kernels need be also updated in similar way to
achieve best performance.

* Add MKLDNN layout support in activation OP

* Don't populate layout from input to output when kMKLDNN in

* Refine pool mkldnn op kernel

* MKLDNN layout

* Remove the inferitance from tensor file

* MKLDNN layout: refactoring

* Remove additional #define to register new operator

* Prepare mkldnn tests to work with layout

3ff9ba0e

08 5月, 2018 1 次提交

Clean OpProtoAndCheckerMaker · 0e78cb69

由 Yu Yang 提交于 5月 08, 2018

Do not use ctor

* Reduce line of codes.
* We can use virtual function for Maker now.
* The implementation does not care what maker holds, it is easier to
refactor later.

0e78cb69

03 5月, 2018 1 次提交

Fix/fp64 (#10346) · f63ff90b

由 dzhwinter 提交于 5月 03, 2018

* "fix double type error"

* "fix ci"

* "softmax fp64"

* "fix momentum"

* "fix ci"

f63ff90b

19 4月, 2018 1 次提交
- Y
  add semicolon to op registry (#10034) · e04c43d5
  由 Yang Yang(Tony) 提交于 4月 18, 2018
```
* script to add semicolon

* fix typo
```
  e04c43d5
17 4月, 2018 2 次提交
- J
  - Added EPS for softmax MKLDNN op · acdf7cbd
  由 Jacek Czaja 提交于 4月 16, 2018
```
- EPS added to softmax mkldnn primitive outcome is limited to training
phase

Fixes after review

clang format fixes

clang format fixes
```
  acdf7cbd
- Y
  
  script to fix all · ce7c2e86
  由 Yang Yang 提交于 4月 16, 2018
  
  ce7c2e86
07 4月, 2018 1 次提交
- K
  Add float16 support to non-cudnn softmax op on GPU (#9686) · b2a1c9e8
  由 Kexin Zhao 提交于 4月 06, 2018
```
* initial commit

* fix error

* fix typo and order
```
  b2a1c9e8
21 3月, 2018 3 次提交

- Softmax MKLDNN primitive integration · 3b95b55f

由 Jacek Czaja 提交于 3月 01, 2018

removed diagnostic

- Added Unit tests for Softmax MKLDNN Forward

Added fix for div by 0 to happen in cross_entropy backward

Conflicts:
	paddle/fluid/operators/CMakeLists.txt

- Cosmetic fixes to SoftMax MKLDNN fluid operator

Added misssing softmax fluid operator file

Disabled MKLDNN softmax operator by default

Fix to softmax op unittest merge

clang_formater fixes

clang_formatter fixes

- Name changing of softmax mkldnn operator to maintin consistency
  across codebase

- updated comment

fix to comment

3b95b55f

K

small fix · b7801b9f
由 Kexin Zhao 提交于 3月 20, 2018

b7801b9f
K

initial commit · 70e71227
由 Kexin Zhao 提交于 3月 20, 2018

70e71227

15 3月, 2018 1 次提交

[Speed]implement cudnn sequence softmax cudnn (#8978) · 128adf53

由 dzhwinter 提交于 3月 15, 2018

* "add softmax cudnn functor support"

* "add testing"

* "refine cmakelist"

* "sequence softmax forward speed up"

* "add softmax grad"

* "fix sequence softmax test"

* "add double precision'

* "fix softmax test"

* "add softmax cudnn support"

* "fix softmax cudnn test"

* "add softmax to nn.py"

* "fix compile bug"

* "refine cmakelist"

* "fix ci"

* "fix based on comment"

* "fix based on comments"

* "fix ci"

128adf53

12 2月, 2018 1 次提交
- Q
  
  Fix the grammar in copyright. (#8403) · 24509f4a
  由 qingqing01 提交于 2月 12, 2018
  
  24509f4a
10 2月, 2018 2 次提交
- Y
  
  Correct #include path · fc374821
  由 Yi Wang 提交于 2月 09, 2018
  
  fc374821
- Y
  
  Move file to fluid/; Edit CMakeLists.txt · 90648f33
  由 Yi Wang 提交于 2月 09, 2018
  
  90648f33
10 1月, 2018 1 次提交
- Q
  Topk share lod (#7373) · 91f80f79
  由 Qiao Longfei 提交于 1月 10, 2018
```
* add lod tensor ToAbsOffset test

* add share lod to topk op and softmax op
```
  91f80f79
26 12月, 2017 1 次提交
- F
  
  Change softmax · 874cac0c
  由 fengjiayi 提交于 12月 26, 2017
  
  874cac0c
20 12月, 2017 1 次提交
- Y
  Move framework.proto to proto namespace (#6718) · e445b3ff
  由 Yu Yang 提交于 12月 20, 2017
```
* Move framework.proto to proto namespace

* Fix compile

* Fix compile

* Fix Compile
```
  e445b3ff
12 12月, 2017 1 次提交

Refine device context (#6433) · 61ec0b95

由 QI JUN 提交于 12月 12, 2017

There are mainly following fixes:

- take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place`
- remove `eigen_device` interface in base class  `DeviceContext`
- remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext`
- remove unused `platform::EigenDeviceConverter`
- rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL`
- rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`

61ec0b95

23 11月, 2017 1 次提交
- C
  
  fix LaTeX syntax in liear_chain_crf op. · 8ba62a5f
  由 caoying03 提交于 11月 23, 2017
  
  8ba62a5f
05 11月, 2017 1 次提交

Fixing documentations for few more operators (#5374) · e65ab795

由 kavyasrinet 提交于 11月 04, 2017

* Doc fix for smooth L1 loss

* Adding doc for softmax_op

* Added doc for softmax_with_cross_entropy

* Adding documentation for transpose_op

* small change to restart TeamCity CI

e65ab795

17 10月, 2017 1 次提交
- Y
  Correct OpWithKernel's infershape (#4847) · 73a8b78a
  由 Yu Yang 提交于 10月 16, 2017
```
They are public now
```
  73a8b78a
07 10月, 2017 1 次提交
- Q
  
  rename InferShapeContextBase to InferShapeContext · c0a34e1c
  由 qiaolongfei 提交于 10月 07, 2017
  
  c0a34e1c
27 9月, 2017 1 次提交

Refactoring InferShape (#3946) · 9a9d50a6

由 Qiao Longfei 提交于 9月 26, 2017

* init Infershape

* add static InferShape interface

* refactor add-op infershape

* add AttrReader

* add all maker's infershape

* add all InferShape

* add python infer api

* add VarDesc interface

* add python VarDesc and OpDesc interface

* update python code

* use infershape function to do shape inference

* clean code

* do not use pointer

* refine code of op_proto_maker

* add get_dims to VarDesc

* refine the code

* remove the dependency from operator to op registry

* remove OpProtoAndCheckerMaker from operator

* restore complete_add_op

* add shape_infer_impl.h

* code optimization

* remove const return value

* add fake BlockDesc class

* optimize code

* remove infer function in op_info

* move InferShapeContextImpl to operator.h

* optimize the interface of InferShapeContextBase

* add temperary interface of new infershape

* change add_op, clip_op, conv2d_op and activation_op

* change all operators InferShape

* fix SetDim

* update cos_sim_op

* update crop_op

* update lookup_table_op

* allocate tensor when call GetDim in InferShapeContext

* update modified_huber_loss_op

* update rowwise_add_op

* update mean_op

* update sequence_avg_pool_op

* typo

* remove old InferShape interface

* can compile

* fix or unit test

* clean code

* clean code

* remove const before InferShapeContext

* change InferenceContextBase to pointer

* rename RunTime to Runtime, code clean

9a9d50a6

21 9月, 2017 1 次提交
- D
  
  Remove LoDTensor in some operators' InferShape and refine ShareLoD function. · 36aeb30d
  由 dangqingqing 提交于 9月 21, 2017
  
  36aeb30d
15 9月, 2017 1 次提交
- L
  
  Add the check of inputs and outputs in all operators. · eef1ccbf
  由 Liu Yiqun 提交于 9月 15, 2017
  
  eef1ccbf
13 9月, 2017 1 次提交
- D
  
  Using LoDTensor instead of Tensor in every operator. · f2992063
  由 dangqingqing 提交于 9月 13, 2017
  
  f2992063
07 9月, 2017 1 次提交
- C
  
  rename input and output of softmax_op. · 5b4526fa
  由 caoying03 提交于 9月 07, 2017
  
  5b4526fa
06 9月, 2017 1 次提交
- C
  
  refine softmax operator. · 7d16fe87
  由 caoying03 提交于 9月 06, 2017
  
  7d16fe87
05 9月, 2017 2 次提交
- C
  
  update doc of softmax_op. · dc520da7
  由 caoying03 提交于 9月 05, 2017
  
  dc520da7
- F
  
  Revert "Remove `grad_op_type` in `REGISTER_OP`" · 9a3c69c2
  由 fengjiayi 提交于 9月 04, 2017
  
  9a3c69c2
03 9月, 2017 1 次提交
- F
  
  Remove `grad_op_type` in REGISTER_OP · 79b1f33a
  由 fengjiayi 提交于 9月 02, 2017
  
  79b1f33a
12 8月, 2017 5 次提交
- Y
  
  Remove empty constructor for operator · 11c35605
  由 Yu Yang 提交于 8月 12, 2017
  
  11c35605
- Y
  
  Get `DEFINE_OPERATOR_CTOR` Back to code · 0b1052fc
  由 Yu Yang 提交于 8月 12, 2017
  
  0b1052fc
- F
  
  Merge REGISTER_OP and REGISTER_GRADIENT_OP · 2ea2fbea
  由 fengjiayi 提交于 8月 11, 2017
  
  2ea2fbea
- Y
  
  Update · 65bd7c77
  由 Yi Wang 提交于 8月 11, 2017
  
  65bd7c77
- F
  
  Refine macro · f784741d
  由 fengjiayi 提交于 8月 11, 2017
  
  f784741d
08 8月, 2017 3 次提交
- D
  
  "fix clang format" · 22f03c39
  由 dongzhihong 提交于 8月 08, 2017
  
  22f03c39
- Y
  
  Try make pass · 7e830116
  由 Yu Yang 提交于 8月 08, 2017
  
  7e830116
- Y
  fix some enforce (#3301) · 2af35002
  由 Yan Chunwei 提交于 8月 08, 2017
```
* fix some enforce

* remove compatible_type to avoid compile error

* remove shared_ptr

* fix tensor error msg
```
  2af35002

Crayon鑫 / Paddle 与 Fork 源项目一致

Crayon鑫 / Paddle
与 Fork 源项目一致