- 11 6月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "add inplace attribute" * "register inplace attribute" * "change se-next model for memory-reuse" * "fix typo" * repick * fix merge conflict * "fix stupid error"
-
- 07 6月, 2018 1 次提交
-
-
由 mozga-intel 提交于
* Add MKLDNN layout support in Paddle Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout can be used in MKLDNN enabled OP kernel. Before this commit, NCHW is hardcode to be used in all MKLDNN op kernels. As a result, non-optimized execution path is selected in MKLDNN primitive which bring worse performance. Besides framework change, three MKLDNN OP kernels were updated for using new MKLDNN layout. They are conv/pool2d/batch_norm. Other MKLDNN OP kernels need be also updated in similar way to achieve best performance. * Add MKLDNN layout support in activation OP * Don't populate layout from input to output when kMKLDNN in * Refine pool mkldnn op kernel * MKLDNN layout * Remove the inferitance from tensor file * MKLDNN layout: refactoring * Remove additional #define to register new operator * Prepare mkldnn tests to work with layout
-
- 08 5月, 2018 1 次提交
-
-
由 Yu Yang 提交于
Do not use ctor * Reduce line of codes. * We can use virtual function for Maker now. * The implementation does not care what maker holds, it is easier to refactor later.
-
- 03 5月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "fix double type error" * "fix ci" * "softmax fp64" * "fix momentum" * "fix ci"
-
- 19 4月, 2018 1 次提交
-
-
由 Yang Yang(Tony) 提交于
* script to add semicolon * fix typo
-
- 17 4月, 2018 2 次提交
-
-
由 Jacek Czaja 提交于
- EPS added to softmax mkldnn primitive outcome is limited to training phase Fixes after review clang format fixes clang format fixes
-
由 Yang Yang 提交于
-
- 07 4月, 2018 1 次提交
-
-
由 Kexin Zhao 提交于
* initial commit * fix error * fix typo and order
-
- 21 3月, 2018 3 次提交
-
-
由 Jacek Czaja 提交于
removed diagnostic - Added Unit tests for Softmax MKLDNN Forward Added fix for div by 0 to happen in cross_entropy backward Conflicts: paddle/fluid/operators/CMakeLists.txt - Cosmetic fixes to SoftMax MKLDNN fluid operator Added misssing softmax fluid operator file Disabled MKLDNN softmax operator by default Fix to softmax op unittest merge clang_formater fixes clang_formatter fixes - Name changing of softmax mkldnn operator to maintin consistency across codebase - updated comment fix to comment
-
由 Kexin Zhao 提交于
-
由 Kexin Zhao 提交于
-
- 15 3月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* "add softmax cudnn functor support" * "add testing" * "refine cmakelist" * "sequence softmax forward speed up" * "add softmax grad" * "fix sequence softmax test" * "add double precision' * "fix softmax test" * "add softmax cudnn support" * "fix softmax cudnn test" * "add softmax to nn.py" * "fix compile bug" * "refine cmakelist" * "fix ci" * "fix based on comment" * "fix based on comments" * "fix ci"
-
- 12 2月, 2018 1 次提交
-
-
由 qingqing01 提交于
-
- 10 2月, 2018 2 次提交
- 10 1月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
* add lod tensor ToAbsOffset test * add share lod to topk op and softmax op
-
- 26 12月, 2017 1 次提交
-
-
由 fengjiayi 提交于
-
- 20 12月, 2017 1 次提交
-
-
由 Yu Yang 提交于
* Move framework.proto to proto namespace * Fix compile * Fix compile * Fix Compile
-
- 12 12月, 2017 1 次提交
-
-
由 QI JUN 提交于
There are mainly following fixes: - take `DeviceContext` as the template parameter of math functors and OpKernel instead of `Place` - remove `eigen_device` interface in base class `DeviceContext` - remove `GetEigenDevice` interface in `ExecutionContext` and base class `DeviceContext` - remove unused `platform::EigenDeviceConverter` - rename `REGISTER_OP_GPU_KERNEL` to `REGISTER_OP_CUDA_KERNEL` - rename `USE_GPU_ONLY_OP` to `USE_CUDA_ONLY_OP`
-
- 23 11月, 2017 1 次提交
-
-
由 caoying03 提交于
-
- 05 11月, 2017 1 次提交
-
-
由 kavyasrinet 提交于
* Doc fix for smooth L1 loss * Adding doc for softmax_op * Added doc for softmax_with_cross_entropy * Adding documentation for transpose_op * small change to restart TeamCity CI
-
- 17 10月, 2017 1 次提交
-
-
由 Yu Yang 提交于
They are public now
-
- 07 10月, 2017 1 次提交
-
-
由 qiaolongfei 提交于
-
- 27 9月, 2017 1 次提交
-
-
由 Qiao Longfei 提交于
* init Infershape * add static InferShape interface * refactor add-op infershape * add AttrReader * add all maker's infershape * add all InferShape * add python infer api * add VarDesc interface * add python VarDesc and OpDesc interface * update python code * use infershape function to do shape inference * clean code * do not use pointer * refine code of op_proto_maker * add get_dims to VarDesc * refine the code * remove the dependency from operator to op registry * remove OpProtoAndCheckerMaker from operator * restore complete_add_op * add shape_infer_impl.h * code optimization * remove const return value * add fake BlockDesc class * optimize code * remove infer function in op_info * move InferShapeContextImpl to operator.h * optimize the interface of InferShapeContextBase * add temperary interface of new infershape * change add_op, clip_op, conv2d_op and activation_op * change all operators InferShape * fix SetDim * update cos_sim_op * update crop_op * update lookup_table_op * allocate tensor when call GetDim in InferShapeContext * update modified_huber_loss_op * update rowwise_add_op * update mean_op * update sequence_avg_pool_op * typo * remove old InferShape interface * can compile * fix or unit test * clean code * clean code * remove const before InferShapeContext * change InferenceContextBase to pointer * rename RunTime to Runtime, code clean
-
- 21 9月, 2017 1 次提交
-
-
由 dangqingqing 提交于
-
- 15 9月, 2017 1 次提交
-
-
由 Liu Yiqun 提交于
-
- 13 9月, 2017 1 次提交
-
-
由 dangqingqing 提交于
-
- 07 9月, 2017 1 次提交
-
-
由 caoying03 提交于
-
- 06 9月, 2017 1 次提交
-
-
由 caoying03 提交于
-
- 05 9月, 2017 2 次提交
- 03 9月, 2017 1 次提交
-
-
由 fengjiayi 提交于
-
- 12 8月, 2017 5 次提交
- 08 8月, 2017 3 次提交
-
-
由 dongzhihong 提交于
-
由 Yu Yang 提交于
-
由 Yan Chunwei 提交于
* fix some enforce * remove compatible_type to avoid compile error * remove shared_ptr * fix tensor error msg
-