- 08 6月, 2018 3 次提交
- 07 6月, 2018 8 次提交
-
-
由 dzhwinter 提交于
* "split into multiple .ccl" * "refine file structure" * "refine files" * "remove the cmakelist" * "fix typo" * "fix typo" * fix ci
-
由 dzhwinter 提交于
* "init benchmark ops" * "untrack outputs" * "delete some usused code" * "benchmark" * "fix ci" * "fix op test" * "fix uint16 missing" * "fix ci" * "follow comments" * "fix ci" * "follow comments" * "conficts. merge develop branch" * repick * "merge develop branch"
-
由 fengjiayi 提交于
-
由 mozga-intel 提交于
* Add MKLDNN layout support in Paddle Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout can be used in MKLDNN enabled OP kernel. Before this commit, NCHW is hardcode to be used in all MKLDNN op kernels. As a result, non-optimized execution path is selected in MKLDNN primitive which bring worse performance. Besides framework change, three MKLDNN OP kernels were updated for using new MKLDNN layout. They are conv/pool2d/batch_norm. Other MKLDNN OP kernels need be also updated in similar way to achieve best performance. * Add MKLDNN layout support in activation OP * Don't populate layout from input to output when kMKLDNN in * Refine pool mkldnn op kernel * MKLDNN layout * Remove the inferitance from tensor file * MKLDNN layout: refactoring * Remove additional #define to register new operator * Prepare mkldnn tests to work with layout
-
由 fengjiayi 提交于
-
由 Luo Tao 提交于
-
由 gongweibao 提交于
-
由 Yan Chunwei 提交于
-
- 06 6月, 2018 18 次提交
-
-
由 qingqing01 提交于
* Enable assertions in CUDA. * Fix PADDLE_ASSERT.
-
由 yuyang18 提交于
* Use Builder Chain to decorate new builders. It is easy to extend builders. * Make graphviz path as a build strategy, not a FLAGS.
-
由 chengduoZH 提交于
-
由 Xin Pan 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 tensor-tang 提交于
-
由 yuyang18 提交于
-
由 dzhwinter 提交于
-
由 dzhwinter 提交于
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 dzhwinter 提交于
-
由 fengjiayi 提交于
-
由 Luo Tao 提交于
-
由 fengjiayi 提交于
-
由 Yan Chunwei 提交于
-
由 dzhwinter 提交于
* "fix deterministic" * "fix ci" * "fix init"
-
- 05 6月, 2018 6 次提交
-
-
由 Yuan Gao 提交于
* add normalize switch to box_coder_op * add default prior box var * update according to the review
-
由 Wu Yi 提交于
-
由 whs 提交于
-
由 Qiao Longfei 提交于
fix protobuf memory leak
-
由 sneaxiy 提交于
-
由 Siddharth Goyal 提交于
-
- 04 6月, 2018 5 次提交
-
-
由 qiaolongfei 提交于
-
由 fengjiayi 提交于
-
由 tensor-tang 提交于
-
由 fengjiayi 提交于
-
由 tensor-tang 提交于
-