- 07 6月, 2018 23 次提交
-
-
由 dzhwinter 提交于
* "init benchmark ops" * "untrack outputs" * "delete some usused code" * "benchmark" * "fix ci" * "fix op test" * "fix uint16 missing" * "fix ci" * "follow comments" * "fix ci" * "follow comments" * "conficts. merge develop branch" * repick * "merge develop branch"
-
由 mozga-intel 提交于
* Add MKLDNN layout support in Paddle Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout can be used in MKLDNN enabled OP kernel. Before this commit, NCHW is hardcode to be used in all MKLDNN op kernels. As a result, non-optimized execution path is selected in MKLDNN primitive which bring worse performance. Besides framework change, three MKLDNN OP kernels were updated for using new MKLDNN layout. They are conv/pool2d/batch_norm. Other MKLDNN OP kernels need be also updated in similar way to achieve best performance. * Add MKLDNN layout support in activation OP * Don't populate layout from input to output when kMKLDNN in * Refine pool mkldnn op kernel * MKLDNN layout * Remove the inferitance from tensor file * MKLDNN layout: refactoring * Remove additional #define to register new operator * Prepare mkldnn tests to work with layout
-
由 fengjiayi 提交于
fix a compile error on Mac
-
由 Yu Yang 提交于
[Feature] Add fuse vars op handle
-
由 Tao Luo 提交于
rewrite unittest of trt_activation_op
-
由 fengjiayi 提交于
-
-
由 Luo Tao 提交于
-
由 fengjiayi 提交于
Add unittests to check channelwise add
-
由 fengjiayi 提交于
Add reverse op
-
由 Luo Tao 提交于
-
由 Tao Luo 提交于
install libgtk2.0-dev in latest images
-
由 Yu Yang 提交于
SSA Graph Builder Factory
-
由 Yu Yang 提交于
Extract method from tensor_impl.h to tensor.cc
-
由 Luo Tao 提交于
-
由 weixing 提交于
Fix formula display error
-
由 gongweibao 提交于
-
由 dzhwinter 提交于
-
由 Xin Pan 提交于
-
由 gongweibao 提交于
-
由 Xin Pan 提交于
fix cudnn version issue
-
由 tensor-tang 提交于
Fix abort issue in cpu multi-threads
-
由 Yan Chunwei 提交于
-
- 06 6月, 2018 17 次提交
-
-
由 Tao Luo 提交于
add python-opencv in paddlepaddle/paddle:lastest images
-
由 qingqing01 提交于
* Enable assertions in CUDA. * Fix PADDLE_ASSERT.
-
由 tensor-tang 提交于
Merge remote-tracking branch 'ups/develop' into multithreads
-
由 yuyang18 提交于
* Use Builder Chain to decorate new builders. It is easy to extend builders. * Make graphviz path as a build strategy, not a FLAGS.
-
-
由 chengduoZH 提交于
-
由 Xin Pan 提交于
small clean up and document pointer ownership.
-
由 tensor-tang 提交于
-
由 Luo Tao 提交于
-
由 Luo Tao 提交于
-
由 dzhwinter 提交于
-
-
-
由 Xin Pan 提交于
-
由 Tao Luo 提交于
disable failed tests
-
由 tensor-tang 提交于
-
由 Yu Yang 提交于
Try to speed up parallel executor
-