- 12 6月, 2018 5 次提交
-
-
由 yuyang18 提交于
* less_than * cumsum * multiplex * open_recordio_file
-
由 Wu Yi 提交于
* wip * use executor.complete to end trainer * fix build * fix build with distribute off * fix typo * fix cmake typo * fix build
-
由 Luo Tao 提交于
-
由 qiaolongfei 提交于
-
由 qingqing01 提交于
* Add normalization operator. 1. Refine the raw norm_op and let it more general to support to normalize Tensor along any axis. 2. There is a bug in l2_normalize API, which lacks sqrt after `reduce_sum`. 3. Use norm_op to refine the l2_normalize API. 4. Fix bug in test_normalization_wrapper.py.
-
- 11 6月, 2018 18 次提交
-
-
由 whs 提交于
* Add slice op. * Remove using from header file and fix doc. * Fix doc * Small fix.
-
由 Luo Tao 提交于
-
由 qiaolongfei 提交于
-
由 gongweibao 提交于
-
由 Xin Pan 提交于
The status is updated in the Process() thread and can be checked in another HandleRequest() thread.
-
由 qiaolongfei 提交于
-
由 Luo Tao 提交于
-
由 qiaolongfei 提交于
-
由 dzhwinter 提交于
* "add inplace attribute" * "register inplace attribute" * "change se-next model for memory-reuse" * "fix typo" * repick * fix merge conflict * "fix stupid error"
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 qiaolongfei 提交于
-
由 gongweibao 提交于
-
由 yuyang18 提交于
* row_conv * uniform_random * layer_norm * create_parameter * hard_shrink * ssd_loss
-
由 qiaolongfei 提交于
-
由 yuyang18 提交于
* Remove unused arg_max/min_op.h * Remove reference parameter. Use pointer insteaded. * undef macro * Always set OutT as int64_t.
-
由 mozga-intel 提交于
-
由 mozga-intel 提交于
-
- 08 6月, 2018 9 次提交
-
-
由 sneaxiy 提交于
-
由 Yancey1989 提交于
-
由 guochaorong 提交于
-
由 yuyang18 提交于
-
由 sneaxiy 提交于
-
由 yuyang18 提交于
-
由 yuyang18 提交于
-
由 fengjiayi 提交于
-
由 Yancey1989 提交于
-
- 07 6月, 2018 8 次提交
-
-
由 Xin Pan 提交于
-
由 dzhwinter 提交于
* "split into multiple .ccl" * "refine file structure" * "refine files" * "remove the cmakelist" * "fix typo" * "fix typo" * fix ci
-
由 fengjiayi 提交于
-
由 fengjiayi 提交于
-
由 mozga-intel 提交于
* Add MKLDNN layout support in Paddle Add MKLDNN layout in Paddle so that MKLDNN friendly memory layout can be used in MKLDNN enabled OP kernel. Before this commit, NCHW is hardcode to be used in all MKLDNN op kernels. As a result, non-optimized execution path is selected in MKLDNN primitive which bring worse performance. Besides framework change, three MKLDNN OP kernels were updated for using new MKLDNN layout. They are conv/pool2d/batch_norm. Other MKLDNN OP kernels need be also updated in similar way to achieve best performance. * Add MKLDNN layout support in activation OP * Don't populate layout from input to output when kMKLDNN in * Refine pool mkldnn op kernel * MKLDNN layout * Remove the inferitance from tensor file * MKLDNN layout: refactoring * Remove additional #define to register new operator * Prepare mkldnn tests to work with layout
-
由 fengjiayi 提交于
-
由 gongweibao 提交于
-
由 Yan Chunwei 提交于
-