- 28 9月, 2018 2 次提交
-
-
由 Yu Yang 提交于
Polish styles in generate_proposals_op. 1. inline lambda functions rathar than use std::function to save var. 2. add `static inline` to template functions .cc * Make them static to prevent generating symbols. * Make them inline to give compiler a hit inline them as possible. * Not if the function is not static, they cannot be inlined since the symbols should be exported. 3. add `static` to global functions in .cc * Make them static to prevent generating symbols. 4. Use Vector<uint64> instead manually manange storage between devices. 5. Prefer to use platform::ForRange, so we can optimize `ForRange` by just changing `for_range.h` if it is needed. 6. Do not change shape of inputs test=develop
-
由 Yu Yang 提交于
Use OO style to rewrite memory allocation.
-
- 27 9月, 2018 5 次提交
-
-
由 chengduo 提交于
test=develop
-
由 qingqing01 提交于
* Add CUDA implementation for generate_proposals_op. * Clean code. * Update code.
-
由 tangwei12 提交于
* add dist ut for text_classification * add dist ut for text_classification * add simnet bow unittest * add dist ut for simnet bow * add trainning data url for simnet bow * add trainning data url for simnet bow * modify simnet test_reader to train reader * add test_dist_ctr * test_dist_ctr can run now * dense update is good * add unit test for selected rows * debug unit test * fix dist sparse update problem * Constant args at init * optimize code * simnet optimize * fix DebugStringEx * optimize sum_op.h * add ScaleOpVarTypeInference * clean code * fix test_dist_transpiler.py * code optimize * modify delta * fix sparse update bug * dist test use one cpu * update some data * remove unused code * add use cuda config * unit test fix * unit test fix * unit test fix * unit test fix * dist_word2vec use CPU * unit test fix * unit test fix * code clean * code clean * merge develop * api spec update * Revert: api spec update * replace simnet data with fake * replace simnet data with fake * update dim * add batch auc * code clean * code clean * modify print to stderr * update simnet delta -> 1e-5 * update RUN_STEP * add use_reader_alloc * add use_reader_alloc * add use_reader_alloc * modify delta * add use_reader_alloc * fix stderr write * python3 compatibility test=develop * python3 compatibility, test=develop * Update dist_text_classification.py * test=develop
-
由 typhoonzero 提交于
This reverts commit 1d91a49d.
-
由 tangwei12 提交于
* add distributed auc * add attr "is distributed" and config it * add distributed auc * add batch auc and code format * code format * auc optimize * metric_op optimize * code clean * bug fix and code clean * bug fix and code clean * code optimize * code optimize * api spec update * Comments optimized * add mutex * Revert: add mutex * remove distribute metric * remove distribute metric * spec modifyed * add annotation, test=develop * keep API compatibility test=develop
-
- 26 9月, 2018 3 次提交
-
-
由 tensor-tang 提交于
test=develop
-
由 chengduo 提交于
* some trivial opt * remove the fix of lod_tensor and shrink_rnn_memory_op * refine ShrinkRNNMemoryOp test=develop
-
由 ktlichkid 提交于
-
- 25 9月, 2018 3 次提交
-
-
由 dzhwinter 提交于
-
由 Michal Gallus 提交于
-
由 Michal Gallus 提交于
Also fix MKL-DNN pooling integration for ceil mode
-
- 21 9月, 2018 9 次提交
-
-
由 nhzlx 提交于
-
由 chengduo 提交于
* add ShareLoDs * refine * add Is EmptyVarName * refine Sharedlod
-
由 sneaxiy 提交于
-
由 gongweibao 提交于
-
由 JiabinYang 提交于
-
由 gongweibao 提交于
-
由 Wu Yi 提交于
* wip * clean up * should fix running with memopt * add ut * mark lr schedule op role * hide lr_schedule_guard * use op_role_var instead of ufind * unify dist test name * wip for py3 support * fix var deref * fix python3 mem_opt order * remove comments
-
由 Yu Yang 提交于
-
由 whs 提交于
* Add roi perspective transform. * Add roi_perspective_transform_op. * Fix code style. * Add python api and fix doc. * Fix API.spec * Fix python api. * Fix API.spec * Move src to detection.
-
- 20 9月, 2018 8 次提交
-
-
由 sneaxiy 提交于
-
由 sneaxiy 提交于
-
由 Yu Yang 提交于
-
由 sneaxiy 提交于
-
由 Yihua Xu 提交于
* Optimize GRU with AVX instruction * Clean code * Add the Unitest and fix the align issue * Remove the remanent part of the unitest part * Code clean * Fix the parameters length issue for fusion_gru to pass CI * Change the default type as float32
-
由 chengduo 提交于
* Add Preface * Add demo code * Save file * Refine code * seems can work * use elementwise strategy * Use ElementwiseComputeEx * Add comments * extract functions from operator * Refine code * Follow comment * code refine * add op_fuse pass * add backward * code refine * use TopologySortOperations * follow comments * refine IsFusible * code enhance * fix op_fusion_pass * refine code * refine fuse_elemwise_act_op * adjust the input and output * refine logic * add intermediate_edge * disable inplace * follow comments * refine logic * follow comments * Remove the removable IntermediateOut * change strategy * code refine * enable fuse backward * code refine * code refine * rename unit test * follow comments
-
- 19 9月, 2018 7 次提交
-
-
由 Qingsheng Li 提交于
Sequence Scatter Op
-
由 nhzlx 提交于
-
由 nhzlx 提交于
-
由 chengduo 提交于
* add unit test for nested drnn * add nested dyn_rnn * refine while_op * fix bug
-
由 whs 提交于
* Add truncated gaussian initializer. * Fix unitest. * Update API.spec * Fix code style and fix bug. * Fix code style. * Small fix.
-
由 JiabinYang 提交于
-
由 Dun 提交于
* loosen the restriction of output_size in conv2d_transpose * test and docs * fix code style * fix ci test error * bug fix * fix python3 issue
-
- 18 9月, 2018 3 次提交