- 31 10月, 2018 1 次提交
-
-
由 Yu Yang 提交于
* feat(platform): lazy initialization of devicecontext in pool Use std::async(deferer, []{...}) to lazy initialize DeviceContext in Pool test=develop * Add future includes test=develop
-
- 29 10月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* fix nccl2 lars dist support * put lars in momentum op * add tests lars * fix ci * fix cpu kernel * soft warning * remove lars in test_recognize_digits.py * move to another op * add file * update api.spec test=develop * update test=develop * fix api.spec test=develop * wip * wip, finish grad merge ops * wip, finish graph build * wip test running * work on 1 gpu * workable version * update * fix tests * fuse broadcast op * fix compile failed * refine * add batch merge test mnist * fix CI test=develop * fix build * use independent bn params for batch merge test=develop * update api.spec * follow comments and for test * wip * refine tests test=develop * follow comments test=develop * remove startup bn modify test=develop * follow comments test=develop * fix merge test=develop
-
- 28 10月, 2018 1 次提交
-
-
由 Wu Yi 提交于
* fix graph num hang test=develop * re-enable tests test=develop * re-enable graph num check test=develop * fix multi device pass role check test=develop
-
- 27 10月, 2018 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 25 10月, 2018 1 次提交
-
-
由 Xin Pan 提交于
please fix offline
-
- 23 10月, 2018 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 21 10月, 2018 1 次提交
-
-
由 chengduozh 提交于
test=develop
-
- 15 10月, 2018 2 次提交
- 30 9月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 29 9月, 2018 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 27 9月, 2018 1 次提交
-
-
由 chengduo 提交于
* add GraphNum test=develop * add graph number check in parallelExecutor test=develop * fix transformer_model bug test=develop * fix graph num
-
- 25 9月, 2018 1 次提交
-
-
由 Xin Pan 提交于
-
- 20 9月, 2018 1 次提交
-
-
由 chengduo 提交于
* Add Preface * Add demo code * Save file * Refine code * seems can work * use elementwise strategy * Use ElementwiseComputeEx * Add comments * extract functions from operator * Refine code * Follow comment * code refine * add op_fuse pass * add backward * code refine * use TopologySortOperations * follow comments * refine IsFusible * code enhance * fix op_fusion_pass * refine code * refine fuse_elemwise_act_op * adjust the input and output * refine logic * add intermediate_edge * disable inplace * follow comments * refine logic * follow comments * Remove the removable IntermediateOut * change strategy * code refine * enable fuse backward * code refine * code refine * rename unit test * follow comments
-
- 17 9月, 2018 2 次提交
- 15 9月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 10 9月, 2018 2 次提交
- 14 8月, 2018 1 次提交
-
-
由 yuyang18 提交于
-
- 09 8月, 2018 1 次提交
-
-
由 Xin Pan 提交于
Reduce one level of inheritence.
-
- 27 7月, 2018 1 次提交
-
-
由 Xin Pan 提交于
-
- 26 7月, 2018 5 次提交
- 22 7月, 2018 1 次提交
-
-
由 Xin Pan 提交于
-
- 18 7月, 2018 5 次提交
- 15 7月, 2018 1 次提交
-
-
由 chengduo 提交于
* Add learning rate decay test * fix test name * doesn't share @LR_DECAY_COUNTER@
-
- 13 7月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine multi-thread CPU Parallel exe * refine multi thread CPU Parallel exe * Refine CPU version for ParallelExecutor * add share_parameter_between_cards_ * Fix ParallelExecutor bug * Fix unit test * Fix parameter opt balance * Fix with opti (param->grad) * Add grad to op var * Remove shard_param_between_cards
-
- 12 7月, 2018 2 次提交
-
-
由 Yancey1989 提交于
-
由 Yancey1989 提交于
-
- 29 6月, 2018 1 次提交
-
-
由 chengduo 提交于
* Fix tensorcopy bug * follow comment * Refine TensorCopy
-
- 28 6月, 2018 1 次提交
-
-
由 chengduo 提交于
-
- 26 6月, 2018 1 次提交
-
-
由 yi.wu 提交于
-