- 11 4月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Add an option to enable the cache of expected kernel in train phase. test=develop * Change the default value of cache_expected_kernel to true.
-
- 10 4月, 2019 1 次提交
-
-
由 liuwei1031 提交于
-
- 08 4月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Try to enable the runtime_context_cache pass in train phase. * Put the append of runtime_context_cache pass ahead of multi_dev passes. test=develop
-
- 02 4月, 2019 1 次提交
-
-
由 chengduo 提交于
* expose fuse broadcast ops
-
- 28 3月, 2019 2 次提交
- 20 3月, 2019 1 次提交
-
-
由 chengduo 提交于
* fuse all_reduce test=develop * add fuse_parameter_groups_size test=develop * Polish code test=develop * Fix travis-ci test=develop * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize test=develop * Add SetGroupAccordingToMemorySize test=develop * fix multi_devices_graph test=develop * reset params_grads test=develop * Polish code test=develop
-
- 15 3月, 2019 1 次提交
-
-
由 qingqing01 提交于
* Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)
-
- 06 3月, 2019 1 次提交
-
-
由 liuwei1031 提交于
* add ir memory optimize test case for IfElse op, test=develop * fix some unitttest failure by force using the python memory_optimize, test=develop * tweak comments, test=develop * fix unittest, test=develop * fix unittest, test=develop
-
- 05 3月, 2019 2 次提交
-
-
由 Qiao Longfei 提交于
-
由 liuwei1031 提交于
* add ir memory optimize test case for IfElse op, test=develop * fix some unitttest failure by force using the python memory_optimize, test=develop * tweak comments, test=develop * fix unittest, test=develop * fix unittest, test=develop
-
- 21 2月, 2019 1 次提交
-
-
由 Xin Pan 提交于
test=develop
-
- 11 2月, 2019 1 次提交
-
-
由 dzhwinter 提交于
-
- 31 1月, 2019 1 次提交
-
-
由 dzhwinter 提交于
-
- 22 1月, 2019 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 21 1月, 2019 2 次提交
-
-
由 dzhwinter 提交于
-
由 Dun 提交于
* mem opt * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine code test=develop * refine with cub test=develop * fix mkldnn test && remove comments && test=develop * polish code && test=develop * add only_forward test && test=develop
-
- 17 1月, 2019 1 次提交
-
-
由 Qiao Longfei 提交于
-
- 07 1月, 2019 1 次提交
-
-
由 chengduo 提交于
* Refactor ParallelExecutor test=develop * extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder test=develop * Refactor MultiDevSSAGraphBuilder test=developt * Remove enable_data_balance test=develop * code refine test=develop * remove data balance test=develop * refine ScaleLossGradOp test=develop * remove uncessary file test=develop * code refine test=develop * modify function name test=develop * follow comments test=develop * add is_distribution field test=develop * set is_distribution test=develop * fix DistSSAGraphBuilder test=develop
-
- 28 12月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 26 12月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 20 12月, 2018 1 次提交
-
-
由 chengduo 提交于
* refine parallel_executor * remove uncessary code test=develop
-
- 18 12月, 2018 1 次提交
-
-
由 dzhwinter 提交于
* follow comments. test=develop * Fix typo * fix compile error. test=develop * merge develop branch. test=develop * Remove set_equal * Polish code * Delete unused functions test=develop * polish code. test=develop * follow comment * polish code. * fix windows compile error. test=develop * fix op handle. * rerun ci. test=develop * rerun ci. test=develop * rerun macci. test=develop * polish code. test=develop * rewrite sort code. test=develop * remove unused code. test=develop * fix tests. test=develop * fix conflict. test=develop * follow comment. test=develop * merge develop branch. test=develop * fix tests. test=develop * remove ToTypeIndex. test=develop * rerun ci. test=develop
-
- 17 12月, 2018 1 次提交
-
-
由 Yancey1989 提交于
-
- 07 12月, 2018 1 次提交
-
-
由 gongweibao 提交于
-
- 27 11月, 2018 1 次提交
-
-
由 gongweibao 提交于
-
- 22 11月, 2018 1 次提交
-
-
由 peizhilin 提交于
-
- 13 11月, 2018 2 次提交
- 06 11月, 2018 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 02 11月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 29 10月, 2018 2 次提交
-
-
由 Wu Yi 提交于
* fix nccl2 lars dist support * put lars in momentum op * add tests lars * fix ci * fix cpu kernel * soft warning * remove lars in test_recognize_digits.py * move to another op * add file * update api.spec test=develop * update test=develop * fix api.spec test=develop * wip * wip, finish grad merge ops * wip, finish graph build * wip test running * work on 1 gpu * workable version * update * fix tests * fuse broadcast op * fix compile failed * refine * add batch merge test mnist * fix CI test=develop * fix build * use independent bn params for batch merge test=develop * update api.spec * follow comments and for test * wip * refine tests test=develop * follow comments test=develop * remove startup bn modify test=develop * follow comments test=develop * fix merge test=develop
-
由 sneaxiy 提交于
-
- 12 10月, 2018 1 次提交
-
-
由 sneaxiy 提交于
-
- 25 9月, 2018 4 次提交
- 20 9月, 2018 1 次提交
-
-
由 chengduo 提交于
* Add Preface * Add demo code * Save file * Refine code * seems can work * use elementwise strategy * Use ElementwiseComputeEx * Add comments * extract functions from operator * Refine code * Follow comment * code refine * add op_fuse pass * add backward * code refine * use TopologySortOperations * follow comments * refine IsFusible * code enhance * fix op_fusion_pass * refine code * refine fuse_elemwise_act_op * adjust the input and output * refine logic * add intermediate_edge * disable inplace * follow comments * refine logic * follow comments * Remove the removable IntermediateOut * change strategy * code refine * enable fuse backward * code refine * code refine * rename unit test * follow comments
-
- 07 8月, 2018 1 次提交
-
-
由 chengduo 提交于
* fix the order of sum * add doc * check whether need to copy * follow comments
-