- 21 3月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* add more ops * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * rm ipu_strategy.check() * fix UT fail * fix typo Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NZhaorui Chen <zhaoruic@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
- 24 2月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* Update IpuStrategy Python Part * add docs * add add_custom_op for ipu_strategy * fix build warning * rm unneeded part * clean api * fix typo * update option names * update IpuStrategy doc
-
- 19 1月, 2022 1 次提交
-
-
由 jianghaicheng 提交于
* ipu_commit_tests p1 * resolve comments * resolve comments * resolve comments * resolve comments * resolve comments * resolve comments * resolve comments * update lint and ipustrategy introduction * update ipu_config * update __init__ of static * update doc * update doc 2 * update doc 3 * update doc 4 * update doc 5 * update doc 5 * update doc 6 * update lint * update lint 2 * update ipustrategy * add IpuStrategy to all * update ipustrategy * update ipu_shard_guard * update ipu_shard_guard 2 Co-authored-by: Nyaozhixin <522190855@qq.com>
-
- 17 1月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* add no reduce mode for pe * add NoReduce ut
-
- 05 3月, 2021 1 次提交
-
-
由 liuyuhui 提交于
[Kunlun]Multi xpu dygraph performance optimization , add distributed.spawn support for multi xpu and some bug-fixes (#31130)
-
- 07 1月, 2021 1 次提交
-
-
由 wangchaochaohu 提交于
-
- 26 12月, 2020 1 次提交
-
-
由 liuyuhui 提交于
-
- 16 12月, 2020 1 次提交
-
-
由 liuyuhui 提交于
-
- 14 10月, 2020 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 29 9月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* polish Program api doc & example * polish CompiledProgram api doc & example * polish ParallelEnv api doc & examples * polish details, test=document_fix * polish program doc details, test=document_fix * polish details, test=document_fix * fix note format error, test=document_fix * add lost example, test=document_fix * fix lost example, test=document_fix
-
- 14 5月, 2020 1 次提交
-
-
由 Cindy Cai 提交于
* test=develop, test=document_fix * test=develop, test=document_fix Co-authored-by: Nswtkiwi <1208425345@qq.com>
-
- 13 4月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* api compild program error polish, test=develop * fix coverage problem, test=develop * fix details & add unittests, test=develop * add test for coverage, test=develop
-
- 10 4月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
* fix compiled program sample code error, test=develop, test=document_fix * remove cn code in doc, test=develop, test=document_fix
-
- 10 3月, 2020 1 次提交
-
-
由 WangXi 提交于
-
- 11 1月, 2020 1 次提交
-
-
由 liym27 提交于
* add NotImplementedError for multi optimizers used on multi-places . test=develop * assert error only if num_devices>1. test=develop * set test_optimizer_in_control_flow in CMakeLists for using multi-GPU.test=develop
-
- 10 10月, 2019 2 次提交
- 19 9月, 2019 1 次提交
-
-
由 flame 提交于
-
- 22 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* use PE for compiler test=develop
-
- 09 8月, 2019 2 次提交
- 26 7月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* first version memory optimize pass, test=develop * remove move_tensor_sharing_pass, test=develop * refine code comments, add unittests, test=develop * turn off memory_optimize by default, test=develop * follow huihuang's comments, test=develop * follow chengduoZH's comments, test=develop * fix grammar error, add const qualifier, fix pass_test exception message, test=develop * follow chengduoZH's comments 2nd, test=develop
-
- 11 7月, 2019 2 次提交
-
-
由 gongweibao 提交于
-
由 Zeng Jinle 提交于
* feature/buffer_shared_inplace, test=develop * refine code, test=develop * fix elementwise_add op cpu inplace and sum inplace bug, test=develop * add unittest and debug log, test=develop * fix parallel_executor scope bug, polish code, test=develop * fix sum op, activation op, single_in_place_inference bug, test=develop * remove kLocalExecScopeName, test=develop * fix unittest,test=develop * fix out_var first version bug, test=develop * follow comments,test=develop
-
- 27 6月, 2019 1 次提交
-
-
由 chengduo 提交于
* update pe reduce config test=develop * drop the local_exe_scopes of the previous parallel_executor test=develop
-
- 23 6月, 2019 1 次提交
-
-
由 liuwei1031 提交于
-
- 07 6月, 2019 1 次提交
-
-
由 whs 提交于
* Add auto pruning strategy. 1. Fix compressor. 2. Enhence graph executor. 3. Add SAController 4. Add auto pruning strategy. 5. Add unitest for auto pruning strategy. test=develop * Init light-nas * Add light nas. * Some fix. test=develop * Fix sa controller. test=develop * Fix unitest of light nas. test=develop * Fix setup.py.in and API.spec. test=develop * Fix unitest. 1. Fix unitest on windows. 2. Fix package importing in tests directory. * 1. Remove unused comments. 2. Expose eval_epoch option. 3. Remove unused function in search_agent. 4. Expose max_client_num to yaml file. 5. Move flops constraint to on_epoch_begin function test=develop * Fix light nas strategy. test=develop * Make controller server stable. test=develop * 1. Add try exception to compressor. 2. Remove unitest of light-nas for windows. test=develop * Add comments Enhence controller test=develop * Fix comments. test=develop
-
- 27 5月, 2019 1 次提交
-
-
由 gongweibao 提交于
-
- 08 5月, 2019 1 次提交
-
-
由 chengduo 提交于
* polish doc test=develop * updata parallel executor doc test=develop * update API.spec test=develop * polish code test=develop
-
- 21 4月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
* speedup gc and inplace softmax_with_cross_entropy_grad test=develop * refine models gpu mem Merge skip vars and warning messages of mem opt remove relu mem opt test=develop * follow comments test=develop
-
- 20 3月, 2019 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 19 3月, 2019 1 次提交
-
-
由 sneaxiy 提交于
test=develop
-
- 15 3月, 2019 2 次提交
-
-
由 sneaxiy 提交于
test=develop
-
由 qingqing01 提交于
* Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)
-
- 13 3月, 2019 1 次提交
-
-
由 Yan Xu 提交于
* fix broadcast with mp mode * polish code test=develop * fix bcast strategy test=develop * fic cpplint test=develop * fix py3 failed test=develop * fix comment test=develop * update comment test=develop
-
- 12 3月, 2019 1 次提交
-
-
由 Zhen Wang 提交于
-
- 09 3月, 2019 1 次提交
-
-
由 chengduo 提交于
test=develop
-
- 06 3月, 2019 1 次提交
-
-
由 liuwei1031 提交于
* add ir memory optimize test case for IfElse op, test=develop * fix some unitttest failure by force using the python memory_optimize, test=develop * tweak comments, test=develop * fix unittest, test=develop * fix unittest, test=develop
-
- 05 3月, 2019 1 次提交
-
-
由 liuwei1031 提交于
* add ir memory optimize test case for IfElse op, test=develop * fix some unitttest failure by force using the python memory_optimize, test=develop * tweak comments, test=develop * fix unittest, test=develop * fix unittest, test=develop
-
- 04 3月, 2019 1 次提交
-
-
由 chengduo 提交于
* Unified ParallelExecutor and Compiler
-