- 22 3月, 2019 2 次提交
-
-
由 xiaolil1 提交于
* Enable INT8 Concat Kernel to improve the performance of MobileNet-SSD. test=develop * Optimize UT format. test=develop * Fix UT file address issue. test=develop * Refine the license year. test=develop * Optimize code for new API. test=develop * Restructure INT8 Concat kernel. test=develop
-
由 chengduo 提交于
* refine fused_all_reduce_op * add unit test in test_parallel_executor_seresnext test=develop
-
- 21 3月, 2019 7 次提交
-
-
由 phlrain 提交于
-
由 lujun 提交于
-
由 qingqing01 提交于
* Rewrite gradient ProtoMaker for affine_channel_op to remove the Output as the input. * Add act in Python API to make the act can be in-place by layer_help.py
-
由 phlrain 提交于
-
由 phlrain 提交于
-
由 Zhen Wang 提交于
-
由 Wu Yi 提交于
* add resnet nccl2 test=develop * test dist train test=develop * update test=develop * increase timeout test=develop * test on CI env test=develop
-
- 20 3月, 2019 5 次提交
-
-
由 flame 提交于
* save infer model bug fix, return target vars' name list
-
由 baojun 提交于
* Add softmax_with_cross_entropy_op test=develop * simplify implementation test=develop
-
由 chengduo 提交于
* fuse all_reduce test=develop * add fuse_parameter_groups_size test=develop * Polish code test=develop * Fix travis-ci test=develop * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize test=develop * Add SetGroupAccordingToMemorySize test=develop * fix multi_devices_graph test=develop * reset params_grads test=develop * Polish code test=develop
-
由 Wu Yi 提交于
* wip allreduce in op * wip * wip * wip * wip adding test * wip for conflict with mp mode * fix tests test=develop * fix cpu build test=develop * fix travis clang format test=develop * fix cpu build test=develop * update api.spec test=develop * delete comment test=develop * fix cpplint test=develop * fix test=develop * follow comment test=develop * add file test=develop * fix build test=develop * update test=develop * to be compatible with sync_bn, and fix mp mode in develop test=develop
-
由 lujun 提交于
-
- 19 3月, 2019 5 次提交
-
-
由 Zhen Wang 提交于
-
由 whs 提交于
* Make step_input support custom lod level. test=develop * Fix API.spec test=develop * Fix API.spec. test=develop * Fix API.spec test=develop * Add default value in document of step_input. test=develop * Fix document. test=develop * Fix API.spec test=develop
-
由 ceci3 提交于
-
由 Zeng Jinle 提交于
test=develop
-
由 Yibing Liu 提交于
test=develop
-
- 18 3月, 2019 3 次提交
-
-
由 xiaolil1 提交于
* Enable INT8 transpose kernel for MobileNet-SSD improvement. test=develop * Refine the license year. test=develop * Delete redundant code. test=develop * Add axis check. test=develop
-
由 tangwei12 提交于
* update load persistables for increment, test=develop * update load persistables for increment, test=develop * update API Spec, test=develop * update API Spec, test=develop * add doc, test=develop * add doc, test=develop * Update lookup_table_utils.py * Update API.spec * Update lookup_table_utils.py test=develop * Update API.spec test=develop * fix api spec * Update lookup_table_utils.py test=develop
-
由 Xin Pan 提交于
test=develop
-
- 15 3月, 2019 8 次提交
-
-
由 Xin Pan 提交于
test=develop
-
由 Xin Pan 提交于
test=develop
-
由 视言 提交于
* Add moving average absmax op in quantilize-aware training.
-
由 Aurelius84 提交于
-
由 ceci3 提交于
-
由 qingqing01 提交于
* Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)
-
由 Yibing Liu 提交于
* Impl fp16 compute kernel for slice_op test=develop * Use data() to replace mutable_data()
-
由 Aurelius84 提交于
-
- 14 3月, 2019 6 次提交
- 13 3月, 2019 3 次提交
-
-
由 minqiyang 提交于
-
由 Yan Xu 提交于
* fix broadcast with mp mode * polish code test=develop * fix bcast strategy test=develop * fic cpplint test=develop * fix py3 failed test=develop * fix comment test=develop * update comment test=develop
-
由 baojun 提交于
* remove concast_cast and refactor code test=develop * reduce flag use test=develop
-
- 12 3月, 2019 1 次提交
-
-
由 dengkaipeng 提交于
-