- 20 3月, 2019 6 次提交
-
-
由 nhzlx 提交于
-
由 nhzlx 提交于
support change input size
-
由 nhzlx 提交于
-
由 chengduo 提交于
* fuse all_reduce test=develop * add fuse_parameter_groups_size test=develop * Polish code test=develop * Fix travis-ci test=develop * Add SetGroupAccordingToLayers and SetGroupAccordingToGroupSize test=develop * Add SetGroupAccordingToMemorySize test=develop * fix multi_devices_graph test=develop * reset params_grads test=develop * Polish code test=develop
-
由 Wu Yi 提交于
* wip allreduce in op * wip * wip * wip * wip adding test * wip for conflict with mp mode * fix tests test=develop * fix cpu build test=develop * fix travis clang format test=develop * fix cpu build test=develop * update api.spec test=develop * delete comment test=develop * fix cpplint test=develop * fix test=develop * follow comment test=develop * add file test=develop * fix build test=develop * update test=develop * to be compatible with sync_bn, and fix mp mode in develop test=develop
-
由 sneaxiy 提交于
test=develop
-
- 19 3月, 2019 4 次提交
-
-
由 luotao1 提交于
test=develop
-
由 Tao Luo 提交于
-
由 Jacek Czaja 提交于
* - Fix to crash of Transformer when mkldnn is to be used Desc: TensorCopy was not setting MKLDNN primitive descriptor when layout was to be kMKLDNN test=develop * - Enable transformer for mkl-dnn test=develo * - Compilation fix test=develop * - Removed manual selection of MKL-DNN ops to be used in Transformer test test=develop
-
由 Wojciech Uss 提交于
* Add cpu_quantize_placement_pass for C-API quantization test=develop * added a comment on required pass attributes test=develop
-
- 18 3月, 2019 4 次提交
-
-
由 minqiyang 提交于
test=develop
-
由 minqiyang 提交于
test=develop
-
由 luotao1 提交于
test=develop
-
由 Wojciech Uss 提交于
* Add cpu_quantize_pass for C-API quantization test=develop * add cpu_quantize_pass test * fix lint: add include memory unorderd_map and unordered_set test=develop * fuse_relu 1 test=develop * tuned 2 without squash * fixes test=develop * remove unused vars test=develop * refactored test=develop * fix lint c-style cast -> C++ style cast test=develop * remove QuantMax and c style casts test=develop * last usage of QuantMax removed test=develop * Fix Analysis Predictor UT Check if memory_optimize_pass has already been added to the analysis config before adding a new one, so that it is not added multiple times. test=develop * change map to unordered_map fix the forgotten part of cpu_quantize_pass_tester.cc test=develop * removed quantized attribute * fixed cpu_quantize_pass_tester and op attr comments test=develop * removed redundant line test=debug * removed gmock test=develop * fix after merge
-
- 16 3月, 2019 1 次提交
-
-
由 qingqing01 提交于
test=develop
-
- 15 3月, 2019 7 次提交
-
-
由 minqiyang 提交于
test=develop
-
由 minqiyang 提交于
test=develop
-
由 minqiyang 提交于
test=develop
-
由 minqiyang 提交于
test=develop
-
由 luotao1 提交于
test=develop
-
由 qingqing01 提交于
* Support Sync Batch Norm. * Note, do not enable it in one device. Usage: build_strategy = fluid.BuildStrategy() build_strategy.sync_batch_norm = True binary = fluid.compiler.CompiledProgram(tp).with_data_parallel( loss_name=loss_mean.name, build_strategy=build_strategy)
-
由 minqiyang 提交于
-
- 14 3月, 2019 2 次提交
-
-
由 liuwei1031 提交于
2. remove lod check in reshape op test=develop
-
由 Wojciech Uss 提交于
* Add cpu_quantize_squash_pass for C-API quantization test=develop * add cpu_quantize_squash_pass teste * fix lint: add include memory unorderd_map and unordered_set test=develop * lint fix 2 * fixes test=develop * refactored test=develop * fix windows ci test=develop
-
- 13 3月, 2019 4 次提交
-
-
由 minqiyang 提交于
-
由 luotao1 提交于
test=develop
-
由 Yan Xu 提交于
* fix broadcast with mp mode * polish code test=develop * fix bcast strategy test=develop * fic cpplint test=develop * fix py3 failed test=develop * fix comment test=develop * update comment test=develop
-
由 baojun 提交于
* remove concast_cast and refactor code test=develop * reduce flag use test=develop
-
- 12 3月, 2019 6 次提交
- 11 3月, 2019 3 次提交
- 08 3月, 2019 3 次提交