- 19 9月, 2019 1 次提交
-
-
由 Jie Fang 提交于
Optimize amp for multi-gpu to enable FP16 gradients transfer across gpus
-
- 18 9月, 2019 1 次提交
-
-
由 Bai Yifan 提交于
* support_dispensable_student_loss, test=develop * add distillation test, test=develop * fix distillation test non convergence problem, test=develop * fix test_distillation fail problem, test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Bai Yifan 提交于
* split teacher checkpoints with student checkpoints, test=develop * add unittest for graph.merge(), test=develop
-
- 10 9月, 2019 1 次提交
-
-
由 gongweibao 提交于
Fix float16 optimizer
-
- 06 9月, 2019 2 次提交
-
-
由 Chen Weihang 提交于
* delete useless raw variables in Conv2D, test=develop * adjust the vars number in test_graph_wrapper to pass unittest, test=develop
-
由 Jie Fang 提交于
init new amp, optimize inserting cast op for batchnorm
-
- 05 9月, 2019 1 次提交
-
-
由 123malin 提交于
* test=develop, communicator merge add => merge average
-
- 03 9月, 2019 2 次提交
-
-
由 whs 提交于
test=develop
-
由 gongweibao 提交于
Change backward_guard to optimize_guard to maximize the allreduce overlap
-
- 31 8月, 2019 1 次提交
-
-
由 Zeng Jinle 提交于
-
- 29 8月, 2019 1 次提交
-
-
由 lidanqing 提交于
test=develop
-
- 26 8月, 2019 2 次提交
-
-
由 Leo Chen 提交于
* fix bug of getting bool Flags from os.environ, test=develop * add empty loss_name in CompiledProgram for inplace grad test, test=develop
-
由 liu zhengxi 提交于
* python inference api supports numpy and add unit test, fix unit test fail in test_slim_int8_googlenet and test_slim_int8_mobilenet
-
- 23 8月, 2019 2 次提交
-
-
由 Jiancheng Li 提交于
* update light_nas_strategy: add latency constraint test=develop * update light_nas_strategy: update get_model_latency test=develop * update light_nas_strategy: add more check test=develop * update light_nas test test=develop * update light_nas test test=develop * minor update light_nas test test=develop * minor update light_nas test test=develop * update light_nas test test=develop * update _constrain_func of light_nas_strategy test=develop * update _constrain_func of light_nas_strategy test=develop * remove unused code test=develop
-
由 Zhen Wang 提交于
* add pool2d quantization support, only for max-pooling. * add the partial quantization support.
-
- 16 8月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove unused inference_transpiler unit-tests test=develop * remove InferenceTranspiler usage in quantize_transpiler.py test=develop
-
- 11 8月, 2019 1 次提交
-
-
由 yaoxuefeng 提交于
add save cache model api in fleet& add slots shuffle in dataset module & add metric op to calculate ctr related metrics (#18871) * add ctr related metric layer test=develop * add save cache and slots shuffle test=develop * add save cache and slots shuffle test=develop * fix error * fix error * fix style for ci * fix for comments * change SlotsShuffle input to std::strinf for generality * fix style * fix style * fix style * fix style * fix style * fix style * fix stylr * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * fix style * change non-const reference to pointer * fix style * fix style * fix style test=develop * fix style test=develop * add return ins num in ctr metric op * change dtype to float in metric_op.py * fix error test=develop * fix style test=develop * fix API spec * fix API spec * fix API spec test=develop * add UT test=develop
-
- 05 8月, 2019 1 次提交
-
-
由 chengduo 提交于
* update exe.run
-
- 01 8月, 2019 1 次提交
-
-
由 whs 提交于
test=develop
-
- 22 7月, 2019 1 次提交
-
-
由 tangwei12 提交于
do some odd jobs, test=develop
-
- 18 7月, 2019 1 次提交
-
-
由 guru4elephant 提交于
* remove ctr reader, all functions are satisfied in dataset
-
- 09 7月, 2019 1 次提交
-
-
由 bingyanghuang 提交于
-
- 28 6月, 2019 2 次提交
-
-
由 Jie Fang 提交于
test=develop
-
由 Wojciech Uss 提交于
test=develop
-
- 27 6月, 2019 1 次提交
-
-
由 翟飞跃 提交于
-
- 26 6月, 2019 1 次提交
-
-
由 whs 提交于
Socket can't be pickled. test=develop
-
- 25 6月, 2019 2 次提交
-
-
由 Hongyu Liu 提交于
* sequnce mask support max length tensor input; test=develop * add rnn_impl.py; test=develop * add basic gru lstm unittest; test=develop * fix api spec; test=develop * fix sequence_mask op bug; test=develop test=document_preview * change +-*x to elmentwise_op; test=develop * add mkl flag; test=develop * fix rnn impl bug; test=develop * update api spec; test=develop * fix doc bug; test=develop * fix lstm bugs; test=develop
-
由 Jie Fang 提交于
test=develop
-
- 21 6月, 2019 1 次提交
-
-
由 songhao 提交于
1. fix the bug that out_put_var in SaveSelectedRows would be empty string 2. use merge_sparse_lookup_table to replace sum op for load_persistables_for_inference 3. fix the bug in _clone_var_in_block_ when the var is SELECTED_ROWS.
-
- 20 6月, 2019 2 次提交
-
-
由 Wojciech Uss 提交于
* Enable MKL-DNN for slim FP32 vs. INT8 tests test=develop * added test for MobileNetV1 with MKL-DNN test=develop
-
由 bingyanghuang 提交于
* QAT doc first version * follow the comments, test=develop * update the benchmark, test=develop * Change to channel-wise, test=develop
-
- 19 6月, 2019 3 次提交
- 18 6月, 2019 2 次提交
-
-
由 翟飞跃 提交于
-
由 bingyanghuang 提交于
-
- 16 6月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
test=develop
-
- 15 6月, 2019 1 次提交
-
-
由 Sylwester Fraczek 提交于
-
- 13 6月, 2019 2 次提交
-
-
由 chengduo 提交于
* update CPU_NUM config test=develop
-
由 Wojciech Uss 提交于
* added unit test for QAT FP32 & INT8 comparison test=develop * enabled other models and updated filenames test=develop * added accuracy check and multiple batch handling test=develop * removed quantization_mkldnn_pass.py test=develop * cleanup test=develop * updated model paths test=develop * renamed tests without MKL-DNN test=develop * fix reusing mkldnn pool2d primitive test=develop * add performance measuring test=develop * fix accuracy statistics test=develop * removed non-mkldnn tests test=develop * added conv2d_depthwise->conv2d mkldnn transformation test=develop * format update test=develop * fixed creating key for pool2d grad test=develop * added pass * Fix the accuracy issue while using float precision to get the scale. test=develop * Fix the format issue when 'X' is not nchw. test=develop * removed output comparing and changed number of images test=develop * cmake and comment fix test=develop * updated acc threshold for QAT comparison tests test=develop * added OMP_NUM_THREADS setting test=develop * enable all QAT INT8 tests test=develop * restored upstream version of a file test=develop * modified directory names test=develop
-