- 08 7月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
fix mask rcnn add interface for setting optim_cache_dir(eg: when in trt int8 mode, and load model from memory, there should be a interface for setting the trt calibration table data dir) test=release/1.5
-
- 28 6月, 2019 1 次提交
-
-
由 石晓伟 提交于
* Update the Anakin interfaces for content-dnn and MLU (#17890) * update anakin-engine interfaces for content-dnn test=develop * support only-gpu mode of Anakin modify eltwise parse test=develop * modification for thread-safe test=develop * Integrated template instance test=develop * increase template parameters test=develop * support MLU predictor test=develop * update anakin cmake files test=develop * update TargetWrapper::set_device * update the initialization of anakin subgraph test=develop * use the default constructor of base class test=develop * modify the access level of anakin engine (#18015) test=develop * fix ci test cmake test=develop
-
- 18 6月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
Cherry pick #18077 and #18111 unify FP32 vs. INT8 comparison tests output, reuse C-API INT8 unit test application (#18145) * unify FP32 vs. INT8 comparison tests output (#18111) test=release/1.5 * reuse C-API INT8 unit test application (#18077) test=release/1.5
-
- 12 6月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* fix: when use the load model from memory mode, the RAM occupy is high test=release/1.5
-
- 06 6月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 29 5月, 2019 1 次提交
-
-
由 mozga-intel 提交于
-
- 25 5月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* fluid int8 train and trt int8 predict align. trt int8 predict init op converter * 2. align fluid int8 train and trt int8 inference. enhance quant dequant fuse pass enhance op converter, trt engine, trt engine op, trt subgraph pass. * 3. add delete_quant_dequant_pass for trt test=develop * 4. add the missing file test=develop * 5. i modify the c++ interface, but forget to modify the pybind code fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter test=develop
-
- 16 5月, 2019 1 次提交
-
-
由 Zhen Wang 提交于
* add set_not_owned function for graph * add scope set. test=develop * add scope_ptr enforce not null before setting.test=develop
-
- 09 5月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
1. infernce multi card occupy 2. facebox model inference occupy too much test=develop
-
- 07 5月, 2019 1 次提交
-
-
由 石晓伟 提交于
* cherry-pick commit from 88770542 * cherry-pick commit from 3f0b97df * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn (cherry picked from commit 8643dbc2) * Cherry-Pick from 16662 : Anakin subgraph cpu support (cherry picked from commit 7ad182e1) * Cherry-pick from 1662, 16797.. : add anakin int8 support (cherry picked from commit e14ab180) * Cherry-pick from 16813 : change singleton to graph RegistBlock test=release/1.4 (cherry picked from commit 4b9fa423) * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2 Support ShuffleNet and MobileNet-v2, test=release/1.4 (cherry picked from commit a6fb066f) * Cherry-pick : anakin subgraph add opt config layout argument #16846 test=release/1.4 (cherry picked from commit 8121b3ec) * 1. add shuffle_channel_detect (cherry picked from commit 6efdea89) * update shuffle_channel op convert, test=release/1.4 (cherry picked from commit e4726a06) * Modify symbol export rules test=develop
-
- 29 3月, 2019 2 次提交
-
-
由 Shixiaowei02 提交于
-
由 Shixiaowei02 提交于
-
- 28 3月, 2019 2 次提交
-
-
由 nhzlx 提交于
refine trt first run add quant dequant fuse pass omit simplify_anakin_priorbox_detection template omit transpose_flatten_concat_fuse template test=develop
-
由 chengduo 提交于
* modify the interface of Pass::Allay test=develop * Polish code test=develop * Fix Travis CI test=develop * fix Pass::Apply interface test=develop * Fix Travis CI test=develop
-
- 25 3月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
test=develop
-
- 22 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
2. refine trt code test=develop
-
- 21 3月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
* Add enabling quantization test=develop * remove unused (here) function
-
- 20 3月, 2019 6 次提交
- 19 3月, 2019 1 次提交
-
-
由 zhhsplendid 提交于
test=develop
-
- 18 3月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
* Add cpu_quantize_pass for C-API quantization test=develop * add cpu_quantize_pass test * fix lint: add include memory unorderd_map and unordered_set test=develop * fuse_relu 1 test=develop * tuned 2 without squash * fixes test=develop * remove unused vars test=develop * refactored test=develop * fix lint c-style cast -> C++ style cast test=develop * remove QuantMax and c style casts test=develop * last usage of QuantMax removed test=develop * Fix Analysis Predictor UT Check if memory_optimize_pass has already been added to the analysis config before adding a new one, so that it is not added multiple times. test=develop * change map to unordered_map fix the forgotten part of cpu_quantize_pass_tester.cc test=develop * removed quantized attribute * fixed cpu_quantize_pass_tester and op attr comments test=develop * removed redundant line test=debug * removed gmock test=develop * fix after merge
-
- 08 3月, 2019 7 次提交
-
-
由 nhzlx 提交于
add if use static engine for trt test=develop
-
由 nhzlx 提交于
test=develop
-
由 nhzlx 提交于
test=develop
-
由 nhzlx 提交于
1). add static trt load model 2). fix bug: when device_id is not 0, the trt will have a bug test=develop
-
由 nhzlx 提交于
add simple static mode loading test=develop
-
由 nhzlx 提交于
test=develop
-
由 nhzlx 提交于
1. bind trt input and output to fluid tensors
-
- 07 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
add if use static engine for trt test=develop
-
- 27 2月, 2019 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 26 2月, 2019 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 22 2月, 2019 1 次提交
-
-
由 nhzlx 提交于
1). add static trt load model 2). fix bug: when device_id is not 0, the trt will have a bug test=develop
-
- 18 2月, 2019 1 次提交
-
-
由 nhzlx 提交于
add simple static mode loading test=develop
-
- 15 2月, 2019 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 14 2月, 2019 2 次提交
-
-
由 Wojciech Uss 提交于
This also fixes linking to libpaddle_fluid.so built in debug mode test=develop
-
由 nhzlx 提交于
1. bind trt input and output to fluid tensors
-