- 03 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Add a interface to enable cudnn for inference. * Add cudnn_placement_pass. test=develop * Set the default value of cudnn_enabled_op_types to null. test=develop * Write the common basic class, placement_pass_base, to refine the codes. test=develop * Call EnableCUDNN in unittest. test=develop * Refine cudnn_placement_pass tester. * Enable the testing of cudnn_placement_pass in inference's unittest. test=develop * Add the check of op kernels. test=develop
-
- 19 8月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* fix mask rcnn bug: 1. affine channel fuse (diff) 2. condition block op (memory leak) 3. merge lod tensor op (diff) 4. memroy optim (diff) test=develop * fix ci aboud PADDLE_ENFOCE fix merge lod infer op ut test=develop
-
- 31 7月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* Fix Mask rcnn predictor 1. refine memory optim algorithm to support the model with the block op. 2. output diff : modify the affine channel fuse 3. add condition_block_infer op add interface for setting trt calib table dir test=develop * add the missing files. test=develop * 1 add trt fp16 support test=develop
-
- 11 7月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* add config.SetMkldnnCacheCapacity api for mkldnn cache clear strategy test=develop * enhance MkldnnPostReset test=develop * add comments for mkldnn_cache_capacity field test=develop
-
- 08 7月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* Fix Mask rcnn predictor 1. refine memory optim algorithm to support the model with the block op. 2. output diff : modify the affine channel fuse 3. add condition_block_infer op add interface for setting trt calib table dir test=develop * add the missing files. test=develop
-
- 16 6月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
* reuse C-API INT8 unit test application test=develop * updates after review test=develop
-
- 11 6月, 2019 2 次提交
-
-
由 石晓伟 提交于
* update anakin-engine interfaces for content-dnn test=develop * support only-gpu mode of Anakin modify eltwise parse test=develop * modification for thread-safe test=develop * Integrated template instance test=develop * increase template parameters test=develop * support MLU predictor test=develop * update anakin cmake files test=develop * update TargetWrapper::set_device * update the initialization of anakin subgraph test=develop * use the default constructor of base class test=develop
-
由 Zhaolong Xing 提交于
* fix: when use the load model from memory mode, the RAM occupy is high test=develop * ligth mem reuse test=develop * fix cpplint test=develop
-
- 06 6月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
test=develop
-
- 29 5月, 2019 1 次提交
-
-
由 mozga-intel 提交于
-
- 25 5月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
* fluid int8 train and trt int8 predict align. trt int8 predict init op converter * 2. align fluid int8 train and trt int8 inference. enhance quant dequant fuse pass enhance op converter, trt engine, trt engine op, trt subgraph pass. * 3. add delete_quant_dequant_pass for trt test=develop * 4. add the missing file test=develop * 5. i modify the c++ interface, but forget to modify the pybind code fix the IS_TRT_VERSION_GE bug, and fix elementwise op converter test=develop
-
- 16 5月, 2019 1 次提交
-
-
由 Zhen Wang 提交于
* add set_not_owned function for graph * add scope set. test=develop * add scope_ptr enforce not null before setting.test=develop
-
- 09 5月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
1. infernce multi card occupy 2. facebox model inference occupy too much test=develop
-
- 07 5月, 2019 1 次提交
-
-
由 石晓伟 提交于
* cherry-pick commit from 88770542 * cherry-pick commit from 3f0b97df * cherry-pick from 16691:Anakin subgraph support yolo_v3 and faster-rcnn (cherry picked from commit 8643dbc2) * Cherry-Pick from 16662 : Anakin subgraph cpu support (cherry picked from commit 7ad182e1) * Cherry-pick from 1662, 16797.. : add anakin int8 support (cherry picked from commit e14ab180) * Cherry-pick from 16813 : change singleton to graph RegistBlock test=release/1.4 (cherry picked from commit 4b9fa423) * Cherry Pick : 16837 Support ShuffleNet and MobileNet-v2 Support ShuffleNet and MobileNet-v2, test=release/1.4 (cherry picked from commit a6fb066f) * Cherry-pick : anakin subgraph add opt config layout argument #16846 test=release/1.4 (cherry picked from commit 8121b3ec) * 1. add shuffle_channel_detect (cherry picked from commit 6efdea89) * update shuffle_channel op convert, test=release/1.4 (cherry picked from commit e4726a06) * Modify symbol export rules test=develop
-
- 29 3月, 2019 2 次提交
-
-
由 Shixiaowei02 提交于
-
由 Shixiaowei02 提交于
-
- 28 3月, 2019 2 次提交
-
-
由 nhzlx 提交于
refine trt first run add quant dequant fuse pass omit simplify_anakin_priorbox_detection template omit transpose_flatten_concat_fuse template test=develop
-
由 chengduo 提交于
* modify the interface of Pass::Allay test=develop * Polish code test=develop * Fix Travis CI test=develop * fix Pass::Apply interface test=develop * Fix Travis CI test=develop
-
- 25 3月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
test=develop
-
- 22 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
2. refine trt code test=develop
-
- 21 3月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
* Add enabling quantization test=develop * remove unused (here) function
-
- 20 3月, 2019 6 次提交
- 19 3月, 2019 1 次提交
-
-
由 zhhsplendid 提交于
test=develop
-
- 18 3月, 2019 1 次提交
-
-
由 Wojciech Uss 提交于
* Add cpu_quantize_pass for C-API quantization test=develop * add cpu_quantize_pass test * fix lint: add include memory unorderd_map and unordered_set test=develop * fuse_relu 1 test=develop * tuned 2 without squash * fixes test=develop * remove unused vars test=develop * refactored test=develop * fix lint c-style cast -> C++ style cast test=develop * remove QuantMax and c style casts test=develop * last usage of QuantMax removed test=develop * Fix Analysis Predictor UT Check if memory_optimize_pass has already been added to the analysis config before adding a new one, so that it is not added multiple times. test=develop * change map to unordered_map fix the forgotten part of cpu_quantize_pass_tester.cc test=develop * removed quantized attribute * fixed cpu_quantize_pass_tester and op attr comments test=develop * removed redundant line test=debug * removed gmock test=develop * fix after merge
-
- 08 3月, 2019 7 次提交
-
-
由 nhzlx 提交于
add if use static engine for trt test=develop
-
由 nhzlx 提交于
test=develop
-
由 nhzlx 提交于
test=develop
-
由 nhzlx 提交于
1). add static trt load model 2). fix bug: when device_id is not 0, the trt will have a bug test=develop
-
由 nhzlx 提交于
add simple static mode loading test=develop
-
由 nhzlx 提交于
test=develop
-
由 nhzlx 提交于
1. bind trt input and output to fluid tensors
-
- 07 3月, 2019 1 次提交
-
-
由 nhzlx 提交于
add if use static engine for trt test=develop
-
- 27 2月, 2019 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 26 2月, 2019 1 次提交
-
-
由 nhzlx 提交于
test=develop
-
- 22 2月, 2019 1 次提交
-
-
由 nhzlx 提交于
1). add static trt load model 2). fix bug: when device_id is not 0, the trt will have a bug test=develop
-