- 06 9月, 2021 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add fusion_lstm INT8 PTQ * Correct mkldnn_cache_capacity and enable fc_lstm_fuse_pass only for this test * Change mkldnn_cache_capacity
-
由 Wilber 提交于
-
- 04 9月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 31 8月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* Revert "Revert "Add copy from tensor (#34406)" (#35173)" This reverts commit 32c1ec42. * add template instantiation
-
- 27 8月, 2021 2 次提交
-
-
由 joanna.wozna.intel 提交于
* Add calculation for gru op * Correct the types * Remove mkldnn only * Correct mkldnn ifdef * Remove mkldnn ifdef * Separate mkldnn quantizer test * Correct Windows test * Check different cmake fix * Revert cmake change * Cmake change 2 * Cmake change 3
-
由 zhangchunle 提交于
This reverts commit ac33c0ca.
-
- 26 8月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* add api * temp save * revert * copytocpu async ok * fix style * copy sync ok * fix compile error * fix compile error * api done * update python async api * fix compile * remove async python api; add c++ async unittest * remove python async api * update unittest * update unittest * add C++ unittest for copytensor * add unittest * update namespace utils to class TensorUtils * add unittest * update unittest * update unittest * update code style * update code style * update unittest
-
- 18 8月, 2021 1 次提交
-
-
由 Peihan 提交于
-
- 12 8月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 06 8月, 2021 1 次提交
-
-
由 houj04 提交于
-
- 05 8月, 2021 1 次提交
-
-
由 houj04 提交于
* add NPU support for zero_copy_tensor. * revert unnesessary codes. * revert unnesessary codes.
-
- 29 7月, 2021 1 次提交
-
-
由 wenbin 提交于
* tile op * more uts * disable tile if trt6.0 * typo * fix timeout issue * opteller * opteller remove duplicate code * comments. test=document_fix * modify PADDLE_ENFORCE. * fix reduce_mean issue
-
- 21 7月, 2021 1 次提交
-
-
由 wenbin 提交于
* reduce_mean supported. test=allcase * ut. test=allcase * test=develop * ut.test=allcase * correct name. test=allcase * correct UT. test=allcase * correct UT.test=develop * remove op * UT * add convert * fix timeout issue * more uts * more ut * correct ut
-
- 19 7月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 14 7月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 23 6月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 21 6月, 2021 1 次提交
-
-
由 Pei Yang 提交于
-
- 18 6月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
-
- 17 6月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 11 6月, 2021 2 次提交
-
-
由 Jacek Czaja 提交于
* - Second fix - fix * - fix
-
由 wenbin 提交于
-
- 05 6月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 04 6月, 2021 1 次提交
-
-
由 wenbin 提交于
* fix inference prepare data bug * rename functions * typo * typo * typo * UT correct * correct condition * correct condition * ci coverage * morelines * fix ci coverage
-
- 01 6月, 2021 1 次提交
-
-
由 Wangzheee 提交于
* add trt convert op: reshape * reshape * fix * input: Shape,ShapeTensor * Add some comments
-
- 25 5月, 2021 1 次提交
-
-
由 jakpiase 提交于
-
- 19 5月, 2021 2 次提交
-
-
由 wuhuanzhou 提交于
* CI skip inference test if only python files modified, test=develop * fix compilation error on ROCM, test=develop * fix cmake error on PR-CI-ROCM-Compile, test=develop
-
由 石晓伟 提交于
-
- 29 4月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 28 4月, 2021 2 次提交
-
-
由 denglin-github 提交于
* Add dlnne engine runtime * Fix log * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format * Fix CMakeList format error * Add copyright message * Fix dlnne CMakeList.txt * Add some paddlepaddle_pass to support more networks * Fix some format bug * Add delete dropout_op pass * Fix some format bug * Fix format bug
-
由 Jacek Czaja 提交于
* - Added clearing oneDNN per executor * - Executor is nt always having FLAGS_use_mkldnn set to true
-
- 26 4月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 25 4月, 2021 2 次提交
-
-
由 Wilber 提交于
-
由 denglin-github 提交于
* Add dlnne engine runtime * Fix log * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format * Fix CMakeList format error * Add copyright message * Fix dlnne CMakeList.txt * Add some paddlepaddle_pass to support more networks * Fix some format bug
-
- 02 4月, 2021 1 次提交
-
-
由 seemingwang 提交于
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions Co-authored-by: NHuang Zhengjie <270018958@qq.com> Co-authored-by: NWeiyue Su <weiyue.su@gmail.com> Co-authored-by: Nsuweiyue <suweiyue@baidu.com> Co-authored-by: Nluobin06 <luobin06@baidu.com> Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
-
- 01 4月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* add anchor generator op plugin * add anchor generator unit_test * remove dbg info * remove redundant line * replace assertion with paddle enforce * dynamic plugin replaces assertion with paddle enforce * anchor generator support dynamic shape on spatial axis * anchor generator test with fp16, dynamic shape * add anchor generator test all * add back main * reduce test input size to not exceed the timelimit of ci * change super to InferencePassTest for python2 compatibility * reuse paddle operator anchor generator * move creator construct to header with default * add cuda ifdef * reduce line * change super to InferencePassTest for python2 compatibility * fix anchor generator fp16 serialize setting * split unittest from test_all * restrict anchor generator input format before version 7234 * anchor generator only support greater than trt7.1 * change min_graph_size to 2 * min_graph size to 3 if dynamic shape * reduce dynamic shape size to avoid trt search tactic too long to exceed time limit * remove anchor from fetch list * anchor generator support all trt version * fix memory not allocated but if serialized
-
- 31 3月, 2021 1 次提交
-
-
由 Zhou Wei 提交于
* [Parallel UT]improve Parallel UT level on Windows/Linux * [Parallel UT]improve Parallel UT level on Windows/Linux * [Parallel UT]Improve Parallel UT level on Windows/Linux * [Parallel UT]Improve Parallel UT level on Windows/Linux * fix CI
-
- 30 3月, 2021 2 次提交
- 29 3月, 2021 2 次提交
-
-
由 zlsh80826 提交于
* add roi_align_plugin * add roi align unit_test * add roi align serialization * remove roi align static plugin because of batch dim issue * refine roi align unittest and add fp16/serialization * add trt roi align condition to op_teller * refine error message * remove unnecessary reshape layer
-
由 zlsh80826 提交于
* trt affine channel converter * add trt affine channel base test * add trt affine channel NHWC * remove asterisk for python2 compatibility * trt affine channel converter * add trt affine channel base test * add trt affine channel NHWC * remove asterisk for python2 compatibility * fix rebase * move LodTensor to Tensor * add dbg info * affine channel converter only support NCHW * scale,bias are parameters, use create_parameters api * reduce test input size to not exceed the timelimit of ci * refine affine channel unittest and add serialization/dynamic test * change super to InferencePassTest for python2 compatibility * change super to InferencePassTest for python2 compatibility * fix affine channel fp16 serialize setting
-