1. 28 5月, 2021 1 次提交
    • W
      强化非trt conv判断 (#33150) · e90f3006
      wenbin 提交于
      * add more conditions
      
      * dynamic shape
      
      * ut
      
      * correct contidions
      
      * commnent
      
      * remove rebandadnt op type
      
      * remove rebandant if
      e90f3006
  2. 08 5月, 2021 1 次提交
    • W
      [Paddle-TRT]fix trt-converter-fc_op (#32671) · 62d848de
      Wangzheee 提交于
      * [Paddle-TRT]fix fc_op
      
      * [Paddle-TRT]fix fc_op
      
      * [Paddle-TRT]fix fc_op
      
      * test_trt_subgraph_pass.py
      
      * fix elementwise_op
      
      * fix elementwise_op
      
      * fix elementwise_op
      
      * fix elementwise_op.cc
      
      * op_teller.cc
      62d848de
  3. 30 4月, 2021 1 次提交
  4. 27 4月, 2021 1 次提交
  5. 25 4月, 2021 1 次提交
  6. 14 4月, 2021 1 次提交
  7. 13 4月, 2021 1 次提交
  8. 02 4月, 2021 1 次提交
    • S
      graph engine (#31226) · 94736d60
      seemingwang 提交于
      * graph engine demo
      
      * upload unsaved changes
      
      * fix dependency error
      
      * fix shard_num problem
      
      * py client
      
      * remove lock and graph-type
      
      * add load direct graph
      
      * add load direct graph
      
      * add load direct graph
      
      * batch random_sample
      
      * batch_sample_k
      
      * fix num_nodes size
      
      * batch brpc
      
      * batch brpc
      
      * add test
      
      * add test
      
      * add load_nodes; change add_node function
      
      * change sample return type to pair
      
      * resolve conflict
      
      * resolved conflict
      
      * resolved conflict
      
      * separate server and client
      
      * merge pair type
      
      * fix
      
      * resolved conflict
      
      * fixed segment fault; high-level VLOG for load edges and load nodes
      
      * random_sample return 0
      
      * rm useless loop
      
      * test:load edge
      
      * fix ret -1
      
      * test: rm sample
      
      * rm sample
      
      * random_sample return future
      
      * random_sample return int
      
      * test fake node
      
      * fixed here
      
      * memory leak
      
      * remove test code
      
      * fix return problem
      
      * add common_graph_table
      
      * random sample node &test & change data-structure from linkedList to vector
      
      * add common_graph_table
      
      * sample with srand
      
      * add node_types
      
      * optimize nodes sample
      
      * recover test
      
      * random sample
      
      * destruct weighted sampler
      
      * GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * WeightedGraphEdgeBlob to GraphEdgeBlob
      
      * pybind sample nodes api
      
      * pull nodes with step
      
      * fixed pull_graph_list bug; add test for pull_graph_list by step
      
      * add graph table;name
      
      * add graph table;name
      
      * add pybind
      
      * add pybind
      
      * add FeatureNode
      
      * add FeatureNode
      
      * add FeatureNode Serialize
      
      * add FeatureNode Serialize
      
      * get_feat_node
      
      * avoid local rpc
      
      * fix get_node_feat
      
      * fix get_node_feat
      
      * remove log
      
      * get_node_feat return  py:bytes
      
      * merge develop with graph_engine
      
      * fix threadpool.h head
      
      * fix
      
      * fix typo
      
      * resolve conflict
      
      * fix conflict
      
      * recover lost content
      
      * fix pybind of FeatureNode
      
      * recover cmake
      
      * recover tools
      
      * resolve conflict
      
      * resolve linking problem
      
      * code style
      
      * change test_server port
      
      * fix code problems
      
      * remove shard_num config
      
      * remove redundent threads
      
      * optimize start server
      
      * remove logs
      
      * fix code problems by reviewers' suggestions
      Co-authored-by: NHuang Zhengjie <270018958@qq.com>
      Co-authored-by: NWeiyue Su <weiyue.su@gmail.com>
      Co-authored-by: Nsuweiyue <suweiyue@baidu.com>
      Co-authored-by: Nluobin06 <luobin06@baidu.com>
      Co-authored-by: Nliweibin02 <liweibin02@baidu.com>
      94736d60
  9. 01 4月, 2021 1 次提交
    • Z
      [Paddle-TRT] add anchor generator op plugin (#31730) · b807e408
      zlsh80826 提交于
      * add anchor generator op plugin
      
      * add anchor generator unit_test
      
      * remove dbg info
      
      * remove redundant line
      
      * replace assertion with paddle enforce
      
      * dynamic plugin replaces assertion with paddle enforce
      
      * anchor generator support dynamic shape on spatial axis
      
      * anchor generator test with fp16, dynamic shape
      
      * add anchor generator test all
      
      * add back main
      
      * reduce test input size to not exceed the timelimit of ci
      
      * change super to InferencePassTest for python2 compatibility
      
      * reuse paddle operator anchor generator
      
      * move creator construct to header with default
      
      * add cuda ifdef
      
      * reduce line
      
      * change super to InferencePassTest for python2 compatibility
      
      * fix anchor generator fp16 serialize setting
      
      * split unittest from test_all
      
      * restrict anchor generator input format before version 7234
      
      * anchor generator only support greater than trt7.1
      
      * change min_graph_size to 2
      
      * min_graph size to 3 if dynamic shape
      
      * reduce dynamic shape size to avoid trt search tactic too long to exceed time limit
      
      * remove anchor from fetch list
      
      * anchor generator support all trt version
      
      * fix memory not allocated but if serialized
      b807e408
  10. 30 3月, 2021 2 次提交
  11. 29 3月, 2021 2 次提交
    • Z
      [Paddle-TRT] roi_align_plugin (#31732) · e3a38d79
      zlsh80826 提交于
      * add roi_align_plugin
      
      * add roi align unit_test
      
      * add roi align serialization
      
      * remove roi align static plugin because of batch dim issue
      
      * refine roi align unittest and add fp16/serialization
      
      * add trt roi align condition to op_teller
      
      * refine error message
      
      * remove unnecessary reshape layer
      e3a38d79
    • Z
      [Paddle-TRT] trt affine channel converter (#31628) · bfb5cf55
      zlsh80826 提交于
      * trt affine channel converter
      
      * add trt affine channel base test
      
      * add trt affine channel NHWC
      
      * remove asterisk for python2 compatibility
      
      * trt affine channel converter
      
      * add trt affine channel base test
      
      * add trt affine channel NHWC
      
      * remove asterisk for python2 compatibility
      
      * fix rebase
      
      * move LodTensor to Tensor
      
      * add dbg info
      
      * affine channel converter only support NCHW
      
      * scale,bias are parameters, use create_parameters api
      
      * reduce test input size to not exceed the timelimit of ci
      
      * refine affine channel unittest and add serialization/dynamic test
      
      * change super to InferencePassTest for python2 compatibility
      
      * change super to InferencePassTest for python2 compatibility
      
      * fix affine channel fp16 serialize setting
      bfb5cf55
  12. 26 3月, 2021 1 次提交
    • Z
      [Paddle-TRT] multiclass nms (#31742) · 01aa2526
      zlsh80826 提交于
      * add multiclass_nms
      
      * add multiclass_nms unittest
      
      * add default enable_tensorrt_oss option
      
      * refine multiclas nms unittest and add serialization/dynamic test
      
      * change super to InferencePassTest for python2 compatibility
      
      * refine multiclass nms unittest
      
      * move out dynamic shape test due to ci timelimit
      01aa2526
  13. 22 3月, 2021 1 次提交
    • Z
      [Paddle-TRT] nearest_interp op (#31626) · bfced39e
      zlsh80826 提交于
      * nearest_interp op converter w/ dynamic/static
      
      * fix data_layout include
      
      * add trt nearest unit_test
      
      * add nearest_interp NHWC test
      
      * update trt nearest interp nhwc testcase
      
      * remove asterisk for python2 compatibility
      
      * add empty line to prevent conflict
      
      * nearest_interp op converter w/ dynamic/static
      
      * fix data_layout include
      
      * add trt nearest unit_test
      
      * add nearest_interp NHWC test
      
      * update trt nearest interp nhwc testcase
      
      * remove asterisk for python2 compatibility
      
      * add empty line to prevent conflict
      
      * change the priority of out_h, out_w
      bfced39e
  14. 18 3月, 2021 1 次提交
  15. 12 3月, 2021 1 次提交
  16. 03 3月, 2021 1 次提交
  17. 02 3月, 2021 1 次提交
  18. 25 2月, 2021 1 次提交
  19. 24 2月, 2021 1 次提交
    • P
      [Paddle-TRT] support group_norm (#31040) · 00b09e86
      Pei Yang 提交于
      * add group norm plugin
      
      * fix compile problems
      
      * move concat axis check to trt op teller
      
      * add nbDims for scale and bias nv dims
      
      * add group norm unit test
      
      * fix unittest
      
      * add trt version restriction for group norm op teller
      
      * fix unittest
      00b09e86
  20. 18 2月, 2021 1 次提交
  21. 03 2月, 2021 1 次提交
  22. 02 2月, 2021 1 次提交
  23. 01 2月, 2021 1 次提交
  24. 24 12月, 2020 1 次提交
  25. 08 12月, 2020 1 次提交
  26. 07 12月, 2020 2 次提交
  27. 27 11月, 2020 1 次提交
    • S
      detect tensorRT plugin fp16 in runtime (#27933) · b9e76a01
      Shang Zhizhou 提交于
      * remove -DSUPPORTS_CUDA_FP16 in cuda.cmake
      
      * comile with cuda9
      
      * add some unittest
      
      * notest;test=coverage
      
      * add unittest for trt plugin swish && split
      
      * update ernie unittest
      
      * fix some error message
      
      * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter
      
      * fix comile errror when CUDA_ARCH_NAME < Pascal"
      
      * fix comile error
      
      * update unittest timeout
      
      * compile with cuda9
      
      * update error msg
      
      * fix code style
      
      * add some comments
      
      * add define IF_CUDA_ARCH_SUPPORT_FP16
      
      * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
      b9e76a01
  28. 20 11月, 2020 1 次提交
  29. 16 11月, 2020 2 次提交
  30. 12 11月, 2020 1 次提交
  31. 08 11月, 2020 1 次提交
    • Y
      exec ut no more than 15s 1 (#28439) · ba075632
      YUNSHEN XIE 提交于
      * disable ut test_parallel_executor_fetch_isolated_var,test=document_fix
      
      * test for limiting ut exec time as 15S
      
      * fix an error caused by cannot find ut
      
      * fix some error
      
      * can not find test_transformer
      
      * fix error caused by ut not run in windows
      
      * fix error caused by Compiler Options
      
      * fix error caused by setting timeout value as 15 in python/paddle/tests/CMakeLists.txt
      
      * setting timeout value to 120s for old ut
      
      * add the timeout value setting
      
      * fix error caused by ut only run in coverage_ci
      
      * add analyzer_transformer_profile_tester
      
      * fix some error
      
      * fix some error
      
      * fix error with inference option
      
      * fix error with inference option setting as ON_INFER
      
      * add some ut to set timeout
      
      * modified some option
      
      * fix error
      
      * fix some timeout error
      
      * fix error
      
      * fix error
      
      * fix timeout for test_analyzer_bfloat16_resnet50
      
      * fix error
      
      * setting timeout properity for some ut
      
      * first pr for new ut timeout as 15S
      ba075632
  32. 04 11月, 2020 1 次提交
  33. 03 11月, 2020 1 次提交
    • S
      TensorRT中ernie模型推理性能优化,支持变长输入 (#28367) · ea851796
      Shang Zhizhou 提交于
      * fp16 result ok
      
      * change -DWITH_NVINFER_PLUGIN toconfig.EnableTensorRtOSS
      
      * auto detect special slice op converter for ernie with trt oss
      
      * ernie oss only support fp16
      
      * fix special_slice_plugin serialize bug
      
      * matmul in tensorrt ok
      
      * ernie unittest ok
      
      * add matmul tensorrt unittest
      
      * remove demo code
      ea851796
  34. 26 10月, 2020 1 次提交
  35. 14 10月, 2020 1 次提交
    • C
      Remove and reorganize the alias of APIs (#27717) · d05058d2
      chentianyu03 提交于
      * modify cond while_loop to paddle.static.nn.cond
      
      * modify crop_tensor to paddle.crop
      
      * modify Variable to paddle.static.Variable
      
      * remove nn.beam_search, nn.beam_search_decode, nn.gather_tree
      
      * remove bpr_loss, center_loss, rank_loss, smooth_l1, teacher_student_sigmoid_loss, edit_distance, sampled_softmax_with_cross_entropy in nn.functional
      
      * remove apis in nn.functional.learn_rate.py
      
      * remove pool2d, pool3d, adaptive_pool2d, adaptive_pool3d in nn.functional
      
      * remove apis in nn.functional.vision
      
      * remove erf, soft_relu in nn.functional.activation
      
      * remove apis in nn.functional.extension
      
      * remove nn.functional.rnn
      
      * remove hash from nn.functional.lod
      
      * remove row_conv from nn.functional.extension
      
      * remove one_hot, pad2d, pad_constant_like from nn.functional.common
      
      * remove nn.gather_tree, nn.BilinearTensorProduct, nn.Pool2D, nn.Pad2D
      
      * remove apis from optimizer.__init
      
      * remove tensor.creation.fill_constant
      
      * remove elementwise_mul in nn.functional.common and  modify to paddle.multiply
      
      * remove  tensor.stat.reduce_mean
      
      * remove reduce_all, reduce_any in tensor.logic
      
      * remove apis in tensor.math
      
      * remove apis in tensor.__init__
      
      * remove has_inf, has_nan in tensor.search
      
      * remove apis in framework.__init__
      
      * remove apis in paddle.__init__
      
      * remove apis in nn.functional.__init__
      
      * modify removed alias apis to raw api in doc and unittests
      
      * fix remove grid_sample bug
      
      * modify removed alias apis to raw api in doc and unittests
      
      * modify removed alias apis to raw api in doc and unittests
      
      * modify removed alias apis to raw api in doc and unittests
      
      * modify removed alias apis to raw api in doc and unittests
      
      * modify removed alias apis to raw api in doc and unittests
      
      * modify removed alias apis to raw api in doc and unittests
      
      * delete alias api relastions in doc
      
      * reserve paddle.compat, paddle.sysconfig
      
      * remove unittest for paddle.reduce_all, paddle.reduce_any
      
      * modify removed alias apis to raw api in doc and unittests
      
      * recover paddle.save and paddle.load
      
      * resolve conflicts
      
      * fix sample code missing paddle.enable_static() bug
      
      * fix sample code missing paddle.enable_static() bug
      
      * fix to_string sample code error
      d05058d2
  36. 29 9月, 2020 1 次提交