- 15 2月, 2022 1 次提交
-
-
由 Aurelius84 提交于
* #1 migrate dist-related type()-> dtype() * move datatype function from pten -> fluid/framework * change type() in imperative into convert(dtype()) * modify xx_tensor->type into xx_tensor->dtype * change the set_type interface and the caller * modify xx_tensor.type into xx_tensor.dtype * fix mutable_data(place, dtype()) * change caller of mutable_data in pten and distributed * change the caller of mutable_data in fluid/framework * change the caller of mutable_data in imperative directory * mutable_data: inference * update the call of mutable_data * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType * pass the compile. the next step is remove VarType in Pten * fix all and remove VarType from pten. success in linux. Next task is other platform * fix conflict with develop * fix compiled error * Fix reset conversion * fix conflict * fix compiled problem * fix typo * Fix << in tensor_utils.cc * fix type->dtype * fix unittest * fix tensor init constructor * fix DataTypeSize for BFloat16 * fix code style * fix npu compiled error * fix npu * compile npu sucessfully * fix conflict * fix conflict Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
-
- 11 2月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 26 1月, 2022 1 次提交
-
-
由 baoachun 提交于
* support npu weight unified H2D copy * remove redundant variable
-
- 17 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
- 14 1月, 2022 1 次提交
-
-
由 heliqi 提交于
* add trt_convert_flatten_contiguous_rang op * trt version >7,support trt_convert_flatten_contiguous_rang * trt version >7,support trt_convert_flatten_contiguous_rang * trt version >7,support trt_convert_flatten_contiguous_rang * test cast add trt version >=7 skip
-
- 13 1月, 2022 1 次提交
-
-
由 Wangzheee 提交于
* add Paddle Trt config: with_interleaved
-
- 31 12月, 2021 1 次提交
-
-
由 jakpiase 提交于
* glog fix * changed approach
-
- 30 12月, 2021 2 次提交
-
-
由 wenbin 提交于
* dynamic shape clone supported
-
由 JingZhuangzhuang 提交于
-
- 23 12月, 2021 1 次提交
-
-
由 Wilber 提交于
* support external stream. * update * update * update
-
- 15 12月, 2021 1 次提交
-
-
由 jianghaicheng 提交于
* add ipu_inference * resovle commments * resolve comments * add EnableIpu introduction * rm line * restore npu update * add ernie and resnet50 test * fix copyright time Co-authored-by: Nyaozhixin <522190855@qq.com>
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 15 11月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* move extension into pten [no-verify] * append tensor methods by ext_tensor [no-verify] * append other tensor methods [no-verify] * ext related files tidy [no-verify] * include relation tidy [no-verify] * add pten tensor test [no-verify] * replace tensor in custom op & compile success * refine tensor constructor for unittest * custom relu jit run success * fix all custom op unittests * add inference cmake adapt [no-verify] * fix failed unittests * fix windows failed unittests * try to fix kunlun and inference failed * fix test_elementwise_api error * try to fix win compile failed * fix kunlun fp16 type error * remove useless haddle error macro * add custom linear op test * fix compile failed & add win symbols * fix non pten kernel cast failed * add dll decl for api * polish several deetails * polish details by review comment * add dll_decl for register
-
- 27 10月, 2021 1 次提交
-
-
由 wangxinxin08 提交于
* add dcnv2 plugin
-
- 26 10月, 2021 1 次提交
-
-
由 feng_shuai 提交于
-
- 22 10月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 19 10月, 2021 2 次提交
-
-
由 Wilber 提交于
* update * fix ut error * update ut
-
由 wangxinxin08 提交于
* add nearest_interp_v2 trt plugin
-
- 11 10月, 2021 1 次提交
-
-
由 wangxinxin08 提交于
* add mish trt plugin, compile & install success, run error. test=develop * modify code according to review * add TRT_NOEXCEPT for mish trt plugin * add unittest for mish trt plugin * remove unnecessary check of mish in op_teller.cc * fix some problem of trt8 * add check and modify unittest while converting mish to trt plugin Co-authored-by: Ndengkaipeng <dengkaipeng@baidu.com>
-
- 23 9月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 22 9月, 2021 1 次提交
-
-
由 JingZhuangzhuang 提交于
-
- 18 9月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 14 9月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 10 9月, 2021 1 次提交
-
-
由 wenbin 提交于
* conv3d * remove const_cast * modify ut * disable dynamic shape for trt6.0 * remove trt5
-
- 29 7月, 2021 1 次提交
-
-
由 wenbin 提交于
* tile op * more uts * disable tile if trt6.0 * typo * fix timeout issue * opteller * opteller remove duplicate code * comments. test=document_fix * modify PADDLE_ENFORCE. * fix reduce_mean issue
-
- 21 7月, 2021 1 次提交
-
-
由 wenbin 提交于
* reduce_mean supported. test=allcase * ut. test=allcase * test=develop * ut.test=allcase * correct name. test=allcase * correct UT. test=allcase * correct UT.test=develop * remove op * UT * add convert * fix timeout issue * more uts * more ut * correct ut
-
- 14 7月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 21 6月, 2021 1 次提交
-
-
由 Pei Yang 提交于
-
- 11 6月, 2021 2 次提交
-
-
由 Jacek Czaja 提交于
* - Second fix - fix * - fix
-
由 wenbin 提交于
-
- 05 6月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 04 6月, 2021 1 次提交
-
-
由 wenbin 提交于
* fix inference prepare data bug * rename functions * typo * typo * typo * UT correct * correct condition * correct condition * ci coverage * morelines * fix ci coverage
-
- 01 6月, 2021 1 次提交
-
-
由 Wangzheee 提交于
* add trt convert op: reshape * reshape * fix * input: Shape,ShapeTensor * Add some comments
-
- 19 5月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 29 4月, 2021 1 次提交
-
-
由 Wilber 提交于
-
- 26 4月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 25 4月, 2021 2 次提交
-
-
由 Wilber 提交于
-
由 denglin-github 提交于
* Add dlnne engine runtime * Fix log * Remove <const_cast> and remove unrelated modify with dlnne, +clang-format * Fix CMakeList format error * Add copyright message * Fix dlnne CMakeList.txt * Add some paddlepaddle_pass to support more networks * Fix some format bug
-
- 01 4月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* add anchor generator op plugin * add anchor generator unit_test * remove dbg info * remove redundant line * replace assertion with paddle enforce * dynamic plugin replaces assertion with paddle enforce * anchor generator support dynamic shape on spatial axis * anchor generator test with fp16, dynamic shape * add anchor generator test all * add back main * reduce test input size to not exceed the timelimit of ci * change super to InferencePassTest for python2 compatibility * reuse paddle operator anchor generator * move creator construct to header with default * add cuda ifdef * reduce line * change super to InferencePassTest for python2 compatibility * fix anchor generator fp16 serialize setting * split unittest from test_all * restrict anchor generator input format before version 7234 * anchor generator only support greater than trt7.1 * change min_graph_size to 2 * min_graph size to 3 if dynamic shape * reduce dynamic shape size to avoid trt search tactic too long to exceed time limit * remove anchor from fetch list * anchor generator support all trt version * fix memory not allocated but if serialized
-
- 30 3月, 2021 1 次提交
-
-
由 zlsh80826 提交于
* yolobox converter and plugin * yolobox unittest * add dynamic shape restriction * fix git merge log
-