- 06 12月, 2019 1 次提交
-
-
由 石晓伟 提交于
-
- 05 12月, 2019 1 次提交
-
-
由 Pei Yang 提交于
-
- 04 12月, 2019 2 次提交
-
-
由 Pei Yang 提交于
make config option DisableGlogInfo() able to mute all inference logs
-
由 Zhaolong Xing 提交于
* ADD NV JETSON SUPPORT test=release/1.6 * CHERRY_PICK: specify the auto growth allocator for inference. test=release/1.6
-
- 03 12月, 2019 2 次提交
- 02 12月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
-
- 29 11月, 2019 2 次提交
-
-
由 Pei Yang 提交于
added splitter "__" between weight name and suffix number to avoid conflicts.
-
由 Wojciech Uss 提交于
-
- 25 11月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
* fix the CAPI ZeroCopy shape error and reconstruct the output obtain * use an anonymous namespace to cover the functor * fix unit tests because of the output of typeid(T).name() is different from linux and windows, test=develop
-
- 02 11月, 2019 1 次提交
-
-
由 石晓伟 提交于
* fix infer crashes caused by conv/pool upgrades, test=release/1.6 * fix bug, test=release/1.6
-
- 01 11月, 2019 1 次提交
-
-
由 Zhaolong Xing 提交于
Paddle-trt inference: filter conv, depthwise_conv, pooling when padding size > 4 fix C++ multicard inference bug. test=develop
-
- 31 10月, 2019 1 次提交
-
-
由 Pei Yang 提交于
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733) (#20902) * fix pool2d trt converter, test=develop * add fix for split op converter, test=develop
-
- 20 10月, 2019 1 次提交
-
-
由 bingyanghuang 提交于
-
- 18 10月, 2019 2 次提交
-
-
由 石晓伟 提交于
-
由 liu zhengxi 提交于
* improve the performance of capi in PD_PredictorRun (#20665) * alter the capi of PD_PredictorRun to provide proper function, test=release/1.6
-
- 16 10月, 2019 1 次提交
-
-
由 Michał Gallus 提交于
test=release/1.6 test=document_fix
-
- 15 10月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
-
- 14 10月, 2019 2 次提交
-
-
由 Pei Yang 提交于
-
由 zhaoyuchen2018 提交于
* Add Multihead matmul fuse pass (#20167) * Add multihead fuse pass for ernie opt * Refine softmax test=develop * Refine cuda kernel * Refine cuda version * Refine cmake test=develop * refine header file * refine test case and pass * refine comments * Delete useless code. test=develop Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
-
- 12 10月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
-
- 10 10月, 2019 1 次提交
-
-
由 liu zhengxi 提交于
* Add capi for fluid inference api (#20092) * add capi for fluid inference api, including AnalysisConfig, AnalysisPredictor, PaddleBuf, PaddleTensor, ZeroCopyTensor * add dll to inference capi (#20180) * add dll to inference capi, test=develop * add if win32 in cmakelists, test=develop
-
- 01 10月, 2019 1 次提交
-
-
由 石晓伟 提交于
-
- 27 9月, 2019 1 次提交
-
-
由 石晓伟 提交于
* update operator compatible info, test=develop * revert cmake/version.cmake, test=develop * add unit_tests and fix bugs, test=develop * update ../paddle/fluid/framework/framework.proto, test=develop * fix bug of paddle/fluid/inference/api/analysis_predictor.cc, test=develop * update paddle/fluid/framework/version_test.cc, test=develop * add comments and rename interfaces, test=develop
-
- 25 9月, 2019 2 次提交
-
-
由 Zhaolong Xing 提交于
FIx C++ inference BUG: When open memory optim and enable trt subgraph at the same time, there is a bug (#19969) * fix memory optimization type test=develop * 1. fix BUG: open trt and memory optim will trigger bug. 2. Clean memory optim bug. test=develop
-
由 Aurelius84 提交于
* Removing last dims constraints of seq_pad and seq_unpad test=develop * fix test_layer api code test=develop * fix sequence_pad_op.cc conflict test=develop * remove test_analyzer_mm_dnn test=develop * fix vectorize bug test=develop * fix vectorize<int> test=develop
-
- 21 9月, 2019 3 次提交
-
-
由 pawelpiotrowicz 提交于
test=develop
-
由 Pei Yang 提交于
* add TRT shape check, test=develop * model_input_shape == runtime_input_shape, refine message, test=develop
-
由 Pei Yang 提交于
* fix trt bugs when sharing params, test=develop * add unittest for cascade_rcnn
-
- 20 9月, 2019 1 次提交
-
-
由 石晓伟 提交于
-
- 19 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Add fc_elementwise_layernorm_fuse pass and unittest. * Add fused_fc_elementwise_layernorm op and its GPU kernel. test=develop * Apply fc_elementwise_layernorm_fuse_pass to GPU inference. * Add the setting of attrs in the definition of binary_op. test=develop * Add comment. * Implement the unittest. test=develop * Change the unittest name of layer_norm. test=develop
-
- 18 9月, 2019 1 次提交
-
-
由 石晓伟 提交于
-
- 17 9月, 2019 2 次提交
-
-
由 Pei Yang 提交于
zerocopytensor support uint8, analysis config support profile, analysis predictor support GetInputTensorShape, test=develop (#19822)
-
由 Zhaolong Xing 提交于
test=develop
-
- 16 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop * Enhance fc_fuse_pass to enable fusing relu. * Allow print the shapes of var_desc in graph. test=develop * Enhance fc_fuse_pass_tester. * Remove the use of PADDLE_ENFORCE. test=develop * Correct the number of ops after fusing. test=develop * Fix a typo. test=develop * Set activation_type to null when there is no relu in fc. test=develop * Refine fc_fuse_pass's codes. * Enable the set of shape for tensor. * Refine repeated_fc_relu_pass and add unittest. test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop
-
- 09 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* paddle::framework::vectorize() templatization test=develop * update pybind/imperative.cc test=develop * revert update on unsqueeze_op.cc and warpctc_cudnn_op.cu.cc test=develop
-
- 05 9月, 2019 1 次提交
-
-
由 Tao Luo 提交于
* remove assert.h * change PADDLE_ASSERT_MSG to PADDLE_ENFORCE test=develop * fix tensorrt paddle_enforce test=develop
-
- 04 9月, 2019 1 次提交
-
-
由 baojun 提交于
* enable ngraph throught build_strategy test=develop * add unittest test=develop * put use_ngraph unconditional test=develop * remove paddle_enforce test=develop * remove paddle_enforce test=develop * fix copyright test=develop * limit for ngraph only test=develop
-
- 03 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Add a interface to enable cudnn for inference. * Add cudnn_placement_pass. test=develop * Set the default value of cudnn_enabled_op_types to null. test=develop * Write the common basic class, placement_pass_base, to refine the codes. test=develop * Call EnableCUDNN in unittest. test=develop * Refine cudnn_placement_pass tester. * Enable the testing of cudnn_placement_pass in inference's unittest. test=develop * Add the check of op kernels. test=develop
-