- 28 6月, 2022 1 次提交
-
- 27 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
* make inference_api_test use import_lib + dll to smaller its size;test=document_fix;test=windows_ci_inference * make place.h not export symbols; change the order of inference test dependencies * cant delete analysis pass ..in inference_analysis_test or build test_analyzer.exe will failed; temporarily comment test need paddle_inference_c * add onnx path; cp onnxruntime.dll to tests/api * make c_ut depend on paddle_inference_c_shared rather than paddle_inference_c * make c_ut depend .dll in windows and .lib in linux;path add capi_exp * init context pool by analysis predictor * make windows and linux both use shared; compile shared even on_infer off * move case:profile after case:fuse_statis to avoid context pool null problem * make c_ut depend .dll in windows and .lib in linux;path add capi_exp
-
- 14 6月, 2022 1 次提交
-
-
由 Wilber 提交于
* cmake-lint * update
-
- 10 6月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* make all phi kernels to 2(host/device) static libraries directly * fix calling kernel_declare * fix compile * fix cpu compile * fix rocm compile * fix xpu compile * fix xpu kp compile * fix inference compile
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 10 5月, 2022 1 次提交
-
-
由 ronnywang 提交于
-
- 25 4月, 2022 2 次提交
-
-
由 heliqi 提交于
* fix windows compile test case error * test windows ci * cmake add onnxruntime * cmake add onnxruntime * test windows ci * auto_code_generator add ort lib copy * fallback modify windows ci bat * ci notest;test=document_fix;test=windows_ci_inference;test=windows_ci;test=windows_op
-
由 zhouweiwei2014 提交于
* merge all phi lib to several big static lib * merge all phi lib to several big static lib
-
- 06 4月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* remove paddle_ipu shared library * fix unique_name
-
- 31 3月, 2022 1 次提交
-
-
由 TeslaZhao 提交于
* Pass compat of conv_transpose_bias_mkldnn_fuse_pass * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of transpose op, about accessing memory out of bounds of the perm param * op:transpose_op supports bool type * op:transpose_op supports bool type * Keep strided_slice op behavior consistent with slice op when starts input is less than -rank * Using DistConfig in inference
-
- 10 3月, 2022 1 次提交
-
-
由 heliqi 提交于
* add onnxruntime predictor * Add code comments * support link paddle2onnx onnxruntime * support onnxruntime with python * support onnxruntime with python * support onnxruntime with windows * paddle2onnx compile with windows * supoort windows compile * supoort windows compile with onnxruntime * supoort windows compile with paddle2onnx * supoort mac compile * compile with mac * compile with mac * add code comments * fix remind word * code optimization * add test case * add test case * add inference demo_ci test case * fix compile paddle2onnx with no python * add inference demo_ci test case * add inference demo_ci test case * add inference infer_ut test case * support c go api and test cases * add converage test case * add converage test case * add capi test case * add capi test case
-
- 24 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten to phi * fix infrt compile failed * resolve conflict
-
- 23 2月, 2022 1 次提交
-
-
由 Allen Guo 提交于
* update inference part * restore white space
-
- 25 1月, 2022 1 次提交
-
-
由 xiongkun 提交于
* transfer: string tinyformat errors and part of enforce into pten * remove comment * fix by code review * assert is not compile in -DNDEBUG * add string as dependences of paddle_inference
-
- 15 12月, 2021 1 次提交
-
-
由 jianghaicheng 提交于
* add ipu_inference * resovle commments * resolve comments * add EnableIpu introduction * rm line * restore npu update * add ernie and resnet50 test * fix copyright time Co-authored-by: Nyaozhixin <522190855@qq.com>
-
- 15 11月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* move extension into pten [no-verify] * append tensor methods by ext_tensor [no-verify] * append other tensor methods [no-verify] * ext related files tidy [no-verify] * include relation tidy [no-verify] * add pten tensor test [no-verify] * replace tensor in custom op & compile success * refine tensor constructor for unittest * custom relu jit run success * fix all custom op unittests * add inference cmake adapt [no-verify] * fix failed unittests * fix windows failed unittests * try to fix kunlun and inference failed * fix test_elementwise_api error * try to fix win compile failed * fix kunlun fp16 type error * remove useless haddle error macro * add custom linear op test * fix compile failed & add win symbols * fix non pten kernel cast failed * add dll decl for api * polish several deetails * polish details by review comment * add dll_decl for register
-
- 01 11月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* initial tensor design & sign kernel demo * add move constructor for meta & add lodtensor * add dirs & sign xpu kernel * add mean cpu&cuda kernel impl * move sign & mean xpu & npu kernel * add selected_rows basic impl * refactor design, BaseTensor to DenseTensor, etc. * add scale mkldnn kernel * polish xpu & npu impl details * fix mkldnn reuse compile failed * change tensor operation lib name * rename util filename * add more comments * change TensorImplInterface to TensorInterface * add kernel key and factory * remove MKLDNNTensorMeta, add MKLDNNDenseTensor * change XXDeviceContext to XXContext * add base kernel registrar utils & test on sign * replace boost::any by paddle::any * fix several ci failed * fix npu compile error * add ordered map util * fix multiple ordered_map compile errors * move dev into include dir * support sign op in static op run * fix static op run error * fix new executor compile failed * add dygraph branch & remove sign_op.h * fix test_infer_no_need_buffer_slots * fix rocm compile link error * fix unitybuild error & clear glog * fix npu compile failed * skip quant trans test * fix part windows compile problem * fix xpu enforce error * fix inference test failed * remove ordered_map to solve quant failed * fix part of rcom compile faild * add more register kernels * revert scale kernel temporarily * fix code format error * add new kernel registrar marco * rename top to tcmpt * revert xpu, npu, mkldnn impl & remove op def * add kernel args parse functor to auto parse args * revert some change & add scale kernels * add op proto in dygraph kernelcontext building * polish kernel dispatch logic & nameing rule * fix scale kernel match error * fix scale test failed * add mean API and unittest * test mean api success * add branch to solve compiled error * skip clang format error * add mean skip rule in op_library * add dot kernel, api and unittest (#6) * remove old kernel and add symbol link * fix dot compiled failed * add merco for module declare * fix npu and xpu compile error * revert sign, mean, scale, dot kernel removing * add comment for keeping old kernel impl * fix mutable_data error * fix bfloat16 conflit * fix inference undef error * adapt to msvc compile rules * polish comment for template inst * add cmake template instantiation for win * fix backend to place device id bug * fix ifdef error * Op2functor (#7) * add kernel args maker class * make args maker non-const * remove debug log * modify codes by review options * split constructPrKernelContext function * fix output name bug * fix test_mean_op test_sign_op failed * fill_any_like kernel refactor (#10) * fill_any_like kernel refactor * remove useless code of full_like c++ api * skip dtype for fill_any_like * add attrs for kernel key constrcut * add use_pt_kernel Flags to control whether to use pt kernel (#13) * add use_pt_kernel Flags to control whether to use pt kernel * change the default value to true for cheking pt kernels * fix mutable_data cuda place error * move high level apis into hapi * remove selectedrows adapting temporarily * Support Scalar in Tensor Compute Library (#14) * fill_any_like kernel refactor * remove useless code of full_like c++ api * Support Scalar in Tensor Compute Library * add scalar in dygraph and static graph mode * keep the basic type for attr, instead of using scalar for all * merge the code * remove mkldnn tensor & polish details * use flat_hash_map and small_vector in kernel factory * Refactor flatten kernel (#12) * refactor flatten kernel * update infershape function * fix compile bugs * fix bugs when merge * fix compiler bugs * fix bugs when run test_flatten_api * fix bugs when run test * Revert "use flat_hash_map and small_vector in kernel factory" This reverts commit 23091495cfdd3df8cc1be592d30f09ea66a7c72b. * Move cpu, cuda and other device code into kernels (#15) * fill_any_like kernel refactor * remove useless code of full_like c++ api * Support Scalar in Tensor Compute Library * add scalar in dygraph and static graph mode * keep the basic type for attr, instead of using scalar for all * merge the code * start refactor matmul * move cpu, cuda and other device modules into kernels * merge code * polish code in operator.cc * Perfect unitests (#16) * perfect unittest * update license * replace with flat_hash_map, small_vector (#19) * fix small_vector build error on windows platform * replace with flat_hash_map, small_vector * remove todo * Perfect unitests (#20) * perfect unittest * update license * fix bug when run tcmpt_utils_test * refactor execution adapting impl * fix insert conflit * Fix CI bug of test_yolov3 (#21) * fill_any_like kernel refactor * remove useless code of full_like c++ api * Support Scalar in Tensor Compute Library * add scalar in dygraph and static graph mode * keep the basic type for attr, instead of using scalar for all * merge the code * start refactor matmul * move cpu, cuda and other device modules into kernels * merge code * polish code in operator.cc * Fix CI bug of test_yolov3 * add the tensor base class, test=develop (#17) * update the tensor base class, test=develop * remove two funcs, test=develop * update the error msg, test=develop Co-authored-by: NChen Weihang <chenweihang@baidu.com> * [no-verify] commit backend and tensor signature changes * Rename tcmpt to pten (#23) * rename tcmpt to pten * update omitted files for rename to pten * update omitted file for rename to pten * remove k of all enum var * remove kernel_instantiate (#26) * remove symbols and spatial_tensor * change common to functions * readd share tensor impl methods * add a candidate dense tensor class, test=develop (#28) * change all Pt to Pten * resolve conflit with xiaowei * Op2functor opt1 (#27) * replace to small vector and change to const & * add std::move Co-authored-by: NChen Weihang <chenweihang@baidu.com> * polish kernel factory and kernel registry * fix operator test error msg mismatch * remove tensor signature and backend set member * move scalar and polish enforce * revert dtype layout change to fix error * fix enum operator override error * add several base unittests * add pten utils tests * polish some details * Dev/op2func refactor 3 (#30) * add a candidate dense tensor class, test=develop * remove TensorBase::backend(), test=develop * remove some ops, test=develop * cherry-pick the pr of tensor meta, test=develop * moves the dense tensor and some ops, test=develop * update the linalg operator, test=develop * update other operators, test=develop * fix errors, test=develop * fix bugs, test=develop * try to resolve the problem of windows ci, test=develop * updates codes, test=develop * fix the tensor_utils.cc, test=develop * modify the dense tensor, test=develop * fix the data type, test=develop Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com> * polish some details * polish kernel signature details * fix a bug about offsets of the tensor, test=develop (#31) Co-authored-by: Nshixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com> * polish some details Co-authored-by: Nchentianyu03 <ctychentianyu@gmail.com> Co-authored-by: Nzyfncg <1370305206@qq.com> Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com> Co-authored-by: N石晓伟 <39303645+Shixiaowei02@users.noreply.github.com>
-
- 14 9月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 27 8月, 2021 1 次提交
-
-
由 joanna.wozna.intel 提交于
* Add calculation for gru op * Correct the types * Remove mkldnn only * Correct mkldnn ifdef * Remove mkldnn ifdef * Separate mkldnn quantizer test * Correct Windows test * Check different cmake fix * Revert cmake change * Cmake change 2 * Cmake change 3
-
- 10 8月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add any.hpp to utils and replace boost::any with self defined paddle::any * add copy any.hpp to custom op depends * modify any.hpp include path * remove boost from setup.py.in * add copy any.hpp to custom op depends * move any.hpp to paddle/utils/ dirs * move any.h to extension/include direction * copy utils to right directions
-
- 24 4月, 2021 1 次提交
-
-
由 winter-wang 提交于
-
- 23 4月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
-
- 22 4月, 2021 1 次提交
-
-
由 wuhuanzhou 提交于
-
- 18 3月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* support custom complex op * fix detail error * add inference support * fix setup windows failed
-
- 27 2月, 2021 1 次提交
-
-
由 石晓伟 提交于
-
- 20 2月, 2021 1 次提交
-
-
由 Wilber 提交于
* update paddle_fluid.so to paddle_inference.so
-
- 13 1月, 2021 1 次提交
-
-
由 Zhang Jun 提交于
* fix bug on compiling inference shared lib with crypto;test=develop * fix cmake bug when build inference lib using -DWITH_CRYPTO=OFF * update cmake * remove unnecessary enforce message
-
- 12 1月, 2021 1 次提交
-
-
由 tangwei12 提交于
* rename sendrecv.proto to namespace paddle.distributed * split ps with distributed
-
- 24 12月, 2020 1 次提交
-
-
由 tangwei12 提交于
* oneps (3/4) Co-authored-by: NMrChengmo <cmchengmo@163.com> Co-authored-by: Nmalin10 <malin10@baidu.com> Co-authored-by: Nchengmo <chengmo@baidu.com>
-
- 19 11月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 12 10月, 2020 1 次提交
-
-
由 chen.zhiyu 提交于
-
- 24 9月, 2020 1 次提交
-
-
由 Wilber 提交于
-
- 23 9月, 2020 1 次提交
-
-
由 Zhou Wei 提交于
-
- 21 7月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* fix windows ci test=develop * fix ci error
-
- 09 7月, 2020 1 次提交
-
-
由 Chen Weihang 提交于
remove useless property
-
- 05 6月, 2020 1 次提交
-
-
由 silingtong123 提交于
* test=develop, fix a bug * test=develop, remove the macro of PADDLE_DLL_INFERENCE
-
- 01 6月, 2020 2 次提交
-
-
由 Wilber 提交于
-
由 silingtong123 提交于
* add SetCommandLineOption * add the print_FLAGS function * remove the test demo * modify the location of macro * add the 'WITH_STATIC_LIB' option on windows * modify the macro of PD_INFER_DECL * modify the the fuction name * modify the unittest * modify the code style
-
- 11 5月, 2020 1 次提交
-
-
由 Tao Luo 提交于
-
- 04 3月, 2020 1 次提交
-
-
由 石晓伟 提交于
* encapsulate the PaddleTensorToLoDTensor, test=develop * serialize the pd_tensor, test=develop * serialize tensors to file, test=develop
-