- 03 4月, 2023 1 次提交
-
-
由 engineer1109 提交于
-
- 30 3月, 2023 1 次提交
-
-
由 pangengzheng 提交于
* support run haokanctr model in heterps-models * polish setup.py * polish JVM_LIB in evn_dict * align infer auc with DistPsArch pre-stable * async and multi thread data feed * rewrite dense tensor intialization * async infer shape and reuse memory
-
- 28 3月, 2023 1 次提交
-
-
由 Feiyu Chan 提交于
Add basic functionalities to support Scalar & Scalars in operator attribute. 1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types; 2. Scalar enhancement, add formatting, equality; 3. add code to handle Scalar & Scalars in opmaker, conversion from paddle operator to phi kernel, opdesc construction and manipulation, tensorrt converter, tracer, operator construction, etc; 4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`; 5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto); 6. add code to manipulate Scalar proto message via protobuffer python API; Add unittests. 1. add test cases for formatting, equality for Scalars, and WrapAsScalars; 2. add test cases for 'casting' between different morphs of attributes; 3. add test cases for extracting scalar & scalars from attribute; 4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset); 5. fix gmock's library filename on windows platform. 6. clean code: use canonicalize_attrs instead of inlining the function; 7. add test cases for libpaddle.Scalar in python code. 8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.
-
- 21 3月, 2023 1 次提交
-
-
由 iSerendipity 提交于
* move DataType from paddle::experimental to phi * convert namespace * convert namespace * convert namespace * clarify namespace * convert more datatype * Revert "convert more datatype" This reverts commit 083b462959e6a22d4d8767707b628b95b396642e. * convert more in auto_code_generator * fix conflicts for XPU * fix namespace conflicts * fix errors * Revert "fix errors" This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876. * fix errors * fix formatting
-
- 10 3月, 2023 1 次提交
-
-
由 HappyHeavyRain 提交于
* Add function node in phi_kernel for MKLDNN * fix the bug in 'BuildInferVarKernelContext' * add infer_varkernel_utils.cc * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable * change the code according to first review * change the code according to first review * change the mode of paddle_build.sh * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_' * add the error information * fix NotFound infomation warning * fix NotFound infomation warning * fix NotFound infomation warning
-
- 09 3月, 2023 1 次提交
-
-
由 zyfncg 提交于
* register custom kernel for all type of custom device * fix bug * fix GetKernelInputArgDef * fix amp bug * fix TransToPhiPlace * adapt interpreter_util
-
- 06 3月, 2023 1 次提交
-
-
由 Ruibiao Chen 提交于
* Remove InterpretercoreInferShapeContext * Fix lod errors
-
- 27 2月, 2023 1 次提交
-
-
由 csy0225 提交于
-
- 24 2月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 22 2月, 2023 1 次提交
-
-
由 Shuangchi He 提交于
* Fix some typos. Signed-off-by: Yulv-git <yulvchi@qq.com> * pre-commit Signed-off-by: Yulv-git <yulvchi@qq.com> --------- Signed-off-by: Yulv-git <yulvchi@qq.com>
-
- 21 2月, 2023 2 次提交
-
-
由 duanyanhui 提交于
* add static custom back_list * rm comments * fix log * fix comment
-
由 csy0225 提交于
* Optimize the ernie inference performance on xpu * fix enable runtime cache logic * when op's input shape has changed, should create a new runtime context * fix * set flag when input shape has changed
-
- 16 2月, 2023 2 次提交
-
-
由 zhupengyang 提交于
-
由 Huang Jiyi 提交于
* move variable_utils from phi_api_utils to fluid * fix coment * update include * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs * update * update * fix CI-Windows-OpenBLAS * fix bugs * fix bugs * fix bugs * update include * move variable_utils to phi_utils * fix namespace
-
- 08 2月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* unify_kernel * fix compile bugs * modify macro name * perfect code according comment * fix compile bugs * fix compile bugs * fix ci bugs * fix ci bug * fix ci bugs * fix ci bugs * modify code according comment * rm conv_fusion_op
-
- 17 1月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* change feed_op to phi kernel * fix ci bugs * fix build bugs * fix ci bugs * fix compile bugs * fix ci bugs * perfect code * perfect comment code * fix install bugs * modify code according comment * remove visitor in feed_op * modify according comment * perfect code according comment * add infershape * fix py3 bugs * fix getexpected kernel type * fix getexpected kernel type * fix ci bugs * add registry for custom device * fix py3 bugs * fix floating point error * fix py3 test bugs
-
- 04 1月, 2023 1 次提交
-
-
由 HongyuJia 提交于
* execute use kernel_key first * change OpKernelType->KernelKey * fix py3 compile error, remove redundant header files * fix build_strategy_test * fix DataType::RAW * fix custom_type test: operator_test.cc * fix transform place * fix backends_are_same_class * try fix place TransDataDevice * support all KernelKey * fix TransformData * fix place_are_same_class * fix merge * fix test_params_no_grad * fix specific place of GetExpectedKernelType * fix specific place of GetExpectedKernelType * fix GetKernelTypeForVar * fix dtype error * fix fetch_v2 * change GetKernelTypeForVar * fix interpreter * fix typo error * polish codes * polish codes * polish codes * fix conflict
-
- 03 1月, 2023 1 次提交
-
-
由 Aurelius84 提交于
* [OpAttr]Fix Ignore AttriteTensor in IndicateDataType bug in grad_op * add GetExpectedKernelType
-
- 30 12月, 2022 3 次提交
-
-
由 HongyuJia 提交于
-
由 Sanbu 提交于
* 1219 * temporarily change the num_diff_files limit, test=document_fix * Revert "temporarily change the num_diff_files limit, test=document_fix" This reverts commit 8e70f00ef468d2dad0e38b3da06295ed62990d20. * for codestyle * remove duplicate license * `static mode` -> `static graph mode` * Update hybrid_parallel_inference.py * Update layer_function_generator.py * Update manipulation.py * reset Co-authored-by: NLigoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 WangZhen 提交于
* Fix default GetExpectedKernelType for ops supported tensor attrs
-
- 28 12月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* add skip run * alloc minimum memory * skip check_size in Alloc * skip check_size in Alloc * skip check_size in Alloc * fix cases when tensor is initialized or empty * alloc empty output for place info * add test * increase timeout * format code * skip cpu * add cudnn_deterministic * fit for hostAlloc * follow comments * change check_size to fake_alloc
-
- 19 12月, 2022 1 次提交
-
-
由 HongyuJia 提交于
-
- 12 12月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* add new tensor * fix windows compile bugs * fix ci bugs * fix ci bugs * fix ci bugs * perfect according comment * fix ci compile bugs * add raw tensor * fix ci bugs * modify code by comment * delete String
-
- 09 12月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* move share_buffer kernel to phi * fix ut * add source file * fix window links
-
- 08 12月, 2022 1 次提交
-
-
由 Wilber 提交于
-
- 06 12月, 2022 1 次提交
-
-
由 QingshuChen 提交于
*test=kunlun
-
- 05 12月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
-
- 01 12月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* fix typo error * pass CI-coverage
-
- 29 11月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
-
- 28 11月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* Fix onednn kernel bugs * fix gpu bugs
-
- 26 11月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* hot fix * fix compile * merge develop * follow comments
-
- 25 11月, 2022 1 次提交
-
-
由 Chitsing KUI 提交于
* attr ready * op ip ready * start dynamic * end2end ok * input shape to map, stat by op * layer wip * first version ready * fix proto depds * fix profiler deps * fix flops typo, rm tuple shape
-
- 17 11月, 2022 1 次提交
-
-
由 zyfncg 提交于
* clip extra and intermediate output of op * fix bug * fix bug * polich code * polich log
-
- 15 11月, 2022 1 次提交
-
-
由 Sławomir Siwek 提交于
* cleanup unused code * unify is_int8 is_bfloat16 * Simplify matmul_v2 FWD kernel * remove RunKernel methods * remove import namespace * remove headers * clean fluid/phi cross imports * remove fluid axpy_handler * delete fluid methods * activations * OneDNNMemDesc * MKLDNNFormatForSize * MatchShapeToLayout * MKLDNNMemoryFormat * MKLDNNFormat * ReorderMKLDNNHandler * to_void_cast * review suggestions * interpolate * remove fluid depedency
-
- 11 11月, 2022 1 次提交
-
-
由 zhangbo9674 提交于
* refine shape op in new_exe * Revert "refine shape op in new_exe" This reverts commit 0e0336ddc5eede3da019b348a0bcc0ef0f3be64e. * refine shape op in new_exe * refine shape expected_kernel_type * add SelectedRows check for shape op * refine code
-
- 07 11月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code * Call SetDnnFallback function in the base class * activation fallback to plain kernel * fix default GetExpectedKernelType find wrong kernel * search cudnn kernel instead of fallback * fix cudnn_handle bug * remove tanh use_cudnn * restore tanh use_cudnn * debug tanh * fix tanh bug * delete activation cudnn kernel * polish code
-
- 03 11月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* opt CanMKLDNNBeUsed performance * fix nullptr bug * fix OpBase default_attrs=nullptr bug * fix OpBase default_attrs=nullptr bug * fix OpBase default_attrs=nullptr bug
-
- 02 11月, 2022 1 次提交
-
- 01 11月, 2022 1 次提交
-
-
由 HongyuJia 提交于
* move cudnn hardcode outside GetExpectedKernelType * add header file * debug * update interpreter_util with hardcode * update interpreter_util headerfile * solve activation hardcode * debug with CI * add mkldnn_op_list header file * temporarily uncomment mkldnn * temporarily uncomment mkldnn * delete sequence_softmax cudnn hardcode * add hardcode to data_transfer.cc * update data_transfer headerfile * try fix segment fault * update cudnn&miopen_helper * reset HasAttr of DygraphExctnCtx * debug, this commit should pass all CI * debug should pass CI, temporarily disable activation * debug should pass CI * fix default_attr=nullptr bug * clean debug code
-