1. 04 4月, 2023 2 次提交
    • R
      Improve new executor static build (#51149) · 5bac67d4
      Ruibiao Chen 提交于
      * Improve new executor static build
      
      * Skip GC for static build
      
      * Skip infershape for static build
      
      * Handle read_op
      
      * Add fused_attention to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix argsort typos
      
      * Add sequence_pool to OpsWithFluidKernelNeedMoveToPhi
      
      * Fix skip share lod errors
      
      * Fix errors for adam
      
      * Fix errors for eigvals, memcpy and fake_quantize
      
      * Add static_build.cc
      
      * Add black list
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Fix TensorArray
      
      * Fix TensorArray
      
      * Add update_loss_scaling to OpsNeedSetOutputDtypeWhenRegisterPhiKernel
      
      * Fix copy
      
      * Fix errors
      
      * Fix momentum
      
      * Skip mkldnn
      
      * Fix CI errors
      
      * Fix c_sync_calc_stream_op
      
      * Fix CINN
      
      * Fix while op
      
      * All CI pass, disable FLAGS to merge code, enable it after more tests in future
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add mkldnn UT
      
      * Remove mkldnn test
      
      * Fix typos
      
      * Fix dist test
      
      * Fix typos
      
      * Fix CI errors
      
      * Fix CI errors
      
      * Add UTs
      
      * Fix typos
      
      * Fix typos
      
      * Add sparse tests
      
      * ToComplexType -> ToComplex
      
      * Add test_matmul_op_static_build to disable_win_inference_test
      5bac67d4
    • C
      Fix inplace op dims not changed (#52416) · 8e7aa296
      csy0225 提交于
      8e7aa296
  2. 03 4月, 2023 1 次提交
  3. 30 3月, 2023 1 次提交
    • P
      Speedup worker (#51760) · 8ca86d72
      pangengzheng 提交于
      * support run haokanctr model in heterps-models
      
      * polish setup.py
      
      * polish JVM_LIB in evn_dict
      
      * align infer auc with DistPsArch pre-stable
      
      * async and multi thread data feed
      
      * rewrite dense tensor intialization
      
      * async infer shape and reuse memory
      8ca86d72
  4. 28 3月, 2023 1 次提交
    • F
      Add basic functionalities to support Scalar & Scalars in op attr (#51984) · 2e9fd5e4
      Feiyu Chan 提交于
      Add basic functionalities to support Scalar & Scalars in operator attribute.
      
      1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types;
      2. Scalar enhancement, add formatting, equality;
      3. add code to handle Scalar & Scalars in opmaker, conversion from  paddle operator to phi kernel, opdesc construction and manipulation,  tensorrt converter, tracer, operator construction, etc;
      4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`;
      5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto);
      6. add code to manipulate Scalar proto message via protobuffer python API;
      
      Add unittests.
      
      1. add test cases for formatting, equality for Scalars, and WrapAsScalars;
      2. add test cases for 'casting' between different morphs of attributes;
      3. add test cases for extracting scalar & scalars from attribute;
      4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset);
      5. fix gmock's library filename on windows platform.
      6. clean code: use canonicalize_attrs instead of inlining the function;
      7. add test cases for libpaddle.Scalar in python code.
      8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.
      2e9fd5e4
  5. 21 3月, 2023 1 次提交
    • iSerendipity's avatar
      [PHI decoupling] Move DataType* from paddle:experimental to phi namespace (#51716) · 4638a62e
      iSerendipity 提交于
      * move DataType from paddle::experimental to phi
      
      * convert namespace
      
      * convert namespace
      
      * convert namespace
      
      * clarify namespace
      
      * convert more datatype
      
      * Revert "convert more datatype"
      
      This reverts commit 083b462959e6a22d4d8767707b628b95b396642e.
      
      * convert more in auto_code_generator
      
      * fix conflicts for XPU
      
      * fix namespace conflicts
      
      * fix errors
      
      * Revert "fix errors"
      
      This reverts commit f9d9958b54ee32141112274c8a5c3c381ab0f876.
      
      * fix errors
      
      * fix formatting
      4638a62e
  6. 10 3月, 2023 1 次提交
    • HappyHeavyRain's avatar
      [New features]Add function node in phi_kernel for MKLDNN (#51073) · a0a6dc6a
      HappyHeavyRain 提交于
      * Add function node in phi_kernel for MKLDNN
      
      * fix the bug in 'BuildInferVarKernelContext'
      
      * add infer_varkernel_utils.cc
      
      * fix the bug:the first two parametes of 'BuildInferVarKernelContext' can't be template variable
      
      * change the code according to first review
      
      * change the code according to first review
      
      * change the mode of paddle_build.sh
      
      * change 'infer_var_kernel_fn_' to 'get_kerneltype_forvar_fn_'
      
      * add the error information
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      
      * fix NotFound infomation warning
      a0a6dc6a
  7. 09 3月, 2023 1 次提交
  8. 06 3月, 2023 1 次提交
  9. 27 2月, 2023 1 次提交
  10. 24 2月, 2023 1 次提交
  11. 22 2月, 2023 1 次提交
  12. 21 2月, 2023 2 次提交
  13. 16 2月, 2023 2 次提交
  14. 08 2月, 2023 1 次提交
    • Y
      [PHI]Unify Fluid and PHI kernel (#49328) · e92e3aab
      YuanRisheng 提交于
      * unify_kernel
      
      * fix compile bugs
      
      * modify macro name
      
      * perfect code according comment
      
      * fix compile bugs
      
      * fix compile bugs
      
      * fix ci bugs
      
      * fix ci bug
      
      * fix ci bugs
      
      * fix ci bugs
      
      * modify code according comment
      
      * rm conv_fusion_op
      e92e3aab
  15. 17 1月, 2023 1 次提交
    • Y
      [PHI]Change feed_op to phi kernel (#49116) · f7f1dc03
      YuanRisheng 提交于
      * change feed_op to phi kernel
      
      * fix ci bugs
      
      * fix build bugs
      
      * fix ci bugs
      
      * fix compile bugs
      
      * fix ci bugs
      
      * perfect code
      
      * perfect comment code
      
      * fix install bugs
      
      * modify code according comment
      
      * remove visitor in feed_op
      
      * modify according comment
      
      * perfect code according comment
      
      * add infershape
      
      * fix py3 bugs
      
      * fix getexpected kernel type
      
      * fix getexpected kernel type
      
      * fix ci bugs
      
      * add registry for custom device
      
      * fix py3 bugs
      
      * fix floating point error
      
      * fix py3 test bugs
      f7f1dc03
  16. 04 1月, 2023 1 次提交
    • H
      [Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f
      HongyuJia 提交于
      * execute use kernel_key first
      
      * change OpKernelType->KernelKey
      
      * fix py3 compile error, remove redundant header files
      
      * fix build_strategy_test
      
      * fix DataType::RAW
      
      * fix custom_type test: operator_test.cc
      
      * fix transform place
      
      * fix backends_are_same_class
      
      * try fix place TransDataDevice
      
      * support all KernelKey
      
      * fix TransformData
      
      * fix place_are_same_class
      
      * fix merge
      
      * fix test_params_no_grad
      
      * fix specific place of GetExpectedKernelType
      
      * fix specific place of GetExpectedKernelType
      
      * fix GetKernelTypeForVar
      
      * fix dtype error
      
      * fix fetch_v2
      
      * change GetKernelTypeForVar
      
      * fix interpreter
      
      * fix typo error
      
      * polish codes
      
      * polish codes
      
      * polish codes
      
      * fix conflict
      4383494f
  17. 03 1月, 2023 1 次提交
  18. 30 12月, 2022 3 次提交
  19. 28 12月, 2022 1 次提交
    • L
      [new-exec] Ahead-Of-Time choosing kernel (#48789) · 63d2d722
      Leo Chen 提交于
      * add skip run
      
      * alloc minimum memory
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * skip check_size in Alloc
      
      * fix cases when tensor is initialized or empty
      
      * alloc empty output for place info
      
      * add test
      
      * increase timeout
      
      * format code
      
      * skip cpu
      
      * add cudnn_deterministic
      
      * fit for hostAlloc
      
      * follow comments
      
      * change check_size to fake_alloc
      63d2d722
  20. 19 12月, 2022 1 次提交
  21. 12 12月, 2022 1 次提交
  22. 09 12月, 2022 1 次提交
  23. 08 12月, 2022 1 次提交
  24. 06 12月, 2022 1 次提交
  25. 05 12月, 2022 1 次提交
  26. 01 12月, 2022 1 次提交
  27. 29 11月, 2022 1 次提交
  28. 28 11月, 2022 1 次提交
  29. 26 11月, 2022 1 次提交
  30. 25 11月, 2022 1 次提交
    • C
      [PROFILER] add flops for Profiler (#47766) · 3d1981ad
      Chitsing KUI 提交于
      * attr ready
      
      * op ip ready
      
      * start dynamic
      
      * end2end ok
      
      * input shape to map, stat by op
      
      * layer wip
      
      * first version ready
      
      * fix proto depds
      
      * fix profiler deps
      
      * fix flops typo, rm tuple shape
      3d1981ad
  31. 17 11月, 2022 1 次提交
  32. 15 11月, 2022 1 次提交
    • S
      mkldnn directory cleanup (#47779) · 8a339d24
      Sławomir Siwek 提交于
      * cleanup unused code
      
      * unify is_int8 is_bfloat16
      
      * Simplify matmul_v2 FWD kernel
      
      * remove RunKernel methods
      
      * remove import namespace
      
      * remove headers
      
      * clean fluid/phi cross imports
      
      * remove fluid axpy_handler
      
      * delete fluid methods
      
      * activations
      
      * OneDNNMemDesc
      
      * MKLDNNFormatForSize
      
      * MatchShapeToLayout
      
      * MKLDNNMemoryFormat
      
      * MKLDNNFormat
      
      * ReorderMKLDNNHandler
      
      * to_void_cast
      
      * review suggestions
      
      * interpolate
      
      * remove fluid depedency
      8a339d24
  33. 11 11月, 2022 1 次提交
    • Z
      Refine shape op lanch method for standalone executor (#47843) · 981d1a10
      zhangbo9674 提交于
      * refine shape op in new_exe
      
      * Revert "refine shape op in new_exe"
      
      This reverts commit 0e0336ddc5eede3da019b348a0bcc0ef0f3be64e.
      
      * refine shape op in new_exe
      
      * refine shape expected_kernel_type
      
      * add SelectedRows check for shape op
      
      * refine code
      981d1a10
  34. 07 11月, 2022 1 次提交
    • H
      [Restore PR] Remove hard code of PADDLE_WITH_CUDA (#47630) · 908a381d
      HongyuJia 提交于
      * move cudnn hardcode outside GetExpectedKernelType
      
      * add header file
      
      * debug
      
      * update interpreter_util with hardcode
      
      * update interpreter_util headerfile
      
      * solve activation hardcode
      
      * debug with CI
      
      * add mkldnn_op_list header file
      
      * temporarily uncomment mkldnn
      
      * temporarily uncomment mkldnn
      
      * delete sequence_softmax cudnn hardcode
      
      * add hardcode to data_transfer.cc
      
      * update data_transfer headerfile
      
      * try fix segment fault
      
      * update cudnn&miopen_helper
      
      * reset HasAttr of DygraphExctnCtx
      
      * debug, this commit should pass all CI
      
      * debug should pass CI, temporarily disable activation
      
      * debug should pass CI
      
      * fix default_attr=nullptr bug
      
      * clean debug code
      
      * Call SetDnnFallback function in the base class
      
      * activation fallback to plain kernel
      
      * fix default GetExpectedKernelType find wrong kernel
      
      * search cudnn kernel instead of fallback
      
      * fix cudnn_handle bug
      
      * remove tanh use_cudnn
      
      * restore tanh use_cudnn
      
      * debug tanh
      
      * fix tanh bug
      
      * delete activation cudnn kernel
      
      * polish code
      908a381d
  35. 03 11月, 2022 1 次提交