1. 21 7月, 2023 1 次提交
  2. 12 7月, 2023 1 次提交
  3. 05 7月, 2023 1 次提交
    • H
      [NewIR]Fix tensor attribute translator bug (#55129) · bf92ccc7
      hong 提交于
      * suport optional input in new_ir
      
      * polish code
      
      * add coverate test
      
      * update
      
      * update
      
      * add unitest
      
      * remove reduplicate code
      
      * udpate
      
      * fix assign error
      
      * revert test arg min max
      
      * update
      
      * fix bug
      
      * polish code
      bf92ccc7
  4. 05 6月, 2023 1 次提交
  5. 31 3月, 2023 1 次提交
  6. 30 3月, 2023 1 次提交
  7. 28 3月, 2023 1 次提交
    • F
      Add basic functionalities to support Scalar & Scalars in op attr (#51984) · 2e9fd5e4
      Feiyu Chan 提交于
      Add basic functionalities to support Scalar & Scalars in operator attribute.
      
      1. extend allowed types in operator's attribute type, add `paddle::experimental::Scalar`, add corresponding protobuf Message types;
      2. Scalar enhancement, add formatting, equality;
      3. add code to handle Scalar & Scalars in opmaker, conversion from  paddle operator to phi kernel, opdesc construction and manipulation,  tensorrt converter, tracer, operator construction, etc;
      4. bind `paddle::experimental::Scalar` to python, as `libpaddle.Scalar`;
      5. add functionality to canonicalize attribute map according to OpProto(if the op the attribute map used for has an OpProto);
      6. add code to manipulate Scalar proto message via protobuffer python API;
      
      Add unittests.
      
      1. add test cases for formatting, equality for Scalars, and WrapAsScalars;
      2. add test cases for 'casting' between different morphs of attributes;
      3. add test cases for extracting scalar & scalars from attribute;
      4. add test cases for CanonicalizeScalarAttrs(and fix a bug in type index offset);
      5. fix gmock's library filename on windows platform.
      6. clean code: use canonicalize_attrs instead of inlining the function;
      7. add test cases for libpaddle.Scalar in python code.
      8. add test cases for `make_scalar_proto`, which manipulate proto message `Scalar` via protobuffer python API.
      2e9fd5e4
  8. 20 3月, 2023 1 次提交
    • S
      [Hackathon NO.71] 为 Paddle-TRT 添加 pad3d 算子 (#50986) · c36e3fd2
      Sonder 提交于
      * update codes about pad3d
      
      * add codes about Tensor type Padding
      
      * update
      
      * 更新单测文件
      
      * format code style
      
      * update and to &&'
      
      * rewrite codes about pad3d
      
      * add codes about converting paddle pad format to tensorrt pad format
      
      * fix some errors
      
      * 指定trt版本范围
      
      * 修正dims初始化方式
      
      * fix code style
      
      * update test pad values
      
      * 指定pad3d trt版本
      
      * 更新 单测 文件范围
      
      * 更新单测文件
      
      * update pad3d paddings convert codes
      
      * update pad3d
      
      * add static mode support
      
      * update test file
      
      * fix bugs about dynamic mode test codes
      
      * fix bug and add limite in op_teller
      
      * use a new padding convert method[ITensor* padding with using Slice to split the pre_pad and the  post pad]
      
      * fix PADDLE_THROW grammaly error
      
      * update test codes
      
      * 添加对于Tensor padding 的 size 判断
      c36e3fd2
  9. 06 2月, 2023 1 次提交
  10. 21 12月, 2022 1 次提交
    • Z
      Refactor Pass for fused_conv (#48848) · 7f0eb2e3
      zyfncg 提交于
      * refactor conv_activation_mkldnn_fuse_pass
      
      * refactor conv_affine_channel_mkldnn_fuse_pass
      
      * fix conv_activation_mkldnn_fuse_pass
      
      * fix mkldnn unittest
      
      * refactor int8_scale_calculation_mkldnn_pass and params_quantization_mkldnn_pass
      
      * refactor conv_elementwise_add_mkldnn_fuse_pass
      
      * fix quant
      
      * refactor conv_bn_fuse_pass
      
      * fix conv_bn_fuse_pass
      
      * refactor depthwise_conv_bn_fuse_pass
      
      * fix unittest
      
      * fix conv_bn_fuse_pass
      
      * remove redundant conv2d in params_quantization_mkldnn_pass
      
      * fix params_quantization_mkldnn_pass_tester
      7f0eb2e3
  11. 06 12月, 2022 1 次提交
    • Z
      Clear extra input (Bias, ResidualData) in OpMaker of conv2d (#47579) · 0a2dfa38
      zyfncg 提交于
      * delete Bias and ResidualData in OpMaker of conv2d
      
      * delete extra input of conv3d
      
      * refactor pass of conv_bias_fusion
      
      * fix mkldnn dependency
      
      * fix mkldnn compile
      
      * fix test_conv_bias_mkldnn_fuse_pass
      
      * police some code
      
      * remove useless log
      
      * fix analyzer_vit_ocr_tester
      
      * fix conv_activation_mkldnn_fuse_pass
      
      * fix test_analyzer_ocr
      
      * add fused_conv_sig
      
      * fix performence regression
      
      * fix performance regression
      0a2dfa38
  12. 01 11月, 2022 1 次提交
  13. 31 10月, 2022 1 次提交
  14. 26 10月, 2022 1 次提交
  15. 12 10月, 2022 1 次提交
  16. 10 10月, 2022 1 次提交
    • Y
      [PHI]Add RNN yaml (#46812) · ab60fd8b
      YuanRisheng 提交于
      * add yaml entry for rnn and rrnn_grad, move infershape function for rnn_grad to phi infer meta
      
      * WIP: move rnn kernrl to phi
      
      * Change the code generation to avoid converting from intializer list to tuple of heterogeneous types.
      This is only triggered when an api has intermediate outputs, and the result of the outputs are of heterogeneous types.
      
      * fix the bug that when none in a vector of tensors requires gradient, the conversion to InferShapeContext to InferMetaContext (a.k.a. BuildInferMetaContext) produces errorous results.
      
      * fix ci bugs
      
      * fix ci bugs
      
      * fix ci bugs
      
      * modify code according comment
      Co-authored-by: Nchenfeiyu <chenfeiyu@baidu.com>
      ab60fd8b
  17. 19 9月, 2022 1 次提交
  18. 15 9月, 2022 1 次提交
  19. 08 9月, 2022 1 次提交
  20. 06 9月, 2022 1 次提交
  21. 05 9月, 2022 1 次提交
  22. 30 8月, 2022 1 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
  23. 25 8月, 2022 1 次提交
  24. 15 8月, 2022 1 次提交
    • Y
      [Auto Parallel] Move the distributed info from python to c++ (#44510) · a52357fe
      Yulong Ao 提交于
      * [Auto Parallel] Move the distributed info from python to c++
      
      * [Auto Parallel] Add dist_attrs for VarDesc and OpDesc
      
      * [Auto Parallel] Add the lost file
      
      * [Auto Parallel] Make the dist attr be unique_ptr
      
      * [Auto Parallel] Add the proto conversion
      
      * [Auto Parallel] Improve the proto support
      
      * [Auto Parallel] Fix the bugs for adding a device or a link
      
      * [Auto Parallel] Add the C++ ProcessMesh and DistributedMapper
      
      * [Auto Parallel] Improve the impl of these dist attrs
      
      * [Auto Parallel] Pybind11 ProcessMesh and DeviceMesh
      
      * [Auto Parallel] Fix the unittest problem
      
      * [Auto Parallel] Explicitly add the src file for auto_parallel target
      
      * [Auto Parallel] Add the proto depedency explicitly
      
      * [Auto Parallel] Fix the cmake bug on windows and mac
      
      * [Auto Parallel] Remove the pybind11 header file in process_mesh.h
      
      * [Auto Parallel] Remove unused codes
      
      * [Auto Parallel] Check whether the dist attr is null
      
      * [Auto Parallel] Implement the assign operator for OpDesc explicitly
      a52357fe
  25. 13 8月, 2022 1 次提交
    • L
      Refine program cache (#45005) · e96dae8b
      Leo Chen 提交于
      * add cached_serialize_str_
      
      * support program hash
      
      * add sha
      
      * add ut
      
      * use hash_str only for new_exe
      
      * fix attr order
      e96dae8b
  26. 10 8月, 2022 2 次提交
  27. 19 7月, 2022 1 次提交
  28. 04 7月, 2022 2 次提交
  29. 28 6月, 2022 1 次提交
    • R
      Remove boost::variant (#43100) · b3cf28f8
      Ruibiao Chen 提交于
      * boost::variant -> paddle::variant
      
      * boost::variant.apply_visit -> paddle::visit
      
      * Update pybind_boost_hraders.h
      
      * Fix CINN compilation errors
      
      * Revert FetchResultType
      b3cf28f8
  30. 26 6月, 2022 1 次提交
  31. 05 6月, 2022 1 次提交
  32. 27 4月, 2022 1 次提交
  33. 26 4月, 2022 1 次提交
  34. 25 4月, 2022 1 次提交
    • C
      Optimize dygraph InferShape perf (#42155) · 6721376b
      Chen Weihang 提交于
      * init commit
      
      * remove two hash impl
      
      * fix bug
      
      * polish details
      
      * fix compile failed
      
      * fix compile failed
      
      * fix compile failed
      
      * add default kernel sig cache
      
      * fix get kernel arg defs error
      
      * remove kernel arg defs cache
      
      * fix origin op execute
      6721376b
  35. 17 4月, 2022 1 次提交
    • C
      [Perf] Optimize dygraph scheduling performance (#41696) · 7ee31a96
      Chen Weihang 提交于
      * split phi and fluid infermeta context
      
      * resolve conflict
      
      * fix type error
      
      * optimize scheduling perf
      
      * spec small vector size
      
      * replace all grad var name
      
      * fix test failed
      
      * move init defalut signature
      
      * polish details
      
      * polish details
      
      * fix no init bug
      
      * init sig for tests
      
      * add init sig for infer
      
      * fix infrt error
      
      * fix infrt failed
      
      * fix kunlun error
      
      * fix infrt failed
      7ee31a96
  36. 13 4月, 2022 1 次提交
  37. 04 4月, 2022 1 次提交
    • H
      Add dropout yaml (#41355) · 1c7001e7
      hong 提交于
      * add dropout slice yaml
      
      * remove useless code
      
      * fix infer shape error
      
      * skip infrt compile for dropout
      1c7001e7
  38. 20 2月, 2022 1 次提交