1. 01 11月, 2022 1 次提交
    • Z
      [cherry-pick][code-gen] Support code-gen for opmaker of sparse op (#46993) (#47417) · 601626ac
      zyfncg 提交于
      * support generating code of opmaker for backward op invoke forward op (#46912)
      
      * [code-gen] Support code-gen for opmaker of sparse op (#46993)
      
      * support generating code of opmaker for backward op invoke forward op
      
      * gsupport code-gen of opmaker for sparse op
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * fix code_gen bug
      
      * fix bug
      
      * fix kernel signature code-gen
      
      * fix complie bug of VarType
      
      * fix complie bug of VarType
      
      * fix test_sparse_conv_op
      
      * fix test_sparse_norm_op
      
      * [Phi] Refactor logic of judging whether having a phi kernrel (#46920)
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * update cmake
      601626ac
  2. 17 10月, 2022 1 次提交
  3. 27 9月, 2022 1 次提交
    • Z
      [cherry-pick] clear extra attrs of some ops in OpMaker (#45845, #45984, 46060) (#46218) · 0cc2251f
      zyfncg 提交于
      * Clear extra attrs of elementwise op in OpMaker (#45845)
      
      * clear extra attrs of elementwise op in opmaker
      
      * fix op_debug_string_test
      
      * fix bug of grad_add
      
      * fix sort of runtime attrs
      
      * Clear extra attrs of scale in OpMaker (#45984)
      
      * clear extra attr of scale in opmaker
      
      * fix sum bug
      
      * fix merge conflict
      
      * fix minus
      
      * Clear extra attributes of some Op in OpMaker (Part4) (#46060)
      
      * clear extra attr of some ops in opmaker
      
      * revert clear use_cudnn for pool
      
      * fix test_operator_desc
      
      * fix Attr interface of OperatorBase
      
      * fix code stype
      0cc2251f
  4. 20 9月, 2022 1 次提交
  5. 08 9月, 2022 1 次提交
  6. 01 9月, 2022 1 次提交
  7. 30 8月, 2022 1 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
  8. 14 8月, 2022 1 次提交
  9. 10 8月, 2022 1 次提交
  10. 02 8月, 2022 1 次提交
  11. 01 8月, 2022 1 次提交
    • L
      unify gpu context (#44740) · 86763023
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      
      * remove alias name CUDADeviceContext
      
      * fix compile
      
      * fix tests
      
      * revert changes
      86763023
  12. 20 7月, 2022 1 次提交
  13. 19 7月, 2022 1 次提交
  14. 06 7月, 2022 1 次提交
  15. 05 7月, 2022 1 次提交
  16. 26 6月, 2022 1 次提交
  17. 24 6月, 2022 1 次提交
  18. 05 6月, 2022 1 次提交
  19. 30 5月, 2022 1 次提交
  20. 27 4月, 2022 1 次提交
  21. 26 4月, 2022 1 次提交
  22. 25 4月, 2022 2 次提交
  23. 24 4月, 2022 1 次提交
  24. 17 4月, 2022 1 次提交
    • C
      [Perf] Optimize dygraph scheduling performance (#41696) · 7ee31a96
      Chen Weihang 提交于
      * split phi and fluid infermeta context
      
      * resolve conflict
      
      * fix type error
      
      * optimize scheduling perf
      
      * spec small vector size
      
      * replace all grad var name
      
      * fix test failed
      
      * move init defalut signature
      
      * polish details
      
      * polish details
      
      * fix no init bug
      
      * init sig for tests
      
      * add init sig for infer
      
      * fix infrt error
      
      * fix infrt failed
      
      * fix kunlun error
      
      * fix infrt failed
      7ee31a96
  25. 01 4月, 2022 1 次提交
  26. 31 3月, 2022 1 次提交
  27. 27 3月, 2022 1 次提交
  28. 25 3月, 2022 1 次提交
  29. 17 3月, 2022 1 次提交
    • C
      [Phi] Move assign kernel into phi (#40022) · 1904572a
      Chen Weihang 提交于
      * move assign kernel init commit
      
      * change vec<tensor> to vec<tensor*>
      
      * support tensor array
      
      * support api declare
      
      * fix test_list failed
      
      * fix npu and xpu failed
      
      * fix infrt failed
      
      * remove assign array size in operator
      
      * move assign sr header into sr dir
      
      * add infermeta for assign
      
      * test op success
      
      * fix test_list failed
      
      * fix kunlun failed
      
      * add set host allocator in tests
      
      * support tensor array in arg ctx
      
      * open set layout in share_meta
      
      * fix meta tensor layout error
      
      * fix test failed
      1904572a
  30. 09 3月, 2022 1 次提交
  31. 01 3月, 2022 1 次提交
    • C
      [Phi]rm reduce infershape (#39820) · 09039636
      chentianyu03 提交于
      * modify infershape utils and rm reduce infershape
      
      * merge develop
      
      * fix infermete bug
      
      * add IsForInferShape func in ArgumentMappingContext
      
      * add reduce_mean infermeta
      
      * modify annotation
      
      * add default dims
      09039636
  32. 28 2月, 2022 1 次提交
  33. 20 2月, 2022 1 次提交
  34. 19 2月, 2022 1 次提交
    • A
      [Pten]Unify paddle/pten::framework::ddim into pten::ddim (#39614) · 2fe04264
      Aurelius84 提交于
      * Unify paddle/pten::framework::ddim into pten::ddim
      
      * fix paddle namespace
      
      * compile sucessfully
      
      * fix npu src file
      
      * fix conflict
      
      * fix conflict
      
      * fix tensorrt compiler error
      
      * fix conflict
      
      * fix conflict
      
      * fix tesst file conflict
      
      * fix conflict
      
      * fix mlu file conflict
      
      * fix mlu file conflict
      
      * fix cinn header file conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      
      * fix conflict
      2fe04264
  35. 15 2月, 2022 1 次提交
    • A
      [PTen]Migrate proto::VarType outside of Pten (#39411) · 7e7e9404
      Aurelius84 提交于
      * #1 migrate dist-related type()-> dtype()
      
      * move datatype function from pten -> fluid/framework
      
      * change type() in imperative into convert(dtype())
      
      * modify xx_tensor->type into xx_tensor->dtype
      
      * change the set_type interface and the caller
      
      * modify xx_tensor.type into xx_tensor.dtype
      
      * fix mutable_data(place, dtype())
      
      * change caller of mutable_data in pten and distributed
      
      * change the caller of mutable_data in fluid/framework
      
      * change the caller of mutable_data in imperative directory
      
      * mutable_data: inference
      
      * update the call of mutable_data
      
      * transfer MakePenScalarArray MakePtenScalar ResetHolderWithType
      
      * pass the compile. the next step is remove VarType in Pten
      
      * fix all and remove VarType from pten. success in linux. Next task is other platform
      
      * fix conflict with develop
      
      * fix compiled error
      
      * Fix reset conversion
      
      * fix conflict
      
      * fix compiled problem
      
      * fix typo
      
      * Fix << in tensor_utils.cc
      
      * fix type->dtype
      
      * fix unittest
      
      * fix tensor init constructor
      
      * fix DataTypeSize for BFloat16
      
      * fix code style
      
      * fix npu compiled error
      
      * fix npu
      
      * compile npu sucessfully
      
      * fix conflict
      
      * fix conflict
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      7e7e9404
  36. 14 2月, 2022 1 次提交
  37. 11 2月, 2022 1 次提交
  38. 30 1月, 2022 1 次提交
  39. 29 1月, 2022 1 次提交
    • L
      Add xpu2 compiler (#37254) · 92da5055
      Liu-xiandong 提交于
      * Add XPU compiler for paddle, test=develop
      
      * clean code
      
      * clean useless code
      
      * clean useless code
      
      * clean useless code
      
      * test
      
      * add include path
      
      * use clang compiler
      
      * xpu2.cmake
      
      * XPU2 compiler passed
      
      * update
      
      * update after pten
      
      * combination the WITH_XPU and WITH_XPU2
      
      * update the fuse operation in WITH_XPU and WITH_XPU2
      
      * update
      
      * update
      
      * update
      
      * fix the merge error
      
      * update
      
      * update the code
      
      * update the code
      
      * add run_kp_kernel flag
      
      * update
      
      * update
      
      * fix prepared type_ bug
      
      * clean and update the code
      
      * reset the kernel_primitives
      
      * update
      
      * clean the code
      
      * delete useless comment
      
      * fix the bug in WITH_XPU
      
      * update
      
      * update
      
      * modify the abi
      
      * delete some useless code
      
      * Parameter automation in xpu compilation
      
      * Parameter automation in xpu compilation
      
      * delete kps in cmake
      
      * delete useless comment
      
      * clean the code
      
      * clean the code
      92da5055