1. 29 12月, 2022 1 次提交
  2. 25 11月, 2022 1 次提交
  3. 01 11月, 2022 1 次提交
    • Z
      [cherry-pick][code-gen] Support code-gen for opmaker of sparse op (#46993) (#47417) · 601626ac
      zyfncg 提交于
      * support generating code of opmaker for backward op invoke forward op (#46912)
      
      * [code-gen] Support code-gen for opmaker of sparse op (#46993)
      
      * support generating code of opmaker for backward op invoke forward op
      
      * gsupport code-gen of opmaker for sparse op
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * fix code_gen bug
      
      * fix bug
      
      * fix kernel signature code-gen
      
      * fix complie bug of VarType
      
      * fix complie bug of VarType
      
      * fix test_sparse_conv_op
      
      * fix test_sparse_norm_op
      
      * [Phi] Refactor logic of judging whether having a phi kernrel (#46920)
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * update cmake
      601626ac
  4. 18 10月, 2022 1 次提交
  5. 17 10月, 2022 1 次提交
  6. 20 9月, 2022 1 次提交
  7. 19 9月, 2022 1 次提交
  8. 17 9月, 2022 1 次提交
  9. 08 9月, 2022 1 次提交
  10. 07 9月, 2022 2 次提交
  11. 06 9月, 2022 2 次提交
  12. 05 9月, 2022 1 次提交
    • H
      [phi] Migrate memcpy kernel to PHI, hold NPU op (#45622) · 2f19a364
      HongyuJia 提交于
      * migrate memcpy to phi
      
      * fix typo error
      
      * fix typo error
      
      * fix  bug and testcase
      
      * fix typo, uniform_random_kernel.cc header
      
      * fix Alloc pinned bug
      
      * change GPUContext::GetPinnedPlace
      
      * add GetPinnedPlace function
      
      * add GetPinnedPlace function
      
      * restore default throw error
      
      * fix Unimplemented error
      
      * skip StandaloneExecutor testcase
      
      * delete memcpy_sig
      2f19a364
  13. 04 9月, 2022 1 次提交
  14. 02 9月, 2022 1 次提交
  15. 01 9月, 2022 1 次提交
  16. 30 8月, 2022 2 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
    • K
      fix memcpy_h2d bug related to cuda stream setting when allocate memory (#45450) · 10abdb8f
      kangguangli 提交于
      * fix memcpy_h2d bug related to cuda stream setting when allocate memory
      
      * add header file
      
      * fix compile error for cpu only
      10abdb8f
  17. 25 8月, 2022 2 次提交
    • F
      add support for double attributes (#45390) · efab2eb4
      Feiyu Chan 提交于
      efab2eb4
    • K
      Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a
      kangguangli 提交于
      * transfer memcpy_d2h from fluid to phi
      
      * refine arg check and add comment
      
      * fix cannot fallback to phi kernel
      
      * fix gpu_context host alloc when tensor size = 0
      
      * add kernel for std::vector<DenseTensor> args
      
      * fix bugs in MemcpyD2HMultiIOKernel
      
      * remove useless header file
      
      * polish format
      
      * fix typo
      
      * add testcase for cudapinned place
      
      * refine check condition in test
      
      * polish error message
      
      * polish error message
      
      * remove header in fluid  directory
      
      * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation
      
      * fix code style check
      0d14e74a
  18. 24 8月, 2022 1 次提交
  19. 23 8月, 2022 2 次提交
  20. 22 8月, 2022 1 次提交
  21. 19 8月, 2022 1 次提交
  22. 18 8月, 2022 2 次提交
  23. 17 8月, 2022 1 次提交
  24. 16 8月, 2022 2 次提交
  25. 15 8月, 2022 1 次提交
  26. 12 8月, 2022 1 次提交
    • S
      Offload calculations from matmul op to fuse pass (#44941) · acb78ea2
      Sławomir Siwek 提交于
      * remove v2_transpose_reshape
      
      * matmul_transpose_reshape
      
      * reshape_transpose_matmul
      
      * Add int8 support for matmulV2
      
      * restore ut
      
      * adjust old ut
      
      * restore parallel UT ruels
      
      * remove mkldnn code from base ops
      
      * move enforces to pass
      
      * remove duplicated functions
      
      * delete duplicated enforces
      
      * feedback from review
      
      * add comments to variables
      
      * enable eltwise support
      
      * dynamic attribute
      
      * remove fusepass tests from op test
      
      * remove fuse pass cases from op test
      
      * revert introduction of dynamic attributes
      
      * style
      Co-authored-by: Nwozna <joanna.wozna@intel.com>
      acb78ea2
  27. 10 8月, 2022 1 次提交
  28. 09 8月, 2022 1 次提交
  29. 08 8月, 2022 2 次提交
  30. 05 8月, 2022 2 次提交
  31. 03 8月, 2022 1 次提交