1. 04 10月, 2022 1 次提交
  2. 30 9月, 2022 1 次提交
  3. 28 9月, 2022 2 次提交
  4. 27 9月, 2022 1 次提交
  5. 22 9月, 2022 1 次提交
  6. 20 9月, 2022 1 次提交
  7. 19 9月, 2022 2 次提交
  8. 17 9月, 2022 1 次提交
  9. 15 9月, 2022 1 次提交
  10. 08 9月, 2022 1 次提交
  11. 07 9月, 2022 2 次提交
  12. 06 9月, 2022 2 次提交
  13. 05 9月, 2022 1 次提交
    • H
      [phi] Migrate memcpy kernel to PHI, hold NPU op (#45622) · 2f19a364
      HongyuJia 提交于
      * migrate memcpy to phi
      
      * fix typo error
      
      * fix typo error
      
      * fix  bug and testcase
      
      * fix typo, uniform_random_kernel.cc header
      
      * fix Alloc pinned bug
      
      * change GPUContext::GetPinnedPlace
      
      * add GetPinnedPlace function
      
      * add GetPinnedPlace function
      
      * restore default throw error
      
      * fix Unimplemented error
      
      * skip StandaloneExecutor testcase
      
      * delete memcpy_sig
      2f19a364
  14. 04 9月, 2022 1 次提交
  15. 02 9月, 2022 1 次提交
  16. 01 9月, 2022 1 次提交
  17. 30 8月, 2022 2 次提交
    • Z
      Remove extra attribute in OpMaker (#44310) · fe321f9a
      zyfncg 提交于
      * add runtime config in phi
      
      * add runtime attr for op desc and op
      
      * fix no proto error
      
      * adjust opdesc set_attr impl
      
      * try to remove conv_op extra attrs
      
      * add init runtime attr map
      
      * change extra header path
      
      * fix runtime_attr
      
      * fix trace_op
      
      * fix bug of pass
      
      * fix merge conflict
      
      * fix dygraph attrs
      
      * fix bug of pass
      
      * fix dygraph bug
      
      * fix unittest module
      
      * delete extra attr default
      
      * fix dropout kernel
      
      * polish code
      
      * fix extra output of instance_norm
      
      * fix merge confilct
      
      * fix op_desc bug
      
      * add extra attr in yaml for conv3d_transpose
      
      * don't remove extra input and output
      
      * fix save_inference_model
      
      * fix bug of batch_norm
      
      * revert some change
      
      * polish log
      
      * polish code
      
      * add code comment
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      fe321f9a
    • K
      fix memcpy_h2d bug related to cuda stream setting when allocate memory (#45450) · 10abdb8f
      kangguangli 提交于
      * fix memcpy_h2d bug related to cuda stream setting when allocate memory
      
      * add header file
      
      * fix compile error for cpu only
      10abdb8f
  18. 25 8月, 2022 2 次提交
    • F
      add support for double attributes (#45390) · efab2eb4
      Feiyu Chan 提交于
      efab2eb4
    • K
      Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a
      kangguangli 提交于
      * transfer memcpy_d2h from fluid to phi
      
      * refine arg check and add comment
      
      * fix cannot fallback to phi kernel
      
      * fix gpu_context host alloc when tensor size = 0
      
      * add kernel for std::vector<DenseTensor> args
      
      * fix bugs in MemcpyD2HMultiIOKernel
      
      * remove useless header file
      
      * polish format
      
      * fix typo
      
      * add testcase for cudapinned place
      
      * refine check condition in test
      
      * polish error message
      
      * polish error message
      
      * remove header in fluid  directory
      
      * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation
      
      * fix code style check
      0d14e74a
  19. 24 8月, 2022 1 次提交
  20. 23 8月, 2022 2 次提交
  21. 22 8月, 2022 1 次提交
  22. 19 8月, 2022 1 次提交
  23. 18 8月, 2022 2 次提交
  24. 17 8月, 2022 1 次提交
  25. 16 8月, 2022 2 次提交
  26. 15 8月, 2022 1 次提交
  27. 12 8月, 2022 1 次提交
    • S
      Offload calculations from matmul op to fuse pass (#44941) · acb78ea2
      Sławomir Siwek 提交于
      * remove v2_transpose_reshape
      
      * matmul_transpose_reshape
      
      * reshape_transpose_matmul
      
      * Add int8 support for matmulV2
      
      * restore ut
      
      * adjust old ut
      
      * restore parallel UT ruels
      
      * remove mkldnn code from base ops
      
      * move enforces to pass
      
      * remove duplicated functions
      
      * delete duplicated enforces
      
      * feedback from review
      
      * add comments to variables
      
      * enable eltwise support
      
      * dynamic attribute
      
      * remove fusepass tests from op test
      
      * remove fuse pass cases from op test
      
      * revert introduction of dynamic attributes
      
      * style
      Co-authored-by: Nwozna <joanna.wozna@intel.com>
      acb78ea2
  28. 10 8月, 2022 1 次提交
  29. 09 8月, 2022 1 次提交
  30. 08 8月, 2022 2 次提交