1. 30 8月, 2022 9 次提交
  2. 29 8月, 2022 13 次提交
  3. 26 8月, 2022 8 次提交
  4. 25 8月, 2022 7 次提交
    • A
      [OpAttr]axis of Reverse Support Tensor type (#45391) · 91110661
      Aurelius84 提交于
      * [OpAttr]axis of Reverse Support Tensor type
      
      * fix coverage
      
      * fix unittest
      91110661
    • A
      [OpAttr]min/max of uniform_random support Tensor type (#45417) · c8955d0d
      Aurelius84 提交于
      * [OpAttr]min/max of Uniform_rand support Tensor type
      
      * fix typo
      c8955d0d
    • K
      Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a
      kangguangli 提交于
      * transfer memcpy_d2h from fluid to phi
      
      * refine arg check and add comment
      
      * fix cannot fallback to phi kernel
      
      * fix gpu_context host alloc when tensor size = 0
      
      * add kernel for std::vector<DenseTensor> args
      
      * fix bugs in MemcpyD2HMultiIOKernel
      
      * remove useless header file
      
      * polish format
      
      * fix typo
      
      * add testcase for cudapinned place
      
      * refine check condition in test
      
      * polish error message
      
      * polish error message
      
      * remove header in fluid  directory
      
      * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation
      
      * fix code style check
      0d14e74a
    • R
      [NPU] add run_program_op_npu (#45349) · 64afa638
      ronnywang 提交于
      * [NPU] add run_program_op_npu
      
      * add run_program_op_npu ut
      64afa638
    • H
      optimize conv algo cache (#41891) · 1cd7e68b
      hong 提交于
      * optimizer conv alog speed
      
      * code polish
      
      * remove useless code
      
      * fix compile error
      
      * fix cpu compile error
      
      * not use cudnn alog t
      
      * add search cache max number
      
      * polish code
      
      * fix cache test bug
      
      * add groups data format to conv args
      
      * fix cache test bug
      
      * fix cudnn_deterministic bug
      
      * fix test switch auto tune bug
      
      * fix test swith autotune bug;
      
      * fix conv cache bug
      
      * fix cache test error
      
      * fix cache test bug
      
      * fix windows mac compile error
      
      * fix workspace search error
      
      * update cudnn cache
      
      * fix cache test bug; test=develop
      
      * fix autotune swith test error
      
      * polish code
      
      * oplish code
      1cd7e68b
    • R
      [triu_indices] add triu_indices_op (#45168) · a410c397
      Rayman 提交于
      a410c397
    • U
      fix roi_align_op_npu to pass the unittest (#45310) · 256bf6ff
      USTCKAY 提交于
      256bf6ff
  5. 24 8月, 2022 3 次提交
    • L
      make tensor_util contains no cuda code (#45256) · 78916a7a
      Leo Chen 提交于
      * make tensor_util contains no cuda code
      
      * refine isfinite
      
      * revert ut
      
      * move isfinite function to its op
      
      * fix test
      
      * fix compile
      
      * std::isnan is not defined for int type on windows
      
      * fix windows compile
      
      * fix fp16
      
      * fix rocm compile
      
      * revert gradient node
      78916a7a
    • W
      Adapt tensor axis for cumsum (#45372) · 7f49b9ba
      WangZhen 提交于
      7f49b9ba
    • M
      Support fp16 of adam operator in xpu environment (#45292) · a012d426
      mengqingchun02 提交于
      * support beam_search operator on xpu. test=kunlun
      
      * support beam_search operator on xpu. test=kunlun
      
      * support beam_search operator on xpu. test=kunlun
      
      * support beam_search operator on xpu. test=kunlun
      
      * support beam_search operator on xpu. test=kunlun
      
      * support fp16 of adam operator in xpu environment. test=kunlun
      
      * support fp16 of adam operator in xpu environment. test=kunlun
      
      * support fp16 of adam operator in xpu environment. test=kunlun
      a012d426