1. 03 11月, 2022 1 次提交
  2. 02 11月, 2022 1 次提交
  3. 01 11月, 2022 1 次提交
    • Z
      [cherry-pick][code-gen] Support code-gen for opmaker of sparse op (#46993) (#47417) · 601626ac
      zyfncg 提交于
      * support generating code of opmaker for backward op invoke forward op (#46912)
      
      * [code-gen] Support code-gen for opmaker of sparse op (#46993)
      
      * support generating code of opmaker for backward op invoke forward op
      
      * gsupport code-gen of opmaker for sparse op
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * fix code_gen bug
      
      * fix bug
      
      * fix kernel signature code-gen
      
      * fix complie bug of VarType
      
      * fix complie bug of VarType
      
      * fix test_sparse_conv_op
      
      * fix test_sparse_norm_op
      
      * [Phi] Refactor logic of judging whether having a phi kernrel (#46920)
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * update cmake
      601626ac
  4. 28 10月, 2022 1 次提交
  5. 27 10月, 2022 1 次提交
  6. 26 10月, 2022 2 次提交
  7. 25 10月, 2022 1 次提交
    • Z
      [Sparse] Fix indices (#47190) (#47226) · 942ab42f
      zhangkaihuo 提交于
      当前无法从Tensor中获取到SparseTensor的sparse_dim,无法准确推断出indices的shape,所以目前先以3D点云模型为主,输入的SparseTensor的维度是5D的,其中非零元素是一维向量,所以indices是[4, -1]。
      942ab42f
  8. 24 10月, 2022 1 次提交
  9. 21 10月, 2022 1 次提交
  10. 20 10月, 2022 4 次提交
  11. 19 10月, 2022 2 次提交
  12. 18 10月, 2022 2 次提交
  13. 17 10月, 2022 3 次提交
    • Z
      [cherry-pick]Sparse static graph (#46838) · 10225d22
      zhangkaihuo 提交于
      cherry-pick : #46322, #46245
      Sparse API 支持静态图
      10225d22
    • Z
      Optimize performance of depthwise_conv (#46896) · 976af0da
      Zhang Zheng 提交于
      Optimize performance of depthwise_conv
      
      Config: input[2048, 1024, 4, 4], filter[1024, 1, 4, 4], stride=1, pad=0, dilation=1
      976af0da
    • Z
      [Cherry-Pick]Move valid check from python to kernel (#46980) · 8bfd45ad
      Zhang Zheng 提交于
      为了提升性能,将label的边界检查从python端转移到kernel内,减少额外op的调用,如min、max和同步拷贝等
          当前的模板参数IgnoreIndex仅在ignore_index取值范围在[0, dim)时才生效,但是当某个label值超出了边界,ignore_index等于该label,这种情况下是应该仍然能正常计算。虽然当前的计算逻辑在结果上不会出错,但逻辑上仍是有问题的,且模板参数IgnoreIndex是没有必要的
      8bfd45ad
  14. 13 10月, 2022 2 次提交
  15. 12 10月, 2022 1 次提交
  16. 11 10月, 2022 6 次提交
  17. 10 10月, 2022 5 次提交
  18. 29 9月, 2022 3 次提交
  19. 28 9月, 2022 1 次提交
    • Z
      [cherry-pick] Clear extra attrs of some ops in OpMaker (#46150, #46321,... · b2e4211d
      zyfncg 提交于
      [cherry-pick] Clear extra attrs of some ops in OpMaker (#46150, #46321, #46418, #46451, #46457) (#46553)
      
      * Clear extra attributes of some Op in OpMaker (Part4) (#46060)
      
      * clear extra attr of some ops in opmaker
      
      * revert clear use_cudnn for pool
      
      * fix test_operator_desc
      
      * fix Attr interface of OperatorBase
      
      * clear extra attrs of condition op in opmaker (#46150)
      
      * Clear extra attrs of lookup_table_v2 in OpMaker (#46321)
      
      * clear extra attrs of look_up_table_v2 in opmaker
      
      * fix bug
      
      * clear extra attrs of quantize op in opmaker (#46418)
      
      * delete repeated item
      
      * clear extra attrs of distribute op in opmaker (#46451)
      
      * clear extra atts of sequence_softmax in opmaker (#46457)
      b2e4211d
  20. 27 9月, 2022 1 次提交