1. 29 11月, 2022 1 次提交
    • Y
      [cherry-pick] updating mul and matmul with set_mem_desc and fix... · 9e2ba9b9
      yeliang2258 提交于
      [cherry-pick] updating mul and matmul with set_mem_desc and fix squeeze_transpose for MKLDNN (#47951)
      
      * Fix slice bugs in MKLDNN when input dims are zeros (#46671)
      
      * fix slice bugs
      
      * fix
      
      * update code
      
      * fix
      
      * update code
      
      * updating mul and matmul with set_mem_desc (#45624)
      
      * - mul & matmul changes
      
      - fix
      
      - bs16 correction of strides
      
      * - cosmetic fixes
      
      * - lint
      
      * - fix
      
      * - fix
      
      * - format -> mem_desc
      
      * - fix
      
      * - fix
      
      * - fix
      
      * - fix
      
      * - fix
      
      * fix squueze_transpose (#47911)
      Co-authored-by: NJacek Czaja <jacek.czaja@intel.com>
      9e2ba9b9
  2. 25 11月, 2022 1 次提交
  3. 10 11月, 2022 1 次提交
  4. 07 11月, 2022 2 次提交
    • P
      【Cherry-pick PR47666】add cudnn error if compiled cudnn version is incompatible... · 764cea0c
      pangyoki 提交于
      【Cherry-pick PR47666】add cudnn error if compiled cudnn version is incompatible with installed cudnn version (#47673)
      
      * Cherry-pick PR47666, add cudnn error (#47666)
      
      * [CherryPick] Cherry pick #45916 #46031 #47299  (#47610)
      
      * [ Dy2Static ] Fix bugs when select inputs meeting different shape or undefined-var (#45916)
      
      * fix select_input with different shape errors:
      1. select_input_with_buildin_type directly return non-undefinedvar branch when meeting undefined var
      2. the output shape of select_input is inferred from inputs.
      
      * reverse the logic in select_input
      
      * [warning] added warning message in cond block when one branch returns variable and another returns None (#46031)
      
      * [cherry-pick] Allow manaully set py_reader name in standalone executor (#45898) (#45931)
      
      * Allow manaully set py_reader name in standalone executor
      
      * [BugFix] while cond receives dict as input (#47299)
      
      * fix bugs while cond receives dict as input
      
      * add unittest
      
      * change flatten -> _is_sequence_except_dict
      
      * code format
      Co-authored-by: Nfeifei-111 <wuzhanfei@baidu.com>
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      Co-authored-by: Nfeifei-111 <wuzhanfei@baidu.com>
      764cea0c
    • Z
      Revert "SparseConv support duplicate coordinates (#44976)" (#45202) (#47699) · 7145db6e
      zhangkaihuo 提交于
      Revert SparseConv support duplicate coordinates
      7145db6e
  5. 03 11月, 2022 1 次提交
  6. 02 11月, 2022 1 次提交
  7. 01 11月, 2022 1 次提交
    • Z
      [cherry-pick][code-gen] Support code-gen for opmaker of sparse op (#46993) (#47417) · 601626ac
      zyfncg 提交于
      * support generating code of opmaker for backward op invoke forward op (#46912)
      
      * [code-gen] Support code-gen for opmaker of sparse op (#46993)
      
      * support generating code of opmaker for backward op invoke forward op
      
      * gsupport code-gen of opmaker for sparse op
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * fix code_gen bug
      
      * fix bug
      
      * fix kernel signature code-gen
      
      * fix complie bug of VarType
      
      * fix complie bug of VarType
      
      * fix test_sparse_conv_op
      
      * fix test_sparse_norm_op
      
      * [Phi] Refactor logic of judging whether having a phi kernrel (#46920)
      
      * refind logic of choose phi kernrel
      
      * fix complie budg
      
      * update cmake
      601626ac
  8. 28 10月, 2022 1 次提交
  9. 27 10月, 2022 1 次提交
  10. 26 10月, 2022 2 次提交
  11. 25 10月, 2022 1 次提交
    • Z
      [Sparse] Fix indices (#47190) (#47226) · 942ab42f
      zhangkaihuo 提交于
      当前无法从Tensor中获取到SparseTensor的sparse_dim,无法准确推断出indices的shape,所以目前先以3D点云模型为主,输入的SparseTensor的维度是5D的,其中非零元素是一维向量,所以indices是[4, -1]。
      942ab42f
  12. 24 10月, 2022 1 次提交
  13. 21 10月, 2022 1 次提交
  14. 20 10月, 2022 4 次提交
  15. 19 10月, 2022 2 次提交
  16. 18 10月, 2022 2 次提交
  17. 17 10月, 2022 3 次提交
    • Z
      [cherry-pick]Sparse static graph (#46838) · 10225d22
      zhangkaihuo 提交于
      cherry-pick : #46322, #46245
      Sparse API 支持静态图
      10225d22
    • Z
      Optimize performance of depthwise_conv (#46896) · 976af0da
      Zhang Zheng 提交于
      Optimize performance of depthwise_conv
      
      Config: input[2048, 1024, 4, 4], filter[1024, 1, 4, 4], stride=1, pad=0, dilation=1
      976af0da
    • Z
      [Cherry-Pick]Move valid check from python to kernel (#46980) · 8bfd45ad
      Zhang Zheng 提交于
      为了提升性能,将label的边界检查从python端转移到kernel内,减少额外op的调用,如min、max和同步拷贝等
          当前的模板参数IgnoreIndex仅在ignore_index取值范围在[0, dim)时才生效,但是当某个label值超出了边界,ignore_index等于该label,这种情况下是应该仍然能正常计算。虽然当前的计算逻辑在结果上不会出错,但逻辑上仍是有问题的,且模板参数IgnoreIndex是没有必要的
      8bfd45ad
  18. 13 10月, 2022 2 次提交
  19. 12 10月, 2022 1 次提交
  20. 11 10月, 2022 6 次提交
  21. 10 10月, 2022 5 次提交