1. 26 11月, 2021 1 次提交
    • Z
      fix bug of slice_grad using use_mkldnn attr (#37584) · 14fd53d5
      zyfncg 提交于
      slice_grad op在选择kernel过程中出现错误,问题原因是在获取use_mkldnn属性时,map中未找到该键值,所以抛出out_of_range异常
      本PR在map获取use_mkldnn属性数据前增加了是否存在该键值的判断逻辑,从而避免出现上述异常
      14fd53d5
  2. 26 10月, 2021 1 次提交
    • S
      [Cherry-pick] Add FasterTokenizer Operator (#36716) · edff5b79
      Steffy-zxf 提交于
      * Add FasterTokenizer Operator (#34491)
      
      Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent.
      
      * support the text string as an input Tensor
      * support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens
      * Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization.
      * It first applies basic tokenization, followed by wordpiece tokenization.
      
      * optimize fast tokenizer
      
      * remove const_cast
      Co-authored-by: Nzhoushunjie <zhoushunjie@baidu.com>
      Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
      edff5b79
  3. 17 9月, 2021 1 次提交
  4. 12 9月, 2021 1 次提交
  5. 08 9月, 2021 1 次提交
  6. 01 9月, 2021 1 次提交
  7. 06 8月, 2021 1 次提交
  8. 03 8月, 2021 1 次提交
  9. 04 6月, 2021 1 次提交
    • W
      fix inference prepare data bug (#33305) · dd181238
      wenbin 提交于
      * fix inference prepare data bug
      
      * rename functions
      
      * typo
      
      * typo
      
      * typo
      
      * UT correct
      
      * correct condition
      
      * correct condition
      
      * ci coverage
      
      * morelines
      
      * fix ci coverage
      dd181238
  10. 17 5月, 2021 1 次提交
  11. 13 5月, 2021 1 次提交
  12. 20 4月, 2021 1 次提交
  13. 09 4月, 2021 1 次提交
    • L
      [NPU] cherry-pick basic NPU components/allocator/operator/executor supports from ascendrc (#32144) · ccf5709d
      Leo Chen 提交于
      * [feature] support npu allocator (#30840)
      
      [feature] support npu allocator
      
      * [feature] support npu operator (#30951)
      
      [feature] support npu operator
      
      * [feature] support npu allocator, part 2 (#30972)
      
      * support npu allocator
      
      * add npu device context
      
      * fix some compile problem
      
      * fix some compile problem
      
      * add npu info
      
      * compile ok
      
      * fix include dir
      
      * support naive_best_fit_allocator
      
      * run ut ok, bug failed to exit
      
      * call aclrtResetDevice before exit
      
      * fix aclFinilize
      
      * add system allocatot test
      
      * add selected_gpus in gtest
      
      * add tensor_test for npu
      
      * support npu op, initial commit
      
      * add npu stream
      
      * add elementwise_add_op
      
      * compile ok
      
      * fix typo
      
      * fix elementwise_add_op_npu_test
      
      * support op run
      
      * test can run but failed
      
      * change aclopExecuteV2 to aclopCompileAndExecute
      
      * support parsing ascend rank table file (#31000)
      
      support parsing ascend rank table file
      
      * Fix reshape on GE graph. (#31084)
      
      Fix reshape on GE graph
      
      * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
      
      * add npu sub op
      
      * fix typo
      
      * rename test
      
      * fix bug
      
      * fix bug
      
      * add fp16 kernel
      
      * fix typo
      
      * support sub grad op
      
      * support elementwise_sub_grad op
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      
      * Fix compilation problem (#31100)
      
      Fix compilation problem (#31100)
      
      * fix compile
      
      * fix code stype
      
      * remove const_cast
      
      * support adding correct npu op in pybind.h (#31143)
      
      * support adding correct npu op in pybind.h
      
      * refine code
      
      * [NPU] Support executor with NPU (#31057)
      
      * [NPU] Support executor with NPU
      
      * Fix code according to reviews
      
      * Fix code
      
      * Add unittest for sub op npu
      
      * refactor npu device manager (#31154)
      
      refactor npu device manager (#31154)
      
      * fix selected npus
      
      * fix compile
      
      * fix reading flags from env
      
      * format
      Co-authored-by: Nxiayanming <41795079@qq.com>
      Co-authored-by: Ngongweibao <weibao.gong@gmail.com>
      Co-authored-by: Nfrankwhzhang <frankwhzhang@126.com>
      Co-authored-by: Nliym27 <33742067+liym27@users.noreply.github.com>
      ccf5709d
  14. 23 3月, 2021 1 次提交
  15. 26 2月, 2021 1 次提交
  16. 04 2月, 2021 1 次提交
  17. 25 1月, 2021 1 次提交
  18. 11 1月, 2021 1 次提交
  19. 30 12月, 2020 1 次提交
  20. 25 12月, 2020 2 次提交
  21. 04 12月, 2020 1 次提交
  22. 27 11月, 2020 1 次提交
  23. 15 9月, 2020 1 次提交
  24. 21 8月, 2020 1 次提交
    • Q
      support Baidu Kunlun AI Accelerator (#25959) · 138ecf24
      QingshuChen 提交于
      * support Baidu AI Accelerator
        * test=kunlun
      
      * minor
       * test=kunlun
      
      * support xpu op in separate file
       * test=kunlun
      
      * update XPU error message and remove duplicated code
      
       * test=kunlun
      
      * minor
       * test=kunlun
      
      * minor
       * test=kunlun
      138ecf24
  25. 13 8月, 2020 1 次提交
    • L
      [OpDevOptimize] Add common infershape functions (#26096) · ffe52b44
      Leo Chen 提交于
      * add unchaged infershape function
      
      * add broadcast infershape function
      
      * fix bug
      
      * rename infershape functions
      
      * add UnaryOpUnchangedInferShapeCheckAxis
      
      * add error message
      
      * add test for common infer shape functions
      
      * dont update existed ops
      
      * dont update op_desc.h
      
      * add more test
      
      * add error check, refine error message
      ffe52b44
  26. 04 6月, 2020 1 次提交
  27. 14 5月, 2020 1 次提交
  28. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  29. 06 5月, 2020 1 次提交
    • L
      [Dy2Stat]Support list pop (#24250) · ac9a7eee
      liym27 提交于
      * Replace dygraph_to_static_func with @declarative or program_translator.get_func in test_list.py
      
      * Add comments in ConditionalBlock.
      
      * Support list pop last item. 
      
      * Support pop the i-th item. 
      
      * Support an empty tensor array as Input in assign op and set the kernel type is float.
      ac9a7eee
  30. 09 4月, 2020 1 次提交
  31. 03 4月, 2020 1 次提交
    • Z
      support Exhaustive search in dygraph (#23415) · dbfbd7ea
      zhongpu 提交于
      * use global conv cache; test=develop
      
      * use singleton cache; test=develop
      
      * fix format error; test=develop
      
      * add cudnn helper header; test=develop
      
      * fix header error; test=develop
      
      * fix mac unitest; test=develop
      
      * fix mac unitest; test=develop
      
      * fix file format; test=develop
      
      * fix include file error, test=develop
      
      * remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop
      
      * fix test_elementwise_mul_op_dim, test=develop
      
      * fix compile error, test=develop
      Co-authored-by: Nphlrain <phliuhongyu@126.com>
      dbfbd7ea
  32. 02 4月, 2020 2 次提交
    • Z
      Revert "Exhaustive search (#22821)", test=develop (#23401) · bfb07aaf
      zhongpu 提交于
      This reverts commit 48144e40.
      bfb07aaf
    • Z
      Exhaustive search (#22821) · 48144e40
      zhongpu 提交于
      * use global conv cache; test=develop
      
      * use singleton cache; test=develop
      
      * fix format error; test=develop
      
      * add cudnn helper header; test=develop
      
      * fix header error; test=develop
      
      * fix mac unitest; test=develop
      
      * fix mac unitest; test=develop
      
      * fix file format; test=develop
      
      * fix include file error, test=develop
      
      * remove kernel_configs_ in class ExecutionContext and kernel_configs_map_ in class OperatorWithKernel, test=develop
      
      * fix test_elementwise_mul_op_dim, test=develop
      Co-authored-by: Nphlrain <phliuhongyu@126.com>
      48144e40
  33. 18 3月, 2020 1 次提交
  34. 09 3月, 2020 1 次提交
  35. 03 3月, 2020 2 次提交
  36. 28 2月, 2020 1 次提交
  37. 23 2月, 2020 1 次提交