1. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  2. 15 1月, 2022 1 次提交
  3. 13 1月, 2022 1 次提交
  4. 15 12月, 2021 1 次提交
  5. 27 11月, 2021 1 次提交
    • A
      [NPU] reorganization for device API abstraction (#37110) · 72241a6a
      Aganlengzi 提交于
      * [NPU] reorganization for device API abstraction
      
      * [NPU] delete old files
      
      * [NPU] fix npu_collective_helper
      
      * [NPU] fix collective_helper
      
      * [NPU] fix ut
      
      * [NPU] mod memory allocation and hccl_helper
      
      * [NPU] fix place_type
      
      * [NPU] split enfoce.h
      
      * move acl* call into npu_info
      
      * merge conflict
      
      * fix merge
      
      * merge conflict
      
      * merge conflict
      72241a6a
  6. 20 10月, 2021 1 次提交
    • S
      Add FasterTokenizer Operator (#34491) · 3f2d6a3f
      Steffy-zxf 提交于
      Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent.
      
      * support the text string as an input Tensor
      * support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens
      * Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization.
      * It first applies basic tokenization, followed by wordpiece tokenization.
      3f2d6a3f
  7. 31 8月, 2021 1 次提交
  8. 27 8月, 2021 1 次提交
  9. 26 8月, 2021 1 次提交
    • S
      Add copy from tensor (#34406) · ac33c0ca
      Shang Zhizhou 提交于
      * add api
      
      * temp save
      
      * revert
      
      * copytocpu async ok
      
      * fix style
      
      * copy sync ok
      
      * fix compile error
      
      * fix compile error
      
      * api done
      
      * update python async api
      
      * fix compile
      
      * remove async python api; add c++ async unittest
      
      * remove python async api
      
      * update unittest
      
      * update unittest
      
      * add C++ unittest for copytensor
      
      * add unittest
      
      * update namespace utils to class TensorUtils
      
      * add unittest
      
      * update unittest
      
      * update unittest
      
      * update code style
      
      * update code style
      
      * update unittest
      ac33c0ca
  10. 12 8月, 2021 1 次提交
  11. 06 8月, 2021 1 次提交
  12. 05 8月, 2021 1 次提交
  13. 14 7月, 2021 1 次提交
  14. 17 6月, 2021 1 次提交
  15. 25 5月, 2021 1 次提交
  16. 08 3月, 2021 1 次提交
  17. 05 3月, 2021 1 次提交
  18. 20 2月, 2021 1 次提交
  19. 07 2月, 2021 1 次提交
  20. 04 2月, 2021 1 次提交
  21. 04 1月, 2021 1 次提交
  22. 28 12月, 2020 1 次提交
  23. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  24. 11 9月, 2020 1 次提交
  25. 01 6月, 2020 1 次提交
    • S
      support C++ inference shared library on windows (#24672) · 126d3d69
      silingtong123 提交于
      * add SetCommandLineOption
      
      * add the print_FLAGS function
      
      * remove the test demo
      
      * modify the location of macro
      
      * add the 'WITH_STATIC_LIB' option on windows
      
      * modify the macro of PD_INFER_DECL
      
      * modify the the fuction name
      
      * modify the unittest
      
      * modify the code style
      126d3d69
  26. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  27. 05 2月, 2020 1 次提交
  28. 10 12月, 2019 1 次提交
  29. 17 9月, 2019 1 次提交
  30. 09 9月, 2019 1 次提交
  31. 26 3月, 2019 1 次提交
  32. 22 3月, 2019 1 次提交
  33. 20 3月, 2019 1 次提交
    • N
      cherry-pick from feature/anakin-engine: add data type for zero copy #16313 · 4f4daa4b
      nhzlx 提交于
      1. refine anakin engine
      2. add data type for zero copy
      
      align dev branch and PaddlePaddle:feature/anakin-engine brach
      the cudnn workspace modify was not included for now, because we use a hard code way
      in feature/anakin-engine branch. There should be a better way to implement it,
      and subsequent submissions will be made.
      
      test=develop
      4f4daa4b
  34. 11 3月, 2019 1 次提交
  35. 08 3月, 2019 1 次提交
  36. 01 3月, 2019 1 次提交
  37. 17 1月, 2019 1 次提交
  38. 16 1月, 2019 1 次提交
    • Y
      Optimize while_op for test (#14764) · 568cc2ff
      Yiqun Liu 提交于
      * Simplify the compare op for CPU.
      
      * Use asynchronous tensor copy in reshape_op's kernel.
      
      * Optimize while_op for test, avoiding creating variables every time.
      test=develop
      
      * Enable the cache of kernel type and kernel function.
      test=develop
      
      * Enable profiling with gperftools.
      
      * Remove flags for testing, and fix the linking error.
      test=develop
      
      * Delete the codes of ChooseKernel.
      test=develop
      
      * Fix bug when preparing ExecutorPrepareContext for while_op.
      
      * Fix missing depending on grpc libraries.
      
      * Remove the redundant print.
      test=develop
      
      * Follow comments.
      
      * Remove the codes related to prepare the ExecutorPrepareContext for while_op.
      test=develop
      568cc2ff
  39. 19 12月, 2018 1 次提交
  40. 29 11月, 2018 1 次提交