1. 06 12月, 2021 1 次提交
  2. 15 11月, 2021 1 次提交
    • C
      [Pten] Refactor the implementation of custom operator (#37122) · 1e598f1a
      Chen Weihang 提交于
      * move extension into pten [no-verify]
      
      * append tensor methods by ext_tensor [no-verify]
      
      * append other tensor methods [no-verify]
      
      * ext related files tidy [no-verify]
      
      * include relation tidy [no-verify]
      
      * add pten tensor test [no-verify]
      
      * replace tensor in custom op & compile success
      
      * refine tensor constructor for unittest
      
      * custom relu jit run success
      
      * fix all custom op unittests
      
      * add inference cmake adapt [no-verify]
      
      * fix failed unittests
      
      * fix windows failed unittests
      
      * try to fix kunlun and inference failed
      
      * fix test_elementwise_api error
      
      * try to fix win compile failed
      
      * fix kunlun fp16 type error
      
      * remove useless haddle error macro
      
      * add custom linear op test
      
      * fix compile failed & add win symbols
      
      * fix non pten kernel cast failed
      
      * add dll decl for api
      
      * polish several deetails
      
      * polish details by review comment
      
      * add dll_decl for register
      1e598f1a
  3. 27 2月, 2021 1 次提交
  4. 22 2月, 2021 1 次提交
  5. 23 9月, 2020 1 次提交
  6. 05 6月, 2020 1 次提交
  7. 01 6月, 2020 1 次提交
    • S
      support C++ inference shared library on windows (#24672) · 126d3d69
      silingtong123 提交于
      * add SetCommandLineOption
      
      * add the print_FLAGS function
      
      * remove the test demo
      
      * modify the location of macro
      
      * add the 'WITH_STATIC_LIB' option on windows
      
      * modify the macro of PD_INFER_DECL
      
      * modify the the fuction name
      
      * modify the unittest
      
      * modify the code style
      126d3d69
  8. 21 6月, 2019 1 次提交
    • J
      dataset (#17973) · 3f8031e2
      jiaqi 提交于
      (1) use channel instead of vector/BlockingQueue in Dataset,to keep same with existing implementation, and make code more readable and flexible (dataset single output channel or multi output channel). one previous memory out of limit problem is cause by not release memory after training.
      (2) add Record because MultiSlotType costs too much memory (80B),fix memory out of limit problem.
      (3) add Channel, Archive in paddle/fluid/framework
      (4) change dataset from shared_ptr to unique_ptr in pybind
      (5) move create/destroy readers from trainer to dataset
      (6) move shuffle from datafeed to dataset. dataset holds memory, datafeed is only for load data and feed data to network.
      (7) fix thread num bug of Dataset when filelist size < thread num
      (8) support set_queue_num in InMemoryDataset
      3f8031e2
  9. 20 2月, 2019 1 次提交
  10. 21 1月, 2019 1 次提交
  11. 10 1月, 2019 1 次提交
  12. 13 12月, 2018 1 次提交
    • S
      fix cmake · deb0d41c
      sneaxiy 提交于
      fix cmake again
      test=develop
      deb0d41c
  13. 10 12月, 2018 1 次提交
  14. 10 9月, 2018 1 次提交
  15. 18 6月, 2018 1 次提交
  16. 24 5月, 2018 1 次提交
  17. 23 5月, 2018 1 次提交
  18. 22 3月, 2018 1 次提交
  19. 07 3月, 2018 2 次提交
  20. 06 3月, 2018 2 次提交
  21. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  22. 10 2月, 2018 2 次提交
  23. 07 2月, 2018 1 次提交
  24. 06 2月, 2018 2 次提交
  25. 01 2月, 2018 1 次提交
  26. 31 1月, 2018 1 次提交
  27. 30 1月, 2018 1 次提交