1. 16 9月, 2019 1 次提交
  2. 11 9月, 2019 2 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
    • Y
      Implement the GPU kernel of fc operator (#19687) · a65c728e
      Yiqun Liu 提交于
      * Refine the codes related to fc op.
      
      * Add GPU implementation for fc functor.
      
      * Apply fc_fuse_pass in GPU inference.
      test=develop
      
      * Change the cmake for fc op.
      
      * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.
      
      * Add an attribute to set the activation type in fc_op.
      
      * Enhance the unittest of fc_op.
      test=develop
      
      * Remove the declaration of FCOpGrad back to the header file.
      test=develop
      
      * Set default value for newly added arguments in test_fc_op.
      test=develop
      a65c728e
  3. 10 9月, 2019 1 次提交
  4. 07 9月, 2019 1 次提交
  5. 04 9月, 2019 3 次提交
  6. 31 8月, 2019 1 次提交
    • H
      Paddlebox Framework (#18982) · c756b5d2
      hutuxian 提交于
      * Support looking up embeddings from BoxPS.
      * Add a _pull_box_sparse op, for now this op is not exposed to users.
      * Add a BoxHelper class, providing 'BeginPass', 'EndPass', 'FeedPass' functions and so on.
      * Add 'BoxPSDataset' in python code.
      * Add a compile options WITH_BOX_PS and a MACRO PADDLE_WITH_BOX_PS.
      * Add UT.
      * More concrete information pls refer to: https://github.com/PaddlePaddle/Paddle/pull/18982
      c756b5d2
  7. 30 8月, 2019 1 次提交
  8. 20 8月, 2019 1 次提交
  9. 19 8月, 2019 3 次提交
  10. 14 8月, 2019 2 次提交
  11. 12 8月, 2019 1 次提交
  12. 01 8月, 2019 1 次提交
  13. 31 7月, 2019 1 次提交
  14. 29 7月, 2019 1 次提交
  15. 24 7月, 2019 1 次提交
  16. 23 7月, 2019 1 次提交
  17. 22 7月, 2019 1 次提交
  18. 19 7月, 2019 2 次提交
  19. 18 7月, 2019 1 次提交
    • J
      Downgrade gcc to 4.8 (#18614) · 898237c1
      Jiabin Yang 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, downgrade gcc to 4.8 for latest-dev
      
      * test=develop, downgrade gcc to 4.8 for latest-dev
      
      * test=develop, modify cmake to renew all third_party
      
      * test=develop, invoke ci
      
      * test=develop, invoke ci
      
      * test=develop, complie python with wide-unicode
      
      * test=deveop, refine env settings
      
      * test=deveop, refine env settings
      898237c1
  20. 17 7月, 2019 1 次提交
  21. 12 7月, 2019 1 次提交
  22. 09 7月, 2019 1 次提交
  23. 08 7月, 2019 1 次提交
    • Support Bitmain Anakin (#18542) · 15291548
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      
      * load model from buffer with length
      
      test=develop
      
      * modify the access level of class
      
      test=develop
      
      * support anakin for bitmain arch
      
      test=develop
      
      * remove files
      
      * checkout cmakelists
      
      test=develop
      15291548
  24. 03 7月, 2019 1 次提交
  25. 29 6月, 2019 1 次提交
  26. 27 6月, 2019 1 次提交
    • K
      add WITH_COVERAGE option, default OFF (#17872) · 27fb9cad
      kh2se2013 提交于
      * add WITH_COVERAGE option, default OFF
      
      test=develop
      
      * add coverage for python sdk
      
      test=develop
      
      * fix code style
      
      * fix COVERAGE_FILE path
      
      test=develop
      
      * remove coverage package
      
      test=develop
      
      * test = develop, run coverage as module
      27fb9cad
  27. 26 6月, 2019 1 次提交
  28. 21 6月, 2019 1 次提交
  29. 16 6月, 2019 1 次提交
  30. 13 6月, 2019 1 次提交
  31. 12 6月, 2019 1 次提交
    • T
      combine noavx and avx package (#17889) · 5c06bff2
      tensor-tang 提交于
      * support avx and noavx core
      
      * add catch and give some log
      
      test=develop
      
      * fix build
      
      test=develop
      
      * add missing package
      
      test=develop
      
      * fix pybind name
      
      test=develop
      
      * fix import error
      
      test=develop
      
      * conbime noavx core
      
      test=develop
      
      * add requirements
      
      test=develop
      
      * fix unkown message
      
      test=develop
      
      * fix api spec
      
      test=develop
      
      * refine and clean
      
      test=develop
      
      * update
      
      * pass dist ut
      
      * follow comments
      
      test=develop
      
      * refine scripts
      
      test=develop
      5c06bff2
  32. 11 6月, 2019 1 次提交
    • Update the Anakin interfaces for content-dnn and MLU (#17890) · bce259e5
      石晓伟 提交于
      * update anakin-engine interfaces for content-dnn
      
      test=develop
      
      * support only-gpu mode of Anakin
      
      modify eltwise parse
      
      test=develop
      
      * modification for thread-safe
      
      test=develop
      
      * Integrated template instance
      
      test=develop
      
      * increase template parameters
      
      test=develop
      
      * support MLU predictor
      
      test=develop
      
      * update anakin cmake files
      
      test=develop
      
      * update TargetWrapper::set_device
      
      * update the initialization of anakin subgraph
      
      test=develop
      
      * use the default constructor of base class
      
      test=develop
      bce259e5
  33. 03 6月, 2019 1 次提交