1. 18 10月, 2019 2 次提交
  2. 15 10月, 2019 1 次提交
  3. 14 10月, 2019 1 次提交
    • 6
      Dlpack support (#20039) · 12e4be03
      633WHU 提交于
      * support dlpack to tensor and implement python interface test=develop
      
      * add unittest for _to_dlpack and from_dlpack test=develop
      12e4be03
  4. 07 10月, 2019 1 次提交
  5. 02 10月, 2019 1 次提交
  6. 29 9月, 2019 1 次提交
    • L
      fix conv2d and conv3d: (#20042) · 3aa331d9
      liym27 提交于
      1.support asymmetric padding;
          2.support padding algorithm:"SAME" and "VALID";
          3.support channel_last: data_format NHWC and NDHWC;
          4.change doc of python API and c++;
      
          test=develop, test=document_preview
      3aa331d9
  7. 27 9月, 2019 1 次提交
    • update operator compatible info, test=develop (#19978) · 01b9d079
      石晓伟 提交于
      * update operator compatible info, test=develop
      
      * revert cmake/version.cmake, test=develop
      
      * add unit_tests and fix bugs, test=develop
      
      * update ../paddle/fluid/framework/framework.proto, test=develop
      
      * fix bug of paddle/fluid/inference/api/analysis_predictor.cc, test=develop
      
      * update paddle/fluid/framework/version_test.cc, test=develop
      
      * add comments and rename interfaces, test=develop
      01b9d079
  8. 20 9月, 2019 1 次提交
  9. 19 9月, 2019 1 次提交
    • Y
      Add a pass to fuse fc+elementwise_add+layernorm (#19776) · 3cd985a6
      Yiqun Liu 提交于
      * Add fc_elementwise_layernorm_fuse pass and unittest.
      
      * Add fused_fc_elementwise_layernorm op and its GPU kernel.
      test=develop
      
      * Apply fc_elementwise_layernorm_fuse_pass to GPU inference.
      
      * Add the setting of attrs in the definition of binary_op.
      test=develop
      
      * Add comment.
      
      * Implement the unittest.
      test=develop
      
      * Change the unittest name of layer_norm.
      test=develop
      3cd985a6
  10. 17 9月, 2019 1 次提交
  11. 16 9月, 2019 1 次提交
  12. 11 9月, 2019 2 次提交
    • H
      Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989) · 12542320
      Huihuang Zheng 提交于
      TemporaryAllocator is a singleton used for allocating memory for Cudnn. Since it is a singleton, we can delete it for better performance in memory.
      
      We replace TemporaryAllocator by CUDADeviceContextAllocator and CUDADeviceContextAllocation, which uses stream callback to delete the memory allocated for the stream to avoid singleton.
      
      Also added data_feed_proto to operator to fix CI in CPU compilation
      12542320
    • Y
      Implement the GPU kernel of fc operator (#19687) · a65c728e
      Yiqun Liu 提交于
      * Refine the codes related to fc op.
      
      * Add GPU implementation for fc functor.
      
      * Apply fc_fuse_pass in GPU inference.
      test=develop
      
      * Change the cmake for fc op.
      
      * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.
      
      * Add an attribute to set the activation type in fc_op.
      
      * Enhance the unittest of fc_op.
      test=develop
      
      * Remove the declaration of FCOpGrad back to the header file.
      test=develop
      
      * Set default value for newly added arguments in test_fc_op.
      test=develop
      a65c728e
  13. 10 9月, 2019 1 次提交
  14. 07 9月, 2019 1 次提交
  15. 04 9月, 2019 3 次提交
  16. 31 8月, 2019 1 次提交
    • H
      Paddlebox Framework (#18982) · c756b5d2
      hutuxian 提交于
      * Support looking up embeddings from BoxPS.
      * Add a _pull_box_sparse op, for now this op is not exposed to users.
      * Add a BoxHelper class, providing 'BeginPass', 'EndPass', 'FeedPass' functions and so on.
      * Add 'BoxPSDataset' in python code.
      * Add a compile options WITH_BOX_PS and a MACRO PADDLE_WITH_BOX_PS.
      * Add UT.
      * More concrete information pls refer to: https://github.com/PaddlePaddle/Paddle/pull/18982
      c756b5d2
  17. 30 8月, 2019 1 次提交
  18. 20 8月, 2019 1 次提交
  19. 19 8月, 2019 3 次提交
  20. 14 8月, 2019 2 次提交
  21. 12 8月, 2019 1 次提交
  22. 01 8月, 2019 1 次提交
  23. 31 7月, 2019 1 次提交
  24. 29 7月, 2019 1 次提交
  25. 24 7月, 2019 1 次提交
  26. 23 7月, 2019 1 次提交
  27. 22 7月, 2019 1 次提交
  28. 19 7月, 2019 2 次提交
  29. 18 7月, 2019 1 次提交
    • J
      Downgrade gcc to 4.8 (#18614) · 898237c1
      Jiabin Yang 提交于
      * test=develop, fix docker with paddle nccl problem
      
      * test=develop, downgrade gcc to 4.8 for latest-dev
      
      * test=develop, downgrade gcc to 4.8 for latest-dev
      
      * test=develop, modify cmake to renew all third_party
      
      * test=develop, invoke ci
      
      * test=develop, invoke ci
      
      * test=develop, complie python with wide-unicode
      
      * test=deveop, refine env settings
      
      * test=deveop, refine env settings
      898237c1
  30. 17 7月, 2019 1 次提交
  31. 12 7月, 2019 1 次提交
  32. 09 7月, 2019 1 次提交