1. 07 2月, 2020 6 次提交
    • S
      sequential reader stage 1, test=develop · 354abe36
      sneaxiy 提交于
      354abe36
    • Z
      Fix the integer overflow problem of sequence2batch (#22479) · a61d0952
      Zhong Hui 提交于
      Fix the  integer overflow problem in the op of sequence2batch, change the int32_t to size_t,
      In the /paddle/fluid/operators/math/sequence2batch.h#L122.
      a61d0952
    • C
      Add weight quantization in post_training_quanzitaion (#22445) · 197913eb
      cc 提交于
      * support weight quantization in post_training_quanzitaion, test=develop
      * add test for weight quantization, test=develop
      197913eb
    • Y
      Enable the detection of subgraph composed of grad ops (#21223) · dcfb6038
      Yiqun Liu 提交于
      * Add the first implememtation of fusion_group op #19621 (#3)
      
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Enable generating code for a given subgraph. #21126 (#4)
      
      * Enable generating code for a given subgraph.
      
      * Support sorting the subgraph.
      
      * Remove the rearange of expressions because we use the sorted subgraph directly.
      
      * Enable generating code for a subgraph which is composed of grad ops.
      
      * Use expression information to check the accuracy in unittest.
      
      * Separate load and store from computation expressions.
      test=develop
      
      * Improve the loading statements in generated codes.
      test=develop
      
      * Remove unused arguments from formal list.
      test=develop
      
      * Enable the detection of subgraph of grad ops.
      
      * Generate code for detected subgraph in fusion_group_pass.
      
      * Add an option in BuildStrategy to enable fusion_group_pass and add unittest.
      test=develop
      
      * Fix a bug when checking whether the shape of all inputs are the same.
      
      * Add debug information.
      
      * Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#5)
      
      test=develop
      
      * Call subgraph_detector in fusion_group pass.
      test=develop
      
      * Disable fusion_group when WITH_GPU is OFF.
      test=develop
      
      * Refine all PADDLE_ENFORCE message.
      test=develop
      
      * Fix the case that some inputs are not defined in grad ops, and set op_role for fused op.
      test=develop
      
      * Follow review comments.
      test=develop
      dcfb6038
    • T
      refine reshape_op shape error message (#22480) · 7c9ce097
      Tao Luo 提交于
      test=develop
      7c9ce097
    • L
      optimize performance of interpolate op (#22436) · 2b1386b2
      LielinJiang 提交于
      * optimize interpolate op, test=develop
      2b1386b2
  2. 06 2月, 2020 4 次提交
  3. 05 2月, 2020 3 次提交
  4. 04 2月, 2020 3 次提交
  5. 02 2月, 2020 2 次提交
  6. 31 1月, 2020 2 次提交
  7. 25 1月, 2020 2 次提交
  8. 23 1月, 2020 1 次提交
  9. 22 1月, 2020 1 次提交
  10. 21 1月, 2020 2 次提交
  11. 19 1月, 2020 4 次提交
  12. 17 1月, 2020 3 次提交
  13. 16 1月, 2020 6 次提交
    • W
      remove unused code test=develop (#22327) · 1e932ecc
      wangchaochaohu 提交于
      1e932ecc
    • L
      Remove unused inputs for some operators (#22284) · 3e5744aa
      Leo Chen 提交于
      * remove unused inputs, test=develop
      
      * remove unused inputs, test=develop
      
      * update dtype, test=develop
      
      * remove unused inputs, test=develop
      
      * update op_use_default_grad_op_maker, tese=develop
      
      * resolve conflicts, test=develop
      
      * follow comments, test=develop
      
      * update center_loss_grad, test=develop
      3e5744aa
    • Z
      fix typo in error message (#22312) · 805328e1
      zhangchunle 提交于
      805328e1
    • L
      change std::cout to log(INFO), vlog (#22316) · 895f8da7
      lidanqing 提交于
      895f8da7
    • revert paddle_fluid.map, test=develop (#22236) · 8cb04664
      石晓伟 提交于
      8cb04664
    • C
      Speeding up dygraph DataLoader with multiprocessing (#21762) · 35efbe6d
      Chen Weihang 提交于
      * add multiprocess for dygraph data loader, test=develop
      
      * polish code & add safe gurad, test=develop
      
      * refactor dygraph dataloader & add signal handler, test=develop
      
      * fix member initializer compile error on ci, test=develop
      
      * fix member initializer compile error one more, test=develop
      
      * remove useless config, test=develop
      
      * skip windows incompatible problem, test=develop
      
      * add unittest for coverage, test=coverage
      
      * add more exception unittest case, test=develop
      
      * deal with signal handler coverage, test=develop
      
      * polish code & add signal handler tests, test=develop
      
      * deal with coverage ci problem, test=develop
      
      * split data loader test & coverage ci fix, test=develop
      
      * remove test_imperative_data_loader_with_exception, test=develop
      
      * remove singal process except test case, test=develop
      
      * add exception tests again & remove sample list test, test=develop
      
      * split normal and exception unittests to diff class, test=develop
      
      * polish doc for use_multiprocess effect in static mode, test=develop
      35efbe6d
  14. 15 1月, 2020 1 次提交