1. 06 4月, 2020 2 次提交
    • S
      Add a new DataFeed named PaddleBoxDataFeed (#23321) · 5223e2bb
      ShenLiang 提交于
      * add paddleboxdatafeed
      * add ifdef linux and boxps
      * add untest for datafeed
      * fix untest of test_paddlebox_datafeed
      * fix untest
      * rename function
      5223e2bb
    • C
      Implement StaticModelRunner to support dygraph fine-tune static graph pre-training model (#23171) · 75bd3507
      Chen Weihang 提交于
      * static model runner basic implement, test=develop
      
      * add run program op to execute loaded program, test=develop
      
      * refactor static model runner & run program op, test=develop
      
      * reset engine.cc to resolve conflict
      
      * adapt the change of dygraph double grad, test=develop
      
      * refactor impl to solve control flow error, test=develop
      
      * clear debug code, test=develop
      
      * fix ci str compatible error & checkout dygraph grad maker & add example, test=develop
      
      * hide api & add op test, test=develop
      
      * fix run program op test places error, test=develop
      
      * fix program by review comment, test=develop
      
      * delete change var desc name, test=develop
      
      * fix other program by review comment, test=develop
      
      * remove _static_graph_guard, test=develop
      
      * add selectedrows test, test=develop
      
      * remove desc parser, test=develop
      
      * fix detail program, test=develop
      
      * change socpe create & add test, test=develop
      75bd3507
  2. 03 4月, 2020 3 次提交
    • G
    • Z
      fix Tracer::NoGrad, test=develop (#23443) · 0c23e3ff
      Zeng Jinle 提交于
      0c23e3ff
    • L
      [feature] prune program by feed and fetch_list automatically (#22474) · a62599a8
      Leo Chen 提交于
      * prune train program by fetch_list, test=develop
      
      * add unittest for prune, test=develop
      
      * fix pruned feed, test=develop
      
      * support ParallelExecutor and feed prune, test=develop
      
      * add comments, test=develop
      
      * update unittest, test=develop
      
      * update unittests, test=develop
      
      * remove debug code, test=develop
      
      * support cond in clone, test=develop
      
      * support cond in prune, test=develop
      
      * support multiple minimize, test=develop
      
      * support cache, test=develop
      
      * fix _copy_param_info_from, test=develop
      
      * support python2 str, test=develop
      
      * remove debug code, test=develop
      
      * fix bug of caching CompiledProgram, test=develop
      
      * fix multi_device issue, test=develop
      
      * tmp
      
      * support tuple in fetch_list and overriding use_prune, test=develop
      
      * dont use nonlocal in python2, test=develop
      
      * remove nonlocal, test=develop
      
      * code clean, test=develop
      
      * code clean, test=develop
      
      * feed list, test=develop
      
      * test adam, test=develop
      
      * follow comments, test=develop
      
      * reduce duplicate code, test=develop
      
      * update comments, test=develop
      a62599a8
  3. 01 4月, 2020 1 次提交
  4. 31 3月, 2020 1 次提交
    • L
      Feature/expand params in auto-generated pybind functions for dygraph operators (#23181) · 488b2387
      Leo Chen 提交于
      * expand parameters, test=develop
      
      * support resnet, test=develop
      
      * fix resnet, test=develop
      
      * support duplicable out, test=develop
      
      * support ptb
      
      * fix bugs, test=develop
      
      * support null input, test=develop
      
      * fix bugs, test=develop
      
      * fix batchNorm is_test, test=develop
      
      * refine code, test=develop
      
      * follow comments, test=develop
      
      * follow comments, test=develop
      
      * follow comments, test=develop
      
      * follow comments, test=develop
      488b2387
  5. 29 3月, 2020 1 次提交
  6. 27 3月, 2020 1 次提交
    • Z
      Expose dygraph.grad api (#23124) · 8bfd62ff
      Zeng Jinle 提交于
      * expose dygraph.grad api, test=develop, test=document_fix
      
      * add more parameter in dygraph.grad API, test=develop
      
      * add only_inputs=True parameter, test=develop
      
      * follow comments, test=develop, test=document_fix
      
      * fix typo, test=develop, test=document_fix
      8bfd62ff
  7. 26 3月, 2020 2 次提交
    • Z
      [Paddle-TRT]: Ernie Dynamic shape support. (#23138) · 430b0099
      Zhaolong Xing 提交于
      * add dynamic plugin support.
      test=develop
      
      * change emb eltwise layernorm to math function
      test=develop
      
      * add emb eltwise layernorm
      test=develop
      
      * can run dynamic shape ernie
      test=develop
      
      * fix ci
      test=develop
      
      * add ut for trt ernie dynamic
      
      test=develop
      
      * refine dynamic shape c++ interface.
      test=develop
      
      * fix comments
      test=develop
      
      * fix comments
      test=develop
      430b0099
    • X
      add clear one table (#23089) · 68ea1ad5
      xujiaqi01 提交于
      * add clear_one_table
      * test=develop
      68ea1ad5
  8. 20 3月, 2020 2 次提交
    • Z
      Reader sequential and inference partial feed (#22699) · acfc9b8a
      Zeng Jinle 提交于
      * sequential reader stage 1, test=develop
      
      * fix ut, test=develop
      
      * fix iterable=False reset bug, add some logs and polish code, test=develop
      
      * inference feed partial data, test=develop
      
      * Turn on keep_order=True for test, test=develop
      
      * enhance ut to test more cases, test=develop
      
      * test commit for reverting
      
      * Revert "test commit for reverting", test=develop
      
      This reverts commit 80aef42e.
      
      * add ut of merged and unmerged results, test=develop
      
      * add more uts for coverages and add en doc of api, test=develop
      
      * follow comments, test=develop
      
      * change note style, test=develop
      acfc9b8a
    • Z
      Add dygraph double grad implementation (#22939) · a31d7328
      Zeng Jinle 提交于
      * add double grad implementation for dygraph, test=develop
      
      * polish code, add uts, test=develop
      
      * fix place bug, test=develop
      
      * polish codes, add more uts for coverages, test=develop
      
      * add no_grad_set, test=develop
      
      * add star gan ut, test=develop
      
      * follow comments, test=develop
      a31d7328
  9. 19 3月, 2020 1 次提交
    • S
      high-performance dygraph slice (#22879) · 2e2da712
      songyouwei 提交于
      * move __getitem__ to cpp
      
      * bug fix
      
      * add type check and gil release
      
      * support negative step with omitted ends
      test=develop
      
      * code refine
      test=develop
      
      * bug fix
      test=develop
      
      * slice always return different pyobj
      test=develop
      2e2da712
  10. 09 3月, 2020 2 次提交
  11. 04 3月, 2020 2 次提交
  12. 03 3月, 2020 1 次提交
  13. 02 3月, 2020 3 次提交
    • Z
      Unmerged fetch list (#22635) · 89cfa491
      Zhen Wang 提交于
      * update ScopeBufferedSSAGraphExecutor&AsyncSSAGraphExecutor&ThreadedSSAGraphExecutor&FastThreadedSSAGraphExecutor&ParallelSSAGraphExecutor&ParallelExecutor for fetching unmerged results.
      
      * add the unit test for fetch_unmerged.
      
      * update ut for multi-card and multi-cpu.
      
      * add the error message and the user suggestion in FetchOpHandle. test=develop
      89cfa491
    • C
      Speed up dygraph DataLoader based on shared memory and LoDTensor serialization (#22541) · 7d8d5734
      Chen Weihang 提交于
      * add lodtensor share memory & serialization, test=develop
      
      * fix windows compile error, test=develop
      
      * deal vartype pickle & fix unittest matching error message, test=develop
      
      * update timeout variable name, test=develop
      
      * refactor memory map implement, test=develop
      
      * clear mmap file discripter when exit unexpectedly, test=develop
      
      * remove the child process fd in advance, test=develop
      
      * remove mmap fds after Queue.put in child process, test=develop
      
      * add hard unittests for register exit func, test=develop
      
      * fix python2 compatibility problem in unittest, test=develop
      
      * fix exception unittest error, test=develop
      
      * polish code based review comment, test=develop
      7d8d5734
    • H
      support customized download command in dataset (#22782) · 53a2b68f
      hutuxian 提交于
      * user can call dataset.set_download_cmd to set its customized download cmd
      * add UT to cover this scenario
      53a2b68f
  14. 27 2月, 2020 1 次提交
    • Z
      Refine adam op to improve performance, test=develop (#22346) · 72dde4ab
      zhaoyuchen2018 提交于
      * Refine adam op, test=develop
      
      * Fuse kernels together to reduce cpu time.
      
      * Refine paddle enforce, test=develop
      
      * Remove some comments, test=develop
      
      * Refine code,test=develop
      
      * Refine cuda kernel, test=develop
      
      * Refine code according to comments, test=develop
      72dde4ab
  15. 26 2月, 2020 1 次提交
  16. 25 2月, 2020 1 次提交
    • H
      PaddleBox Framework Part2 (#22466) · 175954d8
      hutuxian 提交于
      * Add two types of Metric Calculator: MultiTaskCalculator & CmatchRankCalculator.
      * Add a config for DynamicAdjustChannelNum function to denote whether we will discard the remaining instances when they are not be distributed evenly.
      * Remove CPU code in Pull/PushSparse and we will add it back when testing it fully.
      * Fix some known issues: such as copying persistable vars after one epoch running.
      175954d8
  17. 23 2月, 2020 1 次提交
  18. 22 2月, 2020 1 次提交
  19. 18 2月, 2020 1 次提交
  20. 12 2月, 2020 1 次提交
  21. 11 2月, 2020 1 次提交
    • Y
      multi-loss optimization by adding a DownpourOpt worker (#22025) · 2235ee1a
      yaoxuefeng 提交于
      * update
      
      * update test=develop
      
      * update compile set test=develop
      
      * update compile set test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update compile setting test=develop
      
      * update compile setting test=develop
      
      * update run demo test=develop
      
      * update test=develop
      
      * update test=develop
      
      * fix test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update format test=develop
      
      * update format test=develop
      
      * update style test=develop
      
      * update style test=develop
      
      * change style test=develop
      
      * change style test=develop
      
      * change style test=develop
      
      * add dataset unittest test=develop
      
      * update test=develop
      
      * update for record test=develop
      
      * udpate style for record test=develop
      
      * update for record test=develop
      
      * update for record test=develop
      
      * update for record test=develop
      
      * fix format test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      2235ee1a
  22. 10 2月, 2020 1 次提交
  23. 07 2月, 2020 1 次提交
    • Y
      Enable the detection of subgraph composed of grad ops (#21223) · dcfb6038
      Yiqun Liu 提交于
      * Add the first implememtation of fusion_group op #19621 (#3)
      
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Enable generating code for a given subgraph. #21126 (#4)
      
      * Enable generating code for a given subgraph.
      
      * Support sorting the subgraph.
      
      * Remove the rearange of expressions because we use the sorted subgraph directly.
      
      * Enable generating code for a subgraph which is composed of grad ops.
      
      * Use expression information to check the accuracy in unittest.
      
      * Separate load and store from computation expressions.
      test=develop
      
      * Improve the loading statements in generated codes.
      test=develop
      
      * Remove unused arguments from formal list.
      test=develop
      
      * Enable the detection of subgraph of grad ops.
      
      * Generate code for detected subgraph in fusion_group_pass.
      
      * Add an option in BuildStrategy to enable fusion_group_pass and add unittest.
      test=develop
      
      * Fix a bug when checking whether the shape of all inputs are the same.
      
      * Add debug information.
      
      * Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#5)
      
      test=develop
      
      * Call subgraph_detector in fusion_group pass.
      test=develop
      
      * Disable fusion_group when WITH_GPU is OFF.
      test=develop
      
      * Refine all PADDLE_ENFORCE message.
      test=develop
      
      * Fix the case that some inputs are not defined in grad ops, and set op_role for fused op.
      test=develop
      
      * Follow review comments.
      test=develop
      dcfb6038
  24. 05 2月, 2020 1 次提交
  25. 04 2月, 2020 2 次提交
  26. 02 2月, 2020 1 次提交
  27. 21 1月, 2020 1 次提交
  28. 19 1月, 2020 1 次提交
  29. 17 1月, 2020 2 次提交
    • Y
      Implement a common python unittest to test the ir passes. (#22209) · b7cac50b
      Yiqun Liu 提交于
      * Implement a common python unittest to test the ir passes.
      test=develop
      
      * Save the results in np.array and support to startup on CPU.
      test=develop
      
      * Fix the unittest.
      test=develop
      
      * Add check_program to check whether the optimized program is different from the origin one.
      test=develop
      
      * Remove the inferface all_ops.
      test=develop
      
      * Add exception test in pass_test.
      test=develop
      b7cac50b
    • T
      integrated HALF_ASYNC to communicator (#21869) · 82bc814a
      tangwei12 提交于
      * add half_async in the communicator
      * fix DistributedStrategy
      82bc814a