1. 21 2月, 2020 3 次提交
  2. 20 2月, 2020 3 次提交
    • A
      Support if/else in dygraph_to_static (#22540) · 08b09f64
      Aurelius84 提交于
      * support nested if/else
      
      * support to derivate returns the parameter list automatically
      
      * polish tranform function of slice
      
      * fix modify x.numpy()[i] slice function
      
      * support to transform ast.node into callable function
      
      * fix get_name_ids bug and add more unittest test=develop
      
      * fix requirements.txt test=develop
      
      * remove useless import statement test=develop
      
      * Fixed version compatibility issues in param of function test=develop
      
      * use decorater to test ast_to_func test=develop
      
      * add textwrap.dedent for source_code test=develop
      
      * polish code comment
      
      * fix compatibility with python2 and python3 test=develop
      
      * fix gast version error test=develop
      
      * fix gast repo test=develop
      
      * polish transfer_from_node_type code test=develop
      
      * add nested_if_else unittest test=develop
      
      * split IfElseTransformer test=develop
      
      * specify gast version test=develop
      
      * fix ast_to_func root type test=develop
      08b09f64
    • W
      add fuse_group python unitest (#22532) · 7a4c29e0
      wangchaochaohu 提交于
      * add fuse_group python unintest
      7a4c29e0
    • H
      Add Basic Node Var Type Analysis (#22603) · 14672a63
      Huihuang Zheng 提交于
      1. Move AstNodeWrapper, StaticAnalysisVisitor to a new python file: static_analysis.py
      2. Add basic node var type analysis
      14672a63
  3. 19 2月, 2020 2 次提交
  4. 18 2月, 2020 4 次提交
  5. 17 2月, 2020 6 次提交
  6. 16 2月, 2020 3 次提交
  7. 15 2月, 2020 1 次提交
    • S
      update ops's unittest data type from float32 to float64 and shape over 100 (#22544) · 90ee3666
      Steffy-zxf 提交于
      * update ops's unittest of elementwise_pow, elementwise_max, elementwise_min, scale and sqrt
      1. update elementwise_pow, elementwise_max and scale's unitests with input data type (float32 -> float64)
      2. fix bug that the elementwise_pow doesn't meet threshold requirements with tackling float64 data
      3. remove sqrt from op_accuracy_white_list.py
      4. update the unittests of elementwise_pow, elementwise_max and elementwise_min ops that their input data shape over 100
      5. test=develop
      
      * modify the writing style according suggestions
      test=develop
      90ee3666
  8. 14 2月, 2020 1 次提交
  9. 13 2月, 2020 5 次提交
  10. 12 2月, 2020 3 次提交
    • G
      Add support for dynamic_decode(while) training. (#22231) · 31b54646
      Guo Sheng 提交于
      * Add support for dynamic_decode(while) training. test=develop
      
      * Fix assign_op and tensor_array_read_write_op after solving conflict. test=develop
      
      * Fix test_rnn_decode_api.py. test=develop
      
      * Refine docs for apis in rnn.py. test=develop
      
      * Adjust outputs of dynamic_decode. test=develop
      
      * Remove the force_cpu update in assign_op. test=develop
      
      * Remove the force_cpu update in assign_op. test=develop
      
      * Make RNNCell.get_initial_states support batch_dim_idx argument. test=develop
      
      * Rename _create_array_outof_while as _create_array_out_of_while in rnn.py.
      test=develop
      31b54646
    • T
      fix bug with compiledProgram (#22495) · b0675c81
      tangwei12 提交于
      * add thread barrier for the compiled program
      b0675c81
    • D
      support slice double grad, test=develop (#22166) · 58d99247
      Double_V 提交于
      * support slice double grad, test=develop
      * merge two doublegradopmaker to one doublegradopmaker,test=develop
      * change the shape of slice_OP's unittest, test=develop
      58d99247
  11. 11 2月, 2020 4 次提交
    • H
    • Y
      multi-loss optimization by adding a DownpourOpt worker (#22025) · 2235ee1a
      yaoxuefeng 提交于
      * update
      
      * update test=develop
      
      * update compile set test=develop
      
      * update compile set test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update compile setting test=develop
      
      * update compile setting test=develop
      
      * update run demo test=develop
      
      * update test=develop
      
      * update test=develop
      
      * fix test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update format test=develop
      
      * update format test=develop
      
      * update style test=develop
      
      * update style test=develop
      
      * change style test=develop
      
      * change style test=develop
      
      * change style test=develop
      
      * add dataset unittest test=develop
      
      * update test=develop
      
      * update for record test=develop
      
      * udpate style for record test=develop
      
      * update for record test=develop
      
      * update for record test=develop
      
      * update for record test=develop
      
      * fix format test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      
      * update test=develop
      2235ee1a
    • Z
      Improve transpose performance with tile sm copy, test=develop (#22311) · 54970444
      zhaoyuchen2018 提交于
      
      * Refine code, fix select tile error,test=develop
      
      * Refine element type and some comments, test=develop
      
      * Refine comments and gpu utils, test=develop
      
      * Remove some useless condition
      
      * Refine floor and ceil, test=develop
      
      * refine for loop. test=develop
      Signed-off-by: Nzhaoyuchen <zhaoyuchen01@baidu.com>
      54970444
    • G
      Make assign op support LoDTensorArray and modify while_loop API (#22309) · 3a59a7a1
      guofei 提交于
      This PR makes assign op support LoDTensorArray and enable the loop_vars in
      while_loop to support tuple or list.
      3a59a7a1
  12. 10 2月, 2020 4 次提交
  13. 07 2月, 2020 1 次提交
    • Y
      Enable the detection of subgraph composed of grad ops (#21223) · dcfb6038
      Yiqun Liu 提交于
      * Add the first implememtation of fusion_group op #19621 (#3)
      
      * Add the dynamic load of nvrtc, and support runtime compiling of CUDA kernel using nvrtc.
      test=develop
      
      * Call CUDA driver api to launch the kernel compiled by nvrtc.
      test=develop
      
      * Disable for mac and windows.
      test=develop
      
      * Refine the codes to support manually specified num_threads and workload_per_thread.
      test=develop
      
      * Refine the CUDA kernel to support large dims.
      test=develop
      
      * Add DeviceCodePool to manage all device codes.
      
      * Add the first implementation fusion_group op.
      
      * Add unit-test for fusion_group op.
      
      * Add the check of result.
      
      * Add the check of nvrtc in unit-test.
      test=develop
      
      * Add comment to explain the inputs, outputs and features of fusion_group op.
      test=develop
      
      * Disable fusion_group op for mac and windows.
      test=develop
      
      * Make the compiling of device code return status instead of hanging up.
      test=develop
      
      * Add the check of whether there is CUDA driver library, and do not core dump when failing to call the CUDA driver API.
      
      * Unify fusion_group_op's input and output names.
      test=develop
      
      * Add the check of CUDA driver library in unittest.
      test=develop
      
      * Enable generating code for a given subgraph. #21126 (#4)
      
      * Enable generating code for a given subgraph.
      
      * Support sorting the subgraph.
      
      * Remove the rearange of expressions because we use the sorted subgraph directly.
      
      * Enable generating code for a subgraph which is composed of grad ops.
      
      * Use expression information to check the accuracy in unittest.
      
      * Separate load and store from computation expressions.
      test=develop
      
      * Improve the loading statements in generated codes.
      test=develop
      
      * Remove unused arguments from formal list.
      test=develop
      
      * Enable the detection of subgraph of grad ops.
      
      * Generate code for detected subgraph in fusion_group_pass.
      
      * Add an option in BuildStrategy to enable fusion_group_pass and add unittest.
      test=develop
      
      * Fix a bug when checking whether the shape of all inputs are the same.
      
      * Add debug information.
      
      * Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#5)
      
      test=develop
      
      * Call subgraph_detector in fusion_group pass.
      test=develop
      
      * Disable fusion_group when WITH_GPU is OFF.
      test=develop
      
      * Refine all PADDLE_ENFORCE message.
      test=develop
      
      * Fix the case that some inputs are not defined in grad ops, and set op_role for fused op.
      test=develop
      
      * Follow review comments.
      test=develop
      dcfb6038