1. 20 11月, 2019 4 次提交
    • Y
      Enable generating code for a given subgraph. (#21126) · 6b1e1f0d
      Yiqun Liu 提交于
      * Enable generating code for a given subgraph.
      
      * Support sorting the subgraph.
      
      * Remove the rearange of expressions because we use the sorted subgraph directly.
      
      * Enable generating code for a subgraph which is composed of grad ops.
      
      * Use expression information to check the accuracy in unittest.
      
      * Separate load and store from computation expressions.
      test=develop
      
      * Improve the loading statements in generated codes.
      test=develop
      
      * Remove unused arguments from formal list.
      test=develop
      6b1e1f0d
    • Z
      Fix topk compile failed on windows (#21243) · 3ff5cc2d
      zhaoyuchen2018 提交于
      * Fix topk compile failed on windows
      * Use explicit cast for assign data
      3ff5cc2d
    • P
      fix trt weight bug (#21231) · 2e2f92a5
      Pei Yang 提交于
      added splitter "__" between weight name and suffix number to avoid conflicts.
      2e2f92a5
    • Z
      optimize assign op to avoid copy data from GPU to GPU (#21181) · 01a96463
      Zhang Ting 提交于
      * optimize assign op to avoid copy data from GPU to GPU, test=develop
      
      * modified GetkernelTypeForVar and just avoid device transform, test=develop
      01a96463
  2. 19 11月, 2019 6 次提交
  3. 18 11月, 2019 6 次提交
  4. 15 11月, 2019 5 次提交
  5. 14 11月, 2019 8 次提交
  6. 13 11月, 2019 3 次提交
  7. 12 11月, 2019 6 次提交
  8. 11 11月, 2019 2 次提交
    • Z
      e0285eae
    • Y
      Add the check of lod_level between compile-time and runtime. (#20961) · 35f17ae2
      Yiqun Liu 提交于
      * Add the check of lod_level between compile-time and runtime.
      test=develop
      
      * Fix bug in check_compile_vs_runtime.
      test=develop
      
      * Fix the check of output when it is dispensiable or intermediate.
      test=develop
      
      * Share lod of x to out in match_matrix_tensor op in compile-time.
      
      * Implement GetLoDLevel in InferShapeContext.
      
      * Set the default value of check_compile_vs_runtime to False and enable it in test_sequence_pad_op.
      test=develop
      
      * Enable check_compile_vs_runtime in test_match_matrix_tensor.
      
      * Add the implementation of SetLoDLevel in InferShapeContext.
      
      * Remove the implementation of IncreaseLoDLevel and call Get/SetLoDLevel instead.
      
      * Remove the implementation of DecreaseLoDLevel and call Set/GetLoDLevel instead.
      
      * Refine some ops and unittests.
      test=develop
      
      * Fix a typo.
      test=develop
      
      * Remove the check of var type, and change int to int32_t.
      test=develop
      
      * Add unittest for Get/SetLoDLevel.
      test=develop
      35f17ae2