1. 07 9月, 2022 1 次提交
    • C
      [Phi] Migrate save kernel (#45665) · fc66fdb7
      Chen Weihang 提交于
      * add save kernel
      
      * add save_sr_kernel
      
      * remove original save_op
      
      * add save gpu kernel
      
      * remove combine kernel
      
      * add port.h include
      
      * add save selected rows test
      
      * remove useless kernel.h
      fc66fdb7
  2. 02 7月, 2022 1 次提交
    • L
      unify cpu context, part2 (#44012) · 755438a7
      Leo Chen 提交于
      * fix init()
      
      * delete test_device_context
      
      * replace CPUDeviceContext with CPUContext
      
      * fix test_scalar
      
      * remove dot_op.cc
      
      * fix compile
      755438a7
  3. 26 6月, 2022 1 次提交
  4. 05 6月, 2022 1 次提交
  5. 17 6月, 2021 1 次提交
  6. 08 4月, 2021 1 次提交
  7. 16 10月, 2020 1 次提交
    • G
      Incorporate cudnn_lstm into LSTM api (#27217) · fa9d3fa5
      Guo Sheng 提交于
      * Incorporate cudnn_lstm into LSTM api.
      test=develop
      
      * Make coalesce_tensor support alignment optionally.
      test=develop
      
      * Reorganize RNN apis. test=develop
      
      * Fix cudnn rnn layout conversion.
      test=develop
      
      * Add sequence_length support for RNN cudnn implement.
      Add optional init_h and init_c gradient for cudnn_lstm_op.
      test=develop
      
      * Use create_parameter for rnn cudnn impl.
      test=develop
      
      * Move `self._flat_weight = self.create_parameter()` in RNNBase to main_program.
      test=develop
      
      * Update RNN api unittest to use set_device.
      test=develop
      
      * Fix set_place for unit tests of RNN apis.
      test=develop
      
      * Fix use_align in coalesce_tensor_op.
      test=develop
      
      * Adjust RNN apis arguments according to comments.
      test=develop
      
      * Polish documents for SimpleRNN apis.
      test=develop
      
      * Refine random seed in cudnn_lstm_op.
      Expose rnn params from sublayers to RNN.
      test=develop
      
      * Fix RNN saving for jit.save.
      Refine cudnn_lstm dropout behavior.
      test=develop
      
      * Fix doc of GRU. test=develop
      
      * Use ShareDataWith to avoid copying for cudnn_lstm_op test.
      test=develop
      
      * Remove updates on cudnn_lstm temporarily.
      test=develop
      
      * Use ShareDataWith to avoid copying for cudnn_lstm_op test.
      test=develop
      
      * Refine random seed in cudnn_lstm_op.
      test=develop
      
      * Fix test_lstm by adjust ConcreteProgram buffer getter.
      test=develop
      
      * Use create_parameter instead of create_var for rnn._flat_weight for static graph usage.
      test=develop
      
      * Remove W input for cudnn_lstm to pass unused_var_check.
      test=develop
      
      * Add test_predict for RNN unit tests coverage.
      test=develop
      
      * Fix code style of rnn.
      test=develop
      
      * Fix F.rnn usage in rnn.py.
      test=develop
      fa9d3fa5
  8. 26 4月, 2020 1 次提交
    • L
      improve efficiency of runtime InferVarType (#22778) · 9a93f6aa
      liuwei1031 提交于
      * save InferVarType changes, test=develop
      
      * remove code comments, test=develop
      
      * tweak code, test=develop
      
      * fix compilation warning, update merge_ids_op split_ids_op to new interface, test=develop
      
      * modify fused_bn_activation_op, test=develop
      
      * fix error of fused_bn_activation_op, test=develop
      
      * fix PADDLE_ENFORCE and unittest coverage issue, test=develop
      
      * tweak PADDLE_ENFORCE messages, test=develop
      
      * improve unittest coverage, test=develop
      
      * add StaticGraphInferVarType class, test=develop
      
      * rebase develop branch, test=develop
      
      * fix unittest error, test=develop
      
      * remove comments, test=develop
      
      * improve unittest coverage, test=develop
      
      * imporve error message and imporve unittest coverage, test=develop
      
      * upgrade InferVarType API, test=develop
      
      * tweak pyfunc error message, test=develop
      
      * fix compilation conflict - save_combine_op, test=develop
      9a93f6aa
  9. 07 2月, 2020 1 次提交
  10. 06 12月, 2019 1 次提交
  11. 16 10月, 2019 1 次提交
    • C
      Add IndicateVarDataType interface to block tensor is not initialized problem... · 003f369b
      Chen Weihang 提交于
      Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
      
      * add indicate_var_data_type inferface, test=develop
      
      * add unittests & polish error message, test=develop
      
      * remove needless include, test=develop
      
      * extract public function & polish message, test=develop
      
      * delete empty var check, test=develop
      
      * change data_type to pointer parameter, test=develop
      
      * polish details, test=develop
      003f369b
  12. 22 4月, 2019 1 次提交
  13. 20 3月, 2019 2 次提交
  14. 19 3月, 2019 1 次提交
  15. 18 3月, 2019 1 次提交
  16. 15 3月, 2019 1 次提交
  17. 28 12月, 2018 1 次提交
  18. 12 12月, 2018 1 次提交
  19. 26 11月, 2018 1 次提交
  20. 08 11月, 2018 2 次提交
  21. 05 11月, 2018 1 次提交
  22. 03 9月, 2018 1 次提交
  23. 26 8月, 2018 1 次提交
  24. 14 8月, 2018 1 次提交
  25. 25 6月, 2018 1 次提交
  26. 22 6月, 2018 2 次提交
  27. 20 6月, 2018 1 次提交
  28. 19 6月, 2018 4 次提交
  29. 15 6月, 2018 3 次提交
  30. 13 6月, 2018 1 次提交
  31. 08 5月, 2018 1 次提交
    • Y
      Clean OpProtoAndCheckerMaker · 0e78cb69
      Yu Yang 提交于
      Do not use ctor
      
      * Reduce line of codes.
      * We can use virtual function for Maker now.
      * The implementation does not care what maker holds, it is easier to
      refactor later.
      0e78cb69
  32. 04 5月, 2018 1 次提交