1. 27 1月, 2021 1 次提交
    • J
      REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719) · f8da5536
      jakpiase 提交于
      * added external reorder to profiler
      
      * resolved conflict
      
      * added enable_static
      
      * initial version of lstm, not working yet
      
      * added lstm to operators.cmake
      
      * added vanilla lstm mkldnn op
      
      * added peephole weights integration
      
      * minor changes
      
      * added formatting
      
      * added fusion_lstm_mkldnn to static_whitelist
      
      * added formatting
      
      * removed comment
      
      * moved use_peepholes attribute inside is_cached block
      
      * reverted wrong changes
      
      * minor formatting change
      
      * minor changes
      
      * changed stream handling
      
      * minor change
      
      * added datatype to GetExpectedKernelType()
      
      * added reading stream from TLS
      f8da5536
  2. 26 1月, 2021 2 次提交
  3. 25 1月, 2021 2 次提交
  4. 11 1月, 2021 3 次提交
  5. 10 1月, 2021 1 次提交
  6. 06 1月, 2021 1 次提交
  7. 28 12月, 2020 1 次提交
  8. 14 12月, 2020 1 次提交
  9. 07 12月, 2020 1 次提交
    • L
      Compiling operator libraries with Unity build (#29130) · 671555ed
      LoveAn 提交于
      * Compiling operator libraries with Unity Build on Windows CPU.
      
      * Compiling operator libraries with Unity Build on Windows GPU, no_test, test=windows_ci
      
      * Add option in windows ci script, no_test, test=windows_ci
      
      * Optimize parallel compiling, test=develop
      
      * remove limit of parallel compile and skip some ops in UB, test=develop
      
      * remove changes of header file, test=develop
      
      * remove changes of header file, test=develop
      
      * fix test_eye_op unittest failed, test=develop
      
      * Compiling operator libraries with Unity Build on Linux, test=develop
      
      * set default WITH_UNITY_BUILD=OFF, test=develop
      
      * Move unity build rules into a single file and add comment, test=develop
      
      * optimize parallel compilation, test=develop
      
      * fix undefined reference error on coverage ci, test=develop
      671555ed
  10. 27 11月, 2020 1 次提交
  11. 20 11月, 2020 1 次提交
  12. 19 11月, 2020 1 次提交
  13. 12 11月, 2020 1 次提交
  14. 27 10月, 2020 1 次提交
  15. 14 10月, 2020 1 次提交
  16. 09 10月, 2020 2 次提交
  17. 04 10月, 2020 1 次提交
    • Y
      Polish the error message of fc, fused_fc_elementwise_layernorm and... · 65207b45
      Yiqun Liu 提交于
      Polish the error message of fc, fused_fc_elementwise_layernorm and fused_embedding_seq_pool. (#27692)
      
      * Polish the error message of fc_op.
      
      * Polish the error message of fused_fc_elementwise_layer_norm op.
      
      * Polish an error message in fused_embedding_seq_pool_op.
      65207b45
  18. 01 10月, 2020 1 次提交
  19. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  20. 23 9月, 2020 2 次提交
  21. 22 9月, 2020 1 次提交
  22. 21 9月, 2020 2 次提交
    • S
      fix bug sequececonv_eltadd_relu_fuse_pass (#27404) · d9366194
      Shang Zhizhou 提交于
      * fix bug sequececonv_eltadd_relu_fuse_pass, output error when sequence_conv's padding_start > 0
      
      * fix seqconv_eltadd_relu_fuse_pass unitest error
      d9366194
    • H
      Quant op dev (#25932) · 02606d45
      huangxu96 提交于
      * Finished ChannelWiseQuantDequantAbsMaxOp and Passed unittests.
      
      * Finished channel-wise quantize strategy in imperative quantization.
      
      * Added Cuda code of ChannelWiseQuantDequantMaxAbsOP
      Add Cuda code of ChannelWiseQuantDequantMaxAbsOp
      
      * Add quant_axis for channel_wise quant.
      
      * fixed a bug in unnitests, which will not trigger axis = 1 case and cannot meet the coverage rate requirement.
      
      * Added some assert infomation and fixed some coding style mistakes.
      02606d45
  23. 14 9月, 2020 1 次提交
  24. 12 9月, 2020 1 次提交
    • L
      Fix GRU mkldnn kernel fail on look_table_v2 (#27198) · 5c4eed66
      lidanqing 提交于
      * Fix the lookup_table_v2 failed on GRU mkldnn kernel issue
      test=develop
      
      * fix according to reviews, removed x_num_col_dims
      test=develop
      
      * update gru model. change according to reviews
      test=develop
      
      * change according to reviews
      test=develop
      5c4eed66
  25. 10 9月, 2020 1 次提交
  26. 12 8月, 2020 1 次提交
  27. 10 8月, 2020 1 次提交
  28. 06 8月, 2020 1 次提交
    • A
      Add oneDNN fusion_gru kernel (#25594) · 68c6160e
      Adam 提交于
      * Add oneDNN fusion_gru kernel and fix fc+gru pass
      test=develop
      
      * Formatting changes
      test=develop
      
      * Lint fixes
      test=develop
      
      * Add memory::format_tag::any to GRU weights
      test=develop
      
      * Fix build with CUDA
      
      * Fix build with CUDA v2
      68c6160e
  29. 05 8月, 2020 1 次提交
  30. 24 7月, 2020 1 次提交
  31. 11 5月, 2020 1 次提交
    • C
      Add macro BOOST_GET to enrich the error information of boost :: get (#24175) · aa0f254f
      Chen Weihang 提交于
      * add new macro BOOST_GET_SAFELY & unittests, test=develop
      
      * add different macro type, test=develop
      
      * fix get macro type in executor, test=develop
      
      * four macro part change backup
      
      * using one macro for all case, test=develop
      
      * revert attribute change, test=develop
      
      * change to three func to solve gcc4.8 bug, test=develop
      
      * polish some details, test=develop
      aa0f254f
  32. 26 4月, 2020 1 次提交
    • L
      improve efficiency of runtime InferVarType (#22778) · 9a93f6aa
      liuwei1031 提交于
      * save InferVarType changes, test=develop
      
      * remove code comments, test=develop
      
      * tweak code, test=develop
      
      * fix compilation warning, update merge_ids_op split_ids_op to new interface, test=develop
      
      * modify fused_bn_activation_op, test=develop
      
      * fix error of fused_bn_activation_op, test=develop
      
      * fix PADDLE_ENFORCE and unittest coverage issue, test=develop
      
      * tweak PADDLE_ENFORCE messages, test=develop
      
      * improve unittest coverage, test=develop
      
      * add StaticGraphInferVarType class, test=develop
      
      * rebase develop branch, test=develop
      
      * fix unittest error, test=develop
      
      * remove comments, test=develop
      
      * improve unittest coverage, test=develop
      
      * imporve error message and imporve unittest coverage, test=develop
      
      * upgrade InferVarType API, test=develop
      
      * tweak pyfunc error message, test=develop
      
      * fix compilation conflict - save_combine_op, test=develop
      9a93f6aa
  33. 23 4月, 2020 1 次提交