1. 02 8月, 2023 1 次提交
    • Y
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data... · e61d892a
      yangjianfengo1 提交于
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data format is NHWC implementation. (#55399)
      
      * finish
      
      * cpergroup odd
      
      * fix bf16
      
      * single channel
      
      * code style
      
      * jingdu duiqi
      
      * add head_file
      
      * add bf16 head file
      
      * bf16 2
      
      * bf16
      
      * bf16 head
      
      * bf16 compile
      
      * py test
      
      * bf16 compile
      
      * bf16 compile
      
      * unset py test
      
      * nhwc
      
      * test
      
      * mean var
      
      * bf16 success
      
      * su
      
      * ctest success
      
      * use is_same_as
      
      * is_same
      
      * use is_same
      
      * rtol
      
      * gpu_stream
      
      * del sigmod
      
      * fix bfloat16 type
      
      * use cuda_bf16_hpp
      
      * use_cuda_arch
      
      * bfloat162float2
      
      * del inplace_tol
      
      * del max_releative_tol
      
      * temp store
      
      * jingdu duiqi
      
      * temp store
      
      * plugin
      
      * jingdu duiqi
      
      * duiqi
      
      * include cuda.h
      
      * del half
      
      * half single
      
      * ci
      
      * add const
      
      * ci
      
      * cudamemset
      
      * del printf
      
      * fp16 test
      
      * add half compute
      
      * del br16 ci
      
      * del ci
      
      * ci approve
      
      * del fluid include
      e61d892a
  2. 25 11月, 2022 1 次提交
  3. 19 8月, 2022 1 次提交
  4. 08 6月, 2022 1 次提交
  5. 24 3月, 2022 1 次提交
  6. 21 3月, 2022 1 次提交
  7. 18 3月, 2022 1 次提交
  8. 23 2月, 2022 1 次提交
  9. 20 2月, 2022 1 次提交
  10. 09 2月, 2022 2 次提交
    • H
      Move trace op to pten (#39227) · d7dddf94
      hong 提交于
      * add trace op
      
      * bug fix
      
      * bug fix; test=develop
      
      * thrust bug fix; test=develop
      
      * remove useless register; test=develop
      
      * fix bug; test=develop
      
      * update trace kernel; test=develop
      
      * move kernel args to trace_sig; test=develop
      d7dddf94
    • H
      Move norm to pten (#39324) · ece200b3
      hong 提交于
      * add norm cpu
      
      * update code;
      
      * norm bug fix
      
      * move norm op to pten; test=develop
      
      * move norm op to pten; test=develop
      
      * add norm util; test=develop
      
      * fix norm npu bug; test=develop
      
      * fix norm kernel bug; test=develop
      
      * move kernel args to pten; test=develop
      
      * move kernel args to pten sig; test=develop
      ece200b3
  11. 28 1月, 2022 1 次提交
    • H
      Move digamma to pten (#39240) · 848ae7dc
      hong 提交于
      * move digamma to pten; test=develop
      
      * fix mutable_data bugs; test=develop
      
      * remove useless code; test=develop
      
      * remove kernel compute; test=develop
      
      * fix bug; test=develop
      848ae7dc
  12. 17 1月, 2022 2 次提交
  13. 10 12月, 2021 1 次提交
  14. 24 9月, 2020 1 次提交
    • W
      use iwyu clean include (#27267) · df43905f
      wanghuancoder 提交于
      * use iwyu clean include, test=develop, test=win
      
      * compilation error, test=develop
      
      * fix compilation error2, test=develop
      
      * fix compilation error3, test=develop
      
      * fix compilation error4, test=develop
      
      * fix compilation error5, test=develop
      
      * fix compilation error6, test=develop
      
      * fix compilation error7, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error8, test=develop
      
      * fix compilation error10, test=develop
      
      * fix compilation error11, test=develop
      df43905f
  15. 28 3月, 2019 1 次提交
    • C
      Fix the interface of Pass::Apply (#16484) · ed61d67c
      chengduo 提交于
      * modify the interface of Pass::Allay
      test=develop
      
      * Polish code
      test=develop
      
      * Fix Travis CI
      test=develop
      
      * fix Pass::Apply interface
      test=develop
      
      * Fix Travis CI
      test=develop
      ed61d67c
  16. 21 2月, 2019 1 次提交
  17. 31 1月, 2019 1 次提交
  18. 29 8月, 2018 1 次提交
  19. 21 8月, 2018 1 次提交
  20. 09 8月, 2018 1 次提交
  21. 26 7月, 2018 4 次提交
  22. 22 7月, 2018 1 次提交
  23. 19 7月, 2018 1 次提交
  24. 18 7月, 2018 4 次提交
  25. 26 6月, 2018 1 次提交
  26. 11 6月, 2018 1 次提交
  27. 08 6月, 2018 1 次提交
  28. 06 6月, 2018 1 次提交
    • Y
      SSA Graph Builder Factory · d9af1532
      yuyang18 提交于
      * Use Builder Chain to decorate new builders. It is easy to extend
        builders.
      * Make graphviz path as a build strategy, not a FLAGS.
      d9af1532
  29. 15 5月, 2018 1 次提交
  30. 14 5月, 2018 2 次提交
  31. 07 4月, 2018 1 次提交