1. 02 8月, 2023 4 次提交
    • Y
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data... · e61d892a
      yangjianfengo1 提交于
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data format is NHWC implementation. (#55399)
      
      * finish
      
      * cpergroup odd
      
      * fix bf16
      
      * single channel
      
      * code style
      
      * jingdu duiqi
      
      * add head_file
      
      * add bf16 head file
      
      * bf16 2
      
      * bf16
      
      * bf16 head
      
      * bf16 compile
      
      * py test
      
      * bf16 compile
      
      * bf16 compile
      
      * unset py test
      
      * nhwc
      
      * test
      
      * mean var
      
      * bf16 success
      
      * su
      
      * ctest success
      
      * use is_same_as
      
      * is_same
      
      * use is_same
      
      * rtol
      
      * gpu_stream
      
      * del sigmod
      
      * fix bfloat16 type
      
      * use cuda_bf16_hpp
      
      * use_cuda_arch
      
      * bfloat162float2
      
      * del inplace_tol
      
      * del max_releative_tol
      
      * temp store
      
      * jingdu duiqi
      
      * temp store
      
      * plugin
      
      * jingdu duiqi
      
      * duiqi
      
      * include cuda.h
      
      * del half
      
      * half single
      
      * ci
      
      * add const
      
      * ci
      
      * cudamemset
      
      * del printf
      
      * fp16 test
      
      * add half compute
      
      * del br16 ci
      
      * del ci
      
      * ci approve
      
      * del fluid include
      e61d892a
    • W
      fix security bug (#55782) · 19da5c0c
      wanghuancoder 提交于
      * fix security bug
      19da5c0c
    • J
      [XPU] Add gather_squeeze_pass (#55605) · d13a49d6
      jiangfan06 提交于
      d13a49d6
    • X
      【new ir】add ir pybind api (#55745) · ef29468e
      xiaoguoguo626807 提交于
      * add ir core
      
      * add test
      
      * modify name
      
      * merge
      
      * add test for __eq__
      
      * shield  test for __eq__
      
      * --amend
      
      * Update new_ir_compiler.cc
      ef29468e
  2. 01 8月, 2023 14 次提交
  3. 31 7月, 2023 11 次提交
  4. 30 7月, 2023 1 次提交
  5. 28 7月, 2023 1 次提交
    • H
      New ir support fluid op (#55693) · b76c2f94
      hong 提交于
      * new ir support save combine
      
      * update
      
      * polish code
      
      * update
      
      * new ir support fluid op
      
      * remove depulicate op
      
      * fix ir exe test compile error
      
      * fix compile bug
      
      * update
      
      * code format
      
      * update
      
      * update
      
      * polish code
      b76c2f94
  6. 27 7月, 2023 6 次提交
    • M
      [Paddle-TRT] add flip op (#55688) · d608170a
      ming1753 提交于
      * [Paddle-TRT] add flip op
      d608170a
    • M
      paddle-TRT support float64 (#55520) · 8b063030
      ming1753 提交于
      * Paddle-TRT support float64  in/out type, support fill_any_like_op in int64
      8b063030
    • H
      [NewIR]Fix new ir dygraph 2 static concat grad bug (#55634) · 51ebcf68
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * relax constraint when inserting get_parameter
      
      * add env flag
      
      * fix bug
      
      * dygraph2static support new ir
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * update
      
      * fix build_static bug
      
      * update test
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      
      * fix program translator error
      
      * only run on cpu, not support gpu yet
      
      * fix conflict
      
      * polish code
      
      * fix bug
      
      * add feed with place op
      
      * update
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * update
      
      * align mkldnn version
      
      * new ir support builtin slice op
      
      * fix bug
      
      * fix phi kernel adaptor bug
      
      * add enable static
      
      * add enable_static
      
      * remove useless test case
      
      * change feed list to single variable
      
      * update
      
      * add feed with place and shaddow output op
      
      * fix bug
      
      * remove usless code
      
      * support gpu
      
      * fix bug
      
      * fix bug
      
      * remove template
      
      * add more data type
      
      * fix cimpile bug
      
      * udpate
      
      * remove useless code
      
      * revert dygraph2st test
      
      * remove usless code
      
      * revert op
      
      * fix bug
      
      * remove instance norm
      
      * fix concat grad bug
      
      * revert code
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      51ebcf68
    • R
      [CustomPass] add support for outputting the intermediate variables (#55728) · da258964
      ronnywang 提交于
      * add support for outputting the intermediate variables
      
      * fix fuse_rresnet_unit
      da258964
    • Z
      [IR] Fix pd.feed bug in ir BuildScope (#55720) · 43fcd01b
      zhangbo9674 提交于
      * fix bug
      
      * fix bug
      43fcd01b
    • A
      cbbd940e
  7. 26 7月, 2023 3 次提交