1. 26 7月, 2023 3 次提交
  2. 25 7月, 2023 7 次提交
    • L
      8db3ff1f
    • J
      Bugfix, fast layer norm, OOB (#55639) · 017a6164
      Jeng Bai-Cheng 提交于
      * Fix LayerNormForward perf issue
      
      * Bugfix, fast_layer_norm OOB
      
      * apply pre-commit
      
      ---------
      Co-authored-by: NShijie Wang <jaywan@nvidia.com>
      017a6164
    • c737f0ae
    • L
      fix bugs in rnn op (#55656) · 0cd422b6
      Lucas 提交于
      0cd422b6
    • W
      fix div 0 bug (#55644) · 690ffe81
      wanghuancoder 提交于
      690ffe81
    • H
      [NewIR]new ir dygraph to static supoort gpu (#55620) · fb9bec5d
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * relax constraint when inserting get_parameter
      
      * add env flag
      
      * fix bug
      
      * dygraph2static support new ir
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * update
      
      * fix build_static bug
      
      * update test
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      
      * fix program translator error
      
      * only run on cpu, not support gpu yet
      
      * fix conflict
      
      * polish code
      
      * fix bug
      
      * add feed with place op
      
      * update
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * update
      
      * align mkldnn version
      
      * new ir support builtin slice op
      
      * fix bug
      
      * fix phi kernel adaptor bug
      
      * add enable static
      
      * add enable_static
      
      * remove useless test case
      
      * change feed list to single variable
      
      * update
      
      * add feed with place and shaddow output op
      
      * fix bug
      
      * remove usless code
      
      * support gpu
      
      * fix bug
      
      * fix bug
      
      * remove template
      
      * add more data type
      
      * fix cimpile bug
      
      * udpate
      
      * remove useless code
      
      * revert dygraph2st test
      
      * remove usless code
      
      * revert op
      
      * fix bug
      
      * new ir dygraph2static support gpu
      
      * remove usless code
      
      * code polish
      
      * add const
      
      * revert code and remove useless code
      
      * revert code
      
      * revert legacy op yaml
      
      * remove useless code
      
      * delete std::move
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      fb9bec5d
    • J
      [XPU] Add FP16 support for arg_min_max (#55642) · 14094aad
      jiangfan06 提交于
      14094aad
  3. 24 7月, 2023 1 次提交
  4. 20 7月, 2023 4 次提交
  5. 19 7月, 2023 3 次提交
    • H
      [NewIR]Add feed with place op (#55343) · 8e9e0659
      hong 提交于
      * add feed with place op
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * add enable_static
      
      * remove useless test case
      
      * register int and doubel type
      
      * fix bug
      8e9e0659
    • Z
      delete relu6_raw (#55383) · 56d46ccc
      zhangyuqin1998 提交于
      * delete relu6_raw
      
      * fix codestyle
      
      * Update test_mkldnn_matmul_activation_fuse_pass.py
      
      * fix
      
      * Update backward.yaml
      
      * Update ops.yaml
      
      * Update backward.yaml
      56d46ccc
    • S
      Fix mea segmentation fault error (#55408) · cc262c55
      sneaxiy 提交于
      * fix mea seg fault develop
      
      * fix bias_grad seg fault
      cc262c55
  6. 18 7月, 2023 2 次提交
    • G
      batch add inpalce api (#55078) · 19302938
      GGBond8488 提交于
      * batch add inpalce api
      
      * fix inplace fn generate
      
      * add test for  new inpalce api
      
      * fix typro
      
      * fix typro
      
      * fix typro
      
      * fix test error
      
      * fix atan2
      
      * remove atan2
      
      * auto genereate inpalce api
      
      * fix inplace generate fn error
      
      * fix windows error
      
      * fix test error
      
      * fix test error
      
      * fix windows ci error
      
      * fix test error
      
      * fix test_error
      
      * fix test error
      
      * fix eigen aliasing error in inplace
      
      * remove elementwise_pow inplace
      
      * fix doc error
      
      * fix test error
      19302938
    • H
      [NewIR]Fix new ir concat split bug (#55419) · 5e6645d7
      hong 提交于
      * fix new ir concat op bug
      
      * fix bug
      
      * using add_n_with_kernel instead of add_n impl
      
      * fix pd_op yaml bug
      
      * fix bug
      5e6645d7
  7. 17 7月, 2023 2 次提交
  8. 14 7月, 2023 4 次提交
  9. 13 7月, 2023 7 次提交
  10. 12 7月, 2023 3 次提交
    • FormlessUnit's avatar
      Fix llm int8 build error (#55338) · 006bd959
      FormlessUnit 提交于
      * add macro to avoid llm.int8 build error
      
      * fix ci
      
      ---------
      Co-authored-by: Nwufeisheng <wfs1997@163.com>
      006bd959
    • Y
      [ONEDNN] Upgrade oneDNN version to v3.1 (#52463) · cfa513f7
      YangQun 提交于
      * squash pick the poc code
      * fix build after rebase
      * fix int8 conv and fc uts
      * Fix and clean-up Get_SRC_Scale_Memory
      * fix floating point fc uts
      * fix test_analyzer_int8_googlenet
      * test_analyzer_int8_mobilenetv1
      * fix int8 mobilenet v2 and v3
      * fix build error after rebase
      * [oneDNN] rename library version
      * fix conv bias datatype
      * try to fix import error
      * fix rebase error
      * [oneDNN] pack library into python wheel
      * add MKLDNN_SHARED_LIB_3 to env_dict
      * fix test_analyzer_bert
      * fix fill_constant op kernel
      * fix ernie and matmul op ut
      * fix softplus ut
      * fix conv+relu6 fusion ut
      * fix hardswish fusion
      * fix quant+transpose fusion ut
      * fixsgd ut
      * fix int8 matmul with flatten
      * fix fc+scale fusion
      * fix conv/matmul+gelu fusion uts
      * fix rebase error
      * Revert "fix conv/matmul+gelu fusion uts"
      This reverts commit 47eb5e49972bd8f7271a233def9bfb3e98ce78e1.
      * upgrade to onednn v3.1
      * remove older version onednn
      * use densetensor::data() for achieving mean and var in layernorm impl
      * comments for atol of integer tests
      * fix clang-format
      * Revert "remove older version onednn"
      This reverts commit 783e57ddfd4401254596eae7d47adb9b03590c09.
      * improve binary handle
      * fix expand kernel
      * Revert "use densetensor::data() for achieving mean and var in layernorm impl"
      * always use forward_inference for conv
      * remove activation scales
      * rollback changes to mkldnn.cmake
      * address comments
      * port changes to dequantize kernel
      * fix merge error
      * fix fused_elementwise_kernel
      * upgrade onednn version to v3.1.1
      * fix some approval error
      * fix error msg format
      * remove old onednn libs
      * try to fix symbolic link issue
      * fix cinn test case segfault
      * do not explicit link test with onednn
      * remove unnecessary changes
      * integrate CINN with onednn v3
      * link with mkldnn project
      * fix cinn build file
      
      ---------
      Co-authored-by: NTomasz Socha <tomasz.socha@intel.com>
      Co-authored-by: NChen, Xinyu1 <xinyu1.chen@intel.com>
      Co-authored-by: Ntianshuo78520a <707759223@qq.com>
      cfa513f7
    • W
      [clang-tidy] enable `readability-container-size-empty` check (#55279) · be3a6fa7
      Wang Xin 提交于
      * [clang-tidy] enable readability-container-size-empty check
      
      * fix test_custom_kernel Failed
      
      * add clang-tid-10 in dockerfile
      
      * add clang-tidy in dockerfile
      
      * fix bug
      be3a6fa7
  11. 11 7月, 2023 3 次提交
  12. 07 7月, 2023 1 次提交