1. 31 7月, 2023 7 次提交
  2. 28 7月, 2023 4 次提交
  3. 27 7月, 2023 4 次提交
    • Z
      add int32/int64 for outer/matmul Kernel. (#55584) · ff2142f2
      zxcd 提交于
      * add int32/int64 for outer/matmul Kernel.
      
      * fix by comment.
      
      * fix by comment
      ff2142f2
    • H
      [NewIR]Fix new ir dygraph 2 static concat grad bug (#55634) · 51ebcf68
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * relax constraint when inserting get_parameter
      
      * add env flag
      
      * fix bug
      
      * dygraph2static support new ir
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * update
      
      * fix build_static bug
      
      * update test
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      
      * fix program translator error
      
      * only run on cpu, not support gpu yet
      
      * fix conflict
      
      * polish code
      
      * fix bug
      
      * add feed with place op
      
      * update
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * update
      
      * align mkldnn version
      
      * new ir support builtin slice op
      
      * fix bug
      
      * fix phi kernel adaptor bug
      
      * add enable static
      
      * add enable_static
      
      * remove useless test case
      
      * change feed list to single variable
      
      * update
      
      * add feed with place and shaddow output op
      
      * fix bug
      
      * remove usless code
      
      * support gpu
      
      * fix bug
      
      * fix bug
      
      * remove template
      
      * add more data type
      
      * fix cimpile bug
      
      * udpate
      
      * remove useless code
      
      * revert dygraph2st test
      
      * remove usless code
      
      * revert op
      
      * fix bug
      
      * remove instance norm
      
      * fix concat grad bug
      
      * revert code
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      51ebcf68
    • G
      【inplace api】batch add inplace api paddle.log_, paddle.i0_,... · 58a03d41
      GGBond8488 提交于
      【inplace api】batch add inplace api paddle.log_, paddle.i0_, paddle.nn.functional.leaky_relu_... (#55576)
      
      * batch add inplace api
      
      * add inplace test
      
      * add activation inplace
      
      * fix test
      
      * remove atan2 ge, gt, le, lt, nq
      
      * remove atan2 ge, gt, le, lt, nq
      
      * fix windows ci error
      
      * rerun ci
      
      * fix typro
      
      * fix bugs
      
      ---------
      Co-authored-by: Nzhangrui34 <v_zhangrui34@baidu.com>
      58a03d41
    • A
      cbbd940e
  4. 26 7月, 2023 5 次提交
  5. 25 7月, 2023 7 次提交
    • L
      8db3ff1f
    • J
      Bugfix, fast layer norm, OOB (#55639) · 017a6164
      Jeng Bai-Cheng 提交于
      * Fix LayerNormForward perf issue
      
      * Bugfix, fast_layer_norm OOB
      
      * apply pre-commit
      
      ---------
      Co-authored-by: NShijie Wang <jaywan@nvidia.com>
      017a6164
    • c737f0ae
    • L
      fix bugs in rnn op (#55656) · 0cd422b6
      Lucas 提交于
      0cd422b6
    • W
      fix div 0 bug (#55644) · 690ffe81
      wanghuancoder 提交于
      690ffe81
    • H
      [NewIR]new ir dygraph to static supoort gpu (#55620) · fb9bec5d
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * relax constraint when inserting get_parameter
      
      * add env flag
      
      * fix bug
      
      * dygraph2static support new ir
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * update
      
      * fix build_static bug
      
      * update test
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      
      * fix program translator error
      
      * only run on cpu, not support gpu yet
      
      * fix conflict
      
      * polish code
      
      * fix bug
      
      * add feed with place op
      
      * update
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * update
      
      * align mkldnn version
      
      * new ir support builtin slice op
      
      * fix bug
      
      * fix phi kernel adaptor bug
      
      * add enable static
      
      * add enable_static
      
      * remove useless test case
      
      * change feed list to single variable
      
      * update
      
      * add feed with place and shaddow output op
      
      * fix bug
      
      * remove usless code
      
      * support gpu
      
      * fix bug
      
      * fix bug
      
      * remove template
      
      * add more data type
      
      * fix cimpile bug
      
      * udpate
      
      * remove useless code
      
      * revert dygraph2st test
      
      * remove usless code
      
      * revert op
      
      * fix bug
      
      * new ir dygraph2static support gpu
      
      * remove usless code
      
      * code polish
      
      * add const
      
      * revert code and remove useless code
      
      * revert code
      
      * revert legacy op yaml
      
      * remove useless code
      
      * delete std::move
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      fb9bec5d
    • J
      [XPU] Add FP16 support for arg_min_max (#55642) · 14094aad
      jiangfan06 提交于
      14094aad
  6. 24 7月, 2023 3 次提交
  7. 22 7月, 2023 1 次提交
  8. 21 7月, 2023 3 次提交
  9. 20 7月, 2023 5 次提交
  10. 19 7月, 2023 1 次提交