1. 27 7月, 2023 6 次提交
  2. 26 7月, 2023 25 次提交
  3. 25 7月, 2023 9 次提交
    • L
      8db3ff1f
    • J
      Bugfix, fast layer norm, OOB (#55639) · 017a6164
      Jeng Bai-Cheng 提交于
      * Fix LayerNormForward perf issue
      
      * Bugfix, fast_layer_norm OOB
      
      * apply pre-commit
      
      ---------
      Co-authored-by: NShijie Wang <jaywan@nvidia.com>
      017a6164
    • A
      f9e1b2d2
    • c737f0ae
    • TaoTao Li's avatar
      remove fluid allreduce op (#55672) · 7da1ffbe
      TaoTao Li 提交于
      7da1ffbe
    • L
      fix bugs in rnn op (#55656) · 0cd422b6
      Lucas 提交于
      0cd422b6
    • W
      fix div 0 bug (#55644) · 690ffe81
      wanghuancoder 提交于
      690ffe81
    • T
      Update ccache (#55136) · 6093a7ed
      tianshuo78520a 提交于
      * Update ccache
      
      * del 3.7.9
      
      * fix error
      6093a7ed
    • H
      [NewIR]new ir dygraph to static supoort gpu (#55620) · fb9bec5d
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * relax constraint when inserting get_parameter
      
      * add env flag
      
      * fix bug
      
      * dygraph2static support new ir
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * update
      
      * fix build_static bug
      
      * update test
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      
      * fix program translator error
      
      * only run on cpu, not support gpu yet
      
      * fix conflict
      
      * polish code
      
      * fix bug
      
      * add feed with place op
      
      * update
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * update
      
      * align mkldnn version
      
      * new ir support builtin slice op
      
      * fix bug
      
      * fix phi kernel adaptor bug
      
      * add enable static
      
      * add enable_static
      
      * remove useless test case
      
      * change feed list to single variable
      
      * update
      
      * add feed with place and shaddow output op
      
      * fix bug
      
      * remove usless code
      
      * support gpu
      
      * fix bug
      
      * fix bug
      
      * remove template
      
      * add more data type
      
      * fix cimpile bug
      
      * udpate
      
      * remove useless code
      
      * revert dygraph2st test
      
      * remove usless code
      
      * revert op
      
      * fix bug
      
      * new ir dygraph2static support gpu
      
      * remove usless code
      
      * code polish
      
      * add const
      
      * revert code and remove useless code
      
      * revert code
      
      * revert legacy op yaml
      
      * remove useless code
      
      * delete std::move
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      fb9bec5d