1. 27 7月, 2023 2 次提交
  2. 26 7月, 2023 16 次提交
  3. 25 7月, 2023 7 次提交
    • J
      Bugfix, fast layer norm, OOB (#55639) · 017a6164
      Jeng Bai-Cheng 提交于
      * Fix LayerNormForward perf issue
      
      * Bugfix, fast_layer_norm OOB
      
      * apply pre-commit
      
      ---------
      Co-authored-by: NShijie Wang <jaywan@nvidia.com>
      017a6164
    • A
      f9e1b2d2
    • c737f0ae
    • H
      [NewIR]new ir dygraph to static supoort gpu (#55620) · fb9bec5d
      hong 提交于
      * add kernel dialect
      
      * change DenseTensorTypeStorage to DenseTensorType
      
      * add test case`
      
      * add first pd_op to kernel dialect
      
      * lower pd op to kernel dialect
      
      * update
      
      * update
      
      * remove useless code
      
      * add attrite print test
      
      * fix bug
      
      * update
      
      * update
      
      * update
      
      * update
      
      * polish code
      
      * fix bug
      
      * polish  code  and add python test
      
      * add test
      
      * fix test error
      
      * relax constraint when inserting get_parameter
      
      * add env flag
      
      * fix bug
      
      * dygraph2static support new ir
      
      * fix bug
      
      * revert test env
      
      * change cc_test_old to cc_test
      
      * update
      
      * fix build_static bug
      
      * update test
      
      * fix type test error
      
      * udpate cmake
      
      * disable test in windows
      
      * fix inference compile
      
      * fix program translator error
      
      * only run on cpu, not support gpu yet
      
      * fix conflict
      
      * polish code
      
      * fix bug
      
      * add feed with place op
      
      * update
      
      * remove useless unitest
      
      * udpate mkldnn
      
      * update
      
      * update
      
      * align mkldnn version
      
      * new ir support builtin slice op
      
      * fix bug
      
      * fix phi kernel adaptor bug
      
      * add enable static
      
      * add enable_static
      
      * remove useless test case
      
      * change feed list to single variable
      
      * update
      
      * add feed with place and shaddow output op
      
      * fix bug
      
      * remove usless code
      
      * support gpu
      
      * fix bug
      
      * fix bug
      
      * remove template
      
      * add more data type
      
      * fix cimpile bug
      
      * udpate
      
      * remove useless code
      
      * revert dygraph2st test
      
      * remove usless code
      
      * revert op
      
      * fix bug
      
      * new ir dygraph2static support gpu
      
      * remove usless code
      
      * code polish
      
      * add const
      
      * revert code and remove useless code
      
      * revert code
      
      * revert legacy op yaml
      
      * remove useless code
      
      * delete std::move
      
      ---------
      Co-authored-by: Nkangguangli <kangguangli@hotmail.com>
      fb9bec5d
    • K
      [BugFix] fix random fail of test_bilinear_interp_v2_op (#55643) · 98c7a3e0
      kangguangli 提交于
      * fix random fail of test_bilinear_interp_v2_op
      
      * reset if compiledProgram
      98c7a3e0
    • Q
      解决 grad_fn next_functions api 接口导致内存异常的问题 - (#55627) · 03a2f187
      qiuwenbo 提交于
      * [尝试] 给tensor增加一个属性, 这个属性是一个定值 1
      
      * 暴露gradnode 并构建gradnode新的方法(用来测试)进行暴露给python python端可以访问
      
      * 开发grad_fn、next_functions两个API 并暴露到python端- 做一些规范化处理
      
      * 增加一个单元测试
      
      * 优化 code-style
      
      * 将单侧文件迁到正确的位置
      
      * 优化 code-style
      
      * 删除无用注释
      
      * 解决 __main__ has no attribute
      
      * 修改单侧文件
      
      * 修改单侧脚本-temp
      
      * 解决 grad_fn next_functions api 接口导致内存异常的问题
      
      * 修改单测内容
      
      * 解决 code-style 问题
      03a2f187
    • H
      05a40691
  4. 24 7月, 2023 8 次提交
  5. 22 7月, 2023 1 次提交
  6. 21 7月, 2023 3 次提交
  7. 20 7月, 2023 3 次提交