1. 24 3月, 2023 4 次提交
    • W
      Del old dygraph MLU NPU (#51958) · 611f7ccc
      wanghuancoder 提交于
      * delete old dygraph, mlu npu do not use dygraph
      611f7ccc
    • Z
      Memory Efficient Attention (#51867) · e5ad3859
      ZhangDY-6483 提交于
      * first version, notest
      
      * return final rst, notest
      
      * use infinity() instead of max
      
      * ut structure
      
      * start up of ut
      
      * generate lse
      
      * update
      
      * add depense
      
      * reconstruct cmake
      
      * move file
      
      * add memory efficient attention and fix blasimpl
      
      * update
      
      * update cmake
      
      * add namespace
      
      * update cmake
      
      * use .cu
      
      * update for pad3d
      
      * bug fix
      
      * bug fix
      
      * update
      
      * bug fix
      
      * update enforce
      
      * add test case
      
      * merge the lse pad
      
      * fix kernel_fn of backward
      
      * fix PADDLE_ENFORCE_EQ and phi_api
      
      * fix PADDLE_ENFORCE
      
      * fix PADDLE_ENFORCE
      
      * rerun coverage
      
      * fix memory efficient attention test
      
      * rerun ci
      
      * add cuda version condition
      
      * add cuda version condition
      
      * delete WIP test
      
      * replace PADDLE_ENFORCE
      
      * edit the namespace of datatype in multiple.cc
      
      * rerun
      
      * rerun
      
      ---------
      Co-authored-by: Nliuyuang <liuyuang@baidu.com>
      e5ad3859
    • W
      do not test dygraph in dygraph (#52027) · 298a1a0b
      wanghuancoder 提交于
      * xpu do not test dygraph in dygraph
      298a1a0b
    • Y
      Fix roll kernel gpu bug. (#52012) · b6d0dac9
      Yuang Liu 提交于
      b6d0dac9
  2. 23 3月, 2023 17 次提交
  3. 22 3月, 2023 19 次提交