1. 16 12月, 2021 3 次提交
    • J
      support eager switch system (#38170) · 8305c2be
      Jiabin Yang 提交于
      * support eager switch system
      
      * polish code
      8305c2be
    • L
      Add sparse_attention mask ,test=develop (#37973) · fa463b90
      Liu-xiandong 提交于
      Add key_padding_mask and attn_mask in sparse_attention Api
      
      1.Key padding mask is a tensor with dimensions [batch_size, seq_len], and attention mask is a tensor with dimensions [seq_len, seq_len]. The data types of the two masks are consistent with Q, K, and V, which are float32 or float64. If the value in Mask is 0, it means that the position needs to be masked.
      
      2.The changed files are mainly paddle/fluid/operators/sparse_attention_op.cu and python/paddle/fluid/tests/unittests/test_sparse_attention_op.py. sparse_attention has three parts: sddmm, softmax, and dsd. Adding the mask operation only needs to modify the softmax. It has no effect on the other two parts. In addition, in order to test the mask function, related tests has been added.
      fa463b90
    • Z
      Enabled Eager AutoCodeGen for All Existing Operators & Possible Future Operators (#37969) · 08482a86
      Zhanlue Yang 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Enabled Eager AutoCodeGen for All Existing Operators & Possible Future Operators
      
      * Fixed CI issues
      08482a86
  2. 15 12月, 2021 1 次提交
    • Z
      Synchronized auto-generated Python-C API with Dygraph Forward Functions (#38017) · 77dfb2e8
      Zhanlue Yang 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      77dfb2e8
  3. 14 12月, 2021 1 次提交
  4. 13 12月, 2021 3 次提交
  5. 10 12月, 2021 1 次提交
  6. 09 12月, 2021 4 次提交
  7. 08 12月, 2021 2 次提交
    • W
      [Eager] coreops to 495 (#37926) · aff7397b
      wanghuancoder 提交于
      * refine a test case, test=develop
      
      * publish python c api for eager, test=develop
      
      * revert modify about test_allclose_layer.py, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * delete numpy includes, use pybind11 numpy.h, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * suport eager error msg, and add grad test case, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * generate eager core ops, only 4 ops, test=develop
      
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * open 500 list
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * fix auto code gen, test=develop
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * add to pyobject, test=develop
      
      * Resolved operators without input
      
      * merge pr 37837
      
      * refine
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine,test=develop
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      aff7397b
    • W
      [Eager] generate eager core ops, only 4 ops (#37813) · 52f63cd2
      wanghuancoder 提交于
      * refine a test case, test=develop
      
      * publish python c api for eager, test=develop
      
      * revert modify about test_allclose_layer.py, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * delete numpy includes, use pybind11 numpy.h, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * suport eager error msg, and add grad test case, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * generate eager core ops, only 4 ops, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      52f63cd2
  8. 07 12月, 2021 5 次提交
  9. 06 12月, 2021 1 次提交
  10. 03 12月, 2021 3 次提交
    • W
      Fix _numel func logic and add test (#37810) · 075a02d2
      Weilong Wu 提交于
      075a02d2
    • R
      refine structure for cuda and rocm (#37202) · a6d2fddb
      ronnywang 提交于
      * refine structure for cuda and rocm
      
      * update
      
      * update
      
      * update
      
      * update
      a6d2fddb
    • W
      [Eager] publish python c api for eager (#37550) · 07b4fe93
      wanghuancoder 提交于
      * refine a test case, test=develop
      
      * publish python c api for eager, test=develop
      
      * revert modify about test_allclose_layer.py, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * delete numpy includes, use pybind11 numpy.h, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * suport eager error msg, and add grad test case, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      07b4fe93
  11. 02 12月, 2021 1 次提交
  12. 01 12月, 2021 2 次提交
  13. 30 11月, 2021 2 次提交
  14. 27 11月, 2021 1 次提交
    • A
      [NPU] reorganization for device API abstraction (#37110) · 72241a6a
      Aganlengzi 提交于
      * [NPU] reorganization for device API abstraction
      
      * [NPU] delete old files
      
      * [NPU] fix npu_collective_helper
      
      * [NPU] fix collective_helper
      
      * [NPU] fix ut
      
      * [NPU] mod memory allocation and hccl_helper
      
      * [NPU] fix place_type
      
      * [NPU] split enfoce.h
      
      * move acl* call into npu_info
      
      * merge conflict
      
      * fix merge
      
      * merge conflict
      
      * merge conflict
      72241a6a
  15. 26 11月, 2021 3 次提交
    • Z
      upgrade async distributed training in pscore (#37515) · 74605fc2
      zhaocaibei123 提交于
      * test
      
      * test
      
      * rm test
      
      * update
      
      * update
      
      * update
      
      * add unittest
      
      * update
      
      * update save
      74605fc2
    • Z
      Added interface reset_grad_inplace_version (#37573) · dcb91fd7
      Zhanlue Yang 提交于
      reset_inplace_version removes all inplace related records to VarBase/VariableWrapper, the essential purpose of which is to let you use inplace operations as if using its non-inplaced version, which of course will cause unexpected consequences if not used with care.
      
      This is essentially a hack interface to satisfy one specific request
      dcb91fd7
    • W
      TDM2 (#37044) · 4826167c
      wangzhen38 提交于
      * add tdm sample
      
      * add tdm sample in c++
      
      * update tdm sample
      
      * modify sample count
      
      * fix conflict
      
      * add set_date
      
      * fix cmake error
      
      * fix bug of proto
      
      * update index_dataset proto
      
      * update cmake
      
      * fix error cmake
      
      * fix cmake mkldnn
      
      * fix cmake proto
      
      * update cmake proto
      
      * update cmake
      
      * update rec
      
      * update dataset
      
      * update dataset
      
      * update dataset
      
      * updata dataset
      
      * updata dataset
      
      * updata coverage
      
      * updata ci
      
      * goback4
      
      * fix npu ci
      
      * add xxhash dep
      4826167c
  16. 25 11月, 2021 1 次提交
  17. 23 11月, 2021 4 次提交
  18. 22 11月, 2021 2 次提交