1. 20 12月, 2021 1 次提交
    • Z
      Add multi_tensor for momentum optimizer and clear_grads (#37564) · 0cc5e22c
      zhangbo9674 提交于
      * add multi_tensor for momentum and clear_grads for optimizer
      
      * fix bug for dygraph
      
      * add unittest
      
      * refine comment
      
      * add param_group
      
      * refine regularizaiton logic
      
      * del clear_grads
      
      * add clear_grads
      
      * add dispensable check of None
      
      * refine clear_grad
      
      * fix build bug
      
      * refine code by comment
      
      * refine code
      
      * add multi tensor check
      
      * refine param_group update
      
      * add multi tensor for static mode
      
      * refine comments
      
      * delete useless comma for momentum
      
      * refine comment for momentum
      
      * refine code by commment
      0cc5e22c
  2. 17 12月, 2021 2 次提交
  3. 16 12月, 2021 1 次提交
    • L
      Add sparse_attention mask ,test=develop (#37973) · fa463b90
      Liu-xiandong 提交于
      Add key_padding_mask and attn_mask in sparse_attention Api
      
      1.Key padding mask is a tensor with dimensions [batch_size, seq_len], and attention mask is a tensor with dimensions [seq_len, seq_len]. The data types of the two masks are consistent with Q, K, and V, which are float32 or float64. If the value in Mask is 0, it means that the position needs to be masked.
      
      2.The changed files are mainly paddle/fluid/operators/sparse_attention_op.cu and python/paddle/fluid/tests/unittests/test_sparse_attention_op.py. sparse_attention has three parts: sddmm, softmax, and dsd. Adding the mask operation only needs to modify the softmax. It has no effect on the other two parts. In addition, in order to test the mask function, related tests has been added.
      fa463b90
  4. 08 12月, 2021 1 次提交
    • W
      [Eager] generate eager core ops, only 4 ops (#37813) · 52f63cd2
      wanghuancoder 提交于
      * refine a test case, test=develop
      
      * publish python c api for eager, test=develop
      
      * revert modify about test_allclose_layer.py, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * delete numpy includes, use pybind11 numpy.h, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * suport eager error msg, and add grad test case, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * generate eager core ops, only 4 ops, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      52f63cd2
  5. 01 12月, 2021 1 次提交