1. 24 5月, 2021 1 次提交
  2. 22 5月, 2021 1 次提交
    • J
      Added oneDNN matmul grad BF16/FP32 kernel (#32968) · e2a3a6f7
      jakpiase 提交于
      * added support for most matmul cases
      
      * added more functionality
      
      * full functionality of matmul op, fp32 only
      
      * added bf16 tests and functionality
      
      * added formatting
      
      * changes after review
      
      * minor change
      
      * added reviewers suggestions
      e2a3a6f7
  3. 21 5月, 2021 3 次提交
  4. 20 5月, 2021 4 次提交
    • T
      fix gather op and add logsumexp op on kunlun (#32931) · a96e8bc9
      TTerror 提交于
      * fix gather op and add logsumexp op on kunlun
      
      * update xpu depence
      
      * update tests and fix elementwise_add
      a96e8bc9
    • B
      revert_matmulv2_npu (#33014) · be8e94aa
      Baibaifan 提交于
      be8e94aa
    • C
      Add complex template type (#32857) · 738bf20e
      chentianyu03 提交于
      * add complex template file
      
      * add numtraits for complex template
      
      * add complex template type register
      
      * modify specify template of complex
      
      * modify specify template of complex
      
      * modify specify template of complex
      
      * modify specify template of complex
      
      * make TensorCheckerVisitor support complex type
      
      * fix operator= error
      
      * add complex template
      
      * add complex template type
      
      * add complex template type to pyarray transform
      
      * add complex template type to pyarray transform
      
      * remove complex type for dlpack register
      
      * set dlpack supprot complex type
      
      * set dlpack supprot complex type
      
      * set dlpack supprot complex type
      
      * remove explict for complex constructor
      
      * add complex unit test file
      738bf20e
    • L
      14949521
  5. 19 5月, 2021 2 次提交
  6. 18 5月, 2021 4 次提交
  7. 14 5月, 2021 4 次提交
  8. 13 5月, 2021 4 次提交
  9. 12 5月, 2021 2 次提交
  10. 10 5月, 2021 3 次提交
  11. 08 5月, 2021 2 次提交
  12. 07 5月, 2021 3 次提交
  13. 06 5月, 2021 5 次提交
  14. 30 4月, 2021 2 次提交
    • W
      pylayer_op:release context after compute. (#32707) · 3cc11a3d
      WeiXin 提交于
      3cc11a3d
    • P
      Add 12 inplace APIs including auto generated (#32573) · 308073de
      pangyoki 提交于
      * add relu6_ hardsigmoid_ leaky_relu_ Inplace APIs
      
      * add softmax_with_cross_entropy_ Inplace API
      
      * add clip_ scale_ add_ subtract_ Inplace APIs
      
      * add wlist
      
      * fix parameter of scale api
      
      * add add_n_ Inplace API and remove log_ Inplace API
      
      * fix elementwise_add_ and elementwise_sub_ broadcast problem
      
      * elementwise inplace api give error message before run the op
      
      * use broadcast_shape in elementwise inplace op
      
      * add 8 inplace apis that is auto generated
      
      * add unittest for all inplace apis
      
      * add decorator for inplace apis in static mode
      
      * fix windows blas fail of exp inplace api, change array_equal to allclose
      
      * add flatten inplace api
      
      * add flatten unittest
      
      * fix flatten unittest
      
      * add decorator
      
      * fix grad.numpy in test_pylayer_op
      
      * unsupport softmax_with_cross_entropy_
      
      * add test_inplace_softmax_with_cross_entropy to static_mode_white_list
      
      * delete __all__ in inplace_utils
      
      * delete activation inplace function and add Tensor.inplace_func
      
      * change paddle.inplace_ to Tensor.inplace_
      
      * fix little problem
      
      * add paddle in inplace_utils
      308073de