1. 10 5月, 2021 2 次提交
  2. 08 5月, 2021 2 次提交
  3. 07 5月, 2021 3 次提交
  4. 06 5月, 2021 5 次提交
  5. 30 4月, 2021 5 次提交
    • W
      pylayer_op:release context after compute. (#32707) · 3cc11a3d
      WeiXin 提交于
      3cc11a3d
    • P
      Add 12 inplace APIs including auto generated (#32573) · 308073de
      pangyoki 提交于
      * add relu6_ hardsigmoid_ leaky_relu_ Inplace APIs
      
      * add softmax_with_cross_entropy_ Inplace API
      
      * add clip_ scale_ add_ subtract_ Inplace APIs
      
      * add wlist
      
      * fix parameter of scale api
      
      * add add_n_ Inplace API and remove log_ Inplace API
      
      * fix elementwise_add_ and elementwise_sub_ broadcast problem
      
      * elementwise inplace api give error message before run the op
      
      * use broadcast_shape in elementwise inplace op
      
      * add 8 inplace apis that is auto generated
      
      * add unittest for all inplace apis
      
      * add decorator for inplace apis in static mode
      
      * fix windows blas fail of exp inplace api, change array_equal to allclose
      
      * add flatten inplace api
      
      * add flatten unittest
      
      * fix flatten unittest
      
      * add decorator
      
      * fix grad.numpy in test_pylayer_op
      
      * unsupport softmax_with_cross_entropy_
      
      * add test_inplace_softmax_with_cross_entropy to static_mode_white_list
      
      * delete __all__ in inplace_utils
      
      * delete activation inplace function and add Tensor.inplace_func
      
      * change paddle.inplace_ to Tensor.inplace_
      
      * fix little problem
      
      * add paddle in inplace_utils
      308073de
    • C
      remove is_test=True in grad (#32678) · bd8d35a2
      ceci3 提交于
      bd8d35a2
    • B
      add_c_sync_npu_kernel (#32687) · 8fd724a5
      Baibaifan 提交于
      8fd724a5
    • J
      Reduce grad fix (#32592) · 43527a2b
      jakpiase 提交于
      43527a2b
  6. 29 4月, 2021 4 次提交
  7. 28 4月, 2021 5 次提交
    • L
      [NPU] add input EpsilonTensor for adam (#32605) · 119cda3d
      Leo Chen 提交于
      * add input EpsilonTensor for adam
      
      * update python api
      
      * add unit test
      
      * add npu test
      
      * add more ut
      119cda3d
    • A
      Added pure_bf16 mode (#32281) · bc379ca3
      arlesniak 提交于
      bc379ca3
    • K
      Fix some error message (#32614) · 9ee709fc
      Kqnonrime 提交于
      * fix two error message
      
      * fix two error message
      
      * fix error
      
      * fix error
      
      * fix error
      
      * fix error
      
      * fix some error message
      
      * fix some error
      
      * fix error
      
      * fix some error
      
      * fix some error
      
      * fix some error
      
      * fix one error
      
      * fix some error
      
      * fix seven error message
      
      * fix error
      
      * fix error
      
      * fix error
      
      * fix error
      
      * fix some error message
      
      * fix error
      
      * fix some error
      
      * fix some error
      9ee709fc
    • J
      [oneDNN] Added clearing oneDNN cache per executor (#32499) · ba610761
      Jacek Czaja 提交于
      * - Added clearing oneDNN per executor
      
      * - Executor is nt always having FLAGS_use_mkldnn set to true
      ba610761
    • J
      Optimize update_loss_scaling_op (#32554) · 0dc02dc7
      jiangcheng 提交于
      * optimize update_loss_scaling_op by fused for loop to one kernel, test=develop
      
      * remove useless while loop and optimize variable name, test=develop
      
      * optimize variable name from out_addrs_tensor to out_addrs_mem, test=develop
      
      * optimize variable name for readable by change prefix identifier from t_ to local_
      0dc02dc7
  8. 27 4月, 2021 5 次提交
  9. 26 4月, 2021 5 次提交
  10. 25 4月, 2021 4 次提交