1. 11 10月, 2021 1 次提交
  2. 09 10月, 2021 5 次提交
  3. 08 10月, 2021 6 次提交
  4. 07 10月, 2021 1 次提交
  5. 05 10月, 2021 1 次提交
    • J
      Added concat BF16/FP32 BWD OneDNN kernel (#35889) · dc4d5719
      jakpiase 提交于
      * tmp
      
      * added concat BF16/FP32 BWD oneDNN kernel
      
      * minor change
      
      * minor change
      
      * fix for CI
      
      * added formatting
      
      * Reverted deleting static keyword
      
      * added reviewers suggestions
      
      * reverted deleting concat bf16 test file
      
      * fixed concat tests
      dc4d5719
  6. 30 9月, 2021 3 次提交
  7. 29 9月, 2021 14 次提交
  8. 28 9月, 2021 9 次提交
    • L
      Add sparse_attention api, test=develop (#35676) · 6b587e93
      Liu-xiandong 提交于
      Add sparse_attention OPs, python api will be added in next pr
      6b587e93
    • L
      add API paddle.linalg.eig (#35674) · bc7e2b92
      Lijunhui 提交于
      * Add paddle.linalg.eig op
      
      * remove comments
      
      * remove comments
      
      * extend batch_size to the origin
      
      * add real times complex functor & destroy the backward complex output bug
      
      * terminate output diff when input real tensors
      
      * correct tiny doc errors
      
      * move functions from eig_helper to svd_helper and remove eig_helper
      
      * remove tensor.Resize
      
      * remove no longer used code
      
      * use existing lapack functions
      
      * reply review comments 21/27
      
      * remove .cu as this op is only executed on CPU
      
      * remove const_cast & add const in argument list for read-only references
      
      * fix sample code error in CI
      
      * remove template typename Tbase and more
      
      * remove eig exposure in paddle.*
      
      * add 'name=None' in eig python implementation
      
      * handle the unittest
      
      * try to solve the unittest
      
      * solve CI coverage
      
      * remove no longer used code
      
      * polish API doc and more
      
      * reply review comments
      
      * polish unittest, commit plan B
      
      * polish unittest
      bc7e2b92
    • R
      [ROCM] bugfix for arg_min_max (#36098) · 36791fdd
      ronnywang 提交于
      36791fdd
    • T
      [HeterPs]ps gpu dump (#36157) · 97d30602
      Thunderbrook 提交于
      * ps gpu dump
      
      * remove log
      97d30602
    • X
      [hybrid] seed and dropout op support force-cpu (#35820) · 58c8f6b3
      xiayanming 提交于
      * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid
      
      * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid
      
      * [HIP] fix op not support AMD GPU bug
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] fix seed ci failed issue
      
      * add AsExtra for force_cpu of seed op
      58c8f6b3
    • J
      【Bug fix】Fix dygraph double grad dtype error (#36125) · af4f018a
      Jiabin Yang 提交于
      * fix dygraph double grad dtype error when calling for high differential senario
      
      * reinvoke ci
      
      * add test for partial_engine.cc
      af4f018a
    • L
      reduce calls to SizeOfType (#36110) · c719add7
      Leo Chen 提交于
      c719add7
    • G
    • Z
      rename scale loss grad (#36162) · ad128144
      Zeng Jinle 提交于
      ad128144