1. 29 9月, 2021 5 次提交
  2. 28 9月, 2021 12 次提交
    • L
      Add sparse_attention api, test=develop (#35676) · 6b587e93
      Liu-xiandong 提交于
      Add sparse_attention OPs, python api will be added in next pr
      6b587e93
    • L
      add API paddle.linalg.eig (#35674) · bc7e2b92
      Lijunhui 提交于
      * Add paddle.linalg.eig op
      
      * remove comments
      
      * remove comments
      
      * extend batch_size to the origin
      
      * add real times complex functor & destroy the backward complex output bug
      
      * terminate output diff when input real tensors
      
      * correct tiny doc errors
      
      * move functions from eig_helper to svd_helper and remove eig_helper
      
      * remove tensor.Resize
      
      * remove no longer used code
      
      * use existing lapack functions
      
      * reply review comments 21/27
      
      * remove .cu as this op is only executed on CPU
      
      * remove const_cast & add const in argument list for read-only references
      
      * fix sample code error in CI
      
      * remove template typename Tbase and more
      
      * remove eig exposure in paddle.*
      
      * add 'name=None' in eig python implementation
      
      * handle the unittest
      
      * try to solve the unittest
      
      * solve CI coverage
      
      * remove no longer used code
      
      * polish API doc and more
      
      * reply review comments
      
      * polish unittest, commit plan B
      
      * polish unittest
      bc7e2b92
    • R
      [ROCM] bugfix for arg_min_max (#36098) · 36791fdd
      ronnywang 提交于
      36791fdd
    • T
      [HeterPs]ps gpu dump (#36157) · 97d30602
      Thunderbrook 提交于
      * ps gpu dump
      
      * remove log
      97d30602
    • X
      [hybrid] seed and dropout op support force-cpu (#35820) · 58c8f6b3
      xiayanming 提交于
      * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid
      
      * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid
      
      * [HIP] fix op not support AMD GPU bug
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] fix seed ci failed issue
      
      * add AsExtra for force_cpu of seed op
      58c8f6b3
    • J
      【Bug fix】Fix dygraph double grad dtype error (#36125) · af4f018a
      Jiabin Yang 提交于
      * fix dygraph double grad dtype error when calling for high differential senario
      
      * reinvoke ci
      
      * add test for partial_engine.cc
      af4f018a
    • L
      reduce calls to SizeOfType (#36110) · c719add7
      Leo Chen 提交于
      c719add7
    • G
    • Z
      rename scale loss grad (#36162) · ad128144
      Zeng Jinle 提交于
      ad128144
    • Y
      Add paddle.device.cuda.get_device_properties (#35661) · 4cbed9e5
      Yanxing Shi 提交于
      * Initial Commit
      
      * add unittest and add error information
      
      * modify doc
      
      * fix some error
      
      * fix some word
      
      * fix bug cudaDeviceProp* and modify error explanation
      
      * fix cudaDeviceProp* error and unnitest samples
      
      * fix hip error and PADDLE_WITH_HIP
      
      * update style
      
      * fix error is_compiled_with_cuda
      
      * fix paddle.device.cuda.get_device_properties
      
      * fix error for multi thread safe
      
      * update style
      
      * merge conflict
      
      * modify after mentor review
      
      * update style
      
      * delete word
      
      * fix unittest error for windows
      
      * support string input and modify some code
      
      * modify doc to support string input
      
      * fix error for express information
      
      * fix error for express information
      
      * fix unnitest for windows
      
      * fix device.startswith('gpu:')
      
      * format error and doc
      
      * fix after review
      
      * format code
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix py2 error
      
      * fix wrong words and doc
      
      * fix _gpuDeviceProperties
      4cbed9e5
    • H
      Add Basic CINN Runner Class (#35978) · 6f18b041
      Huihuang Zheng 提交于
      * Add Basic CINN Runner Class
      
      * Add CinnCacheKey
      
      * Add Cache logic and improve CinnCacheKey
      
      
      * Modify as reviewer commented
      
      * Implement hash_combine to fix MAC build.
      6f18b041
    • S
      dlpack fix (#35817) · 74ff59cf
      Siming Dai 提交于
      74ff59cf
  3. 27 9月, 2021 5 次提交
  4. 26 9月, 2021 10 次提交
  5. 25 9月, 2021 1 次提交
  6. 24 9月, 2021 7 次提交
    • J
      add gradient kernel of det op and slogdet op (#36013) · b91e8eec
      jiangcheng 提交于
      * add gradient kernel of det op and slogdet op
      
      * fix CI APPROVAL problem
      b91e8eec
    • P
      Added elementwise_sub_mkldnn operator (#35662) · 787273ed
      piotrekobiIntel 提交于
      * Add elementwise_sub_mkldnn_op without grad
      
      * Add test to static_mode_white_list
      
      * Refactor code, change license years
      
      * Remove invalid grad implementation
      
      * Fix element_wise_sub_op test
      
      * Fix CI Approval error
      
      * Remove unnecessary EltwiseSubMKLDNNGradKernel class
      
      * Fix CI Approval 2
      
      * Fix CI Approval 3
      
      * Fix CI Approval Attempt #4
      
      * Fix CI Approve Attempt #5
      
      * Fix CI Approval Attempt #6
      
      * Fix CI Approval Attemt #7
      
      * Change test names containing add to sub
      
      * Fix old tests testing add instead of sub
      
      * Copy grad implementation from elementwise_add_mkldnn
      
      * CI test fix attempt
      
      * Revert "CI test fix attempt"
      
      This reverts commit c647cacf41e6a87c715385a185de5cbf65fc8900.
      
      * Fix CI attempt 2
      
      * Fix elementwise_sub tests, temporary mkldnn broadcast test disable
      
      * Add working implementation of elementwise_sub grad
      
      * Fix build errors caused by pull
      
      * Fix format error
      
      * Fix format error 2
      
      * Disable elementwise_sub_mkldnn test on GPU
      
      * Apply fix for paddle.fluid import
      
      * Revert changes of test_elementwise_sub and Fix mkldnn test
      
      * Revert "Apply fix for paddle.fluid import"
      
      This reverts commit fc3b122fec8e12f2bcb32928a2685ba4d20fd742.
      
      * fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862)
      
      * Add changes suggested by reviewers
      
      * Change @unittest.skipIf... to @OpTestTool.skip_if_not_cpu_bf16() to satisfy Approval CI
      
      * Remove check_dygraph=False to satisify CI Approval
      Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com>
      787273ed
    • J
      add pool2d convert test (#35923) · 82f255d0
      JingZhuangzhuang 提交于
      * add pool2d convert test
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      82f255d0
    • J
      [oneDNN] candidate fix to #34554 (#35884) · 485b387d
      Jacek Czaja 提交于
      * - candidate fix
      
      * - More fixes to #34554
      
      * - another incosnstent fix to key
      
      * - Remvoed unneeded line
      
      * - matching the cache behaviour to other ops
      485b387d
    • W
      Add paddle.linalg.solve OP (#35715) · 8caf951c
      Weilong Wu 提交于
      * Add linalg.solve op, test=develop
      
      * Fix a bug caused by accidental deletion
      
      * updated description and fix a bug: missing a comma
      
      * Add linalg.solve op, test=develop
      
      * updated solve op backward logic
      
      * updated solve op backward logic again
      
      * Add linalg.solve Op, test=develop
      
      * Updated and modified to fit CI requirements
      
      * Fix a bug
      
      * 1)Add more test cases; 2)Fix a wrong usage in reduces operation; 3)Remove redundant code
      
      * Remove redundant comments
      
      * 1)Removed redundant code; 2)Updated to enhance code robustness
      
      * Removed redundant code
      
      * Updated API documents
      8caf951c
    • L
    • B
      add multihead_matmul trt converter test case (#36023) · fcaa64b3
      baoachun 提交于
      * add multihead_matmul trt converter test case
      
      * move attribute check to op_teller
      fcaa64b3