1. 30 9月, 2021 2 次提交
  2. 29 9月, 2021 10 次提交
  3. 28 9月, 2021 7 次提交
    • L
      Add sparse_attention api, test=develop (#35676) · 6b587e93
      Liu-xiandong 提交于
      Add sparse_attention OPs, python api will be added in next pr
      6b587e93
    • L
      add API paddle.linalg.eig (#35674) · bc7e2b92
      Lijunhui 提交于
      * Add paddle.linalg.eig op
      
      * remove comments
      
      * remove comments
      
      * extend batch_size to the origin
      
      * add real times complex functor & destroy the backward complex output bug
      
      * terminate output diff when input real tensors
      
      * correct tiny doc errors
      
      * move functions from eig_helper to svd_helper and remove eig_helper
      
      * remove tensor.Resize
      
      * remove no longer used code
      
      * use existing lapack functions
      
      * reply review comments 21/27
      
      * remove .cu as this op is only executed on CPU
      
      * remove const_cast & add const in argument list for read-only references
      
      * fix sample code error in CI
      
      * remove template typename Tbase and more
      
      * remove eig exposure in paddle.*
      
      * add 'name=None' in eig python implementation
      
      * handle the unittest
      
      * try to solve the unittest
      
      * solve CI coverage
      
      * remove no longer used code
      
      * polish API doc and more
      
      * reply review comments
      
      * polish unittest, commit plan B
      
      * polish unittest
      bc7e2b92
    • X
      [hybrid] seed and dropout op support force-cpu (#35820) · 58c8f6b3
      xiayanming 提交于
      * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid
      
      * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid
      
      * [HIP] fix op not support AMD GPU bug
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] seed and dropout op support force-cpu
      
      * [hybrid] fix seed ci failed issue
      
      * add AsExtra for force_cpu of seed op
      58c8f6b3
    • Z
      remove new linalg api in paddle.__init__ (#36151) · 3bb4715e
      zhiboniu 提交于
      remove recent linalg api in paddle.init;
      add args 'name' in some new linalg api interface
      same change in develop branch to #36112
      3bb4715e
    • J
      【Bug fix】Fix dygraph double grad dtype error (#36125) · af4f018a
      Jiabin Yang 提交于
      * fix dygraph double grad dtype error when calling for high differential senario
      
      * reinvoke ci
      
      * add test for partial_engine.cc
      af4f018a
    • W
      [hybrid] optimizer sharding support optimize cast (#35878) · eef0a943
      WangXi 提交于
      eef0a943
    • Y
      Add paddle.device.cuda.get_device_properties (#35661) · 4cbed9e5
      Yanxing Shi 提交于
      * Initial Commit
      
      * add unittest and add error information
      
      * modify doc
      
      * fix some error
      
      * fix some word
      
      * fix bug cudaDeviceProp* and modify error explanation
      
      * fix cudaDeviceProp* error and unnitest samples
      
      * fix hip error and PADDLE_WITH_HIP
      
      * update style
      
      * fix error is_compiled_with_cuda
      
      * fix paddle.device.cuda.get_device_properties
      
      * fix error for multi thread safe
      
      * update style
      
      * merge conflict
      
      * modify after mentor review
      
      * update style
      
      * delete word
      
      * fix unittest error for windows
      
      * support string input and modify some code
      
      * modify doc to support string input
      
      * fix error for express information
      
      * fix error for express information
      
      * fix unnitest for windows
      
      * fix device.startswith('gpu:')
      
      * format error and doc
      
      * fix after review
      
      * format code
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix error for doc compile
      
      * fix py2 error
      
      * fix wrong words and doc
      
      * fix _gpuDeviceProperties
      4cbed9e5
  4. 27 9月, 2021 2 次提交
    • J
      Added flatten and flatten2 BF16/FP32 FWD/BWD kernels (#35892) · e427a0f1
      jakpiase 提交于
      * refactored reshape multiop kernel and added flatten1/2 kernels
      
      * added formatting for flatten tests
      
      * CI fix
      
      * disabled reshape_kernel ops after succesful CI run
      
      * minor fix
      e427a0f1
    • L
      Add functional autograd API: jacobian (#35917) · ec2f68e8
      levi131 提交于
      * init functional jacobian api
      
      * finish test with dtype float32
      
      * add float64 test case
      
      * polish code
      
      * use atol=1e-5 with dtype float64
      
      * fix for ci
      
      * set timeout for test_jacobian
      
      * polish API docstring
      
      * modify docstring
      ec2f68e8
  5. 26 9月, 2021 3 次提交
  6. 24 9月, 2021 9 次提交
    • J
      add gradient kernel of det op and slogdet op (#36013) · b91e8eec
      jiangcheng 提交于
      * add gradient kernel of det op and slogdet op
      
      * fix CI APPROVAL problem
      b91e8eec
    • P
      Added elementwise_sub_mkldnn operator (#35662) · 787273ed
      piotrekobiIntel 提交于
      * Add elementwise_sub_mkldnn_op without grad
      
      * Add test to static_mode_white_list
      
      * Refactor code, change license years
      
      * Remove invalid grad implementation
      
      * Fix element_wise_sub_op test
      
      * Fix CI Approval error
      
      * Remove unnecessary EltwiseSubMKLDNNGradKernel class
      
      * Fix CI Approval 2
      
      * Fix CI Approval 3
      
      * Fix CI Approval Attempt #4
      
      * Fix CI Approve Attempt #5
      
      * Fix CI Approval Attempt #6
      
      * Fix CI Approval Attemt #7
      
      * Change test names containing add to sub
      
      * Fix old tests testing add instead of sub
      
      * Copy grad implementation from elementwise_add_mkldnn
      
      * CI test fix attempt
      
      * Revert "CI test fix attempt"
      
      This reverts commit c647cacf41e6a87c715385a185de5cbf65fc8900.
      
      * Fix CI attempt 2
      
      * Fix elementwise_sub tests, temporary mkldnn broadcast test disable
      
      * Add working implementation of elementwise_sub grad
      
      * Fix build errors caused by pull
      
      * Fix format error
      
      * Fix format error 2
      
      * Disable elementwise_sub_mkldnn test on GPU
      
      * Apply fix for paddle.fluid import
      
      * Revert changes of test_elementwise_sub and Fix mkldnn test
      
      * Revert "Apply fix for paddle.fluid import"
      
      This reverts commit fc3b122fec8e12f2bcb32928a2685ba4d20fd742.
      
      * fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862)
      
      * Add changes suggested by reviewers
      
      * Change @unittest.skipIf... to @OpTestTool.skip_if_not_cpu_bf16() to satisfy Approval CI
      
      * Remove check_dygraph=False to satisify CI Approval
      Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com>
      787273ed
    • S
      add update (#36017) · 1691dc7a
      ShenLiang 提交于
      1691dc7a
    • J
      add pool2d convert test (#35923) · 82f255d0
      JingZhuangzhuang 提交于
      * add pool2d convert test
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      
      * modify error
      82f255d0
    • K
      4f42e5d7
    • W
      Add paddle.linalg.solve OP (#35715) · 8caf951c
      Weilong Wu 提交于
      * Add linalg.solve op, test=develop
      
      * Fix a bug caused by accidental deletion
      
      * updated description and fix a bug: missing a comma
      
      * Add linalg.solve op, test=develop
      
      * updated solve op backward logic
      
      * updated solve op backward logic again
      
      * Add linalg.solve Op, test=develop
      
      * Updated and modified to fit CI requirements
      
      * Fix a bug
      
      * 1)Add more test cases; 2)Fix a wrong usage in reduces operation; 3)Remove redundant code
      
      * Remove redundant comments
      
      * 1)Removed redundant code; 2)Updated to enhance code robustness
      
      * Removed redundant code
      
      * Updated API documents
      8caf951c
    • B
      0bbaf9bd
    • B
      add multihead_matmul trt converter test case (#36023) · fcaa64b3
      baoachun 提交于
      * add multihead_matmul trt converter test case
      
      * move attribute check to op_teller
      fcaa64b3
    • W
      add the shape check for the matmul (#35791) · 8e19d1ba
      wawltor 提交于
      * add the shape check for the matmul
      
      * remove the test case for the linear
      8e19d1ba
  7. 23 9月, 2021 1 次提交
  8. 22 9月, 2021 6 次提交
    • Z
      [AMP]split minimize and add unscale_ for GradScaler (#35825) · bf6f0e54
      zhangbo9674 提交于
      * split minimize() to step() + update()
      
      * add unscale and step for grad_scaler
      
      * add unittest
      
      * refine code in minimize
      
      * delete step in loss_scaler
      
      * fix example bug
      
      * refine comment
      
      * refine unittest
      
      * add unittest
      bf6f0e54
    • R
      [NPU] add randperm_op_npu (#35763) · 4f0c3278
      ronnywang 提交于
      * add randperm_op_npu
      
      * fix test_set_value_op_npu
      4f0c3278
    • T
      op:transpose_op supports bool type (#35886) · 0c6ee945
      TeslaZhao 提交于
      * Pass compat of conv_transpose_bias_mkldnn_fuse_pass
      
      * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds
      
      * Fix a bug of transpose op, about accessing memory out of bounds of the perm param
      
      * op:transpose_op supports bool type
      0c6ee945
    • H
      Det &Slogdet (#34992) · 9ce45ddd
      huangxu96 提交于
      Add new API : paddle.linalg.det & paddle.linalg.slogdet
      
      API Alias:paddle.det& paddle.slogdet
      9ce45ddd
    • J
      fix conv2d convert test (#35627) · 1238115e
      JingZhuangzhuang 提交于
      * support nnadapter and ascend310
      
      * modify code
      
      * add anchor_generator convert test
      
      * add gelu convert test
      
      * add conv2d convert test
      
      * modify anchor_operator convert test
      
      * modify conv2d test
      
      * modify con2d convert test
      
      * modify conv2d convert test
      
      * modify conv2d convert test
      
      * modify conv2d test
      
      * fix WITH_PYTHON compile error
      
      * modify test file
      
      * modify test file
      
      * modify test file
      
      * modify test file
      
      * modify test file
      
      * modify test file
      
      * modify test file
      
      * modify test file
      Co-authored-by: Nxiaoxiaohehe001 <hiteezsf@163.com>
      Co-authored-by: Njiweibo <jiweibo@baidu.com>
      1238115e
    • W
      fix feed for new executor (#35803) · 4c2a06df
      wanghuancoder 提交于
      * fix feed, test=develop
      
      * delete one test case, test=develop
      4c2a06df