1. 05 7月, 2021 3 次提交
  2. 04 7月, 2021 1 次提交
  3. 02 7月, 2021 3 次提交
  4. 01 7月, 2021 5 次提交
  5. 30 6月, 2021 3 次提交
    • J
      Added matmul_v2 BF16/FP32 FWD kernel (#33750) · 24783c84
      jakpiase 提交于
      * added matmul_v2 bf16/fp32 FWD kernel
      
      added matmul_v2 bf16/fp32 FWD kernel
      
      * added formatting
      
      * removed some tests due to timeout in CI
      
      * refactored tests
      
      * merged tests classes into one file
      
      * minor change
      
      * removed test guard for CUDA
      
      * remove skipIf
      
      * changes after review
      
      * formated one file
      
      * minor change
      
      * added skipping UT in CUDA place
      24783c84
    • A
      [Dy2Stat] Refine PartialProgramLayer logic (#33796) · 97f86d84
      Aurelius84 提交于
      * refine temp_scope_vec logic
      
      * polish partial_program
      
      * fix fake var
      
      * add stop_gradient in spec
      
      * fix fake_var
      
      * fix unittest
      97f86d84
    • H
      [NPU] support set_device (#33815) · 8225a6a1
      houj04 提交于
      * support set_device for NPU.
      
      * minor update doc and add more unit test.
      8225a6a1
  6. 29 6月, 2021 4 次提交
  7. 28 6月, 2021 3 次提交
  8. 25 6月, 2021 2 次提交
  9. 24 6月, 2021 8 次提交
  10. 23 6月, 2021 4 次提交
  11. 22 6月, 2021 4 次提交
    • Z
      [API/OP]Add a new API paddle.diagonal (#33586) · ad106290
      zhangbo9674 提交于
      * new api diagonal, test=develop
      
      * add new api diagonal, test=develop
      
      * new api diagonal, test=develop
      
      * add new api paddle.diagonal, test=develop
      
      * use framework::stride replace ComputeDimStride
      
      * replace cudaMalloc/cudaMemcpy by TensorFormVector in cudaKernel and cudaGradKernel
      
      * perfect funciton: when attr(offset) is exceed attr(axis1) or attr(axis2), set the diagonal dim is 0
      
      * fix RP-Mac-CI bug: replace framework::stride() by ComputDimStride.
      
      * perfect code-block
      
      * perfect code of python API diagonal
      
      * api supports dtype of float16 and bool
      
      * api supports dtype of float16 and bool
      
      * modify unittest code
      
      * modify unittest code
      
      * perfect dtype describe
      
      * perfect code-block
      ad106290
    • D
      adaptive for py3 for ps util;test=develop (#33727) · e5a6bb1d
      danleifeng 提交于
      e5a6bb1d
    • Z
      8a5bbae6
    • R
      Gpu samplecode test On PR-CPU-Py2 (#33634) · dd4297cd
      Ren Wei (任卫) 提交于
      * using argparse to handle selections
      
      * 2 TODOs
      
      * 先不更改pipeline配置,这里强制改成GPU版本
      
      * sorted the all_names
      
      * exec gpu sample codes tests incrementally
      
      * get all apis from the pr.spec file
      
      * condition with WITH_GPU
      
      WITH_GPU == ON
      
      save
      
      * delete the useless codes
      
      * delete the useless codes.
      
      test=document_fix
      
      * echo the diff result
      
      test=document_fix
      
      * dont reuse the variables
      
      * rename fun to _func not work. put it into the skiplist
      
      https://github.com/PaddlePaddle/Paddle/commit/038ffc795025170e8cda74bcd473b46301b9a1c0
      test=document_fix
      
      * skip it in check api approvals
      
      test=document_fix
      
      save
      
      * skip the private _variables
      
      * print signatures wrong. now rename it to _func
      
      test=document_fix
      dd4297cd