1. 04 1月, 2022 4 次提交
  2. 31 12月, 2021 12 次提交
  3. 30 12月, 2021 11 次提交
    • Z
      add OP lu forward (#38559) · 4e21457d
      zhiboniu 提交于
      LGTM
      4e21457d
    • H
      add sigmoid_cross_entropy_with_logits to kl1 (#38586) · 790cadd1
      houj04 提交于
      * add sigmoid cross entropy with logits to kl1. test=kunlun
      
      * add sigmoid cross entropy with logits to kl1. test=kunlun
      790cadd1
    • Z
      Add exp, abs_grad, reciprocal, reciprocal_grad operator for XPU and update... · ceec1e21
      zhangyk0314 提交于
      Add exp, abs_grad, reciprocal, reciprocal_grad operator for XPU and update xpu2_op_list.h,test=kunlun (#38570)
      
      ceec1e21
    • J
      [New API] add new api paddle.mode and paddle.Tensor.mode (#38446) · 3777779b
      JYChen 提交于
      * add new OP mode
      
      * rename trans-variable name and fix UT
      3777779b
    • H
      Add cpu kernel of new api : lstsq (#38585) · ccf99b66
      Haohongxiang 提交于
      * add cpu kernel of lstsq
      
      * update
      
      * modify code style
      
      * modify unittest
      
      * remove support for complex
      ccf99b66
    • J
      Support test imperative basic with fixed retain grad interface (#38548) · 2421a25a
      Jiabin Yang 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * support inference test
      
      * refine test and fix initializer failed
      
      * support create varbase and fix retain grad error
      
      * fix windows error
      
      * support test_imperative_basic test in eager mode
      
      * remove additional log in variable.h
      
      * remove additional log in variable.h
      
      * remove additional code create in merge
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      2421a25a
    • J
      Added Conv2D BF16 BWD oneDNN kernel (#38507) · ed8ba011
      jakpiase 提交于
      * working test for padding only
      
      * added full conv2d grad kernel
      
      * removed some trash
      
      * minor change
      
      * Ci fix
      
      * format fix
      ed8ba011
    • Z
      [PSCore]Fix test fleet base 2 (#38588) · 04496d89
      zmxdream 提交于
      04496d89
    • X
      add ExponentialFamily and Dirichlet probability distribution (#38445) · 00cddf07
      Xiaoxu Chen 提交于
      * extend Distribution baseclass for supporting multivariant distribution and prob method
      
      * add ExponentialFamily base class and entropy using Bregman divergence
      
      * add dirichlet probability distribution
      00cddf07
    • X
      add dirichlet random sample op in cpu and gpu kernel (#38244) · c5bf09bb
      Xiaoxu Chen 提交于
      * add dirichlet sample op and cpu backend kernel
      
      * add Dirichlet op cuda kernel  (#6)
      
      * add dirichlet op hip kernel
      Co-authored-by: NFeiyu Chan <chenfeiyu@baidu.com>
      c5bf09bb
    • L
      Fix the bug of batch_norm and batch_norm_grad op. (#38288) · cc83c95f
      Leo Guo 提交于
      * Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list.
      
      * Fix the bug of batch_norm and batch_norm_grad op. Add the "roi_align" and "roi_align_grad" op in xpu2 op list. test=kunlun
      Co-authored-by: NZibin <guozibin@baidu.com>
      cc83c95f
  4. 29 12月, 2021 7 次提交
  5. 28 12月, 2021 6 次提交
    • Z
      add new API: paddle.cov (#38392) · 85f5d264
      zhiboniu 提交于
      85f5d264
    • B
      update seq_concat_fc_fuse_pass ut (#38538) · 706d2c08
      baoachun 提交于
      706d2c08
    • F
      Utilize StreamSafeCUDAAllocator to support fast GC in new executor (#37642) · 0c7153a4
      From00 提交于
      * fix reshape move storage error
      
      * remove needless set type
      
      * alloc tensor by shared storage
      
      * Utilize StreamSafeCUDAAllocator to support fast GC in new executor
      
      * Fix compile error for Windows and ROCm
      
      * Fix compile error for Windows
      
      * Modify UT stream_safe_cuda_alloc_test
      
      * Modify UT stream_safe_cuda_alloc_test
      
      * Rewrite fast GC
      
      * Rewrite fast GC
      
      * Fix compile error for BOOST_GET_CONST
      
      * Fix compile error for BOOST_GET_CONST
      
      * Changes default stream for StreamSafeCUDAAllocator
      
      * Fix a small CI error
      
      * Remove some redundant code
      
      * Fix conflict
      
      * Fix compile error for ROCm
      
      * Fix Windoes CI error
      
      * Fix CI error
      
      * Remove some unnecessary code
      
      * Fix CI error
      
      * Add UT for fast GC
      
      * Fix CI error
      
      * add device-agnostic stream class
      
      * add stream.h
      
      * fix ut
      
      * fix cpu compile
      
      * Use RWLock in GetAllocator
      
      * Fix CI error
      Co-authored-by: NChen Weihang <chenweihang@baidu.com>
      Co-authored-by: Nzhiqiu <chenqiuliang@baidu.com>
      0c7153a4
    • H
      add matmul_to_mul matmul_v2_to_mul matmul_v2_to_matmul test case (#37645) · bed71992
      heliqi 提交于
      * add matmul_to_mul matmul_v2_to_mul matmul_v2_to_matmul test case
      
      * modify skip func to ignore_pass_case func
      
      * rebuild CI
      
      * rebuild CI
      
      * add test_map_xx_pass timeout
      
      * add test_map_xx_pass timeout
      
      * merge from develop
      
      * add timeout notest;test=coverage
      
      * Cmakelist add timeout
      
      * add timeout
      
      * add attr of matmul_v2
      
      * add trt skip
      
      * delete trt config
      
      * add skip,  mul diff on 3080
      bed71992
    • J
      Support test basic of Var and Layer (#38426) · 1fb80a6a
      Jiabin Yang 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * support inference test
      
      * refine test and fix initializer failed
      
      * support create varbase and fix retain grad error
      
      * fix windows error
      
      * support test code coverage
      
      * support test code coverage
      
      * support test code coverage
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      1fb80a6a
    • W
      fix ci problem (#38474) · 2e4cb279
      Wilber 提交于
      2e4cb279