1. 24 12月, 2021 5 次提交
    • zhouweiwei2014's avatar
      add new API/OP: paddle.poisson (#38117) · bcf86e5c
      zhouweiwei2014 提交于
      * add new API/OP:paddle.poisson
      
      * fix comment
      bcf86e5c
    • 0
      [Dy2stat]Fix error when calling sublayer's non-forward func in dy2stat (#37296) · 7339a124
      0x45f 提交于
      * fix error when calling sublayer's non-forward func in dy2stat
      
      * fix circular import using an inelegant way
      
      * deal with parameters
      
      * remove param_guard in __call__
      
      * remove comment
      
      * fix error when jit.load
      
      * rename block var
      
      * remove wrong code
      
      * add unit test
      7339a124
    • A
      [Dy2Stat]Consider InputSpec.name to calculate Cachekey hash id (#38273) · 8e6d5d2b
      Aurelius84 提交于
      * Consider InputSpec.name to calculate Cachekey hash id
      
      * fix function
      8e6d5d2b
    • B
      add conv+hard_sigmoid and conv+hard_swish fuse pass ut (#37553) · a858326a
      baoachun 提交于
      * add conv+hard_sigmoid fuse pass ut
      
      * update conv_elementwise_add_mkldnn_fuse_pass ut
      
      * update conv_hard_sigmoid_mkldnn_fuse_pass ut
      
      * update conv+hard_sigmoid and conv+hard_swish fuse pass ut
      
      * update ut
      
      * update ut
      a858326a
    • J
      Support test imperative basic in eager (#38313) · d48f7c89
      Jiabin Yang 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * support inference test
      
      * refine test and fix initializer failed
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      d48f7c89
  2. 23 12月, 2021 10 次提交
  3. 22 12月, 2021 10 次提交
  4. 21 12月, 2021 11 次提交
  5. 20 12月, 2021 4 次提交
    • S
      add check pass conflict tools (#38276) · 0d12aa64
      sneaxiy 提交于
      0d12aa64
    • B
      add mkldnn conv_transpose_bias fuse pass ut (#37508) · ac696941
      baoachun 提交于
      * add mkldnn conv_transpose_bias fuse pass ut
      
      * update conv_transpose_bias_mkldnn_fuse_pass ut
      
      * update conv_transpose_bias_mkldnn_fuse_pass ut
      
      * update conv_transpose_bias_mkldnn_fuse_pass ut
      
      * restrict conv2d data_format in conv_transpose_bias_mkldnn_fuse_pass
      
      * update ut timeout setting
      
      * update ut
      ac696941
    • C
      [pten]add pten conj kernel (#38247) · a2793e5e
      chentianyu03 提交于
      * add pten conj kernel
      
      * modify conj_kernel file path
      
      * add defined cuda macro to cuda/conj_kernel.h
      a2793e5e
    • S
      Support FP16 for more ops (#38123) · 1f445bf3
      sneaxiy 提交于
      * support FP16 for more ops
      
      * add amp list tests
      
      * refine reduce_mean_grad
      
      * fix OP benchmark ci
      
      * fix fp16 reduce_mean
      
      * updat ut, but still have some problems
      
      * remove mean/reduce_mean fp16 kernel
      1f445bf3