1. 11 1月, 2022 1 次提交
  2. 10 1月, 2022 3 次提交
    • C
      move get expected kernel args into pten (#38825) · 3a23c1a2
      Chen Weihang 提交于
      3a23c1a2
    • Z
      [Unify Tensors PR #5] framework::Tensor inherits from DenseTensor,test=allcases (#38632) · 5c73a6ea
      Zhanlue Yang 提交于
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      
      * [Unify Tensors PR #3]Ported framework::Tensor interfaces to pten::DenseTensor
      
      * Fixed issues with place
      
      * Added comments
      
      * Moved mutable_data with stream argument to DenseTensor
      
      * Added set_offset interface
      
      * Fixed CI issues,test=allcases
      
      * [Unify Tensors PR #4] Port LoDTensor interfaces to DenseTensor
      
      * Removed friend class EigenTensor/EigenMatrix/EigenVector from Tensor
      
      * Modified framework::Tensor to inherit from DenseTensor
      
      * Reverted changes too pten_layout() interface
      
      * Removed friend classes
      
      * Rearranged cfunction calls from tensor.data<void>() to tensor.data()
      
      * Fixed CI issues
      
      * Fixed lite issues
      
      * Fixed data() interface issues,test=allcases
      
      * Resolved IsInitialized() issues
      
      * Fixed ResetHolder() issues
      
      * Fixed MKLDNN & Storage issues
      
      * Resolved ShareBufferWith() issues
      
      * Fixed LoD issues
      5c73a6ea
    • C
      Support setting infershape function for custom grad op (#38776) · 046553c7
      Chen Weihang 提交于
      * unify infer_shape func calling
      
      * support set grad infer shape fn for custom op
      
      * unify infershape in new executor and eager
      
      * remove todo comment
      
      * revert infershape in operator
      046553c7
  3. 07 1月, 2022 2 次提交
  4. 06 1月, 2022 5 次提交
  5. 05 1月, 2022 2 次提交
  6. 04 1月, 2022 4 次提交
  7. 31 12月, 2021 3 次提交
  8. 30 12月, 2021 2 次提交
  9. 29 12月, 2021 3 次提交
  10. 28 12月, 2021 5 次提交
    • L
      Support multi-output feature for elementwise (#38410) · 48f061fb
      limingshu 提交于
      * first commit
      
      * pass ctest of  elementwise_div_grad
      48f061fb
    • J
      Support test basic of Var and Layer (#38426) · 1fb80a6a
      Jiabin Yang 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * support inference test
      
      * refine test and fix initializer failed
      
      * support create varbase and fix retain grad error
      
      * fix windows error
      
      * support test code coverage
      
      * support test code coverage
      
      * support test code coverage
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      1fb80a6a
    • Z
      refactor matmul directory in pten (#38227) · 982bf444
      zyfncg 提交于
      * refactor matmul directory in pten
      
      * fix merge conflict
      982bf444
    • C
      [pten] remove in_type arg in cast kernel (#38486) · 0637b9a6
      chentianyu03 提交于
      * remove intype arg in cast kernel
      
      * modify conj config in api.yaml by dictionary order
      
      * rm unused code in cast_kernel.cu
      0637b9a6
    • Z
      Fixed issue with offset,test=allcases (#38506) · dc30ad1d
      Zhanlue Yang 提交于
      dc30ad1d
  11. 27 12月, 2021 4 次提交
  12. 26 12月, 2021 2 次提交
  13. 24 12月, 2021 4 次提交
    • C
      add is dense tensor method (#38424) · 6ff3596e
      Chen Weihang 提交于
      6ff3596e
    • C
      [pten] combine reduce_cuda codes (#38328) · 08941eda
      chentianyu03 提交于
      * combine reduce_cuda codes
      
      * support float16 in pten redcue_mean
      
      * replace ReduceCudaKernel impl with pten reduce impl
      
      * mv reduce funcs into reduce_cuda_impl
      
      * rm unsed codes and headers
      
      * mv GetReduceDim into reduce_cuda_impl
      
      * recover GetReduceDim in reduce_op.h
      
      * add new dispatch macro
      
      * fix pool op output not inited and cause transform to pten::denseTensor error
      
      * fix output tensor not initialized error
      
      * rename new dispatch macro and format code style
      
      * rm reduce_functor_op.h file
      08941eda
    • Z
      [Unify Tensors PR #1] Replaced pten::Allocation with... · 42cf2bee
      Zhanlue Yang 提交于
      [Unify Tensors PR #1] Replaced pten::Allocation with shared_ptr<memory::Allocation> for Storage (#38301)
      
      * Added shared_ptr<Allocation> member & corresponding interfaces to Storage
      
      * Removed original pten::Allocation from Storage and adjusted the interfaces accordingly
      
      * Fixed issues with storage offset
      
      * Used place to malloc allocation for TensorStorage
      42cf2bee
    • C
      add register general kernel marco (#38409) · fc0a50aa
      Chen Weihang 提交于
      fc0a50aa