1. 11 1月, 2022 13 次提交
    • Z
      【PTen】Add dot and matmul grad kernel in pten (#38713) · be817719
      zyfncg 提交于
      * refactor matmul directory in pten
      
      * fix merge conflict
      
      * add dot_grad kernel
      
      * add dot_grad kernel in pten
      
      * add matmul_grad kernel
      
      * update the code
      
      * delete useless code in fluid
      
      * fix some bug of running matmul grad kernel
      
      * fix merge conflict
      
      * refactor some code
      
      * refactor code
      be817719
    • S
      oepn third_party cache in wincheck_inference (#38877) · 5b940c44
      Sing_chan 提交于
      5b940c44
    • Z
      Fix bug in elementwise_mul/div_grad when inplace strategy (#38840) · 7915d180
      Zhang Zheng 提交于
      * fix bug when inplace strategy
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      7915d180
    • N
      3eaf8d2c
    • W
      [PTEN] Add pten::Place data structure. (#38844) · 2bed9b9c
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      2bed9b9c
    • W
      29c211ee
    • C
      【Auto Parallel】New local tensor (#38747) · d3ba1895
      caozhou 提交于
      * update dist tensor
      
      * add unitest
      
      * update unitest
      
      * refactor dist tensor
      
      * update dist tensor and unitest
      d3ba1895
    • Z
      [AMP] Check call order of paddle.amp.decorate and paddle.DataParallel (#38785) · fbb40281
      zhangbo9674 提交于
      * check amp.decorate and DataParallel
      
      * refine coverage
      
      * fix layer dtype
      
      * refine code
      fbb40281
    • L
      Remove useless headers for some grad ops (#38823) · 9f34a070
      limingshu 提交于
      * fix the wrong filename
      
      * first commit
      
      * first commit
      
      * remove rest useless headers
      
      * for ci approval
      9f34a070
    • S
      support vs2019 compilation in windows (#38719) · 0ad363b1
      Sing_chan 提交于
      * support vs2019 compilation in windows
      
      * not modify pow_op's original compute logic
      0ad363b1
    • M
      Jit pre save hook (#38186) · e91f7c02
      Ming-Xu Huang 提交于
      * Pre-save hooks of jit.save
      
      1. Added pre_save_hooks features to jit.save.
      2. Added related unittests
      
      * Added jit pre_save_hooks functions's alias to paddle.jit and copyright.
      
      * Make jit.save_pre_hook style be consisent with Paddle's rule.
      
      * Fixed arguments passing bug in run_save_pre_hooks
      
      * Added API Documents
      
      * Move clear and run_pre_save_hooks as internal methonds only.
      
      * Made register_save_pre_hook as an internal function.
      e91f7c02
    • W
      [Eager] fix some eager logic (#38576) · d3686471
      wanghuancoder 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * eager test case
      
      * support inference test
      
      * refine test and fix initializer failed
      
      * modify eagertensor patch method
      
      * add eagertensor.clear_grandint, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support create varbase and fix retain grad error
      
      * call monkey_patch_varbase in _test_eager_guard, test=develop
      
      * fix windows error
      
      * split clear_gradient to clear_gradient and zero_grads, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support test_imperative_basic test in eager mode
      
      * remove additional log in variable.h
      
      * remove additional log in variable.h
      
      * remove additional code create in merge
      
      * eager
      
      * fix some eager logic, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NJiabinYang <360788950@qq.com>
      d3686471
    • F
      roi_align fix (#38788) · ffbc2122
      fengkuangxiaxia 提交于
      ffbc2122
  2. 10 1月, 2022 27 次提交