1. 10 1月, 2022 2 次提交
  2. 07 1月, 2022 10 次提交
  3. 06 1月, 2022 18 次提交
  4. 05 1月, 2022 10 次提交
    • L
      optimize elementwise_mul_grad using new interfaces (#37728) · 36a102f8
      Lijunhui 提交于
      * init commit: new elem_mul_grad
      
      * add template speciallization for complex in multiply
      
      * reply review comments
      
      * correct dx and dy computation when T is complex
      
      * reply review comments
      
      * update to new ReduceRunctor
      
      * mul-output broadcast
      
      * call functions
      
      * call functions with comments
      
      * remove comments
      36a102f8
    • F
      Fix bug for UT GetAllocatorInterfaceTest (#38720) · 905c8022
      From00 提交于
      * Fix bug of GetAllocatorInterfaceTest
      
      * Replace some shared_ptr with unique_ptr
      
      * Change Alloc call
      905c8022
    • J
      Make post training quant API support dataloader (#38686) · 0af1a87b
      Jiaqi Liu 提交于
      * make post training quant API support dataloader
      0af1a87b
    • J
      60c51de5
    • Q
      [XPU] update XPU run check scripts, test=develop (#38698) · bbe83ed1
      Qi Li 提交于
      bbe83ed1
    • T
      update masked_select_op for kunlun (#38678) · 40078103
      TTerror 提交于
      40078103
    • W
      [Eager] Support test imperative basic in eager test_empty_grad (#38376) · 9108e777
      wanghuancoder 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * eager test case
      
      * support inference test
      
      * refine test and fix initializer failed
      
      * modify eagertensor patch method
      
      * add eagertensor.clear_grandint, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call monkey_patch_varbase in _test_eager_guard, test=develop
      
      * split clear_gradient to clear_gradient and zero_grads, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NJiabinYang <360788950@qq.com>
      9108e777
    • W
      add depthwise_conv2d op for mkldnn (#38484) · e1cc2236
      wangxinxin08 提交于
      e1cc2236
    • C
      [pten]Move reduce code new (#38648) · 7a4a512d
      chentianyu03 提交于
      * change 'math' to 'math_kernel'
      
      * fix compile bugs
      
      * merge develop
      
      * fix compile bugs
      
      * fix compile bugs
      
      * move reduce files by new rule
      
      * add set header
      
      * format code style
      
      * merge develop and fix conflict
      
      * merge develop and fix conflict
      Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
      7a4a512d
    • W
      add the examples for the mm (#38669) · c90a652d
      wawltor 提交于
      * add the examples for the mm
      
      * fix the document of paddle.mm
      c90a652d