1. 25 1月, 2022 16 次提交
  2. 24 1月, 2022 10 次提交
    • S
      fix test allreduce tests (#39166) · c00303ec
      sneaxiy 提交于
      c00303ec
    • C
      [pten] add Scale xpu kernel (#39092) · 7874d0a5
      chentianyu03 提交于
      * add scale xpu kernel
      
      * add scale xpu kernel
      
      * add scale xpu kernel
      
      * replace with pten scale kernel
      
      * change dev_ctx
      
      * modify float16 head path
      
      * remove unused xpu header
      7874d0a5
    • Y
      [Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and... · 3bf3a6ee
      YuanRisheng 提交于
      [Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048)
      
      * refactor elementwise add grad
      
      * fix compile bugs
      
      * fix unit test bugs
      
      * fix file conflicts
      
      * fix bugs when buildPtenContext
      3bf3a6ee
    • J
      Remved redundant defintions of likely/unlikely (#38911) · 43919d0a
      Jacek Czaja 提交于
      * - more unlikely
      
      * - compilation fix
      
      * - removed redundant definition
      
      * - fix
      
      * - Fixes
      
      * - compilation fix for windows
      43919d0a
    • F
      [Pten] Migration of eigen numeric extensions and functors in paddle/fluid/operatos/eigen (#39124) · a1e40dc6
      Feiyu Chan 提交于
      * migration of functors in paddle/fluid/operators/eigen and paddle/fluid/platform/eigen_ext.h
      * update path of data types like float16.h in includes in extensions.h
      a1e40dc6
    • Z
      unify compare functor (#39024) · def81b4f
      Zhang Ting 提交于
      def81b4f
    • W
      [PTEN] Move dynload from fluid to pten. (#39120) · 3c1dc6f6
      Wilber 提交于
      * move dynload from fluid to pten.
      
      * fix ci compile
      
      * fix windows ci compile.
      
      * update
      
      * update
      
      * fix compile error
      3c1dc6f6
    • [Refactoring Tensor PR #5] replace storage with pten allocation (#39085) · a56e16a7
      石晓伟 提交于
      * updates callers, test=develop
      
      * updates tensor, test=develop
      
      * fixes errors, test=develop
      
      * remove some dtypes, test=develop
      
      * fix errors in the base storage modification, test=develop
      
      * fixes a bug, test=develop
      
      * fixes the bugs in push the whole, test=develop
      
      * updates, test=develop
      
      * update
      
      * update, test=develop
      
      * fixes the mac-py3 CI, test=develop
      
      * remove the storage impl, test=develop
      
      * updates some codes, test=develop
      
      * update, test=develop
      
      * updates pten allocation, test=develop
      a56e16a7
    • z8hanghuan's avatar
      support sparse of adam, *test=kunlun (#38483) · e106901e
      z8hanghuan 提交于
      * support sparse of adam, *test=kunlun
      
      * add pre-commit-config.yaml
      
      * support sparse of adam in KL2,*test=kunlun
      
      * support sparse of adam in KL2, *test=kunlun
      
      * modify xpu.cmake, *test=kunlun
      
      * support sparse of adam, rm some wait, *test=kunlun
      
      * support sparse of adam, rm some wait, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      
      * support sparse of adam, *test=kunlun
      e106901e
    • Z
      Refactored python-level trace_op to call through _C_ops instead of... · c3796061
      Zhanlue Yang 提交于
      Refactored python-level trace_op to call through _C_ops instead of Tracer::TraceOp, under eager_mode (#38338)
      
      * Replaced core.ops with _C_ops
      
      * Refactored python-level trace_op to call through _C_ops instead of Tracer::TraceOp, under eager_mode
      
      * Modified trace_op interface
      
      * Refactored trace_op logic for eager mode
      
      * Added Eager Dygraph support for OpTest
      
      * Fixed ci issues
      
      * Fixed CI failures
      
      * Fixed Coverage CI Issues
      
      * Fixed XPU CI Issues
      c3796061
  3. 23 1月, 2022 1 次提交
    • W
      Support test_imperative apply and Add a setter for EagerTensor (#39016) · 8c5c1046
      Weilong Wu 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * eager test case
      
      * support inference test
      
      * refine test and fix initializer failed
      
      * modify eagertensor patch method
      
      * add eagertensor.clear_grandint, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support create varbase and fix retain grad error
      
      * call monkey_patch_varbase in _test_eager_guard, test=develop
      
      * fix windows error
      
      * split clear_gradient to clear_gradient and zero_grads, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support test_imperative_basic test in eager mode
      
      * remove additional log in variable.h
      
      * remove additional log in variable.h
      
      * remove additional code create in merge
      
      * eager
      
      * fix some eager logic, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * patch_tensor_method_func, test=develop
      
      * refine, test=develop
      
      * eager test case, test=develop
      
      * refine, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager optimizer, test=develop
      
      * eager optimizer, test=develop
      
      * eager test_imperative_optimizer_v2, test=develop
      
      * eager, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * eager, test=develop
      
      * add resize in share buffer to, test=develop
      
      * eager, test=develop
      
      * fix _share_buffer_to, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support eager for dataloader,test=develop
      
      * Exposed EagerTensor's set func to implement set_value func
      
      * Rename set to _set_value, Supplement the corresponding test case
      
      * fix test concat dev api build failed
      
      * fix conflict
      
      * fix conflict
      
      * Use extern to Polish code
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NJiabinYang <360788950@qq.com>
      Co-authored-by: NWang Huan <wanghuan29@baidu.com>
      Co-authored-by: Nwanghuancoder <wanghuancoder@163.com>
      Co-authored-by: Nchentianyu03 <chentianyu03@baidu.com>
      8c5c1046
  4. 22 1月, 2022 3 次提交
  5. 21 1月, 2022 10 次提交