1. 08 8月, 2023 7 次提交
    • H
      [NewIR]Polish new ir interpreter core cache (#56035) · 73c70654
      hong 提交于
      * update
      
      * update cache
      
      * fix compile error
      73c70654
    • H
      New ir support legacy kernel instruction (#55880) · f9c2f4ca
      hong 提交于
      * new ir remove fetch list
      
      * fix pattern rewrite bug
      
      * try to remove constant fold
      
      * revert code
      
      * add pattern rewrite test flag
      
      * fix multi fetch
      
      * remove usless code
      
      * new ir support legacy kernel instraction
      
      * new ir support legacy kernel instruction
      
      * add scope prefix
      
      * update
      
      * update
      
      * update
      
      * update
      
      * fix
      
      * revert channel shuffl test
      
      * polish code
      
      * try to fix windows compile error
      
      * polish code
      
      * update
      
      * update
      
      * revert op test
      f9c2f4ca
    • R
      [CustomDeivce] fix grad accumulation (#56052) · 393db4a9
      ronnywang 提交于
      393db4a9
    • J
      Provide opoperands(), opresults() methods for the Operation module (#55903) · a5ba0b65
      jjyaoao 提交于
      * Provide opoperands(), opresults() methods for the Operation module
      Signed-off-by: Njjyaoao <jjyaoao@126.com>
      
      * Update test_ir_pybind.py
      
      ---------
      Signed-off-by: Njjyaoao <jjyaoao@126.com>
      a5ba0b65
    • H
      [NewIR]fix new ir sgd op bug (#55982) · d05ec503
      hong 提交于
      * fix new ir sgd op bug
      
      * fix bug
      
      * fix bug
      
      * update
      
      * revert code
      d05ec503
    • C
      [PRIM][IR] Migrate vjp rules to new ir in non primitive mode (#55647) · 501a51fc
      Charles-hit 提交于
      * [prim][newir] add basic framework for primitive
      
      * support desctensor in new ir
      
      * add vjp interface
      
      * support vjp in new ir
      
      * support vjp in new ir
      
      * polish vjp interface
      
      * fix stop_gradients set
      
      * fix vjp dispatch
      
      * add comment
      
      * add vjp test for new ir
      
      * add test for tanh vjp
      
      * [prim][newir] add basic framework for primitive
      
      * support desctensor in new ir
      
      * support vjp in new ir
      
      * support vjp in new ir
      
      * polish vjp interface
      
      * fix stop_gradients set
      
      * fix vjp dispatch
      
      * add comment
      
      * add vjp test for new ir
      
      * add test for tanh vjp
      
      * add eager and static backend for warp lower level api
      
      * support call_vjp pybind
      
      * polish code and add test for vjp
      
      * remove useless code
      
      * polish code
      
      * remove useless code
      
      * support mean vjp
      
      * add test for mean vjp and support has_vjp function
      
      * fix call_vjp
      
      * polish code
      
      * add primitive ops set for backend
      
      * add vjp test for tanh_
      
      * fix inference CI
      
      * fix inference ci
      
      * modify fluid cmake
      
      * remove useless deps
      
      * add cmake
      
      ---------
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
      501a51fc
    • Z
      refine code (#56020) · 611e71d0
      zhangbo9674 提交于
      611e71d0
  2. 07 8月, 2023 8 次提交
  3. 05 8月, 2023 1 次提交
  4. 04 8月, 2023 7 次提交
    • K
      [NewIR] Rename feed with place to data (#55778) · 274e5e54
      kangguangli 提交于
      * fix bug: feed_with_place should consider variable existence
      
      * fix
      
      * fix build scope
      
      * change method to set feed var name
      
      * remove feed_with_place to placeholder
      
      * fix
      
      * rename to data
      
      * fix
      
      * fix
      274e5e54
    • J
      [Semi AutoParall] Support Partial Semantic I (#55508) · e3b6e02f
      JZ-LIANG 提交于
      e3b6e02f
    • H
      [NewIR]New ir aot placement refactor (#55810) · dd1379e4
      hong 提交于
      * refacot aot
      
      * update
      
      * fix bugs
      
      * remove some test
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update
      dd1379e4
    • R
      [clang-tidy] enable modernize-use-emplace (#55799) · 469a0392
      Ruibin Cheung 提交于
      * [clang-tidy] enable modernize-use-emplace
      
      * Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into modernize_use_emplace
      469a0392
    • Z
      1e4f627d
    • J
      Support Combined indexing for __getitem__ and __setitem__ (#55211) · 697c712f
      JYChen 提交于
      * WIP: start writing combined indexing get
      
      * list/tuple/Variable
      
      * getitem 80%
      
      * add setitem
      
      * add some unittest for setitem
      
      * lazy import
      
      * fix some setitem error
      
      * fix advance indexing with decreasing axes; fix strided_slice input name
      
      * combine int-tensor getitem is ok (without boolean support & broadcast); add getitem unittest for static
      
      * add broadcast & parse bool tensor for __getitem
      
      * [change getitem] _getitem_impl_ to _getitem_static, not deleting the former one
      
      * refine new getitem; fix ut in variable/var_base
      
      * add __getitem__ ut in dygraph
      
      * re-dispatch getitem for Py/CPP; fix strided_slice decrease axes error in dygraph
      
      * fix ut; support tensor in slice
      
      * [change setitem] _setitem_impl_ to _setitem_static, not deleting the former one
      
      * remove some UT (for some, temporarily)
      
      * add IndexError to solve timeout problem in static-mode
      
      * 1.temply forbideen all-False bool-indexput; 2.setitem_static will return new variable
      
      * xpu uses old stratege
      
      * rename dy2st setitem ut to avoid same-name problem
      
      * dy2st for new combined index
      
      * ut case for combine-index with dy2st
      
      * open ut with all-false-bool setitem
      
      * remove useless doc and _getitem_impl_
      
      * change static res
      
      * fix static xpu
      697c712f
    • C
      [IR] Reshape2 and Flatten_contiguous_range Support Inplace (#55809) · dd0681e3
      chen 提交于
      * inplace pass support reshape2 and flatten_contiguous_range
      
      * recover the modification to inplace_op_var_pass.cc
      dd0681e3
  5. 03 8月, 2023 9 次提交
  6. 02 8月, 2023 8 次提交
    • X
      [EvalFrame] support python3.11 in eval frame. (#55887) · f45dd5ee
      xiongkun 提交于
      f45dd5ee
    • W
      Eager tensor doc (#55879) · 880e94fc
      wanghuancoder 提交于
      * add docstring of three eager method
      
      * test=docs_preview
      
      * update element size bind
      
      * update docs of numpy, clone, clear_gradient, element_size; test=docs_preview
      
      * refine clear_gradient docs; test=docs_preview
      
      * refine element_size docs; test=docs_preview
      
      * add detach doc; test=docs_preview
      
      * empty commit; test=docs_preview
      
      * update signature; test=docs_preview
      
      * refactor; test=docs_preview
      
      * empty commit; test=docs_preview
      
      * add docstring of Tensor
      
      * empty commit; test=docs_preview
      
      * refine TensorDoc; test=docs_preview
      
      * refine TensorDoc; test=docs_preview
      
      * remove extra indent in TensorDoc; test=docs_preview
      
      * remove a space; test=docs_preview
      
      * move docs ahead of implementation; test=docs_preview
      
      * refine
      
      ---------
      Co-authored-by: Nwj-Mcat <1435130236@qq.com>
      Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
      880e94fc
    • W
      [XPU]Add conv1d fuse pass (#55719) · 22c7a6eb
      wz1qqx 提交于
      22c7a6eb
    • Z
      [IR] NewIr Interpreter Beta run regular (#55828) · 63b7fc80
      zhangbo9674 提交于
      * add interface
      
      * add code
      
      * add code
      
      * add code
      
      * add code
      
      * fix bug
      
      * fix bug
      
      * add var prefix
      
      * add code
      
      * add code
      
      * add code
      
      * fix compile bug
      
      * fix bug
      
      * refine code
      
      * refine code
      
      * refine code
      
      * refine code
      
      * fix bug
      
      * add code
      
      * add code
      
      * fix bug
      
      * add code
      
      * add code
      
      * refine code
      
      * refine code
      
      * fix bug
      
      * add code
      
      * fix bug in phi__kernel_utils
      
      * refine code
      
      * fix bug
      
      * open flag
      
      * refine code
      
      * fix bug
      
      * fix bug
      
      * refine code
      
      * fix bug
      63b7fc80
    • Y
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data... · e61d892a
      yangjianfengo1 提交于
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data format is NHWC implementation. (#55399)
      
      * finish
      
      * cpergroup odd
      
      * fix bf16
      
      * single channel
      
      * code style
      
      * jingdu duiqi
      
      * add head_file
      
      * add bf16 head file
      
      * bf16 2
      
      * bf16
      
      * bf16 head
      
      * bf16 compile
      
      * py test
      
      * bf16 compile
      
      * bf16 compile
      
      * unset py test
      
      * nhwc
      
      * test
      
      * mean var
      
      * bf16 success
      
      * su
      
      * ctest success
      
      * use is_same_as
      
      * is_same
      
      * use is_same
      
      * rtol
      
      * gpu_stream
      
      * del sigmod
      
      * fix bfloat16 type
      
      * use cuda_bf16_hpp
      
      * use_cuda_arch
      
      * bfloat162float2
      
      * del inplace_tol
      
      * del max_releative_tol
      
      * temp store
      
      * jingdu duiqi
      
      * temp store
      
      * plugin
      
      * jingdu duiqi
      
      * duiqi
      
      * include cuda.h
      
      * del half
      
      * half single
      
      * ci
      
      * add const
      
      * ci
      
      * cudamemset
      
      * del printf
      
      * fp16 test
      
      * add half compute
      
      * del br16 ci
      
      * del ci
      
      * ci approve
      
      * del fluid include
      e61d892a
    • W
      fix security bug (#55782) · 19da5c0c
      wanghuancoder 提交于
      * fix security bug
      19da5c0c
    • J
      [XPU] Add gather_squeeze_pass (#55605) · d13a49d6
      jiangfan06 提交于
      d13a49d6
    • X
      【new ir】add ir pybind api (#55745) · ef29468e
      xiaoguoguo626807 提交于
      * add ir core
      
      * add test
      
      * modify name
      
      * merge
      
      * add test for __eq__
      
      * shield  test for __eq__
      
      * --amend
      
      * Update new_ir_compiler.cc
      ef29468e