1. 08 8月, 2023 3 次提交
  2. 07 8月, 2023 23 次提交
  3. 06 8月, 2023 1 次提交
  4. 05 8月, 2023 1 次提交
  5. 04 8月, 2023 12 次提交
    • D
      repacle embedding in fluid with 2.0 version (#55757) · 2d91a9bd
      Difer 提交于
      * replace embedding
      
      * replace sparse_embedding
      
      * fix some bugs
      
      * del embedding
      
      * repalce layers.embedding
      
      * fix type error
      2d91a9bd
    • K
      [NewIR] Rename feed with place to data (#55778) · 274e5e54
      kangguangli 提交于
      * fix bug: feed_with_place should consider variable existence
      
      * fix
      
      * fix build scope
      
      * change method to set feed var name
      
      * remove feed_with_place to placeholder
      
      * fix
      
      * rename to data
      
      * fix
      
      * fix
      274e5e54
    • J
      [Semi AutoParall] Support Partial Semantic I (#55508) · e3b6e02f
      JZ-LIANG 提交于
      e3b6e02f
    • H
      [NewIR]New ir aot placement refactor (#55810) · dd1379e4
      hong 提交于
      * refacot aot
      
      * update
      
      * fix bugs
      
      * remove some test
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * update
      dd1379e4
    • F
      [CINN] Dump more compilation result and optimize parallel compiler flags (#55935) · 39b59603
      Fisher 提交于
      1. `Parallel Compiler`:
          - 合并`FLAGS_cinn_parallel_compile_size`和`FLAGS_cinn_parallel_compile_thread`,通过`FLAGS_cinn_parallel_compile_thread`即可指定编译时使用的线程数,所有的`fusion_groups`将会平均分配到可用的线程上
          - 增强编译完成后返回的信息,除`instruction`外,将`lowered_function`、`source_code`、`source_ptx`返回,供上层进一步使用
      2. Debug信息:
          - 新增`FLAGS_ cinn_dump_group_lowered_func`、`FLAGS_cinn_dump_group_source_code`、`FLAGS_ cinn_dump_group_ptx`、`FLAGS_ cinn_dump_group_instruction`,可分别按`fusion_groups`储存编译的每个阶段中的中间代码
          - 重新整理`graph_visualization`,所有的可视化图、单测代码均能正确分组储存
      3. Bug修复:
          - 修复`MakeDirectory`不能正确创建文件夹的问题
      4. 其他:
          - 清除了一些无用代码
      39b59603
    • R
      [clang-tidy] enable modernize-use-emplace (#55799) · 469a0392
      Ruibin Cheung 提交于
      * [clang-tidy] enable modernize-use-emplace
      
      * Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into modernize_use_emplace
      469a0392
    • Z
      1e4f627d
    • K
      [NewIR] add decorator for dy2st test with new ir (#55840) · b67715a4
      kangguangli 提交于
      * add decorator for new_ir_test
      
      * fix bug and only test in ci-coverage
      
      * fix bug and only test in ci-coverage
      
      * fix
      
      * fix bugs
      
      * fix
      
      * fix
      b67715a4
    • J
      Support Combined indexing for __getitem__ and __setitem__ (#55211) · 697c712f
      JYChen 提交于
      * WIP: start writing combined indexing get
      
      * list/tuple/Variable
      
      * getitem 80%
      
      * add setitem
      
      * add some unittest for setitem
      
      * lazy import
      
      * fix some setitem error
      
      * fix advance indexing with decreasing axes; fix strided_slice input name
      
      * combine int-tensor getitem is ok (without boolean support & broadcast); add getitem unittest for static
      
      * add broadcast & parse bool tensor for __getitem
      
      * [change getitem] _getitem_impl_ to _getitem_static, not deleting the former one
      
      * refine new getitem; fix ut in variable/var_base
      
      * add __getitem__ ut in dygraph
      
      * re-dispatch getitem for Py/CPP; fix strided_slice decrease axes error in dygraph
      
      * fix ut; support tensor in slice
      
      * [change setitem] _setitem_impl_ to _setitem_static, not deleting the former one
      
      * remove some UT (for some, temporarily)
      
      * add IndexError to solve timeout problem in static-mode
      
      * 1.temply forbideen all-False bool-indexput; 2.setitem_static will return new variable
      
      * xpu uses old stratege
      
      * rename dy2st setitem ut to avoid same-name problem
      
      * dy2st for new combined index
      
      * ut case for combine-index with dy2st
      
      * open ut with all-false-bool setitem
      
      * remove useless doc and _getitem_impl_
      
      * change static res
      
      * fix static xpu
      697c712f
    • N
      Fix a bug in VecAutomaticAddPerBlock (#55929) · 81511469
      niuliling123 提交于
      81511469
    • L
    • C
      [IR] Reshape2 and Flatten_contiguous_range Support Inplace (#55809) · dd0681e3
      chen 提交于
      * inplace pass support reshape2 and flatten_contiguous_range
      
      * recover the modification to inplace_op_var_pass.cc
      dd0681e3