1. 06 9月, 2023 3 次提交
  2. 04 9月, 2023 1 次提交
    • Z
      [IR] Support inplace pass (#56672) · 67c84c45
      zhangbo9674 提交于
      * add code
      
      * add code
      
      * refine code
      
      * add code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * add code
      
      * add ut
      
      * polish code
      
      * fix bug
      
      * refine code
      
      * fix bug
      
      * refine code
      
      * fix bug
      
      * refine code
      
      * fix bug
      
      * refine code
      
      * fix bug
      
      * refine code
      
      * add code
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * fix bug
      
      * refine code
      67c84c45
  3. 31 8月, 2023 2 次提交
  4. 29 8月, 2023 1 次提交
  5. 28 8月, 2023 2 次提交
  6. 25 8月, 2023 1 次提交
  7. 24 8月, 2023 1 次提交
  8. 22 8月, 2023 2 次提交
  9. 21 8月, 2023 2 次提交
  10. 15 8月, 2023 1 次提交
  11. 14 8月, 2023 1 次提交
  12. 11 8月, 2023 1 次提交
    • D
      move some fluid apis (#55986) · eafc9889
      Difer 提交于
      * move fluid apis
      
      * fix type error
      
      * remove static exponential_decay
      
      * fix some import error
      
      * remove nn.py
      
      * fix some error
      
      * fix type error
      eafc9889
  13. 09 8月, 2023 3 次提交
  14. 08 8月, 2023 2 次提交
  15. 07 8月, 2023 2 次提交
    • Y
      Add attn_mask supported for FlashAttnKernel. (#55969) · 42e0c6b8
      yin wei 提交于
      * add mask
      
      * add backword
      
      * add enforce info
      
      * update scale
      
      * integrate code
      
      * update enforce
      
      * add enforce eq
      
      * add error type
      
      * update enforce
      
      * add test_flash_attention
      
      * Polish codes and fix compiling errors.
      
      * Set num_splits to 0 for flash-attn with tensor mask.
      
      * Fix the compiling error for non flash-attn case.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      42e0c6b8
    • X
      [dy2static] PaddleSOT pr (#54202) · c1913a5f
      xiongkun 提交于
      * add paddle-symbolic-trace to paddle
      
      * add symoblic trace
      
      * delete swp
      
      * support Layer in symbolic trace
      
      * fix test-symbolic-trace, make symbolic trace return a StaticFunction
      
      * template the error message
      
      * fix some unittest
      
      * Modify the execution mode of test
      
      * Modify the module name
      
      * add dy2static unittest decorator
      
      * change some unittest files by @ast_only_test
      
      * fix unittest.
      
      * test-symbolic-trace
      
      * update test_write_python_container.py
      
      * update
      
      * fix test_param_parse.py
      
      * add submodule and ln -sf in cmakefile
      
      * update
      
      * update
      
      * fix some ast only errors
      
      * update
      
      * Polish ut
      
      * fix unittests
      
      * update
      
      * update
      
      * fix unittests
      
      * update
      
      * test warning ast only
      
      * update
      
      * Ast only some uts
      
      * Fix unitests
      
      * test_error ast only
      
      * update
      
      * update
      
      * Support build_strategy for sot
      
      * update
      
      * import sot as a third party module
      
      * update
      
      * update
      
      * Polish code
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * remove old fluid api and use paddle.nn.relu instead
      
      * fix
      
      * comment the print of ast code
      
      * add try-finally block
      
      * fix dy2static stop-gradient bugs
      
      * fix code
      
      * remove unused submodule and minor codestyle fix
      
      * fix
      
      * fix cast error
      
      * fix interpolate meets int64 in static model
      
      * add evalframe support for py311
      
      * fix
      
      * fix err
      
      * switch ENABLE_FALL_BACK=False
      
      * fix
      
      * Fix CI for some unittest
      
      * add ENABLE_SOT
      
      * remove setup.py dependences
      
      ---------
      Co-authored-by: NNotHaozi <zhangmenghao@baidu.com>
      Co-authored-by: Nfeifei-111 <2364819892@qq.com>
      Co-authored-by: N0x45f <wangzhen45@baidu.com>
      Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
      c1913a5f
  16. 04 8月, 2023 1 次提交
  17. 03 8月, 2023 3 次提交
  18. 02 8月, 2023 3 次提交
    • Y
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data... · e61d892a
      yangjianfengo1 提交于
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data format is NHWC implementation. (#55399)
      
      * finish
      
      * cpergroup odd
      
      * fix bf16
      
      * single channel
      
      * code style
      
      * jingdu duiqi
      
      * add head_file
      
      * add bf16 head file
      
      * bf16 2
      
      * bf16
      
      * bf16 head
      
      * bf16 compile
      
      * py test
      
      * bf16 compile
      
      * bf16 compile
      
      * unset py test
      
      * nhwc
      
      * test
      
      * mean var
      
      * bf16 success
      
      * su
      
      * ctest success
      
      * use is_same_as
      
      * is_same
      
      * use is_same
      
      * rtol
      
      * gpu_stream
      
      * del sigmod
      
      * fix bfloat16 type
      
      * use cuda_bf16_hpp
      
      * use_cuda_arch
      
      * bfloat162float2
      
      * del inplace_tol
      
      * del max_releative_tol
      
      * temp store
      
      * jingdu duiqi
      
      * temp store
      
      * plugin
      
      * jingdu duiqi
      
      * duiqi
      
      * include cuda.h
      
      * del half
      
      * half single
      
      * ci
      
      * add const
      
      * ci
      
      * cudamemset
      
      * del printf
      
      * fp16 test
      
      * add half compute
      
      * del br16 ci
      
      * del ci
      
      * ci approve
      
      * del fluid include
      e61d892a
    • O
      [xdoctest] reformat example code with google style in No.21-30 (#55849) · 5d26d79f
      ooooo-create 提交于
      * [Doctest]fix No.21, test=docs_preview
      
      * Revert "[Doctest]fix No.21, test=docs_preview"
      
      This reverts commit 76bcdb280e254d682be6fc6f85588f1940bb1ade.
      
      * [Doctest]fix No.21, test=docs_preview
      
      * fix bugs,test=docs_preview
      
      * [Doctest]fix No.22-24,26,27, test=docs_preview
      
      * update fix
      
      * with pre-commit, test=docs_preview
      
      * fix seed, test=docs_preview
      
      * fix error, test=docs_preview
      
      * fix seed, test=docs_preview
      
      * fix seed, test=docs_preview
      5d26d79f
    • zhenhailiu's avatar
      Add scaled_dot_product_attention api (#55242) · b19dfb8c
      zhenhailiu 提交于
      b19dfb8c
  19. 31 7月, 2023 3 次提交
  20. 27 7月, 2023 1 次提交
  21. 26 7月, 2023 1 次提交
  22. 20 7月, 2023 1 次提交
    • X
      [Kunlun] Modify some legacy code on distributed training (#55515) · 806f8d2b
      XiaociZhang 提交于
      * [Kunlun] Mofify some legacy code on distributed training
      
      There were limitations on XPUs before, such as concat/split is not
      supported, and c_broadcast only support fp32. These limitations are
      lifted recently.
      
      Multi-device profiling on XPU will also be supported by this PR.
      Without this PR, a hanging broadcast will be issued by devices that
      enables profiling, eventually lead to kernel timeout error.
      
      * fix typo
      806f8d2b
  23. 18 7月, 2023 1 次提交
    • M
      [Add] Paddle 代码 CI 中引入 xdoctest 检查 (#55295) · 26fba07c
      megemini 提交于
      * [Add]Add Xdoctester
      
      * [Fix]fix beta docstring
      
      * [Doctest]change dirichlet docstring
      
      * [Doctest]change gumbel docstring
      
      * [Doctest]change bernoulli docstring
      
      * [Doctest]change categorical docstring
      
      * [Doctest]change ops.py docstring
      
      * [Doctest]change conv docstring
      
      * [Doctest]change distance docstring, test=docs_preview
      
      * [Change]add ref
      
      * [Change]patch xdoctest debug
      26fba07c
  24. 11 7月, 2023 1 次提交