1. 09 8月, 2023 2 次提交
  2. 08 8月, 2023 2 次提交
  3. 07 8月, 2023 2 次提交
    • Y
      Add attn_mask supported for FlashAttnKernel. (#55969) · 42e0c6b8
      yin wei 提交于
      * add mask
      
      * add backword
      
      * add enforce info
      
      * update scale
      
      * integrate code
      
      * update enforce
      
      * add enforce eq
      
      * add error type
      
      * update enforce
      
      * add test_flash_attention
      
      * Polish codes and fix compiling errors.
      
      * Set num_splits to 0 for flash-attn with tensor mask.
      
      * Fix the compiling error for non flash-attn case.
      
      ---------
      Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
      42e0c6b8
    • X
      [dy2static] PaddleSOT pr (#54202) · c1913a5f
      xiongkun 提交于
      * add paddle-symbolic-trace to paddle
      
      * add symoblic trace
      
      * delete swp
      
      * support Layer in symbolic trace
      
      * fix test-symbolic-trace, make symbolic trace return a StaticFunction
      
      * template the error message
      
      * fix some unittest
      
      * Modify the execution mode of test
      
      * Modify the module name
      
      * add dy2static unittest decorator
      
      * change some unittest files by @ast_only_test
      
      * fix unittest.
      
      * test-symbolic-trace
      
      * update test_write_python_container.py
      
      * update
      
      * fix test_param_parse.py
      
      * add submodule and ln -sf in cmakefile
      
      * update
      
      * update
      
      * fix some ast only errors
      
      * update
      
      * Polish ut
      
      * fix unittests
      
      * update
      
      * update
      
      * fix unittests
      
      * update
      
      * test warning ast only
      
      * update
      
      * Ast only some uts
      
      * Fix unitests
      
      * test_error ast only
      
      * update
      
      * update
      
      * Support build_strategy for sot
      
      * update
      
      * import sot as a third party module
      
      * update
      
      * update
      
      * Polish code
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * remove old fluid api and use paddle.nn.relu instead
      
      * fix
      
      * comment the print of ast code
      
      * add try-finally block
      
      * fix dy2static stop-gradient bugs
      
      * fix code
      
      * remove unused submodule and minor codestyle fix
      
      * fix
      
      * fix cast error
      
      * fix interpolate meets int64 in static model
      
      * add evalframe support for py311
      
      * fix
      
      * fix err
      
      * switch ENABLE_FALL_BACK=False
      
      * fix
      
      * Fix CI for some unittest
      
      * add ENABLE_SOT
      
      * remove setup.py dependences
      
      ---------
      Co-authored-by: NNotHaozi <zhangmenghao@baidu.com>
      Co-authored-by: Nfeifei-111 <2364819892@qq.com>
      Co-authored-by: N0x45f <wangzhen45@baidu.com>
      Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
      c1913a5f
  4. 04 8月, 2023 1 次提交
  5. 03 8月, 2023 3 次提交
  6. 02 8月, 2023 3 次提交
    • Y
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data... · e61d892a
      yangjianfengo1 提交于
      [Inference] Replace groupNorm when data types are bf16 and fp16, and data format is NHWC implementation. (#55399)
      
      * finish
      
      * cpergroup odd
      
      * fix bf16
      
      * single channel
      
      * code style
      
      * jingdu duiqi
      
      * add head_file
      
      * add bf16 head file
      
      * bf16 2
      
      * bf16
      
      * bf16 head
      
      * bf16 compile
      
      * py test
      
      * bf16 compile
      
      * bf16 compile
      
      * unset py test
      
      * nhwc
      
      * test
      
      * mean var
      
      * bf16 success
      
      * su
      
      * ctest success
      
      * use is_same_as
      
      * is_same
      
      * use is_same
      
      * rtol
      
      * gpu_stream
      
      * del sigmod
      
      * fix bfloat16 type
      
      * use cuda_bf16_hpp
      
      * use_cuda_arch
      
      * bfloat162float2
      
      * del inplace_tol
      
      * del max_releative_tol
      
      * temp store
      
      * jingdu duiqi
      
      * temp store
      
      * plugin
      
      * jingdu duiqi
      
      * duiqi
      
      * include cuda.h
      
      * del half
      
      * half single
      
      * ci
      
      * add const
      
      * ci
      
      * cudamemset
      
      * del printf
      
      * fp16 test
      
      * add half compute
      
      * del br16 ci
      
      * del ci
      
      * ci approve
      
      * del fluid include
      e61d892a
    • O
      [xdoctest] reformat example code with google style in No.21-30 (#55849) · 5d26d79f
      ooooo-create 提交于
      * [Doctest]fix No.21, test=docs_preview
      
      * Revert "[Doctest]fix No.21, test=docs_preview"
      
      This reverts commit 76bcdb280e254d682be6fc6f85588f1940bb1ade.
      
      * [Doctest]fix No.21, test=docs_preview
      
      * fix bugs,test=docs_preview
      
      * [Doctest]fix No.22-24,26,27, test=docs_preview
      
      * update fix
      
      * with pre-commit, test=docs_preview
      
      * fix seed, test=docs_preview
      
      * fix error, test=docs_preview
      
      * fix seed, test=docs_preview
      
      * fix seed, test=docs_preview
      5d26d79f
    • zhenhailiu's avatar
      Add scaled_dot_product_attention api (#55242) · b19dfb8c
      zhenhailiu 提交于
      b19dfb8c
  7. 31 7月, 2023 3 次提交
  8. 27 7月, 2023 1 次提交
  9. 26 7月, 2023 1 次提交
  10. 20 7月, 2023 1 次提交
    • X
      [Kunlun] Modify some legacy code on distributed training (#55515) · 806f8d2b
      XiaociZhang 提交于
      * [Kunlun] Mofify some legacy code on distributed training
      
      There were limitations on XPUs before, such as concat/split is not
      supported, and c_broadcast only support fp32. These limitations are
      lifted recently.
      
      Multi-device profiling on XPU will also be supported by this PR.
      Without this PR, a hanging broadcast will be issued by devices that
      enables profiling, eventually lead to kernel timeout error.
      
      * fix typo
      806f8d2b
  11. 18 7月, 2023 1 次提交
    • M
      [Add] Paddle 代码 CI 中引入 xdoctest 检查 (#55295) · 26fba07c
      megemini 提交于
      * [Add]Add Xdoctester
      
      * [Fix]fix beta docstring
      
      * [Doctest]change dirichlet docstring
      
      * [Doctest]change gumbel docstring
      
      * [Doctest]change bernoulli docstring
      
      * [Doctest]change categorical docstring
      
      * [Doctest]change ops.py docstring
      
      * [Doctest]change conv docstring
      
      * [Doctest]change distance docstring, test=docs_preview
      
      * [Change]add ref
      
      * [Change]patch xdoctest debug
      26fba07c
  12. 11 7月, 2023 1 次提交
  13. 06 7月, 2023 1 次提交
    • Z
      add clip_grad_value_ api (#54603) · 88402cdb
      zqw_1997 提交于
      * add clip_grad_value_ api
      
      * add test for ClipGradByValue
      
      * typo fix
      
      * refine and modify clip_grad_norm_
      
      * no_grad
      
      * clip_
      
      * remove g=p.grad
      
      * bug: AssertionError: When Variable is used as the condition of if/while , Variable can only contain one element.
      88402cdb
  14. 03 7月, 2023 1 次提交
  15. 30 6月, 2023 1 次提交
    • L
      修改 COPY-FROM No.5 nn (#54889) · 0f69d932
      Li-fAngyU 提交于
      * COPY FROM issue of paddle nn
      
      * 移除额外添加的示例,并修改Examplex->Examples
      
      * remove example name of transformer
      0f69d932
  16. 28 6月, 2023 1 次提交
  17. 16 6月, 2023 1 次提交
  18. 14 6月, 2023 3 次提交
  19. 12 6月, 2023 1 次提交
  20. 09 6月, 2023 1 次提交
  21. 31 5月, 2023 1 次提交
  22. 30 5月, 2023 1 次提交
  23. 23 5月, 2023 2 次提交
  24. 22 5月, 2023 1 次提交
    • M
      [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() (#53856) · 3794d171
      Meteor Liu 提交于
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode()
      
      * fixed cyclic reference that caused patial import
      
      * fixed bad change
      
      * fix bad import
      
      * fix bad import
      
      * fix bad import
      
      * fix ut failed caused by change in_dynamic_mode
      
      * fix ut failed caused by change in_dynamic_mode
      
      * fixed usage of in_dynamic_mode() or in_dygraph_mode()
      
      * revert python3 to python in .pre-commit-config.yaml
      
      * fix merge conflicts
      3794d171
  25. 18 5月, 2023 3 次提交
  26. 16 5月, 2023 1 次提交