1. 01 3月, 2023 1 次提交
    • C
      Integration flash attention (#49869) · 61611786
      Chitsing KUI 提交于
      * flash attn
      
      * seed
      
      * almost
      
      * softmax
      
      * fix workspace
      
      * add unitest; linux only
      
      * fix setup
      
      * fix datatype include
      
      * fix setup typo
      
      * fix def scope
      
      * new error api
      
      * use paddle fork
      
      * fix attr bug; complete ut
      
      * update flash hash
      
      * fix rng reset
      
      * fix offset
      
      * fix comments
      61611786
  2. 11 2月, 2023 1 次提交
    • H
      [Tensor Operator] Overload Tensor Operator (#50098) · 14e45f6b
      HongyuJia 提交于
      * init commit
      
      * fix tensor operator*
      
      * fix compile bug
      
      * bug reproduce
      
      * update commit
      
      * polish codes
      
      * fix compile bug
      
      * test begin
      
      * test begin
      
      * compile finish
      
      * restore origin composite_backward_api
      
      * pass local CI
      
      * fix merge error
      
      * fix merge error
      
      * change py_test from GPU->CPU, test custom op
      
      * polish codes, modify prim unittest
      
      * modify prim unittest
      
      * determine phi_tensor_operants location
      
      * polish codes
      
      * add header file
      
      * solve windows unresolved symbol
      
      * fix some CI error
      
      * add overload defination
      
      * fix CI inference and Windows
      
      * polish codes according to reviewers' opinion
      
      * polish codes according to reviewers' opinion
      14e45f6b
  3. 09 1月, 2023 1 次提交
    • J
      Prim paddle Basic (#49272) · 2f601282
      Jiabin Yang 提交于
      * proto type of composite grad in paddle
      
      * proto type of composite grad in paddle
      
      * refactor composite api with phi
      
      * fix compile error
      
      * support static graph code-gen for squeeze op
      
      * generate static graph code of unsqueeze
      
      * refine op name
      
      * fix compile error
      
      * add extra output in op_compat
      
      * remove debug log
      
      * fix clang compile error
      
      * support prim switch flag
      
      * support prim switch flag
      
      * fix dygraph error
      
      * merge develop
      
      * add code_gen
      
      * add necessary files without codegen
      
      * fix code_gen bug
      
      * add deps
      
      * modify igmnore
      
      * add ignore
      
      * delete std cout
      
      * add composite logic for backward.py
      
      * add tanh first order grad composite
      
      * support enable_prim flag for static graph
      
      * throw expection when both GrapOpMaker and GradCompOpMaker not been registered
      
      * reorganize the directory of prim api tests
      
      * fix windows error
      
      * add eager_utils
      
      * add eager_utils
      
      * modify code gen
      
      * add composite parse
      
      * add unittest for get_grad_op_desc
      
      * code optimize
      
      * fix static test on windows
      
      * support generate static graph code for imag and real op
      
      * fix windows compile error in test_static_prim
      
      * merge develop
      
      * disable test eager in inference
      
      * prim code gen
      
      * disable eager compile in inference
      
      * rm other file
      
      * rm gitignore file
      
      * code_style
      
      * add eager test
      
      * code_style
      
      * merge develop
      
      * remove useless files
      
      * modify static test
      
      * support bool flag from singlton
      
      * merge develop
      
      * recover git ignore
      
      * fix conflict
      
      * recover git ignore for generated op
      
      * fix test compile error
      
      * remove some tests
      
      * add python test
      
      * fix some name issue
      
      * add composite code gen
      
      * modify backward yaml
      
      * fix static composite grad maker code gen
      
      * remove addtional files
      
      * add some static funcs unit test
      
      * fix some bugs
      
      * fix composite grad maker register code gen
      
      * optimize some functions
      Co-authored-by: Nzyfncg <zhangyunfei07@baidu.com>
      Co-authored-by: Nwangruting <wangruting@baidu.com>
      Co-authored-by: Ncxxly <chenxx_id@163.com>
      Co-authored-by: Ncharles-hit <wanghao107@baidu.com>
      Co-authored-by: Nxiaoguoguo626807 <100397923+xiaoguoguo626807@users.noreply.github.com>
      2f601282
  4. 04 3月, 2022 1 次提交
    • H
      Move conv to pten (#39354) · d50fb43e
      hong 提交于
      * move conv to pten
      
      * move conv to pten; test=develop
      
      * fix bug;
      
      * add conv cudnn impl; test=develop
      
      * update
      
      * update operator; test=develop
      
      * fix bug; test=develop
      
      * move operator and prepared_operator to develop; test=develop
      
      * resolve conflict; test=develop
      
      * remove useless code;test=develop
      
      * add depency ; test=develop
      
      * fix bug;
      
      * add sig.cc ; test=develop
      
      * fix use_op error; test=develop
      
      * fix bug; test=develop
      
      * fix bug; test=develop
      
      * add conv3d register; test=develop
      
      * fix star gan and conv_nn_grad test failed; test=develop
      
      * add header; test=develop
      
      * manul to recover to develop;
      
      * resolve confilct; test=develop
      
      * remove useless code
      
      * fix bug;
      
      * remove conv2d_cudnn; test=develop
      
      * fix bugs; test=develop
      
      * fix cpu rocm compile bugs; test=develop
      
      * fix blas error; test=develop
      
      * fix compile bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows error; test=develop
      
      * resolve confilct; test=develop
      d50fb43e
  5. 25 2月, 2022 1 次提交
    • 0
      move eye、size、erfinv、pixel_shuffle OP to phi (#39712) · 639675de
      0x45f 提交于
      * move eye OP to pten
      
      * move size OP to pten
      
      * merge develop
      
      * fix merge
      
      * move files
      
      * move erfinv OP to phi
      
      * remove comment
      
      * move pixel_shuffle OP to phi
      
      * remove comment
      
      * fix PT_REGISTER
      
      * fix NPU
      
      * fix CR
      
      * remove size_sig.cc for PR-CI-Coverage
      639675de
  6. 24 2月, 2022 1 次提交
  7. 20 2月, 2022 1 次提交
  8. 28 1月, 2022 1 次提交
    • H
      Move digamma to pten (#39240) · 848ae7dc
      hong 提交于
      * move digamma to pten; test=develop
      
      * fix mutable_data bugs; test=develop
      
      * remove useless code; test=develop
      
      * remove kernel compute; test=develop
      
      * fix bug; test=develop
      848ae7dc
  9. 17 1月, 2022 1 次提交
  10. 12 1月, 2022 1 次提交
    • Z
      the_one_ps dirs reconstruct (#38804) · 50609214
      ziyoujiyi 提交于
      * delete gloo connect retry
      
      * the_one_ps dirs reconstruct
      
      * .
      
      * .
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * create the_one_ps dirs
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      
      * the one ps dirs modify
      50609214
  11. 03 11月, 2021 1 次提交
  12. 18 9月, 2021 1 次提交
    • H
      Basic PR on Cost Model (#35774) · 5ba9fe6e
      Huihuang Zheng 提交于
      Add basic Cost Model, it uses executor to run program and profile it to get op time.
      
      This is an early basic version, we will add more functions in the future.
      5ba9fe6e
  13. 15 9月, 2020 1 次提交
  14. 03 6月, 2020 1 次提交
    • Y
      Add crypto python (#24836) · aa47356b
      Yanghello 提交于
      * add crypto helper for paddle, test=develop
      
      * cryptopp.cmake bug fixed, test=develop
      
      * remove debug build type, test=develop
      
      * fixed CMakeLists for new target, test=develop
      
      * fix CI bug, test=develop
      
      * add cmake option flag DWITH_CRYPTO, test=develop
      
      * add crypto api for python, test=develop
      
      * Revert "add crypto api for python, test=develop"
      
      This reverts commit 3a1cfa9d.
      
      * Revert "Add crypto api (#24694)"
      
      This reverts commit 5a7a517c.
      
      * Revert "Revert "Add crypto api (#24694)""
      
      This reverts commit f952b19f.
      
      * fixed cryptopp cmake building error, test=develop
      
      * change WITH_CRYPTO building option to OFF, test=develop
      
      * â€fixed cipher test failed, test=develop
      
      * "add crypto api for python, test=develop"
      
      This reverts commit 83fb55c0.
      
      * travis CI bug fixed, test=develop
      
      * fixed test in python3
      
      * test=develop
      
      * fixed unittest, test=develop
      aa47356b
  15. 21 1月, 2019 1 次提交
  16. 10 1月, 2019 1 次提交
  17. 13 12月, 2018 1 次提交
    • S
      fix cmake · deb0d41c
      sneaxiy 提交于
      fix cmake again
      test=develop
      deb0d41c
  18. 10 12月, 2018 1 次提交
  19. 10 9月, 2018 1 次提交
  20. 18 6月, 2018 1 次提交
  21. 24 5月, 2018 1 次提交
  22. 23 5月, 2018 1 次提交
  23. 22 3月, 2018 1 次提交
  24. 07 3月, 2018 2 次提交
  25. 06 3月, 2018 2 次提交
  26. 15 2月, 2018 1 次提交
    • Y
      Update tensor_util.h (#8422) · cfffb1a3
      Yi Wang 提交于
      * Update tensor_util.h
      
      * Update with moved TensorDesc
      
      * Fix tensur_utils.cu
      
      * Update
      
      * Update
      
      * Update
      
      * Update
      
      * Make tensor_util.cu a symbolic link
      cfffb1a3
  27. 10 2月, 2018 2 次提交
  28. 07 2月, 2018 1 次提交
  29. 06 2月, 2018 2 次提交
  30. 01 2月, 2018 1 次提交
  31. 31 1月, 2018 1 次提交
  32. 30 1月, 2018 1 次提交