1. 26 10月, 2021 1 次提交
    • L
      [cherry-pick-2.2] Fused attention op forward (#35905) (#36708) · d2be870a
      Li Min 提交于
      功能:本PR的目标是提高attention模块的计算性能。
      为了减少框架层对op的调度开销,本PR通过在C++层手动实现attention模块,对外提供attention 大op;
      为了减少防存开销,本PR采取了两种优化方法:
      (1)在q,k,v计算时通过共享输入X,将该处的gemm,transpose和bias add从三次调用减少为一次;
      (2)使用kernel融合优化技术,在不同cuda kernel之间通过寄存器传输数据;
      d2be870a
  2. 26 7月, 2021 1 次提交
  3. 01 7月, 2021 1 次提交
  4. 29 4月, 2021 1 次提交
  5. 27 4月, 2021 2 次提交
  6. 23 2月, 2021 1 次提交
  7. 01 2月, 2021 1 次提交
  8. 29 12月, 2020 1 次提交
    • C
      fix code bugs (#29932) · af372858
      Chen Long 提交于
      * fix code bugs
      
      * fix code bugs test=document_fix
      
      * fix code bugs test=document_fix
      af372858
  9. 05 12月, 2020 1 次提交
  10. 24 11月, 2020 1 次提交
    • L
      Upgrade string literals to raw string (#28989) · 3815d7aa
      Leo Chen 提交于
      * upgrade comment string to raw string
      
      * fix string in
      
      * fix string with ' '
      
      * revert update on comments
      
      * upgrade only necessary
      
      * fix sample code checker
      
      * fix comments with '''
      3815d7aa
  11. 30 9月, 2020 1 次提交
  12. 29 9月, 2020 1 次提交
  13. 18 9月, 2020 1 次提交
  14. 02 9月, 2020 1 次提交
  15. 24 8月, 2020 1 次提交
    • G
      [API 2.0] Add transformer apis (#26418) · 317f7ce2
      Guo Sheng 提交于
      * Add MultiHeadAttention api.
      test=develop
      
      * Add MultiHeadAttention cache type and gen_cache.
      test=develop
      
      * Add TransformerEncoderLayer and TransformerEncoder.
      test=develop
      
      * Add Transformer decoder apis.
      test=develop
      
      * Add Transformer api.
      test=develop
      
      * add unittests for transformer api
      
      * add unittests for transformer api
      
      * Fix some bugs in Transformer apis.
      test=develop
      
      * add unittests for encoder, decoder and transformer
      
      * clean conflicts infor in code
      
      * clean Chinese comments
      
      * Add TransformerDecoderCell and TransformerBeamSearchDecoder.
      test=develop
      
      * Remove TransformerDecoderCell and TransformerBeamSearchDecoder temporarily.
      test=develop
      
      * Add import for Transformer apis.
      test=develop
      
      * Update usage of weight_attr and Tensor in Transformer api docs.
      test=develop
      
      * Update Transformer apis by renaming MultiheadAttention and cal_kv according to comments.
      test=develop
      
      * Fix MultiHeadAttention in test_transformer_api.py.
      test=develop
      Co-authored-by: NLiuChiaChi <709153940@qq.com>
      317f7ce2
  16. 23 3月, 2020 1 次提交
  17. 17 9月, 2018 1 次提交
  18. 03 9月, 2018 1 次提交
  19. 15 8月, 2018 1 次提交
  20. 26 7月, 2018 1 次提交
  21. 18 6月, 2018 1 次提交
  22. 09 6月, 2018 1 次提交
  23. 24 5月, 2018 1 次提交
  24. 18 5月, 2018 1 次提交
  25. 17 5月, 2018 2 次提交
  26. 16 5月, 2018 1 次提交
  27. 12 5月, 2018 1 次提交
    • Q
      Add inferencer infer (#10445) · 2a971f30
      Qiao Longfei 提交于
      * add Inference.infer
      
      * optimize code
      
      * update no_test_word2vec_new_api.py
      
      * update trainer
      
      * split check_and_get_place
      
      * use inference_program to save inference model in Trainer
      
      * update demo
      
      * update save_inference_model
      
      * clean code
      2a971f30
  28. 05 5月, 2018 1 次提交
    • J
      Trainer save load params (#10386) · bd66eed5
      Jeff Wang 提交于
      * Load/save the params from the params_path
      
      * Switch to use load_persistables and save_persistables
      
      * Instaed of setup the executor to run program and scope. Pass the program to the load_persistables
      bd66eed5
  29. 03 5月, 2018 1 次提交
    • H
      improve trainer API · a66052c6
      Helin Wang 提交于
      - The trainer and inferencer will load params from disk if param_path
        argument is not None in their constructor.
      
      - Remove params.py, we will expose core.Scope to the user if needed
        (e.g., for GAN). Currently we will not expose it, unless we clearly
        know doing so can support GAN.
      
      - Add `save_params` to Trainer (a TODO item).
      
      - rename "network" to "program"
      a66052c6
  30. 02 5月, 2018 4 次提交
  31. 12 2月, 2018 1 次提交
  32. 21 1月, 2018 1 次提交
    • D
      "fix decode bug" (#7711) · e983cc90
      dzhwinter 提交于
      * "fix decode bug"
      
      * "follow commnet"
      
      * "fix error"
      
      * "fix hook bug"
      
      * fix based comment
      
      * fix copyright
      
      * fix based on comment
      e983cc90
  33. 15 1月, 2018 1 次提交
    • D
      Feature/hooks (#7513) · b9b75377
      dzhwinter 提交于
      * add copyright hook
      
      * add copyright hook
      
      * refine copyright hook
      
      * "test copyright hook"
      
      * fix check style
      
      * fix ci
      b9b75377
  34. 12 11月, 2016 1 次提交
  35. 28 9月, 2016 1 次提交