1. 31 5月, 2022 2 次提交
    • C
      [Eager] Polish append op using for model perf (#43102) · e9589e35
      Chen Weihang 提交于
      * polish append op using
      
      * fix var error
      
      * fix group norm impl
      e9589e35
    • T
      【PaddlePaddle Hackathon 2】16 新增 API RRelu (#41823) · 21e1d10f
      thunder95 提交于
      * rrelu逻辑部分
      
      * unregistered op kernel (unresolved)
      
      * commit before merge
      
      * 丰富测试用例
      
      * 修复rrelu-sig的bug
      
      * 修复cpu环境测试
      
      * 修改拼写错误
      
      * 修改code format
      
      * 尝试优化测试用例timeout的问题
      
      * 优化测试用例
      
      * 移除seed, 优化随机函数
      
      * update en doc for rrelu
      
      * fix rrelu en docs, test=document_fix
      
      * add paper link for en docs, test=document_fix
      
      * udpate en doc
      
      * add r,test=document_fix
      21e1d10f
  2. 19 5月, 2022 1 次提交
  3. 13 5月, 2022 1 次提交
  4. 12 5月, 2022 1 次提交
  5. 28 4月, 2022 1 次提交
  6. 26 4月, 2022 1 次提交
    • B
      【PaddlePaddle Hackathon 2】29、为 Paddle 新增 PixelUnshuffle 组网 API (#40728) · 5be9b824
      BrilliantYuKaimin 提交于
      * 增加PixelUnshuffle的形状推断
      
      * 增加PixelUnshuffle的算子注册
      
      * 增加PixelUnshuffle及其梯度的核函数
      
      * 增加PixelUnshuffle算子的描述
      
      * 增加PixelUnshuffle算子的签名
      
      * 在Python层面增加PixelUnshuffle
      
      * 增加PixelUnshuffle的单测
      
      * Update test_pixel_unshuffle.py
      
      * test=document_fix
      
      * Update test_pixel_unshuffle.py
      
      增加对extra_repr的测试
      
      * 修正代码格式
      
      * Update test_pixel_unshuffle.py
      
      修正对extra_repr的测试
      
      * 修改pixel_unshuffle核函数的实现位置
      
      * 修正代码格式
      
      * 完善对输入的检查
      
      * Update test_pixel_unshuffle.py
      
      * 完善pixel_unshuffle的输入检查
      
      * Update pixel_unshuffle_op.cc
      
      * Update unary.cc
      
      * add pixel_unshuffle
      
      * Update test_pixel_unshuffle.py
      
      * Update vision.py
      
      * 调整代码格式
      
      * Update vision.py
      
      * Delete extra spaces
      
      * Update pixel_unshuffle_sig.cc
      
      * Update vision.py
      
      * Update vision.py
      
      * add PixelUnshuffleGradInferMeta
      
      * remove PixelUnshuffleOpArgumentMapping
      
      * Update pixel_unshuffle_op.cc
      
      * 调整pixel_unshuffle及其梯度的核函数的实现位置
      
      * Update pixel_unshuffle_op.cc
      5be9b824
  7. 25 4月, 2022 1 次提交
    • B
      【PaddlePaddle Hackathon 2】24、为 Paddle 新增 nn.ChannelShuffle 组网 API (#40743) · bbaaf217
      BrilliantYuKaimin 提交于
      * Add infermeta for ChannelShuffle
      
      * Create channel_shuffle_grad_kernel.h
      
      * Create channel_shuffle_kernel.h
      
      * Create channel_shuffle_sig.cc
      
      * Create channel_shuffle_op.cc
      
      ChannelShuffle算子的描述
      
      * Create channel_shuffle_kernel_impl.h
      
      ChannelShuffle核函数的实现
      
      * Create channel_shuffle_grad_kernel_impl.h
      
      ChannelShuffle反向核函数的实现
      
      * Add kernel register of channel shuffle and grad
      
      注册ChannelShuffle及其反向的核函数
      
      * add nn.functional.channel_shuffle
      
      * add nn.ChannelShuffle
      
      * Create test_channel_shuffle.py
      
      * Update example of ChannelShuffle in vision.py
      
      * Update test_channel_shuffle.py
      
      * 修改channel_shuffle核函数的实现位置
      
      * 修正代码格式
      
      * 删除多余空格
      
      * 完善channel_shuffle的错误检查
      
      * Update unary.cc
      
      * Update channel_shuffle_op.cc
      
      * Update test_channel_shuffle.py
      
      * Update unary.cc
      
      * add channel_shuffle
      
      * Update test_channel_shuffle.py
      
      * Update vision.py
      
      * 调整代码格式
      
      * Update channel_shuffle_sig.cc
      
      * 更新ChannelShuffle的文档
      
      * 更新channel_shuffle的文档
      
      * remove ChannelShuffleOpArgumentMapping
      
      * add ChannelShuffleGradInferMeta
      
      * Update channel_shuffle_op.cc
      
      * 调整channel_shuffle及其梯度的核函数的位置
      bbaaf217
  8. 21 4月, 2022 1 次提交
  9. 18 4月, 2022 1 次提交
  10. 02 4月, 2022 1 次提交
  11. 24 3月, 2022 1 次提交
  12. 08 3月, 2022 1 次提交
    • X
      Fix fold python examples (#38636) · d4a4eb9d
      xiaoting 提交于
      * fix fold python examples, test=develop
      
      * fix size type, test=develop
      
      * fix python example, test=develop
      
      * fix fold shape check
      
      * fix fold dygraph mode, test=develop
      d4a4eb9d
  13. 24 2月, 2022 2 次提交
  14. 23 2月, 2022 1 次提交
  15. 22 2月, 2022 1 次提交
  16. 20 1月, 2022 1 次提交
    • W
      [Eager] Support Eager mode for some testcase (#38783) · d21074cd
      wanghuancoder 提交于
      * Rearranged Eager AutoCodeGen directory structure
      
      * Removed USE_OP in Eager AutoCodeGen
      
      * Enabled generation for Operators without Grad/Inputs/Outputs
      
      * Resolved operators without input
      
      * Fixed merge conflicts
      
      * Enabled Eager AutoCodeGen for 10+ more operators
      
      * Refactored Eager AutoCodeGen with more organized helper objects
      
      * Enabled Eager AutoCodeGen for operators with multiple OpBases
      
      * Adjusted Eager AutoCodeGen to Enable Passing Output Tensor as Input Argument
      
      * Handled Dispensable Inputs/Outputs in Eager AutoCodeGen
      
      * Adjusted function generation/call between Python-C API & Dygraph API
      
      * Synchronized auto-generated Python-C API with Dygraph Forward Functions
      
      * support more eager tensor api
      
      * fix merge compile error
      
      * fix compile error and fit develop code
      
      * support pure CPU
      
      * fix some logic error in eager_mode
      
      * support _varbase_creator in eager mode
      
      * Added safe_initialized interface to EagerTensor for use in processing dispensable inputs
      
      * for eager mode
      
      * refine
      
      * support multiple constructor for eager tensor
      
      * add place related code
      
      * polish code
      
      * specific randint with dtype of int64
      
      * Support pure cpu test
      
      * eager logic
      
      * refine test in pure cpu
      
      * eager logic
      
      * eager logic
      
      * eager logic, test=develop
      
      * skip core.eager when in inference, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * call RetainGrad after run forward kernel, test=develop
      
      * refine, test=develop
      
      * support dygraph util, meta, guard test
      
      * eager test case
      
      * support inference test
      
      * refine test and fix initializer failed
      
      * modify eagertensor patch method
      
      * add eagertensor.clear_grandint, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support create varbase and fix retain grad error
      
      * call monkey_patch_varbase in _test_eager_guard, test=develop
      
      * fix windows error
      
      * split clear_gradient to clear_gradient and zero_grads, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support test_imperative_basic test in eager mode
      
      * remove additional log in variable.h
      
      * remove additional log in variable.h
      
      * remove additional code create in merge
      
      * eager
      
      * fix some eager logic, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * patch_tensor_method_func, test=develop
      
      * refine, test=develop
      
      * eager test case, test=develop
      
      * refine, test=develop
      
      * eager, test=develop
      
      * eager, test=develop
      
      * eager optimizer, test=develop
      
      * eager optimizer, test=develop
      
      * eager test_imperative_optimizer_v2, test=develop
      
      * eager, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * eager, test=develop
      
      * add resize in share buffer to, test=develop
      
      * eager, test=develop
      
      * fix _share_buffer_to, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * support eager for dataloader,test=develop
      Co-authored-by: Njim19930609 <jim19930609@gmail.com>
      Co-authored-by: NJiabinYang <360788950@qq.com>
      d21074cd
  17. 12 1月, 2022 1 次提交
  18. 10 1月, 2022 2 次提交
  19. 07 1月, 2022 1 次提交
    • W
      modify mish op and add mish api (#38734) · 8c92337c
      wangxinxin08 提交于
      * add mish operator and api
      
      * remove redundant code and modify grad_atol of mish unittest
      
      * modify mish code to be consistent with other activation implementation
      8c92337c
  20. 31 12月, 2021 1 次提交
    • X
      Add fold opereators (#38613) · 8898dce1
      xiaoting 提交于
      * add fold opereators, test=develop
      
      * add fold opereators, test=develop
      
      * add fold opereators, test=develop
      
      * update fold op error test, test=develop
      
      * fix unitext, test=develop
      
      * fix unitext, test=develop
      8898dce1
  21. 29 12月, 2021 1 次提交
  22. 15 12月, 2021 1 次提交
    • Skr.B's avatar
      Add New API nn.HingeEmbeddingLoss (#37540) · 3b85864a
      Skr.B 提交于
      * add hinge_embedding_loss
      
      * fix test_API
      
      * test_API succeed
      
      * add English doc
      
      * fixed using of expired fluid api
      
      * fix doc
      
      * fix doc and rm python/paddle/fluid/layers/loss.py
      
      * get raw python/paddle/fluid/layers/loss.py back
      
      * fix Examples bug in English doc
      
      * unique -> flatten
      
      * fix api code
      
      * fix English doc
      
      * fix functional loss English doc
      
      * fix Example doc
      
      * .numpy() -> paddle.unique()
      
      * fix unique
      
      * fix label_item_set
      
      * modified judgment equation
      
      * Got a beautiful loss equation
      
      * use paddle.to_tensor
      
      * fix loss and add static check
      
      * fix loss and add static check
      
      * delta -> margin
      3b85864a
  23. 10 12月, 2021 1 次提交
  24. 07 12月, 2021 1 次提交
  25. 30 11月, 2021 1 次提交
  26. 25 11月, 2021 1 次提交
    • M
      【PaddlePaddle Hackathon】6、在 Paddle 中新增 ZeroPad2d (#37151) · 81861f69
      Matsumoto GAO 提交于
      * add zeropad2d v0.1
      
      * add zeropad2d v0.2
      
      * add zeropad2d v0.3
      
      * add zeropad2d v0.3
      
      * add zeropad2d v0.3
      
      * add zeropad2d v0.4
      
      * add zeropad2d v0.5
      
      * add zeropad2d v0.5 codestyle
      
      * add zeropad2d v0.5 codestyle
      
      * add zeropad2d v0.6 functional
      
      * add zeropad2d v0.6 functional
      
      * add zeropad2d v0.6 functional
      81861f69
  27. 22 11月, 2021 1 次提交
  28. 28 10月, 2021 1 次提交
  29. 26 10月, 2021 1 次提交
    • L
      Add fused attention op backward and python layer. (#36498) · 5119428e
      Li Min 提交于
      功能:本PR的目标是提高attention模块的计算性能。
      为了减少框架层对op的调度开销,本PR通过在C++层手动实现attention模块,对外提供attention 大op;
      为了减少防存开销,本PR采取了两种优化方法:
      (1)在q,k,v计算时通过共享输入X,将该处的gemm,transpose和bias add从三次调用减少为一次;
      (2)使用kernel融合优化技术,在不同cuda kernel之间通过寄存器传输数据;
      5119428e
  30. 22 10月, 2021 1 次提交
    • L
      Fused attention op forward (#35905) · d4906214
      Li Min 提交于
      功能:本PR的目标是提高attention模块的计算性能。
      为了减少框架层对op的调度开销,本PR通过在C++层手动实现attention模块,对外提供attention 大op;
      为了减少防存开销,本PR采取了两种优化方法:
      (1)在q,k,v计算时通过共享输入X,将该处的gemm,transpose和bias add从三次调用减少为一次;
      (2)使用kernel融合优化技术,在不同cuda kernel之间通过寄存器传输数据;
      d4906214
  31. 13 10月, 2021 2 次提交
  32. 17 9月, 2021 1 次提交
  33. 15 9月, 2021 1 次提交
  34. 06 9月, 2021 1 次提交
    • F
      replase pass with error exception (#35367) · 5675042d
      Feng Xing 提交于
      This PR adds error exception in fused transformer python interface.
      The function body are not implemented (will be implemented later).
      Following zhiqiu's comment in previous PR-35206 (merged already), it is better to raise an exception instead of using "pass".
      5675042d
  35. 31 8月, 2021 1 次提交
    • F
      transformer opt python files (#35206) · e2991555
      Feng Xing 提交于
      This PR adds fused transformer python related files. It defines interface of fused transformer.
      
      Fused transformer implements an optimized version of transformer layer (in python/paddle/nn/layer/transformer.py). In this PR, four layers (functions) are defined:
      (1) FusedMultiHeadAttention: multi-head attention layer
      (2) FusedFeedForward: feed forward layer
      (3) FusedTransformerEncoderLayer: transformer encoder layer
      (4) FusedTransformer: transformer layer
      e2991555
  36. 27 8月, 2021 1 次提交
    • X
      Add unpool2d op & Expose max_unpool2d API (#35056) · ceee71a0
      xiaoting 提交于
      * add maxunppol2d op, test=develop
      
      * fix typo, test=develop
      
      * fix unpool unitest, test=develop
      
      * fix unpool code-example, test=develop
      
      * fix for unpool_op_unittest,test=develop
      
      * fix example code, test=develop
      
      * add noqa:F401, test=develop
      
      * fix converage, test=develop
      
      * fix unitest for unpool, test=develop
      
      * rename unpool2d to unpool, test=develop
      
      * rename unpool2d to unpool, test=develop
      ceee71a0