1. 31 3月, 2022 8 次提交
  2. 30 3月, 2022 8 次提交
    • Z
      [Phi] Move Rnn Op from fluid to phi (#41007) · 66cf8b08
      zyfncg 提交于
      * move rnn kernel to phi
      
      * move infershape of rnn to phi
      
      * fix HIP bug
      
      * rename function
      
      * fix HIP bug
      
      * fix hip bug
      66cf8b08
    • R
      [MoE] Moe apis (#41092) · aac7879a
      Roc 提交于
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * add op about moe gate
      
      update utils
      
      add limit by capacity op
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      * fix for win
      
      * fix bugs in test_limit_by_capacity_op
      
      * update ut
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * update(fix) ut for win
      
      * moe apis in incubate
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * fix for win
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * fix ut for number count
      
      * add apis and utils
      
      * add gate apis
      
      * add moe and grad clip apis
      
      * update moe apis
      
      * add ops for moe gate
      
      * fix
      
      * update for base moe layer api
      
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * fix for dygraph
      
      * update with ranodm routing
      
      * update
      
      * fix ut for limit by capacity
      
      * update
      
      * update limit by capacity for easily to switch to single thread mode
      
      * update api docs
      Co-authored-by: Nhlygit66666 <2570058140@qq.com>
      aac7879a
    • C
      Revert "Revert "[Phi] Move elementwise_floordiv and elementwise_pow to phi... · eef46770
      Chen Weihang 提交于
      Revert "Revert "[Phi] Move elementwise_floordiv and elementwise_pow to phi (#40993)" (#41065)" (#41110)
      
      This reverts commit 3a6f1135.
      eef46770
    • C
      Revert "Revert "[Phi] trans logsumexp op (#40790)" (#41068)" (#41109) · ee8eeb45
      Chen Weihang 提交于
      This reverts commit 054fc997.
      ee8eeb45
    • H
      Revert "Revert "Move some activation to phi (#40727)" (#41056)" (#41095) · 91bb52cd
      hong 提交于
      This reverts commit 05f3d48e.
      91bb52cd
    • Y
      move elementwise_mul selected rows input (#41042) · 13f1641d
      YuanRisheng 提交于
      13f1641d
    • Z
      Optimize the perf of top_k when k is too large (#40941) · 45078d9f
      Zhang Zheng 提交于
      * Optimize the perf of top_k when k is too large
      
      * fix rcom compile
      
      * fix
      
      * only compile in cuda
      
      * fix log info
      45078d9f
    • H
      swish and pow op for xpu test=kunlun (#40654) · d951f3af
      houj04 提交于
      * swish and pow op for xpu. test=kunlun
      
      * fix code style. test=kunlun.
      
      * use pow_grad xdnn api. test=kunlun.
      d951f3af
  3. 29 3月, 2022 10 次提交
    • R
      [MoE] Moe apis (#40895) · aeade538
      Roc 提交于
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * add op about moe gate
      
      update utils
      
      add limit by capacity op
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      * fix for win
      
      * fix bugs in test_limit_by_capacity_op
      
      * update ut
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * update(fix) ut for win
      
      * moe apis in incubate
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * fix for win
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * fix ut for number count
      
      * add apis and utils
      
      * add gate apis
      
      * add moe and grad clip apis
      
      * update moe apis
      
      * add ops for moe gate
      
      * fix
      
      * update for base moe layer api
      
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * fix for dygraph
      
      * update with ranodm routing
      
      * update
      
      * fix ut for limit by capacity
      
      * update
      Co-authored-by: Nhlygit66666 <2570058140@qq.com>
      aeade538
    • T
      Revert "[Phi] trans logsumexp op (#40790)" (#41068) · 054fc997
      tianshuo78520a 提交于
      This reverts commit 9c0eaada.
      054fc997
    • T
      3a6f1135
    • T
      Revert "Move some activation to phi (#40727)" (#41056) · 05f3d48e
      tianshuo78520a 提交于
      This reverts commit e77a947e.
      05f3d48e
    • [Phi] trans logsumexp op (#40790) · 9c0eaada
      提交于
      * [Phi] trans logsumexp op
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * add sig
      
      * fix sig bugs
      
      * fix sig bugs
      
      * fix xpu bugs
      
      * fix review bugs
      
      * test=develop
      9c0eaada
    • W
      [Phi] Move elementwise_floordiv and elementwise_pow to phi (#40993) · b532315d
      wuyefeilin 提交于
      * mv floordiv to phi
      
      * mv elementwise_pow to phi
      
      * fix as review
      b532315d
    • Z
    • Z
      [MLU]add reduce op mlu kernel (#41028) · d1c1d731
      zn 提交于
      d1c1d731
    • Z
    • C
      fix assign typo (#41005) · 5728dffb
      Chen Weihang 提交于
      5728dffb
  4. 28 3月, 2022 8 次提交
    • H
      Move meshgrid to phi (#40994) · ca871957
      hong 提交于
      * move momentum, rmsprop to phi; test=develop
      
      * update
      
      * update
      
      * update
      
      * update
      
      * udpate; test=develop
      
      * fix xpu npu bugs; test=develop
      
      * fix npu bug; test=develop
      
      * fix windows compile error; test=develop
      
      * fix windows compile error; test=develop
      
      * polish code; test=develop
      
      * fix conflict; test=develop
      
      * add meshgrid;
      
      * update
      
      * polish code
      
      * polish code;
      
      * fix bug
      
      * format; remove useless code
      
      * fix npu bug
      
      * fix bug
      ca871957
    • H
      Move some activation to phi (#40727) · e77a947e
      hong 提交于
      * update
      
      * add forward case
      
      * update
      
      * update; test=develop
      
      * add some grad kernel; test=develop
      
      * move gpu kernel; test=develop
      
      * update
      
      * update;
      
      * update test;
      
      * fix selected rows bug;
      
      * add mix vector include ;
      
      * add mixed vector depen; test=develop
      
      * add logit grad signature;
      
      * polish code
      
      * fix bug;
      
      * add namespace for abs
      
      * revert code
      
      * not move softsign
      
      * revmove duplate register;
      
      * fix softsign bug
      
      * polish code
      
      * format
      
      * format
      
      * fix bug
      
      * remove cmake dep
      
      * add square sqrt selected rows support
      
      * update
      
      * remove clip norm
      
      * add standalone executor sqrt dep
      
      * standalone exec denp sqrt
      
      * remove sqrt op in cmkaelist
      
      * open some case
      e77a947e
    • K
      Add window computation in stft op. (#40987) · c049a6b4
      KP 提交于
      c049a6b4
    • Y
      b6661d3a
    • D
      add fused_seqpool_cvm op (#37928) · ea5b2f26
      danleifeng 提交于
      * add fused_seqpool_cvm op;test=develop
      ea5b2f26
    • 0
      [Phi] Move warpctc OP to phi (#40023) · cb183762
      0x45f 提交于
      * moving OP
      
      * move forward
      
      * move grad and infershape
      
      * code format
      
      * format code
      
      * fix code
      
      * fix code
      
      * fix CMakerLists.txt
      
      * fix comments
      
      * Refine CMakeLists for rocm ci
      cb183762
    • C
      [Phi] Move assign value op kernel into phi (#40967) · 324b6b71
      Chen Weihang 提交于
      * move assign value op kernel
      
      * remove original op
      
      * fix failed unittests
      
      * remove infershape
      324b6b71
    • Y
      [Phi] Move backward infershape of Reshape Op (#40914) · 27996fd1
      YuanRisheng 提交于
      * perfect reshape kernel
      
      * fix bugs of sig
      
      * add unittest for reshape_sig
      
      * fix bugs when run converage
      27996fd1
  5. 27 3月, 2022 4 次提交
  6. 26 3月, 2022 1 次提交
  7. 25 3月, 2022 1 次提交