1. 05 4月, 2022 2 次提交
  2. 04 4月, 2022 2 次提交
  3. 03 4月, 2022 3 次提交
    • C
      [Phi]Concat grad (#41112) · 3f57ef7a
      chentianyu03 提交于
      * add concat_grad kernel
      
      * fix error
      
      * remove comment code
      
      * fix outs nullptr error
      
      * change to phi header
      
      * add concat_grad declare for standalone_executor_test
      3f57ef7a
    • H
      Add infer meta (#41054) · 868a3203
      hong 提交于
      * add some infer meta
      
      * fix bug
      
      * fix bugs;
      
      * fix bug and add set data type
      
      * revert infer shape of lookup table
      
      * recover test
      868a3203
    • Z
      Add randperm and range yaml (#41265) · fd1ecfc5
      zyfncg 提交于
      * add randperm and range yaml
      
      * add eager test for randperm
      fd1ecfc5
  4. 02 4月, 2022 5 次提交
    • S
      Add graph apis (#40809) · b0398c8e
      Siming Dai 提交于
      * Add graph_reindex API
      
      * add graph_sample_neighbors api
      
      * Add buffer
      
      * delete VLOG
      
      * delete thrust::copy for output
      
      * add ShareDataWith
      
      * delete graph_reindex hashtable output
      
      * add graph_reindex dispensable
      
      * add reindex unittest, move memset to cuda kernel, change api
      
      * fix conflict
      
      * add reindex buffer for gpu version note
      
      * fix conflicts for op_func_generator
      
      * Add fisher_yates sampling, add dispensable, change infermeta
      
      * add dtype for edge_id
      
      * fix rocm ci and static check ci
      
      * add unittest
      
      * fix unittest
      
      * fix unittest
      
      * fix bug
      b0398c8e
    • L
      do not use scope in op kernel (#41316) · 0f6412c0
      Leo Chen 提交于
      0f6412c0
    • W
      [Paddle inference] support new quant_model (#41049) · 1b58ce14
      Wangzheee 提交于
      * paddle inference support new quant_model
      1b58ce14
    • W
      [phi] Move clip op to phi (#40602) · c0658045
      wuyefeilin 提交于
      * move clip op to phi
      
      * fix as review
      
      * update hierarchical_sigmoid_kernel.cc
      
      * update selected_rows
      
      * update clip_kernel.cu
      
      * fix as review
      c0658045
    • T
      xpu add dropout&cast unitest (#41120) · acec26a1
      taixiurong 提交于
      acec26a1
  5. 01 4月, 2022 6 次提交
    • Z
      Add nll_loss yaml (#41126) · 8e032db8
      zyfncg 提交于
      * add nll_loss yaml
      
      * fix nll loss
      
      * fix nll loss bug
      
      * fix bug
      
      * fix bug
      
      * fix infrt problem
      Co-authored-by: Nxiongkun <xiongkun03@baidu.com>
      8e032db8
    • W
      [Eager] Support pinned (#41035) · f3270fc8
      wanghuancoder 提交于
      * support pinned, test=develop
      
      * support async_write, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine,test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      
      * refine, test=develop
      f3270fc8
    • C
      [Phi] Move softmax with cross entropy kernel into phi (#40832) · e6ec98fe
      Chen Weihang 提交于
      * add cross_entropy_with_softmax phi kernel
      
      * remove softmax_with_cross_entropy kernel
      
      * add softmax_with_cross_entropy grad kernel
      
      * remove original op kernel
      
      * refine cross entropy impl
      
      * fix pointer error
      
      * revert kernel cu change
      
      * fix xpu failed
      
      * fix cinn failed
      
      * fix npu failed
      
      * add forward sig
      
      * add check_nan_inf for pt kernel
      
      * remove repeat cmake item
      
      * fix unittest error
      e6ec98fe
    • C
      [Phi]Interploatd kernels into phi (#40855) · d65a7a46
      chentianyu03 提交于
      * add interploate cpu kernel
      
      * fix nullptr bug
      
      * add interpolate gpu kernel
      
      * fix unit test error
      
      * remove raw kernels
      
      * add cuda kernel impl
      
      * add infermeta
      
      * recover accidentally deleted kernels in interpolate op
      
      * fix grad x_grad name error
      
      * remove interpolate_v2_op.h
      
      * rm unused codes
      
      * fix xpu build error
      
      * fix build error
      
      * fix namespace error
      
      * add register header for nup
      
      * fix infermeta error
      
      * modify by review
      
      * add the missing args in test_trt_convert_nearest_interp_v2
      d65a7a46
    • z8hanghuan's avatar
      support multi_layer of bilstm,*test=kunlun (#41151) · 00d23897
      z8hanghuan 提交于
      * support multi_layer of bilstm,*test=kunlun
      
      * support multi_layer of bilstm, *test=kunlun
      
      * support multi_layer of bilstm, *test=kunlun
      
      * support multi_layer of bilstm, *test=kunlun
      00d23897
    • C
      [Phi] Add shape and strided_slice yaml & Adapt eager mode (#41131) · 9b6a02d4
      Chen Weihang 提交于
      * add several yaml
      
      * polish strided slice kernel & add yaml
      
      * reorder yaml
      
      * add several yaml
      
      * revert yaml config change
      
      * resolve conflict
      
      * Update test_strided_slice_op.py
      9b6a02d4
  6. 31 3月, 2022 9 次提交
  7. 30 3月, 2022 8 次提交
    • Z
      [Phi] Move Rnn Op from fluid to phi (#41007) · 66cf8b08
      zyfncg 提交于
      * move rnn kernel to phi
      
      * move infershape of rnn to phi
      
      * fix HIP bug
      
      * rename function
      
      * fix HIP bug
      
      * fix hip bug
      66cf8b08
    • R
      [MoE] Moe apis (#41092) · aac7879a
      Roc 提交于
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * add op about moe gate
      
      update utils
      
      add limit by capacity op
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      * fix for win
      
      * fix bugs in test_limit_by_capacity_op
      
      * update ut
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * update(fix) ut for win
      
      * moe apis in incubate
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * fix for win
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * fix ut for number count
      
      * add apis and utils
      
      * add gate apis
      
      * add moe and grad clip apis
      
      * update moe apis
      
      * add ops for moe gate
      
      * fix
      
      * update for base moe layer api
      
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * fix for dygraph
      
      * update with ranodm routing
      
      * update
      
      * fix ut for limit by capacity
      
      * update
      
      * update limit by capacity for easily to switch to single thread mode
      
      * update api docs
      Co-authored-by: Nhlygit66666 <2570058140@qq.com>
      aac7879a
    • C
      Revert "Revert "[Phi] Move elementwise_floordiv and elementwise_pow to phi... · eef46770
      Chen Weihang 提交于
      Revert "Revert "[Phi] Move elementwise_floordiv and elementwise_pow to phi (#40993)" (#41065)" (#41110)
      
      This reverts commit 3a6f1135.
      eef46770
    • C
      Revert "Revert "[Phi] trans logsumexp op (#40790)" (#41068)" (#41109) · ee8eeb45
      Chen Weihang 提交于
      This reverts commit 054fc997.
      ee8eeb45
    • H
      Revert "Revert "Move some activation to phi (#40727)" (#41056)" (#41095) · 91bb52cd
      hong 提交于
      This reverts commit 05f3d48e.
      91bb52cd
    • Y
      move elementwise_mul selected rows input (#41042) · 13f1641d
      YuanRisheng 提交于
      13f1641d
    • Z
      Optimize the perf of top_k when k is too large (#40941) · 45078d9f
      Zhang Zheng 提交于
      * Optimize the perf of top_k when k is too large
      
      * fix rcom compile
      
      * fix
      
      * only compile in cuda
      
      * fix log info
      45078d9f
    • H
      swish and pow op for xpu test=kunlun (#40654) · d951f3af
      houj04 提交于
      * swish and pow op for xpu. test=kunlun
      
      * fix code style. test=kunlun.
      
      * use pow_grad xdnn api. test=kunlun.
      d951f3af
  8. 29 3月, 2022 5 次提交
    • R
      [MoE] Moe apis (#40895) · aeade538
      Roc 提交于
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * add op about moe gate
      
      update utils
      
      add limit by capacity op
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      add ut for limit_by_capacity
      
      add ut for prune_gate_by_capacity
      
      * fix for win
      
      * fix bugs in test_limit_by_capacity_op
      
      * update ut
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * update(fix) ut for win
      
      * moe apis in incubate
      
      * # This is a combination of 10 commits.
      # The first commit's message is:
      add expert count op
      
      add ut for expert_count
      
      # This is the 2nd commit message:
      
      update UT only for cuda
      
      # This is the 3rd commit message:
      
      fix for rocm
      
      # This is the 4th commit message:
      
      update ut
      
      # This is the 5th commit message:
      
      add moe module
      
      # This is the 6th commit message:
      
      add expert count op
      
      add ut for expert_count
      
      # This is the 7th commit message:
      
      update UT only for cuda
      
      # This is the 8th commit message:
      
      update ut
      
      # This is the 9th commit message:
      
      add moe module
      
      # This is the 10th commit message:
      
      make expert count private
      
      * add assign pos op
      
      * fix upper num name
      
      * add api _assign pos
      
      * add ut for assign pos op
      
      * update date
      
      * fix for win
      
      * update for test (timeout)
      
      * fix ut
      
      * update
      
      * fix ut for number count
      
      * add apis and utils
      
      * add gate apis
      
      * add moe and grad clip apis
      
      * update moe apis
      
      * add ops for moe gate
      
      * fix
      
      * update for base moe layer api
      
      * add random routing op
      
      add _random_routing api in utils
      
      add random routing ut
      
      * fix for dygraph
      
      * update with ranodm routing
      
      * update
      
      * fix ut for limit by capacity
      
      * update
      Co-authored-by: Nhlygit66666 <2570058140@qq.com>
      aeade538
    • T
      Revert "[Phi] trans logsumexp op (#40790)" (#41068) · 054fc997
      tianshuo78520a 提交于
      This reverts commit 9c0eaada.
      054fc997
    • T
      3a6f1135
    • T
      Revert "Move some activation to phi (#40727)" (#41056) · 05f3d48e
      tianshuo78520a 提交于
      This reverts commit e77a947e.
      05f3d48e
    • [Phi] trans logsumexp op (#40790) · 9c0eaada
      提交于
      * [Phi] trans logsumexp op
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * fix bugs
      
      * add sig
      
      * fix sig bugs
      
      * fix sig bugs
      
      * fix xpu bugs
      
      * fix review bugs
      
      * test=develop
      9c0eaada