1. 15 8月, 2022 1 次提交
  2. 12 8月, 2022 4 次提交
    • L
      fix nccl comm in sync_bn (#45100) · 1e965756
      LiYuRio 提交于
      1e965756
    • D
      enhance grid_sampler to support 3d input (#45015) · 1773fbba
      duanyanhui 提交于
      * enhance grid_sampler to support 3d input
      1773fbba
    • Z
      fix extra output of kernels for inference (#45048) · 1cb883da
      zyfncg 提交于
      1cb883da
    • S
      [geometric]Add paddle.geometric.send_ue_recv API (#43174) · 615b15a3
      Siming Dai 提交于
      * add init file
      
      * add op definition and infermeta
      
      * add kernel definition funcs
      
      * add broadcast infer shape
      
      * add gpu forward kernel
      
      * delete SUB and DIV
      
      * add x_grad
      
      * add template
      
      * add e_grad for min and max
      
      * fix small bug
      
      * temp commit
      
      * temp commit
      
      * add e_grad for sum and mean
      
      * fix some compile bug
      
      * fix compile bugs
      
      * fix compile problem
      
      * add sum forward unittest
      
      * fix broadcast error, add kernel sig, register e_grad, change unit test
      
      * fix grad
      
      * add temp grad fix
      
      * temp commit
      
      * add min max unittest
      
      * add max, min unittest, fix mul bug
      
      * add cpu forward sum and mean
      
      * add forward min max, fix mean unittest
      
      * add cpu backward min max
      
      * fix code-style
      
      * add backward sum mean
      
      * fix rocm ci
      
      * set uniitest timeout
      
      * fix bug of x broadcast to e, gpu grad
      
      * fix bug of x broadcast to e, cpu grad
      
      * rename BOOST_GET_CONST macro
      
      * fix rocm ci
      
      * mv graph_send_e_recv to graph_send_ue_recv
      
      * move out_size to IntArray
      
      * add eager op test
      
      * fix max pool type bug, add unittest for api
      
      * revise api doc
      
      * add fp16 for atomic min and max, add unittest
      
      * add unittest
      
      * add fp16 support for graph_send_recv
      
      * fix unittest fp16 bug
      
      * change OutSizeTensor to Out_size
      
      * move E to Y
      
      * add copyright, fix comment
      
      * review code
      
      * fix thread block size
      
      * fix thread block size
      
      * change api attribute name: pool_type to reduce_op, compute_type to message_op
      
      * change api attribute name, move pool_type to reduce_op, move compute_type to message_op
      615b15a3
  3. 11 8月, 2022 1 次提交
  4. 10 8月, 2022 1 次提交
  5. 09 8月, 2022 5 次提交
  6. 08 8月, 2022 5 次提交
  7. 07 8月, 2022 1 次提交
  8. 05 8月, 2022 2 次提交
    • D
      migrate kernel (#44841) · 62a98130
      duanboqiang 提交于
      * migrate kernel
      
      * fix sig order
      
      * remove header files
      
      * remove header
      
      * remove header
      
      * modify logits grad
      62a98130
    • F
      move fft kernels to phi (#44714) · 153f1138
      Feiyu Chan 提交于
      * move fft kernels to phi, done with cufft, pocketfft, mkl_cdft, hipfft
      * make stft_op use fft from phi/kernels/funcs, clean code
      153f1138
  9. 04 8月, 2022 3 次提交
    • Z
      phi_fill_diagonal_tensor (#44649) · 2140e825
      zhiboniu 提交于
      * phi_fill_diagonal_tensor
      
      * delete extra lines
      
      * update
      
      * add legacy api test
      
      * rename sig
      2140e825
    • Z
      Phi generate_proposals_v2 (#44436) · 566c80ff
      zhiboniu 提交于
      * phi_generate_proposals_v2
      
      * remove old kernels
      
      * optest add eager_check
      
      * del lod
      
      * update
      
      * update
      
      * update test_detection with_lod
      
      * update nms_util
      
      * remove old nms_util.h
      566c80ff
    • X
      mv fold & unpool to phi (#44836) · e9994f2e
      xiaoting 提交于
      * fix conflicts
      
      * mv unused file
      
      * revert backward.h
      
      * revert lu_unpack kernel
      
      * rm .cu file
      
      * Update lu_unpack_kernel.cc
      
      * format phi yaml
      e9994f2e
  10. 03 8月, 2022 9 次提交
    • Z
      Phi fill diagonal (#44453) · 9f1616a0
      zhiboniu 提交于
      * phi_fill_diagonal
      
      * remove old kernels
      
      * update
      
      * update attr args
      
      * refix
      
      * update
      9f1616a0
    • T
      [operator migration] Migrate affine grid op (#44663) · d94b9686
      Thomas Young 提交于
      * save change
      
      * save change by YSL
      
      * save change by YSL
      
      * change by YSL
      
      * test pre commit
      
      * Revert "test pre commit"
      
      This reverts commit eee5e116331186cc544de871b4a5174a6431f17c.
      
      * fix code style
      
      * fix ctest
      
      * temp save
      
      * save change
      
      * change by YSL
      
      * final change by ysl
      
      * fix ci
      
      * fix code style
      
      * delete unuse code
      
      * change by ysl
      d94b9686
    • S
      move repeat interleave (#44753) · 3b32835f
      seemingwang 提交于
      * move repeat interleave
      
      * fix api name
      
      * recover op registration
      
      * fix arguments order
      
      * fix
      
      * fix infermeta
      
      * fix infermeta
      
      * fix header
      
      * fix infermeta
      
      * fix
      
      * fix
      
      * fix dtype
      
      * log&test
      
      * test
      
      * remove logs
      
      * fix
      
      * remove logs
      
      * combine files
      
      * combine
      
      * combine files
      
      * fix cuda place
      3b32835f
    • W
      [BUG FIX]fix auc calculation error under multi instag scene (#41289) · ef945c81
      wangzhen38 提交于
      * fix_auc in instag task
      
      * fix_auc in instag task
      
      * fix_auc in instag task
      
      * fix opmaker error
      
      * optmize default value of ins_tag
      
      * fix multiary
      
      * fix multiary
      
      * reback ctr_metric_bundle
      
      * reback ctr_metric_bundle
      
      * for ci auc_test
      
      * code style
      
      * code style fix
      
      * suit paddle
      
      * suit paddle
      
      * update code example
      
      * complete Auc unittest
      
      * set asdispensable
      
      * set optional
      
      * compatable and code style
      
      * fix cu
      
      * fix cu
      
      * for ci win
      
      * update unittests of auc
      
      * update code doc by reviews
      
      * change code doc by reviews
      ef945c81
    • W
      [Phi] mv decode_jpeg (#44645) · a2265028
      wuyefeilin 提交于
      * mv kernel
      
      * mv infershape
      
      * mv yaml
      
      * update some
      
      * maintain decoe_jpeg in old dygraph
      
      * fix as review
      
      * rm decode_jpeg_op.cu
      
      * update for rocm
      a2265028
    • W
      [PHI] Move uniform random inplace op to PHI. (#44700) · bc106fad
      wuyefeilin 提交于
      bc106fad
    • W
      [PHI] Move eigvalsh op to phi (#44559) · cdbfeff4
      wuyefeilin 提交于
      * mv eigvalsh op
      cdbfeff4
    • Z
      Phi edit distance (#44447) · 5ad3228c
      zhiboniu 提交于
      * phi_edit_distance
      
      * fix
      5ad3228c
    • Z
      opt bn1d backward (#44783) · 36f08826
      zhangkaihuo 提交于
      36f08826
  11. 02 8月, 2022 3 次提交
    • S
      move renorm op (#44676) · 669353c1
      seemingwang 提交于
      * move renorm op
      
      * change python api
      
      * change op class func
      
      * alloc data
      
      * remove comments
      
      * fix grad arguments
      
      * fix python argument
      
      * fix python argument
      
      * change unit-test
      
      * remove shape func registration
      
      * recover extra-arguments
      
      * recover shape functor
      669353c1
    • C
      【PFCC算子性能优化】 SeluKernel Optimization (#44490) · 859c4077
      carryyu 提交于
      * [PFCC] SeluKernel Optimization
      
      * selu kernel optimization
      
      * add private
      
      Co-authored-by: carryyu <>
      859c4077
    • Y
      [Phi] Move QR to Phi (#44742) · 2cf2e786
      Yulong Ao 提交于
      * [Phi] Move Qr to the Phi
      
      * [Phi] Regiter the cpu grad kernel for qr
      
      * [Phi] Share the cuda kernels to lstsq
      
      * [Phi] Remove some improper inlcude files
      
      * [Phi] Modify codes based on the reviews
      
      * [Phi] Remove unecessary files and add the cuda_only comment
      
      * [Phi] Remove the unecessary include file
      
      * [Phi] Remove qr_op.cu and lstsq_op.cu
      2cf2e786
  12. 01 8月, 2022 5 次提交
    • L
      unify gpu context (#44740) · 86763023
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      
      * remove alias name CUDADeviceContext
      
      * fix compile
      
      * fix tests
      
      * revert changes
      86763023
    • T
      [operator migration] Migrate unstack_op and nms_op (#44424) · 9d2e0ecb
      Thomas Young 提交于
      * update unstack_op
      
      * update unstack_op
      
      * update unstack_op
      
      * fix unstack test
      
      * update unstack
      
      * update with remote
      
      * fix unstack_test.py
      
      * temp_save_change_nms_op
      
      * add nms test
      
      * update nms fix
      
      * update unstack_op
      
      * temp save change
      
      * finish fix nms_op
      
      * pass nms test
      
      * fix CI
      
      * fix ops test
      
      * save change
      
      * fix code style
      
      * fix code style
      
      * fix ci and codestyle
      
      * fix ci
      Co-authored-by: NShiningZhang <zhang_liang1991@126.com>
      9d2e0ecb
    • L
      migrate overlap_add and overlap_add_grad op (#44739) · 2a8219c1
      levi131 提交于
      * update code format
      
      * add ymal and test
      
      * update for comments
      2a8219c1
    • X
      8482f1ae
    • L
      [PHI] Move lu_unpack to phi (#44674) · c905a9e9
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer kernel implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Add lu_unpack_sig
      
      * Fix bugs
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * Update include files
      
      * lu_impl.h->lu_kernel_impl.h
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add check_eager
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      c905a9e9