1. 03 8月, 2022 13 次提交
  2. 02 8月, 2022 6 次提交
    • S
      move renorm op (#44676) · 669353c1
      seemingwang 提交于
      * move renorm op
      
      * change python api
      
      * change op class func
      
      * alloc data
      
      * remove comments
      
      * fix grad arguments
      
      * fix python argument
      
      * fix python argument
      
      * change unit-test
      
      * remove shape func registration
      
      * recover extra-arguments
      
      * recover shape functor
      669353c1
    • C
      【PFCC算子性能优化】 SeluKernel Optimization (#44490) · 859c4077
      carryyu 提交于
      * [PFCC] SeluKernel Optimization
      
      * selu kernel optimization
      
      * add private
      
      Co-authored-by: carryyu <>
      859c4077
    • W
      [Eager] use eager final state instead intermediate state (#44722) · f1873b90
      Weilong Wu 提交于
      * [Eager] call final_state_slice under eager mode
      
      * rm useless comments
      
      * use eager final state instead intermidiate state
      
      * update fill_constant yaml
      
      * update fill_constant yaml
      
      * modify wrapped_infermeta_gen logic to fix special case
      
      * fix slice in manipulation
      
      * use fill_constant_
      
      * modify slice infermeta
      
      * rm final_state_conv2d
      
      * use final_state_slice
      
      * use final_state_slice only
      
      * polish slice, use final state
      
      * add paddle_throw for SplitInferMeta
      
      * rm fill_constant_ temply
      
      * recover array_equal, not allclose
      
      * recover original code
      f1873b90
    • Y
      [Phi] Move QR to Phi (#44742) · 2cf2e786
      Yulong Ao 提交于
      * [Phi] Move Qr to the Phi
      
      * [Phi] Regiter the cpu grad kernel for qr
      
      * [Phi] Share the cuda kernels to lstsq
      
      * [Phi] Remove some improper inlcude files
      
      * [Phi] Modify codes based on the reviews
      
      * [Phi] Remove unecessary files and add the cuda_only comment
      
      * [Phi] Remove the unecessary include file
      
      * [Phi] Remove qr_op.cu and lstsq_op.cu
      2cf2e786
    • W
      [Phi] polish and rename, pt* -> phi* (#44697) · 942ff89f
      Weilong Wu 提交于
      * polish and rename, pt* -> phi*
      
      * fix code format
      942ff89f
    • C
      [phi] add yolov3_loss yaml and unittest (#44476) · c7cf12fc
      ccrrong 提交于
      * add yaml and unittest
      
      * update yaml
      
      * update backward yaml and unittest
      
      * update yaml
      
      * add Yolov3LossGradInferMeta
      
      * update yolov3_loss_op.cc
      
      * fix bug
      
      * code format
      c7cf12fc
  3. 01 8月, 2022 9 次提交
  4. 30 7月, 2022 1 次提交
  5. 29 7月, 2022 9 次提交
    • L
      unify fluid::CUDADeviceContext and phi::GpuContext (#44723) · 88490567
      Leo Chen 提交于
      * remove cudaDeviceContext
      
      * remove more template
      
      * fix rocm compile
      88490567
    • H
      [API/OP] Migrate Lstsq op into phi (#44318) · ab2aaf8b
      Haohongxiang 提交于
      * migrate lstsq op
      
      * update
      
      * fix bugs for CIs
      
      * update
      
      * fix bugs
      
      * add uts
      
      * update
      
      * update
      
      * update
      
      * fix bugs of jip
      
      * fix bugs of hip
      
      * update
      
      * update according to review
      
      * update
      
      * update
      
      * update
      
      * update
      ab2aaf8b
    • Q
      add some fp16 op for kunlun resnet50 model (#44672) · fecbc958
      QingshuChen 提交于
      * add some fp16 op for kunlun resnet50 model
      *test=kunlun
      
      * tmp
      *test=kunlun
      fecbc958
    • Z
      phi_multiclass_nms3 (#44613) · a9919903
      zhiboniu 提交于
      a9919903
    • A
      add FLAGS_enable_api_kernel_fallback (#44706) · e439d735
      Aganlengzi 提交于
      * add FLAGS_enable_api_kernel_fallback
      
      * deal with more cases
      
      * add ut for coverage
      e439d735
    • W
      Phi softplus migration (#44542) · 05515662
      Wang Bojun 提交于
      * add yaml and utests of phi softplus
      
      add yaml of softplus
      
      fix softplus bug in phi
      
      * update utests
      
      * bug fix
      
      * bug fix for test_layers
      
      * layer api match
      
      * match def and doc in ops.py
      
      * doc polish
      
      * fix unwanted modified of thresholded_relu
      
      * style imporve
      05515662
    • L
      move CUDAStream to phi (#44529) · da3743fd
      Leo Chen 提交于
      * init
      
      * move CUDAStream to phi
      
      * fix compilation
      
      * merge develop
      
      * add stream_owned_ member
      
      * split cuda_stream.h
      
      * fix cpu compile
      
      * fix constructor
      
      * fix bug
      
      * fix windows compile
      
      * fix inference test_levit
      
      * fix windows tests
      da3743fd
    • L
      [PHI] Move lu to phi (#44605) · 3d88816e
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * lu_impl.h->lu_kernel_impl.h
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      3d88816e
    • Y
      [Phi] Add yaml for assign_value (#44596) · 88584396
      Yulong Ao 提交于
      * [Phi] Add yaml for assign_value
      
      * [Phi] Fix the bug of the assign api and modify the unittest
      
      * [Phi] Fix the bug when the tensor does not have the backend info
      
      * [Phi] Replace the functional-style cast init by the brace-init
      
      * [Phi] Cast the data explicitly
      88584396
  6. 28 7月, 2022 2 次提交