1. 29 7月, 2022 6 次提交
    • Z
      phi_multiclass_nms3 (#44613) · a9919903
      zhiboniu 提交于
      a9919903
    • A
      add FLAGS_enable_api_kernel_fallback (#44706) · e439d735
      Aganlengzi 提交于
      * add FLAGS_enable_api_kernel_fallback
      
      * deal with more cases
      
      * add ut for coverage
      e439d735
    • W
      Phi softplus migration (#44542) · 05515662
      Wang Bojun 提交于
      * add yaml and utests of phi softplus
      
      add yaml of softplus
      
      fix softplus bug in phi
      
      * update utests
      
      * bug fix
      
      * bug fix for test_layers
      
      * layer api match
      
      * match def and doc in ops.py
      
      * doc polish
      
      * fix unwanted modified of thresholded_relu
      
      * style imporve
      05515662
    • L
      move CUDAStream to phi (#44529) · da3743fd
      Leo Chen 提交于
      * init
      
      * move CUDAStream to phi
      
      * fix compilation
      
      * merge develop
      
      * add stream_owned_ member
      
      * split cuda_stream.h
      
      * fix cpu compile
      
      * fix constructor
      
      * fix bug
      
      * fix windows compile
      
      * fix inference test_levit
      
      * fix windows tests
      da3743fd
    • L
      [PHI] Move lu to phi (#44605) · 3d88816e
      Lin Manhui 提交于
      * Add kernel declarations
      
      * Copy kernel implementation code
      
      * Transfer implementation code
      
      * Register new kernels
      
      * Remove old kernels
      
      * Fix code style
      
      * Fix bugs
      
      * mutable_data->HostAlloc
      
      * Transfer infermeta
      
      * Add yaml and update python api
      
      * Add PADDLE_WITH_HIP check
      
      * Update unittests
      
      * Fix bugs
      
      * Fix bugs
      
      * Optimize directory structure
      
      * Add output checks
      
      * lu_impl.h->lu_kernel_impl.h
      Co-authored-by: NBobholamovic <linmanhui@baidu.com>
      3d88816e
    • Y
      [Phi] Add yaml for assign_value (#44596) · 88584396
      Yulong Ao 提交于
      * [Phi] Add yaml for assign_value
      
      * [Phi] Fix the bug of the assign api and modify the unittest
      
      * [Phi] Fix the bug when the tensor does not have the backend info
      
      * [Phi] Replace the functional-style cast init by the brace-init
      
      * [Phi] Cast the data explicitly
      88584396
  2. 28 7月, 2022 11 次提交
  3. 27 7月, 2022 10 次提交
  4. 26 7月, 2022 12 次提交
  5. 25 7月, 2022 1 次提交
    • M
      Fc fp16 (#44578) · a54c6953
      ming1753 提交于
      * (modified) fc support fp16
      
      * __CUDA_ARCH__ version
      
      * delete half
      
      * delete half
      
      * add half support
      
      * add half support
      
      * add half support
      a54c6953