1. 28 1月, 2022 1 次提交
  2. 25 1月, 2022 1 次提交
    • L
      Optimize nearest_interp forward (#38528) · 232bbce2
      Lijunhui 提交于
      * init commit
      
      * remove comments
      
      * remove nchw branch
      
      * optimize code
      
      * apply fast div mod in 1D kernel, rm 3D kernel
      
      * move init of FastDivMode to CPU
      
      * 3D kernel for nchw, FastDiv for 1D kernel
      
      * debug done. process boundary
      
      * 2^n
      
      * optimize
      
      * optimize
      
      * change code & optimize code
      232bbce2
  3. 20 1月, 2022 1 次提交
  4. 18 1月, 2022 1 次提交
  5. 17 1月, 2022 1 次提交
    • W
      [Pten] Replace platform::Place to pten::Place. (#38899) · c48a9ad5
      Wilber 提交于
      * add pten::Place data structure.
      
      * update ci problem
      
      * fix ci problem
      
      * update
      
      * using platform::Place=pten::Place
      
      * remove BOOST_GET_CONST for CPUPlace and GPUPlace
      
      * compile pass 25%.
      
      * compile pass 45%
      
      * compile pass 60%
      
      * remove boost_get for xpu npu mlu and ipu
      
      * compile pass on cpu and gpu.
      
      * fix compile problem
      
      * fix compile error.
      
      * update
      
      * fix ci problem
      
      * update
      
      * ci approve
      
      * fix ci problem
      
      * fix ci eager test problem
      
      * remove BOOST_GET_CONST
      
      * fix npu compile
      c48a9ad5
  6. 12 1月, 2022 1 次提交
    • L
      Adjust warpper of gpu_lanuch_config (#38654) · f5166284
      limingshu 提交于
      * first commit
      
      * fix wrong filename
      
      * fix the wrong spell name
      
      * fix gpu config warper
      
      * modify according to pr advices
      
      * fix GpuLauchConfig1D api bugs
      
      * change the config for dropout grad
      
      * fix bugs
      
      * modification according to pr advices
      
      * modification according to pr advices
      f5166284
  7. 30 12月, 2021 1 次提交
    • Z
      Add cusparse and unittest (#38431) · 667dc9f0
      zhangkaihuo 提交于
      
      
          将cuSparse的handle与DeviceContext进行绑定,避免op中进行创建和销毁
          添加对cuSparse中dense和sparse转换的API进行封装
          添加对封装的API的单测
      667dc9f0
  8. 27 12月, 2021 1 次提交
  9. 17 12月, 2021 1 次提交
    • F
      Get base pointer from Allocation (#37978) · 431a2d6a
      From00 提交于
      * Get GPU BasePtr from CUDA allocation
      
      * Fix compile error for ROCm
      
      * Add BasePtr function for IPUPlace in naive_best_fit_allocator.cc
      
      * Add alignment for BuddyAllocator
      
      * Set address alignment of BuddyAllocator to 32 bytes
      
      * Fix CI error
      
      * Remove code for naive_best_fit strategy
      431a2d6a
  10. 10 12月, 2021 1 次提交
  11. 09 12月, 2021 1 次提交
  12. 03 12月, 2021 1 次提交