1. 30 1月, 2022 3 次提交
  2. 29 1月, 2022 3 次提交
    • L
      Add xpu2 compiler (#37254) · 92da5055
      Liu-xiandong 提交于
      * Add XPU compiler for paddle, test=develop
      
      * clean code
      
      * clean useless code
      
      * clean useless code
      
      * clean useless code
      
      * test
      
      * add include path
      
      * use clang compiler
      
      * xpu2.cmake
      
      * XPU2 compiler passed
      
      * update
      
      * update after pten
      
      * combination the WITH_XPU and WITH_XPU2
      
      * update the fuse operation in WITH_XPU and WITH_XPU2
      
      * update
      
      * update
      
      * update
      
      * fix the merge error
      
      * update
      
      * update the code
      
      * update the code
      
      * add run_kp_kernel flag
      
      * update
      
      * update
      
      * fix prepared type_ bug
      
      * clean and update the code
      
      * reset the kernel_primitives
      
      * update
      
      * clean the code
      
      * delete useless comment
      
      * fix the bug in WITH_XPU
      
      * update
      
      * update
      
      * modify the abi
      
      * delete some useless code
      
      * Parameter automation in xpu compilation
      
      * Parameter automation in xpu compilation
      
      * delete kps in cmake
      
      * delete useless comment
      
      * clean the code
      
      * clean the code
      92da5055
    • C
      [PTen] Tidy pten core headers (#39188) · dd990981
      Chen Weihang 提交于
      * open header for custom kernel
      
      * add core utils
      
      * tidy core code
      
      * tify header
      
      * tidy include
      
      * tidy namespace
      
      * resolve conflit
      
      * fix unittest and coverage
      
      * remove platform using
      
      * resolve conflict
      
      * resolve conflict
      
      * fix digamma namespace error
      
      * fix xpu full kernel error
      
      * fix xpu full kernel error
      
      * polish details
      
      * add place for lib storage
      dd990981
    • L
      7b4916c4
  3. 28 1月, 2022 7 次提交
  4. 27 1月, 2022 4 次提交
    • Z
      【PTen】Remove ReMakePtenDenseTensor (#39094) · 98c1829b
      zyfncg 提交于
      * remove remake densetensor
      
      * fix eager test error
      
      * fix bug in eager
      98c1829b
    • A
      [PluggableDevice] Add custom kernel support based on pten kernel management (#38848) · a8879215
      Aganlengzi 提交于
      * [Demo] custom kernel based on pten kernel
      
      * merge and npu custom work well
      
      * del comments
      
      * delete other code
      
      * fix CUDAContext
      
      * fix not found small_vector.h
      
      * support NPU
      
      * fix NPUContext
      
      * fix DeviceContext support
      
      * add UT
      
      * fix call
      
      * add UT
      
      * fix
      
      * fix for comments and ut
      
      * add MACRO control
      
      * fix multi input output
      
      * support env CUSTOM_DEVICE_ROOT
      
      * deal with special cases
      
      * fix for Windows
      
      * try coverage with test_custom_kernel_dot.py
      
      * fix test_custom_kernel_dot
      
      * fix test_custom_kernel_dot
      
      * fix merge
      
      * fix merge
      
      * fix CI
      
      * update
      
      * merge and fix
      
      * remove WITH_CUSTOM_KERNEL
      
      * fix merge
      
      * merge and fix
      
      * fix ut
      
      * fix ut for mac
      
      * add more UT
      
      * add more UT
      
      * fix
      a8879215
    • W
      fix shuffle_channel_detect_pass (#39242) · af9ddeb7
      wenbin 提交于
      * shuffle channel pass
      
      * add ut
      
      * timeout fix
      
      * makefile fix
      af9ddeb7
    • T
      compile for afs api (#39113) · 4748486e
      Thunderbrook 提交于
      * compile for afs api
      
      * with pslib
      4748486e
  5. 26 1月, 2022 4 次提交
  6. 25 1月, 2022 4 次提交
  7. 24 1月, 2022 4 次提交
  8. 22 1月, 2022 2 次提交
  9. 21 1月, 2022 5 次提交
  10. 20 1月, 2022 2 次提交
  11. 18 1月, 2022 2 次提交
    • S
      Mish FP32/BF16 kernel, conv and fc fuse passes (#38623) · 1d18bc2c
      Sławomir Siwek 提交于
      * Mish
      
      * Change exp() library
      
      * mish fuse pass
      
      * mish attrs
      
      * fixes
      
      * mishop maker
      
      * remove attrs
      
      * mish kernal for bf16
      
      * fc+mish fuse
      
      * fix code format error
      
      * Resolve merge conflicts
      
      * Update mish operator version
      
      * update mish variable to new naming convention
      1d18bc2c
    • Z
      [Unify Tensors PR #8] Merged Tensor into DenseTensor, test=allcases (#38914) · 2052f1e3
      Zhanlue Yang 提交于
      * Merged LoDTensor with Tensor,test=allcases
      
      * Patched python level LoDTensor
      
      * Patched python level LoDTensor
      
      * Merge Tensor into DenseTensor
      
      * Fixed namespace issues,test=allcases
      
      * Fixed merge issues
      
      * Fixed inference issues
      
      * Fixed NPU test issues
      
      * Fixed merge issues
      2052f1e3