1. 20 1月, 2023 1 次提交
  2. 19 1月, 2023 2 次提交
    • H
      Fix paddle.queeze_ bug (#49903) · 11e34ae0
      heliqi 提交于
      * fix queeze_ bug
      
      * fix slove use squeeze_kernel
      
      * fix slove use squeeze_kernel
      
      * fix slove use squeeze_kernel
      
      * add test case
      11e34ae0
    • J
      [KUNLUN] add op: maxpool_with_index (#49505) · f71f77e9
      jameszhang 提交于
      * [KUNLUN] add op: maxpool_with_index
      
      * use DeviceContext::Alloc() instead of DenseTensor::mutable_data()
      
      * fix file format
      
      * solve clip unittest failure
      
      * minor fix
      
      * Revert "solve clip unittest failure" since the issue is fixed
      in #49535
      
      This reverts commit 1127adc66e79afe35ac3c00bb34e6aaa7cd7d78b.
      
      * align with xdnn on the definition of mask in max_pool_with_index
      
      * minor
      f71f77e9
  3. 18 1月, 2023 6 次提交
  4. 16 1月, 2023 1 次提交
    • Z
      CUDA12.0 integration (#49539) · 1885d55a
      zlsh80826 提交于
      * Update warpctc for cuda-12
      
      * Deprecate cudaProfilerInitialize for CUDA > 11
      
      * Deprecate CUSPARSE_MV_ALG_DEFAULT for CUDA_VERSION >= 11040
      
      * Add the missing thrust header
      1885d55a
  5. 13 1月, 2023 8 次提交
  6. 12 1月, 2023 4 次提交
  7. 11 1月, 2023 1 次提交
    • Y
      Implement a common segmented array. (#49450) · b1faa562
      Yiqun Liu 提交于
      * Implement a common PointerArray.
      
      * Polish codes.
      
      * Add including of header file.
      
      * Add the branch of kFix8.
      
      * Fix compiling error.
      
      * Add alignas hint to fix the performance drop.
      
      * Optimize the H2D copy in stack_grad.
      
      * Rename the macro.
      
      * Fix align hint for different compilers.
      
      * Polish the define of PADDLE_ALIGN.
      
      * Fix compiling error.
      
      * Remove the align hint on windows.
      b1faa562
  8. 10 1月, 2023 3 次提交
  9. 09 1月, 2023 4 次提交
  10. 06 1月, 2023 3 次提交
    • R
      Dev (#49591) · 07db4a9f
      RuohengMa 提交于
      * add bitwise and, bitwise not, bitwise or and bitwise xor
      
      * correct typo
      07db4a9f
    • J
      [zero-dim] Support 0-d for kthvalue and mode (#49340) · 292738f3
      JYChen 提交于
      * add 0-d support for paddle.kthvalue
      
      * add 0-d support for paddle.mode
      
      * fix coverage test for device
      
      * fix check-bug in windows
      
      * change axis check from LT to LE
      
      * add shape & value check for grad when input is 0d tensor
      292738f3
    • T
      fix bug (#49546) · e0ee7403
      Thomas Young 提交于
      e0ee7403
  11. 05 1月, 2023 2 次提交
  12. 04 1月, 2023 3 次提交
    • W
      [Inference] Add conv_fusion nhwc impl. (#49047) · 4a8708bb
      Wilber 提交于
      4a8708bb
    • Y
      [Paddle Inference] fix mixed precision diff (#49475) · ac75a9a6
      Yuanle Liu 提交于
      ac75a9a6
    • H
      [Unify KernelKey] change OpKernelType->KernelKey (#49138) · 4383494f
      HongyuJia 提交于
      * execute use kernel_key first
      
      * change OpKernelType->KernelKey
      
      * fix py3 compile error, remove redundant header files
      
      * fix build_strategy_test
      
      * fix DataType::RAW
      
      * fix custom_type test: operator_test.cc
      
      * fix transform place
      
      * fix backends_are_same_class
      
      * try fix place TransDataDevice
      
      * support all KernelKey
      
      * fix TransformData
      
      * fix place_are_same_class
      
      * fix merge
      
      * fix test_params_no_grad
      
      * fix specific place of GetExpectedKernelType
      
      * fix specific place of GetExpectedKernelType
      
      * fix GetKernelTypeForVar
      
      * fix dtype error
      
      * fix fetch_v2
      
      * change GetKernelTypeForVar
      
      * fix interpreter
      
      * fix typo error
      
      * polish codes
      
      * polish codes
      
      * polish codes
      
      * fix conflict
      4383494f
  13. 03 1月, 2023 2 次提交