1. 31 8月, 2022 6 次提交
    • X
      [XPU] transfer concat kernel (#45463) · a0e3a175
      xiongkun 提交于
      * transfer concat kernel
      
      * test=kunlun
      
      * test=kunlun
      
      * test=kunlun
      
      * test=kunlun
      a0e3a175
    • J
      [XPU] move pool/pool_grad xpu kernel to phi (#45480) · 13a0ea4c
      james 提交于
      * move pool/pool_grad xpu kernel to phi, test=kunlun
      
      * replace mutable_data() with DeviceContext::Alloc()
      
      * replace PADDLE_ENFORCE_EQ with PADDLE_ENFORCE_XDNN, test=kunlun
      
      * adjust function param name & update include header
      
      * remove pool_op_xpu.cc
      
      * fire r200 test
      
      * minor, test=kunlun
      13a0ea4c
    • R
      [ROCM] fix bmm_kernel (#45530) · 4478389c
      ronnywang 提交于
      4478389c
    • C
      Fix split api bug (#45396) · 4a25b60d
      Charles-hit 提交于
      * fix split bug
      
      * solve function redefine
      
      * fix fluid.layers.split and add unit test
      
      * delete splitInferMeta register in unary.cc
      
      * modify test_split_op GPU unit test
      
      * modify test_split_op GPU unit test place param
      
      * refactor split op and fix infershape bugs
      
      * add () in && and ||
      
      * fix split C++ unit test
      
      * fix split infershape
      4a25b60d
    • W
      Move XPU mean and mean_grad to phi (#45512) · df7600ab
      WangZhen 提交于
      * Move XPU mean and mean_grad to phi, test=kunlun
      
      * Fix stream, test=kunlun
      
      * Replace ENFORCE, test=kunlun
      df7600ab
    • L
      Add index add API (#45176) · 45171911
      Li Min 提交于
      45171911
  2. 30 8月, 2022 12 次提交
  3. 29 8月, 2022 13 次提交
  4. 26 8月, 2022 4 次提交
  5. 25 8月, 2022 5 次提交
    • P
      Enable OMP multithreading in lookup_table_v2 (#45249) · 0c363de8
      piotrekobi 提交于
      * Add omp parallel for directives
      
      * Revert "Add omp parallel for directives"
      
      This reverts commit f4e4f8ddb12454018d9c1e49c074af2543659de6.
      
      * Add #pragma omp parallel for to correct file
      
      * Add check for _OPENMP definition
      
      * Disable omp on gpu
      
      * Trigger CI
      
      * Readd check for _OPENMP definition
      
      * Change macro disabling changes on GPU
      
      * Improve macro readability
      0c363de8
    • A
      [OpAttr]axis of Reverse Support Tensor type (#45391) · 91110661
      Aurelius84 提交于
      * [OpAttr]axis of Reverse Support Tensor type
      
      * fix coverage
      
      * fix unittest
      91110661
    • A
      [OpAttr]min/max of uniform_random support Tensor type (#45417) · c8955d0d
      Aurelius84 提交于
      * [OpAttr]min/max of Uniform_rand support Tensor type
      
      * fix typo
      c8955d0d
    • K
      Transfer memcpy d2h from fluid to phi (#45150) · 0d14e74a
      kangguangli 提交于
      * transfer memcpy_d2h from fluid to phi
      
      * refine arg check and add comment
      
      * fix cannot fallback to phi kernel
      
      * fix gpu_context host alloc when tensor size = 0
      
      * add kernel for std::vector<DenseTensor> args
      
      * fix bugs in MemcpyD2HMultiIOKernel
      
      * remove useless header file
      
      * polish format
      
      * fix typo
      
      * add testcase for cudapinned place
      
      * refine check condition in test
      
      * polish error message
      
      * polish error message
      
      * remove header in fluid  directory
      
      * merge memcpy_h2d and memcpy_d2h into one file, change register method to simplify implementation
      
      * fix code style check
      0d14e74a
    • S
      make full_like support double_max in dygraph (#45385) · edd66f2e
      Sing_chan 提交于
      * make full_like support double_max in dygraph
      
      * fix bug
      edd66f2e