1. 06 5月, 2023 1 次提交
    • Z
      move UniformRawKernel to legacy (#53158) · 13e2e10c
      zhangyuqin1998 提交于
      * move UniformRawKernel to legacy
      
      * Update uniform_kernel.cc
      
      * Update uniform_kernel.cu
      
      * Update uniform_kernel.cc
      
      * Update uniform_kernel.cu
      
      * Update uniform_kernel.h
      
      * Update uniform_kernel.cc
      
      * Empty Commit to setup deployments
      13e2e10c
  2. 05 5月, 2023 14 次提交
  3. 04 5月, 2023 5 次提交
  4. 30 4月, 2023 1 次提交
  5. 29 4月, 2023 1 次提交
  6. 28 4月, 2023 15 次提交
  7. 27 4月, 2023 3 次提交
    • B
      Support different dtypes of inputs for broadcast for dropout optimization (#52093) · 3474e09c
      Bo Zhang 提交于
      * change judgement for DropoutGradGPUKernelDriver
      
      * add UnrollerWithoutVecSize and after this Loaddata to be refined
      
      * pass unittest
      
      * use same unroller with XPU
      
      * BroadcastWithInt64Index
      
      * BroadcastDataLoader template partial specialization
      
      * fix compile errs in ROCms
      
      * PR comment
      3474e09c
    • G
      [phi] Move sequence_pool to phi - Step 3 :sequence_pool_grad_op (#52680) · fe053396
      gouzil 提交于
      * [phi] move sequence_pool kernel to phi
      
      * mv kernels impl
      
      * fix parameter error
      
      * clean include
      
      * fix compat filename
      
      * [phi] move fluid sequence_pool_grad to phi
      
      * [phi][compat] sig rm GradVarName
      
      * [phi] fix sequence_pool out type
      
      * [phi] rm impl, add const string
      
      * [phi] fix const str
      
      * fix sequence_pooling cmake
      
      * [phi] mv sequence_pooling_test
      
      * [phi] fix grad sig
      
      * [phi] fix sequence_pool is_test error
      
      * [phi] fix sequence_pooling gpu include
      
      * [phi] mv to impl
      
      * [phi] fix SequencePoolFunctor cu include
      
      * [phi] modify out max_index int32_t
      
      * [phi] add pooltype mapping determine
      
      * [phi] fix sequence_pool_sig
      
      * [phi] fix sequence_pool_sig sum
      
      * [phi] try ci
      
      * [phi] fix max_index optional
      fe053396
    • Y
      scale trt converter support int64 (#53388) · 182b6f83
      Yuanle Liu 提交于
      182b6f83