- 10 5月, 2023 1 次提交
- 
- 
由 Bo Zhang 提交于* Support different dtypes of inputs for broadcast for dropout optimization (#52093) * change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * PR comment * dropout_nd_optimization (#51479) * with printf * add DropOutNdForwardKernel * PR comment * Dropout optimize & clean broadcast inT and ElementwiseType (#52969) * change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean * Fix xpu2 kp compile error (#53548) * fix conflict * conflict 
 
- 
- 23 3月, 2023 1 次提交
- 
- 
由 yeliang2258 提交于* add bf16 and fp16 tests * fix dtype check 
 
- 
- 13 3月, 2023 1 次提交
- 
- 
由 iSerendipity 提交于* Replace paddle::experimental::DataType as phi::DataType * restore custom_device.cc 
 
- 
- 02 3月, 2023 1 次提交
- 
- 
由 Ruibiao Chen 提交于* Check structed kernel for new executor static build * Update code * Ready for resnet50 * Move transfer_dtype to phi * Ready for transformer * Fix CI errors * Fix layer_norm InferMeta * Remove layer_norm infermeta fix 
 
- 
- 10 11月, 2022 1 次提交
- 
- 
由 YuanRisheng 提交于* standard api * fix sparse bugs * fix xpu bugs, test=kunlun * remove hard code for custom unittest * open ci, test=kunlun * deal with conflict 
 
- 
- 04 8月, 2022 1 次提交
- 
- 
由 limingshu 提交于* first commit * add fp16 ctest files for compare op * add cpu register of float16 for compare ops 
 
- 
- 05 6月, 2022 1 次提交
- 
- 
由 Sing_chan 提交于
 
- 
- 03 3月, 2022 1 次提交
- 
- 
由 From00 提交于* Move compare OPs to phi * Fix bug * Use BroadcastKernel and ElementwiseKernel in phi 
 
- 
