- 10 5月, 2023 1 次提交
- 
- 
由 Bo Zhang 提交于* Support different dtypes of inputs for broadcast for dropout optimization (#52093) * change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * PR comment * dropout_nd_optimization (#51479) * with printf * add DropOutNdForwardKernel * PR comment * Dropout optimize & clean broadcast inT and ElementwiseType (#52969) * change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean * Fix xpu2 kp compile error (#53548) * fix conflict * conflict 
 
- 
- 21 4月, 2023 1 次提交
- 
- 
由 JYChen 提交于* fix the set_value error in cpu * add a unitest for set_value OP * fix platform::is_gpu_place * add todo note for set_value * fix test 
 
- 
- 30 1月, 2023 1 次提交
- 
- 
由 engineer1109 提交于replace all TensorFromVector & TensorToVector AssignKernel async copy 
 
- 
- 24 6月, 2022 1 次提交
- 
- 
由 YuanRisheng 提交于* perfect copy * deal with conflict * deal with conflict * fix compile bugs * fix unittest bugs * change code format * deal with conflict * modify code by review * fix ce bugs * fix ce bugs * add lo * perfect code format * deal with conflicts 
 
- 
- 05 6月, 2022 1 次提交
- 
- 
由 Sing_chan 提交于
 
- 
- 31 3月, 2022 1 次提交
- 
- 
由 zyfncg 提交于* rename scalar_array to int_array * update cmake * fix conflict * remove useless log 
 
- 
- 27 3月, 2022 1 次提交
- 
- 
由 hong 提交于* move slice to pten * merge develop; test=develop * fix slice bug; * update * update * fix error * update * fix bug * polish code * polish code * polish code * try to fix windows bug * add gpu compile flag; * try to fix * remov template; * polish code; * fix npu bug; * fix npu bug * fix npu bug; test=develop * fix slice bug; * remove no need dep 
 
- 
- 14 3月, 2022 1 次提交
- 
- 
由 zyfncg 提交于* move set_value_grad kernel form fluid to phi * add unittest for passing coverage ci 
 
- 
- 09 3月, 2022 1 次提交
- 
- 
由 zyfncg 提交于* save code * fix bug of set_value * add coverage test 
 
- 
