- 28 4月, 2023 1 次提交
-
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean
-
- 21 11月, 2022 1 次提交
-
-
由 wanghuancoder 提交于
* refine reduce_all
-
- 20 5月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 14 4月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* chnage dispatch to visit * resolve conflict
-
- 28 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* remove in_dtype, out_dtype in redcue grad * set the dtype and layout in noneedbufferInputs func
-
- 18 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move reduce_mean_grad kernel into phi * move reduce_max/min_grad into phi * remove raw max/min grad kernel * fix bug * fix max/min grad error * move all reduce_grad kernel into one file * add prod grad kernel * add infermeta for prod kernel
-
- 11 3月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* add reduce_sum grad kernel * add reduce_grad * modify reduce grad * update reduce grad functions * fix build error * add argument mapping * move cast input after grad * add dims.size=1 cpu reduce_sum grad compute method * update reduce grad GPU * remove raw reduce_sum_grad kernel * modify header files * add namespace funcs for reduce_grad_funcstions
-
- 20 2月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
由 Yiqun Liu 提交于
-
- 09 2月, 2022 1 次提交
-
-
由 niuliling123 提交于
-