- 28 4月, 2023 1 次提交
-
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * clean ElementwiseT and InT for BroadcastKernel * default axis and clean inT * remove redundant fast divmod computation * optimize drop_nd & drop_nd_grad * optimize BroadcastDataLoader bf16 fp16 * rm InT etc. after merge develop * delete constexpr for windows ci * fix conflict * fix conflic with develop * fix conflic * new clean * clean
-
- 27 4月, 2023 1 次提交
-
-
由 Bo Zhang 提交于
* change judgement for DropoutGradGPUKernelDriver * add UnrollerWithoutVecSize and after this Loaddata to be refined * pass unittest * use same unroller with XPU * BroadcastWithInt64Index * BroadcastDataLoader template partial specialization * fix compile errs in ROCms * PR comment
-
- 22 2月, 2023 1 次提交
-
-
由 Shuangchi He 提交于
* Fix some typos. Signed-off-by: Yulv-git <yulvchi@qq.com> * pre-commit Signed-off-by: Yulv-git <yulvchi@qq.com> --------- Signed-off-by: Yulv-git <yulvchi@qq.com>
-
- 14 2月, 2023 1 次提交
-
-
由 limingshu 提交于
* first commit. * a little changes * add some changes for get vec_size efficiently * fix bugs --------- Co-authored-by: Nzhangbopd <1299246947@qq.com>
-
- 03 1月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Use BroadcastKernel and ReduceKernel to optimize expand and expand_grad. * Correct the axis when there is only 1 input in BroadcastKernel. * Add the calculate of output's shape.
-
- 14 12月, 2022 1 次提交
-
-
由 limingshu 提交于
* First Commit. * add some codes * add elementwise loader * fix code styles * merge with develop * add some changes both in elementwise and transpose * add init operation in broadcast kernel. * change codes according to pr suggestions about transpose file * fix error for op-benchmark ci * fix according to ci
-
- 05 12月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
* Replace mutable_data with DeviceContext.Alloc in phi kernels * Fix CI errors * Fix CI errors * Fix CI errors, test=kunlun * Fix CI errors, test=kunlun * Handle rnn_functor * Update approvals
-
- 28 11月, 2022 1 次提交
-
-
由 Yiqun Liu 提交于
* Optimize the log of broadcast and decrease the log level. * Remove the redundant brackets. * Change op benchmark ci to test the tests module. * Remove the observe of elementwise and reduce_ops sub-directory.
-
- 23 11月, 2022 1 次提交
-
-
由 Leo Chen 提交于
-
- 17 11月, 2022 2 次提交
- 31 10月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 19 9月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit * refine code with template argument * refine code with template argument * add ternary broadcast test file * add ternary broadcast test file * fix accoriding to ci * fix op-benchmark ci error
-
- 16 9月, 2022 1 次提交
-
-
由 sneaxiy 提交于
* support int64 non-broadcast * support broadcast case for int64 index * fix bug * support more Arity * remove some codes * upgrade patchelf to v0.15.0 to pass CI build * fix bug * fix patchelf installation * add debug flags * remove useless codes * fix viterbi_decode and set_value op uts * remove always enable int64
-
- 15 9月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit * fix some bugs in code * fix bugs * to optimize merge one dimension feature
-
- 07 9月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit * merged with develop * merged with develop * fix merge sequential one dims bugs
-
- 23 8月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 06 6月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 20 5月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 16 5月, 2022 1 次提交
-
-
由 niuliling123 提交于
-
- 12 5月, 2022 1 次提交
-
-
由 Shuangchi He 提交于
-
- 10 5月, 2022 1 次提交
-
-
由 shixingbo 提交于
-
- 27 4月, 2022 1 次提交
-
-
由 zyfncg 提交于
* optimize performance of dygraph * optimize performance of dygraph and elementwise_add * optimize the trace op * fix bug * fix bug * fix unittest bug * fix code format
-
- 25 4月, 2022 1 次提交
-
-
由 limingshu 提交于
* change sequential logic * change some quotes * add some notations * change wrong note style.
-
- 07 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move elementwise_div grad * change mutable_data to alloc * fix compile bugs
-
- 04 3月, 2022 1 次提交
-
-
由 Leo Chen 提交于
* clean distribution_helper, index_impl, aligned_vector code in fluid * fix conflicts
-
- 02 3月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* move elementwise function to funcs directory * fix compile bugs * modify according to comment
-
- 23 2月, 2022 1 次提交
-
-
由 Liu-xiandong 提交于
* [KP] Add elementwise add xpu, test=develop * modify the File Permissions * modify the copyright time * modify code style * modify code style
-
- 20 2月, 2022 2 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
由 Yiqun Liu 提交于
-