- 01 11月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit * transpose_kernel_optimization * first complishment of transpose op * second commit * refine code logics of tranpose_kernel * refine transpose kernel * first commit * fix DtoD copy bugs for hip * refine code according to the PR advice * change dim to int64_t type. * fix some type error
-
- 28 9月, 2022 1 次提交
-
-
由 limingshu 提交于
-
- 22 9月, 2022 1 次提交
-
-
由 limingshu 提交于
* first commit * clarify the quotes * change code style format * rerun for ci
-
- 01 7月, 2022 1 次提交
-
-
由 limingshu 提交于
* 2nd part of transpose update * add switch_auto_tune option. * add some changes according to Ci * refine the structure of auto_tune_base. * merge develop changes * reset the switch_set_range and change unittest of transpose auto-tune * change the kernel auto-tune logits
-
- 07 6月, 2022 1 次提交
-
-
由 limingshu 提交于
Transpose optimization with assitant of Chengdu Supercomputing Center and auto_tune operation (#42704)
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 31 3月, 2022 1 次提交
-
-
由 limingshu 提交于
* for 1st time interface combine. * modification with kernel factory * first auto_tune version. * first version. * basic version * add warm up step. * a debug version. * optimize the functionality of class auto_tuner. * add some quotes for optimized auto_tuner class. * add some quotes for optimized auto_tuner class. * add namespace. * modification according to the advices * replace fluid header with phi header. * replace fluid header with phi header.
-