- 26 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* add register general kernel marco * move copy kernel impl * revert needless change * polish details * fix xpu compil faild * fix xpu compile failed * polish format
-
- 24 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* combine reduce_cuda codes * support float16 in pten redcue_mean * replace ReduceCudaKernel impl with pten reduce impl * mv reduce funcs into reduce_cuda_impl * rm unsed codes and headers * mv GetReduceDim into reduce_cuda_impl * recover GetReduceDim in reduce_op.h * add new dispatch macro * fix pool op output not inited and cause transform to pten::denseTensor error * fix output tensor not initialized error * rename new dispatch macro and format code style * rm reduce_functor_op.h file
-
- 21 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* rename cuda to gpu * revert CMake change * resolve conflit * rename other cuda to gpu * poish details
-
- 15 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 10 12月, 2021 2 次提交
-
-
由 chentianyu03 提交于
-
由 chentianyu03 提交于
* fix pten::Copy use error in redcue_impl * remove in_dtype args in reduce kernel * fix copy error * fix copy error
-
- 09 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 03 12月, 2021 1 次提交
-
-
由 ronnywang 提交于
* refine structure for cuda and rocm * update * update * update * update
-
- 29 11月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add pten reduce kernel * add reduce_sum kernel * update attribute args and order * make out dtype undefined * fix empty input error * merge develop branch * rename sum as reduce function * rename sum as reduce function * fix reducekernelImpl args error * add reduce cuda kernel * modify dims type to const & * remove unsed log * fix reduce_all out eigen function error * remove unused codes * add the missing sum api define and testcase * merge develop branch * fix sum test axis value error * replace pten mean kernel with reduce_mean * revcover meam cuda to original implement
-
- 17 11月, 2021 1 次提交
-
-
由 niuliling123 提交于
* Modify reduce_op.op.h for xpu2 with kernel primitive api
-
- 21 10月, 2021 1 次提交
-
-
由 niuliling123 提交于
* Update the implement of reduceAnyKernel according to kernel primitive api * Fix a bug in ReadData, ReadDataBc and ReadDataReduce when NX != 1
-
- 28 9月, 2021 1 次提交
-
-
由 Guoxia Wang 提交于
-
- 08 9月, 2021 1 次提交
-
-
由 niuliling123 提交于
-
- 17 8月, 2021 1 次提交
-
-
由 niuliling123 提交于
fix a bug in nlp: text_matching/sentence_transformers when last dim is 1 and reduce mid dim (#34941)
-
- 11 8月, 2021 1 次提交
-
-
由 niuliling123 提交于
-
- 05 8月, 2021 1 次提交
-
-
由 limingshu 提交于
-
- 02 8月, 2021 1 次提交
-
-
由 Zhang Zheng 提交于
-
- 12 7月, 2021 1 次提交
-
-
由 Zhang Zheng 提交于
-
- 05 7月, 2021 1 次提交
-
-
由 Zhang Zheng 提交于
-
- 02 7月, 2021 1 次提交
-
-
由 niuliling123 提交于
-
- 22 6月, 2021 1 次提交
-
-
由 niuliling123 提交于
-
- 25 5月, 2021 1 次提交
-
-
由 niuliling123 提交于
-