- 05 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs * fix compile bugs * move reduce files by new rule * add set header * format code style * merge develop and fix conflict * merge develop and fix conflict Co-authored-by: NYuanRisheng <yuanrisheng@baidu.com>
-
- 04 1月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs * move cpu_impl of elementwise kernel to new directory
-
由 Chen Weihang 提交于
* move inner cast api to cast_kernel.h * resolve conflit
-
- 29 12月, 2021 2 次提交
-
-
由 Shang Zhizhou 提交于
-
由 limingshu 提交于
-
- 28 12月, 2021 3 次提交
-
-
由 limingshu 提交于
* first commit * pass ctest of elementwise_div_grad
-
由 zyfncg 提交于
* refactor matmul directory in pten * fix merge conflict
-
由 chentianyu03 提交于
* remove intype arg in cast kernel * modify conj config in api.yaml by dictionary order * rm unused code in cast_kernel.cu
-
- 27 12月, 2021 2 次提交
-
-
由 limingshu 提交于
* No harm to KP * Pass the compile stage * change the WriteData function * fix template bugs and pass ctest of current elementwise * for passing partial template specialization of tempalte function in CI-ROCm * To make 'WriteData' funtion flexible. * a less harmful way to support multi-output * a less harmful way to support multi-output
-
由 Chen Weihang 提交于
* rename to api to copy_to * revert needless change * polish format
-
- 26 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* add register general kernel marco * move copy kernel impl * revert needless change * polish details * fix xpu compil faild * fix xpu compile failed * polish format
-
- 24 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* combine reduce_cuda codes * support float16 in pten redcue_mean * replace ReduceCudaKernel impl with pten reduce impl * mv reduce funcs into reduce_cuda_impl * rm unsed codes and headers * mv GetReduceDim into reduce_cuda_impl * recover GetReduceDim in reduce_op.h * add new dispatch macro * fix pool op output not inited and cause transform to pten::denseTensor error * fix output tensor not initialized error * rename new dispatch macro and format code style * rm reduce_functor_op.h file
-
- 23 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
-
由 Chen Weihang 提交于
* move dot kernel impl * remove needless cmake items
-
- 21 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
* rename cuda to gpu * revert CMake change * resolve conflit * rename other cuda to gpu * poish details
-
由 chentianyu03 提交于
* fix when out_dtype is same with x.dtype and still transform type error * fix spell error
-
- 20 12月, 2021 3 次提交
-
-
由 chentianyu03 提交于
* add pten conj kernel * modify conj_kernel file path * add defined cuda macro to cuda/conj_kernel.h
-
由 石晓伟 提交于
-
由 zyfncg 提交于
-
- 17 12月, 2021 1 次提交
-
-
由 limingshu 提交于
* fix_bugs_for_elementwise_branch_selection * fix merge_dims bugs * fix all influenced file
-
- 16 12月, 2021 3 次提交
-
-
由 Chen Weihang 提交于
* unify device context entrance * move all_context include to header * polish cmake relay for device_context * fix npu compile failed * fix npu compile failed
-
由 Chen Weihang 提交于
* add register_ctx_kernel and move scale kernel * polish details by reviewer comment * fix xpu compile failed * fix cmake error
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile * modify register name * fix compile bugs
-
- 15 12月, 2021 2 次提交
-
-
由 Yiqun Liu 提交于
test=document_fix
-
由 Chen Weihang 提交于
-
- 13 12月, 2021 1 次提交
-
-
由 Shang Zhizhou 提交于
* fix reduce_max bug * add unittest
-
- 10 12月, 2021 2 次提交
-
-
由 chentianyu03 提交于
-
由 chentianyu03 提交于
* fix pten::Copy use error in redcue_impl * remove in_dtype args in reduce kernel * fix copy error * fix copy error
-
- 09 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-