- 28 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 26 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 25 1月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
-
由 sneaxiy 提交于
* assert _compile_dir include file existence * polish
-
- 24 1月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add config of backward-api auto-gene * fix compile bug * remove wrong header * rename grad_api to backward_api * modify .gitignore
-
- 21 1月, 2022 2 次提交
-
-
由 YuanRisheng 提交于
* add kernel for c++ api * fix compile bugs * fix kunlun compile bugs * perfect cmake * fix compile bugs when run ci-inference * fix compile bugs * add non-raw kernel for fluid op * fix compile bugs * fix compile bugs * fix unit test bug
-
由 chentianyu03 提交于
-
- 13 1月, 2022 1 次提交
-
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h * move scale dev api into kernel.h and remove infermete.h * rm paddle/pten/include/math.h * rm paddle/pten/include/math.h * rm include dir * rm paddle/pten/include/math.h * fix conflict with develop branch * rm devContext in conj_op.h * add the missing complex_kernel header
-
- 05 1月, 2022 1 次提交
-
-
由 Qi Li 提交于
-
- 04 1月, 2022 1 次提交
-
-
由 YuanRisheng 提交于
* change 'math' to 'math_kernel' * fix compile bugs * merge develop * fix compile bugs * move cpu_impl of elementwise kernel to new directory
-
- 28 12月, 2021 2 次提交
-
-
由 chentianyu03 提交于
* remove intype arg in cast kernel * modify conj config in api.yaml by dictionary order * rm unused code in cast_kernel.cu
-
由 sneaxiy 提交于
-
- 23 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add empty and empty_like kernel in pten * add empty dev_api
-
- 22 12月, 2021 2 次提交
-
-
由 Chen Weihang 提交于
* add pten kernel cmake * add pten kernel cmake function * fix compile error * add enforce include for full kernel * fix compile failed * change cuda to gpu * fix cmake function error
-
由 zyfncg 提交于
* rename full infer_meta * fix merge problem
-
- 20 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
* add pten conj kernel * modify conj_kernel file path * add defined cuda macro to cuda/conj_kernel.h
-
- 17 12月, 2021 3 次提交
-
-
由 sneaxiy 提交于
* add compile_dir * follow comments
-
由 chentianyu03 提交于
* modify sum mean args * add GetExpectedPtenKernelArgs for redcue_op * modify kernel args number * modify kernel args number
-
由 Aurelius84 提交于
* Add RWLock to protect loading module under multi-thread * refine code * remove import statement
-
- 15 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 14 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* Reduce reshape kernel functions in pten * delete notes * fix bugs when compile
-
- 13 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add variadic_args kernel in pten * merge develop code * add variadic_args kernel and benchmark * change dynamic_cast to static_cast for DeviceContext * merge the code * modify code format * refactor variadic kernel function
-
- 10 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions * add alias name for matmul and remove redundant member in kernel factory
-
- 09 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
* refine the kernel register impl * fix cmake and symbol error * remove overload marco * polish details
-
- 08 12月, 2021 1 次提交
-
-
由 YuanRisheng 提交于
* add alias kernel name * modify code as suggestions
-
- 07 12月, 2021 1 次提交
-
-
由 Aurelius84 提交于
-
- 06 12月, 2021 1 次提交
-
-
由 Chen Weihang 提交于
-
- 03 12月, 2021 1 次提交
-
-
由 zyfncg 提交于
* add code-gen file * add api-gen in cmake * adjust the code format * temp save the code * add genen-api module into pten * merge the develop code * fix merge conflict * fix code conflict with develop * support reduce_mean/sum * fix the CI requirement * fix requirement problem of CI * remove original api code * fix bug caused by removing original api
-
- 23 11月, 2021 1 次提交
-
-
由 CtfGo 提交于
`paddle.utils.download` :change to call `extractall` on tar/zip compressd file to speed up the uncompress process when they includes many files --- result of decompression speed comparison --- 1. dataset:https://paddlenlp.bj.bcebos.com/datasets/cnn_dailymail/cnn_stories.tgz, decompression time :5m50s vs 20s 2. dataset:https://paddlenlp.bj.bcebos.com/datasets/cnn_dailymail/dailymail_stories.tgz, decompression time:33m20s vs 47s
-
- 27 10月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] add custom op support, test=develop * remove debug codes, test=develop
-
- 11 10月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [NPU] fix matmul_v2 and utils.run_check, test=develop * remove debug files, test=develop * fix install_check, test=develop * fix doc, test=develop * fix review comments, test=develop
-
- 28 9月, 2021 1 次提交
-
-
由 Siming Dai 提交于
-
- 17 9月, 2021 1 次提交
-
-
由 Leo Chen 提交于
* temporally disable the warnings * disable ut
-
- 13 9月, 2021 1 次提交
-
-
由 Siming Dai 提交于
-
- 11 9月, 2021 1 次提交
-
-
由 王明冬 提交于
-
- 07 9月, 2021 1 次提交
-
-
由 Aurelius84 提交于
* Add DPADDLE_WITH_CUDA for GCC * polish code
-
- 26 8月, 2021 1 次提交
-
-
由 Siming Dai 提交于
* add dlpack api and fix a from_dlpack
-
- 05 8月, 2021 1 次提交
-
-
由 0x45f 提交于
* integrated gast library * integrated gast library * fix unittest and remove ast2.py * remove 'gast' from __all__ in __init__.py * add copyright in other files * fix copyright
-
- 22 6月, 2021 1 次提交
-
-
由 jiangcheng 提交于
-
- 21 6月, 2021 1 次提交
-
-
由 tianshuo78520a 提交于
* del py2 code2 * fix test timeout
-