- 24 1月, 2022 7 次提交
-
-
由 chentianyu03 提交于
* add scale xpu kernel * add scale xpu kernel * add scale xpu kernel * replace with pten scale kernel * change dev_ctx * modify float16 head path * remove unused xpu header
-
由 YuanRisheng 提交于
[Pten]Refactor elementwise_add grad / double grad / triple grad Kernel and move them to pten (#39048) * refactor elementwise add grad * fix compile bugs * fix unit test bugs * fix file conflicts * fix bugs when buildPtenContext
-
由 Jacek Czaja 提交于
* - more unlikely * - compilation fix * - removed redundant definition * - fix * - Fixes * - compilation fix for windows
-
由 Feiyu Chan 提交于
* migration of functors in paddle/fluid/operators/eigen and paddle/fluid/platform/eigen_ext.h * update path of data types like float16.h in includes in extensions.h
-
由 Zhang Ting 提交于
-
由 Wilber 提交于
* move dynload from fluid to pten. * fix ci compile * fix windows ci compile. * update * update * fix compile error
-
由 z8hanghuan 提交于
* support sparse of adam, *test=kunlun * add pre-commit-config.yaml * support sparse of adam in KL2,*test=kunlun * support sparse of adam in KL2, *test=kunlun * modify xpu.cmake, *test=kunlun * support sparse of adam, rm some wait, *test=kunlun * support sparse of adam, rm some wait, *test=kunlun * support sparse of adam, *test=kunlun * support sparse of adam, *test=kunlun * support sparse of adam, *test=kunlun * support sparse of adam, *test=kunlun * support sparse of adam, *test=kunlun
-
- 21 1月, 2022 12 次提交
-
-
由 chentianyu03 提交于
* fix test concat dev api build failed * fix conflict * fix conflict
-
由 YuanRisheng 提交于
* add kernel for c++ api * fix compile bugs * fix kunlun compile bugs * perfect cmake * fix compile bugs when run ci-inference * fix compile bugs * add non-raw kernel for fluid op * fix compile bugs * fix compile bugs * fix unit test bug
-
由 chentianyu03 提交于
-
由 Weilong Wu 提交于
-
由 Zhang Ting 提交于
-
由 TeslaZhao 提交于
Keep strided_slice op behavior consistent with slice op when starts input is less than -rank (#39066)
-
由 fwenguang 提交于
* [MLU]add mlu ci dockerfile * fix comment * add cncl
-
由 Aurelius84 提交于
* Migrate Dim and DDim from paddle::framework into pten namespace * fix paddle::framework::Array * fix framework::Array
-
由 ronnywang 提交于
-
由 FlyingQianMM 提交于
* add block and grid loop for index_sample kernel to deal with a large-shape tensor * fix code format * limit grid dim
-
由 fwenguang 提交于
-
由 Wilber 提交于
* add cpu_context. * update * update * update * update * update * fix ci problem * fix npu ci problem * update * fix ci compile
-
- 20 1月, 2022 7 次提交
-
-
由 fwenguang 提交于
-
由 fwenguang 提交于
-
由 Aurelius84 提交于
* Migrate bfloat16/float16/complex from platform into pten::common * fix typo * fix code style
-
由 yaoxuefeng 提交于
-
由 zhangbo9674 提交于
* fix mp * support merged_momentum for mp
-
由 sneaxiy 提交于
-
由 sneaxiy 提交于
-
- 19 1月, 2022 1 次提交
-
-
由 zhangyikun02 提交于
-
- 18 1月, 2022 7 次提交
-
-
由 Sławomir Siwek 提交于
* Mish * Change exp() library * mish fuse pass * mish attrs * fixes * mishop maker * remove attrs * mish kernal for bf16 * fc+mish fuse * fix code format error * Resolve merge conflicts * Update mish operator version * update mish variable to new naming convention
-
由 zhouweiwei2014 提交于
* change CUDA implementaion of uniform/gaussian OP * fix unittest
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
由 taixiurong 提交于
-
由 Yiqun Liu 提交于
-
由 YuanRisheng 提交于
-
由 sneaxiy 提交于
* speedup gelu using fast math * add bwd part
-
- 17 1月, 2022 6 次提交
-
-
由 jakpiase 提交于
-
由 CtfGo 提交于
collecting input variables that only shape needed of each subgraph that compiled by CINN in build_cinn_pass, and expose them to memory optimization of framework passes by declaringDECLARE_INPLACE_OP_INFERER in cinn_launch op.
-
由 zyfncg 提交于
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
由 sneaxiy 提交于
-
由 Zhang Ting 提交于
-