- 24 1月, 2022 3 次提交
-
-
由 Zhang Ting 提交于
-
由 Wilber 提交于
* move dynload from fluid to pten. * fix ci compile * fix windows ci compile. * update * update * fix compile error
-
由 z8hanghuan 提交于
* support sparse of adam, *test=kunlun * add pre-commit-config.yaml * support sparse of adam in KL2,*test=kunlun * support sparse of adam in KL2, *test=kunlun * modify xpu.cmake, *test=kunlun * support sparse of adam, rm some wait, *test=kunlun * support sparse of adam, rm some wait, *test=kunlun * support sparse of adam, *test=kunlun * support sparse of adam, *test=kunlun * support sparse of adam, *test=kunlun * support sparse of adam, *test=kunlun * support sparse of adam, *test=kunlun
-
- 21 1月, 2022 12 次提交
-
-
由 chentianyu03 提交于
* fix test concat dev api build failed * fix conflict * fix conflict
-
由 YuanRisheng 提交于
* add kernel for c++ api * fix compile bugs * fix kunlun compile bugs * perfect cmake * fix compile bugs when run ci-inference * fix compile bugs * add non-raw kernel for fluid op * fix compile bugs * fix compile bugs * fix unit test bug
-
由 chentianyu03 提交于
-
由 Weilong Wu 提交于
-
由 Zhang Ting 提交于
-
由 TeslaZhao 提交于
Keep strided_slice op behavior consistent with slice op when starts input is less than -rank (#39066)
-
由 fwenguang 提交于
* [MLU]add mlu ci dockerfile * fix comment * add cncl
-
由 Aurelius84 提交于
* Migrate Dim and DDim from paddle::framework into pten namespace * fix paddle::framework::Array * fix framework::Array
-
由 ronnywang 提交于
-
由 FlyingQianMM 提交于
* add block and grid loop for index_sample kernel to deal with a large-shape tensor * fix code format * limit grid dim
-
由 fwenguang 提交于
-
由 Wilber 提交于
* add cpu_context. * update * update * update * update * update * fix ci problem * fix npu ci problem * update * fix ci compile
-
- 20 1月, 2022 7 次提交
-
-
由 fwenguang 提交于
-
由 fwenguang 提交于
-
由 Aurelius84 提交于
* Migrate bfloat16/float16/complex from platform into pten::common * fix typo * fix code style
-
由 yaoxuefeng 提交于
-
由 zhangbo9674 提交于
* fix mp * support merged_momentum for mp
-
由 sneaxiy 提交于
-
由 sneaxiy 提交于
-
- 19 1月, 2022 1 次提交
-
-
由 zhangyikun02 提交于
-
- 18 1月, 2022 7 次提交
-
-
由 Sławomir Siwek 提交于
* Mish * Change exp() library * mish fuse pass * mish attrs * fixes * mishop maker * remove attrs * mish kernal for bf16 * fc+mish fuse * fix code format error * Resolve merge conflicts * Update mish operator version * update mish variable to new naming convention
-
由 zhouweiwei2014 提交于
* change CUDA implementaion of uniform/gaussian OP * fix unittest
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Patched python level LoDTensor * Merge Tensor into DenseTensor * Fixed namespace issues,test=allcases * Fixed merge issues * Fixed inference issues * Fixed NPU test issues * Fixed merge issues
-
由 taixiurong 提交于
-
由 Yiqun Liu 提交于
-
由 YuanRisheng 提交于
-
由 sneaxiy 提交于
* speedup gelu using fast math * add bwd part
-
- 17 1月, 2022 6 次提交
-
-
由 jakpiase 提交于
-
由 CtfGo 提交于
collecting input variables that only shape needed of each subgraph that compiled by CINN in build_cinn_pass, and expose them to memory optimization of framework passes by declaringDECLARE_INPLACE_OP_INFERER in cinn_launch op.
-
由 zyfncg 提交于
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
由 sneaxiy 提交于
-
由 Zhang Ting 提交于
-
- 15 1月, 2022 1 次提交
-
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Fixed example code failure * Polished function names, removed duplicated forward declarations
-
- 14 1月, 2022 3 次提交
-
-
由 Zhangjingyu06 提交于
* [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun,*test=kunlun * [XPU]add stack_grad op for kunlun2,*test=kunlun Co-authored-by: NQingshuChen <chenqingshu@baidu.com>
-
由 YuanRisheng 提交于
-
由 qipengh 提交于
* [MLU]: add mean and reduce mean op * [MLU]add mlu pytest dir in CMakeLists.txt * [MLU]fix tensor data * [MLU]fix TensorToPyArray and license
-