- 15 1月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Fixed example code failure * Polished function names, removed duplicated forward declarations
-
由 zyfncg 提交于
-
- 14 1月, 2022 8 次提交
-
-
由 heliqi 提交于
* add trt_convert_flatten_contiguous_rang op * trt version >7,support trt_convert_flatten_contiguous_rang * trt version >7,support trt_convert_flatten_contiguous_rang * trt version >7,support trt_convert_flatten_contiguous_rang * test cast add trt version >=7 skip
-
由 Zhangjingyu06 提交于
* [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun,*test=kunlun * [XPU]add stack_grad op for kunlun2,*test=kunlun Co-authored-by: NQingshuChen <chenqingshu@baidu.com>
-
由 王明冬 提交于
-
由 Baibaifan 提交于
-
由 YuanRisheng 提交于
-
由 qipengh 提交于
* [MLU]: add mean and reduce mean op * [MLU]add mlu pytest dir in CMakeLists.txt * [MLU]fix tensor data * [MLU]fix TensorToPyArray and license
-
由 Sing_chan 提交于
-
由 石晓伟 提交于
-
- 13 1月, 2022 14 次提交
-
-
由 Chen Weihang 提交于
* rename register marco * fix error changing * fix format error
-
由 Wangzheee 提交于
* add Paddle Trt config: with_interleaved
-
由 shangliang Xu 提交于
-
由 furnace 提交于
[NPU] fix tril_triu
-
由 furnace 提交于
* [NPU] fix expand op * [NPU] optimize codes * [NPU] optimize codes
-
由 Sing_chan 提交于
* force close eager_generator.exe * modify according to zhouwei's comment
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h * move scale dev api into kernel.h and remove infermete.h * rm paddle/pten/include/math.h * rm paddle/pten/include/math.h * rm include dir * rm paddle/pten/include/math.h * fix conflict with develop branch * rm devContext in conj_op.h * add the missing complex_kernel header
-
由 JZ-LIANG 提交于
-
由 LiYuRio 提交于
-
由 wenbin 提交于
roi_align aligned supported
-
由 jakpiase 提交于
* base changes for mul reimplementation * empty commit * tmp save * full implementation of mul bf16/fp32 fwd bwd * CI fix * CI rerun * changed unity build cmake to avoid gpu issues * removed mul mkldnn from unity build * added skipping tests if not cpu_bf16 * CI fix * CI fix * CI fix
-
由 Chen Weihang 提交于
* fix mkldnn invalid infershape * add unittest for mkldnn in new executor * add import os
-
由 Weilong Wu 提交于
* Support test_imperative using_non_zero_gpu and Add a TODO comment * Change GPU number to 0 * Modify the cuda device selection method
-
由 石晓伟 提交于
-
- 12 1月, 2022 15 次提交
-
-
由 Zhang Ting 提交于
* code clean * [part 3]change type of function args
-
由 zhaocaibei123 提交于
-
由 Allen Guo 提交于
* support more ops * Co-authored-by: Xiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * update date Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 ziyoujiyi 提交于
* delete gloo connect retry * the_one_ps dirs reconstruct * . * . * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * the one ps dirs modify * the one ps dirs modify * the one ps dirs modify * the one ps dirs modify
-
由 Sylwester Fraczek 提交于
* fix conv act int8 scale * add unit test for conv+hard_swish
-
由 xiaoting 提交于
* support 5d for nearest * update nearest3d unittest, test=develop * fix approve ci, test=develop * fix approve ci, test=develop
-
由 JZ-LIANG 提交于
* auto parallel sharding base * chmod * add unitest * set unitest cmake dist label * revise code according to rewiew * chmod * bugfix for grad_clip and param broadcast * chmod * update unitest * chmod * add clip * chmod * add amp pass * chmod * add unitest * remove grad update * fixed bug * fixed bug * fixed typose * fixed typoes
-
由 Lijunhui 提交于
* init elem_max_grad op * optimize code and reply review comments * ternary functors * apply new reduce func * move functor to .h * multi-outputs init * rearrange code * modifed functors * optimizer code * pass nullptr * revert the last change as seg fault occurs * optimize code * remove inplace * remove comments
-
由 Chen Weihang 提交于
* remove hybird dir * resolve conflit
-
由 Lijunhui 提交于
* ini commit * multi-outputs init commit * optimize code * remove inplace
-
由 Zhang Ting 提交于
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h
-
由 Jiabin Yang 提交于
-
由 Zhang Zheng 提交于
* Add pten change file check for op benchmark * fix style format * test * revert
-
由 YuanRisheng 提交于
* refactor the impl of elementwise grad kernel * refactor impl of elementwise grad kernel(cuda) * fix compile bugs
-