- 13 1月, 2022 6 次提交
-
-
由 LiYuRio 提交于
-
由 wenbin 提交于
roi_align aligned supported
-
由 jakpiase 提交于
* base changes for mul reimplementation * empty commit * tmp save * full implementation of mul bf16/fp32 fwd bwd * CI fix * CI rerun * changed unity build cmake to avoid gpu issues * removed mul mkldnn from unity build * added skipping tests if not cpu_bf16 * CI fix * CI fix * CI fix
-
由 Chen Weihang 提交于
* fix mkldnn invalid infershape * add unittest for mkldnn in new executor * add import os
-
由 Weilong Wu 提交于
* Support test_imperative using_non_zero_gpu and Add a TODO comment * Change GPU number to 0 * Modify the cuda device selection method
-
由 石晓伟 提交于
-
- 12 1月, 2022 23 次提交
-
-
由 Zhang Ting 提交于
* code clean * [part 3]change type of function args
-
由 zhaocaibei123 提交于
-
由 Allen Guo 提交于
* support more ops * Co-authored-by: Xiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * add authors Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * update date Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 ziyoujiyi 提交于
* delete gloo connect retry * the_one_ps dirs reconstruct * . * . * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * create the_one_ps dirs * the one ps dirs modify * the one ps dirs modify * the one ps dirs modify * the one ps dirs modify
-
由 Sylwester Fraczek 提交于
* fix conv act int8 scale * add unit test for conv+hard_swish
-
由 xiaoting 提交于
* support 5d for nearest * update nearest3d unittest, test=develop * fix approve ci, test=develop * fix approve ci, test=develop
-
由 JZ-LIANG 提交于
* auto parallel sharding base * chmod * add unitest * set unitest cmake dist label * revise code according to rewiew * chmod * bugfix for grad_clip and param broadcast * chmod * update unitest * chmod * add clip * chmod * add amp pass * chmod * add unitest * remove grad update * fixed bug * fixed bug * fixed typose * fixed typoes
-
由 Lijunhui 提交于
* init elem_max_grad op * optimize code and reply review comments * ternary functors * apply new reduce func * move functor to .h * multi-outputs init * rearrange code * modifed functors * optimizer code * pass nullptr * revert the last change as seg fault occurs * optimize code * remove inplace * remove comments
-
由 Chen Weihang 提交于
* remove hybird dir * resolve conflit
-
由 Lijunhui 提交于
* ini commit * multi-outputs init commit * optimize code * remove inplace
-
由 Zhang Ting 提交于
-
由 chentianyu03 提交于
* move dot_dev api into dot_kernel.h * add infermate header * modify to dotkerel in dot_op.h * mvoe conj dev api into complex_kernel.h * move sign dev api into sign_kernel.h
-
由 Jiabin Yang 提交于
-
由 Zhang Zheng 提交于
* Add pten change file check for op benchmark * fix style format * test * revert
-
由 YuanRisheng 提交于
* refactor the impl of elementwise grad kernel * refactor impl of elementwise grad kernel(cuda) * fix compile bugs
-
由 Chen Long 提交于
* update readme test=document_fix * update conll05 docs * update conll05 docs test=document_fix
-
由 Zhang Ting 提交于
-
由 Zhang Ting 提交于
-
由 Zhang Ting 提交于
-
由 limingshu 提交于
* first commit * fix wrong filename * fix the wrong spell name * fix gpu config warper * modify according to pr advices * fix GpuLauchConfig1D api bugs * change the config for dropout grad * fix bugs * modification according to pr advices * modification according to pr advices
-
由 liutiexing 提交于
* add align for WorkQueue * add spinlock * merge develop * merge * Add EventsWaiter * Revert "Add EventsWaiter" This reverts commit e206173aa9be7401b83a53581627bfaf557c8fb2. * os_info update * update * update * update * update * update * fix * update * update for windows * fix windows * update * update Co-authored-by: Nliutiexing <liutiexing@google.com>
-
由 Sing_chan 提交于
* add args check and comment for exp,polynomy decay * modify according to zhouwei's comment
-
由 Chen Weihang 提交于
-
- 11 1月, 2022 11 次提交
-
-
由 YuanRisheng 提交于
-
由 zyfncg 提交于
* refactor matmul directory in pten * fix merge conflict * add dot_grad kernel * add dot_grad kernel in pten * add matmul_grad kernel * update the code * delete useless code in fluid * fix some bug of running matmul grad kernel * fix merge conflict * refactor some code * refactor code
-
由 Sing_chan 提交于
-
由 Zhang Zheng 提交于
* fix bug when inplace strategy * fix * fix * fix * fix * fix
-
由 niuliling123 提交于
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update
-
由 Weilong Wu 提交于
-
由 caozhou 提交于
* update dist tensor * add unitest * update unitest * refactor dist tensor * update dist tensor and unitest
-
由 zhangbo9674 提交于
* check amp.decorate and DataParallel * refine coverage * fix layer dtype * refine code
-
由 limingshu 提交于
* fix the wrong filename * first commit * first commit * remove rest useless headers * for ci approval
-
由 Sing_chan 提交于
* support vs2019 compilation in windows * not modify pow_op's original compute logic
-