- 18 1月, 2022 4 次提交
-
-
由 YuanRisheng 提交于
-
由 zhaoyingli 提交于
* [AutoParallel] Recompute Pass * update unittest * reshard for amp * add comment
-
由 zmxdream 提交于
* update * fix ps_gpu_wrapper. test=develop * fix ps_gpu_wrapper. test=develop
-
由 sneaxiy 提交于
* speedup gelu using fast math * add bwd part
-
- 17 1月, 2022 18 次提交
-
-
由 wenbin 提交于
* develop test * throw * ne * wrong cnt
-
由 jakpiase 提交于
-
由 Allen Guo 提交于
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 Allen Guo 提交于
* update ipu_backend * sync with paddle internal Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * apply comments 01 * update error messag * restore ipu_executor and ipu_optimizer * add clang-format on Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 CtfGo 提交于
collecting input variables that only shape needed of each subgraph that compiled by CINN in build_cinn_pass, and expose them to memory optimization of framework passes by declaringDECLARE_INPLACE_OP_INFERER in cinn_launch op.
-
由 zyfncg 提交于
-
由 Wilber 提交于
* add pten::Place data structure. * update ci problem * fix ci problem * update * using platform::Place=pten::Place * remove BOOST_GET_CONST for CPUPlace and GPUPlace * compile pass 25%. * compile pass 45% * compile pass 60% * remove boost_get for xpu npu mlu and ipu * compile pass on cpu and gpu. * fix compile problem * fix compile error. * update * fix ci problem * update * ci approve * fix ci problem * fix ci eager test problem * remove BOOST_GET_CONST * fix npu compile
-
由 wangguanqun 提交于
-
由 Allen Guo 提交于
Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 Allen Guo 提交于
* update ipu releated passes Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NAllen Guo <alleng@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai> * remove ipu_pass_base * update error msg * update error msg 02 * split pr 01 * restore ipu_pass_base Co-authored-by: NXiaobing Wang <xiaobingw@graphcore.ai> Co-authored-by: NZhixin Yao <zhixiny@graphcore.ai> Co-authored-by: NHaicheng Jiang <haichengj@graphcore.ai> Co-authored-by: NHan Zhao <hanzhao@graphcore.ai>
-
由 sneaxiy 提交于
* add no reduce mode for pe * add NoReduce ut
-
由 sneaxiy 提交于
-
由 Zhang Ting 提交于
-
由 sneaxiy 提交于
-
由 ronnywang 提交于
* fix paddle.where torch diff * update
-
由 0x45f 提交于
* close enable_inplace PASS for PE, and test dy2st pure fp16 training stability * add some comment * enlarge atol
-
由 Jiabin Yang 提交于
* support test_auto_prune_partial * support rest of autoprune strategy in eager mode
-
由 Zhanlue Yang 提交于
-
- 16 1月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* add select kernel map method * fix error
-
- 15 1月, 2022 6 次提交
-
-
由 石晓伟 提交于
-
由 石晓伟 提交于
-
由 Chen Weihang 提交于
* remove cached kernel context * revert dataloader format change
-
由 Chen Weihang 提交于
-
由 Zhanlue Yang 提交于
* Merged LoDTensor with Tensor,test=allcases * Patched python level LoDTensor * Fixed example code failure * Polished function names, removed duplicated forward declarations
-
由 zyfncg 提交于
-
- 14 1月, 2022 8 次提交
-
-
由 heliqi 提交于
* add trt_convert_flatten_contiguous_rang op * trt version >7,support trt_convert_flatten_contiguous_rang * trt version >7,support trt_convert_flatten_contiguous_rang * trt version >7,support trt_convert_flatten_contiguous_rang * test cast add trt version >=7 skip
-
由 Zhangjingyu06 提交于
* [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun2,*test=kunlun * [XPU]add split op for kunlun,*test=kunlun * [XPU]add stack_grad op for kunlun2,*test=kunlun Co-authored-by: NQingshuChen <chenqingshu@baidu.com>
-
由 王明冬 提交于
-
由 Baibaifan 提交于
-
由 YuanRisheng 提交于
-
由 qipengh 提交于
* [MLU]: add mean and reduce mean op * [MLU]add mlu pytest dir in CMakeLists.txt * [MLU]fix tensor data * [MLU]fix TensorToPyArray and license
-
由 Sing_chan 提交于
-
由 石晓伟 提交于
-
- 13 1月, 2022 3 次提交
-
-
由 Chen Weihang 提交于
* rename register marco * fix error changing * fix format error
-
由 Wangzheee 提交于
* add Paddle Trt config: with_interleaved
-
由 shangliang Xu 提交于
-