- 26 6月, 2023 1 次提交
-
-
由 XiaociZhang 提交于
* [XPU] support xpu runtime profiler: follow up * fix compile issue
-
- 20 6月, 2023 1 次提交
-
-
由 XiaociZhang 提交于
* [kunlun] avoid compile issue in non-xpu env also rename macro WITH_XPU_XPTI to WITH_XPTI * move get_xpti_dependency.sh to tools/xpu * move get_xpti_dependency.sh to tools/xpu * call get_xpti_dependency.sh only in need
-
- 16 6月, 2023 1 次提交
-
-
由 jameszhang 提交于
* [kunlun] support xpu runtime profiler * fix cmake error * add libxpti.so to paddle package * fix for style check * sync change in setup.py and python/setup.py.in * remove libxpti.so from paddle output dir in this PR
-
- 26 5月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* create phi so * fix ci bugs * fix py3 bugs * add file * fix py3 bugs * fix windows bugs * perfect so * fix py3 bugs * delete all static target in phi * fix windows bugs * fix py3 bugs * fix ci bugs * fix windows bugs * fix bugs: gflags can't be linked by dynamic and static lib * fix bugs that can not load 3rd party * fix ci bugs * fix compile bugs * fix py3 bugs * fix conflict * fix xpu bugs * fix mac compile bugs * fix psgpu bugs * fix inference failed * deal with conflict * fix LIBRARY_PATH bug * fix windows bugs * fix onednn error * fix windows compile bugs * fix windows compile bugs * fix test_cuda_graph_static_mode_error aborted * fix windows bugs * fix mac-python3 error * fix hip compile bugs * change mode to static * change to static mode * fix ci bugs * fix py3 bugs * fix windows bugs * fix bugs * add static flag * add PADDLE_API * change position of PADDLE_API * fix windows bugs * change mode to dynamic lib * fix windows static bugs * deal with conflict * fix windows unit bug * fix coverage * deal with conflict * fix windows-inference * fix py3 bugs * fix bugs when compile type_info * fix compile bugs * fix py3 bugs * fix windows bugs * fix windows openblas * fix xpu bugs * fix enforce_test in windows * update code according comment * fix windows cmake bug * fix windows bugs * fix windows bugs * delete cinn unittest * fix cinn bugs --------- Co-authored-by: lzydev <1528794076@qq.com>
-
- 14 4月, 2023 1 次提交
-
-
由 umiswing 提交于
-
- 08 4月, 2023 1 次提交
-
-
由 张春乔 提交于
* mv WITH_ASCEND_CL * mv WITH_ASCEND * rollback
-
- 03 4月, 2023 1 次提交
-
-
由 engineer1109 提交于
-
- 06 1月, 2023 1 次提交
-
-
由 张春乔 提交于
-
- 12 12月, 2022 1 次提交
-
-
由 傅剑寒 提交于
* fix codestyle * add double complex<float> complex<double> dtype support for syevj_batched * fix use_syevj flag for precision loss when input dtype of syevj_batch is complex128 in some case * optimize eigh in different case * fix missing ; bug * fix use_syevj bug * fix use_cusolver_syevj_batched flag
-
- 03 11月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 19 10月, 2022 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 18 9月, 2022 1 次提交
-
-
由 RichardWooSJTU 提交于
-
- 14 9月, 2022 1 次提交
-
-
由 JingZhuangzhuang 提交于
* Delay TensorRT registry * Add unused define * Fix TensorRT test * fix function to reference * Update trt_plugin.h
-
- 01 8月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 22 7月, 2022 1 次提交
-
-
由 yuguo 提交于
-
- 18 7月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 28 6月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* [Sparse]add SparseTensor mv kernel(csr*dense_vec->dence_vec, coo*dense_vec->dense_vec) * fix CI
-
- 26 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 24 6月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 18 6月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 15 6月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
* add some kernel(csr*dense->csr, dense*dense->csr) of SparseTensor matmul * fix CI * fix CI * fix comment * fix comment
-
- 13 6月, 2022 1 次提交
-
-
由 Ruibiao Chen 提交于
-
- 09 6月, 2022 1 次提交
-
-
由 minghaoBD 提交于
-
- 05 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 6月, 2022 1 次提交
-
-
由 Sing_chan 提交于
-
- 04 5月, 2022 1 次提交
-
-
由 XiaoguangHu 提交于
-
- 22 4月, 2022 1 次提交
-
-
由 Ming-Xu Huang 提交于
* Fix leading dimension setting error in fused_gemm_epilogue_grad_op. * Add dyload to cuBlasLt functions. * Added cublasLtMatmulAlgoGetHeuristic to improve performance. * Added FLAGS_cublaslt_exhaustive_search_times to cublasLt epilogue * Added UTs to FLAGS_cublaslt_exhaustive_search_times * Added warmup runs in algo searching of Gemm epilogue. * Update copyright and documents. * Fixed error handling.
-
- 11 3月, 2022 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 28 2月, 2022 2 次提交
- 25 2月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 24 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten to phi * fix infrt compile failed * resolve conflict
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 24 1月, 2022 1 次提交
-
-
由 Wilber 提交于
* move dynload from fluid to pten. * fix ci compile * fix windows ci compile. * update * update * fix compile error
-
- 10 1月, 2022 1 次提交
-
-
由 Haohongxiang 提交于
* add lstsq gpu kernel * update * add docs_en * modify ut * fix bugs * modify example in docs_en * remove lstsq_op.cu from ROCM cmake * modify docs_en * modify docs_en * modify docs_en * remove unneccessary TensorCopy
-
- 04 1月, 2022 1 次提交
-
-
由 zhangkaihuo 提交于
-
- 30 12月, 2021 3 次提交
-
-
由 zhiboniu 提交于
LGTM
-
由 Haohongxiang 提交于
* add cpu kernel of lstsq * update * modify code style * modify unittest * remove support for complex
-
由 zhangkaihuo 提交于
将cuSparse的handle与DeviceContext进行绑定,避免op中进行创建和销毁 添加对cuSparse中dense和sparse转换的API进行封装 添加对封装的API的单测
-
- 29 12月, 2021 1 次提交
-
-
由 sneaxiy 提交于
-