- 30 5月, 2023 17 次提交
-
-
由 shaojie_wang 提交于
* softmax fwd: force vec size to 1 when dtype is float * use 1024 as threshold to use cudnn
-
由 Yiqun Liu 提交于
* Reimplement the check_nan_inf function as check_numerics kernel. * Remove the cpu implemention to phi. * Add ifdef for the including of omp.h. * Move the use of FLAGS_check_nan_inf_level out of header file. * Implement a common PrintAndThrowError function. * Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__. * Add dependency of phi. * Polish codes and unittest.
-
由 Leo Chen 提交于
* add timer to log deps * rename flag * add ut
-
由 Yuanle Liu 提交于
add pass/analysis_manager.h ir/type_name.h pass/pass_instrumentation.h pass/utils.h and adjust pass dir (#54170)
-
由 tianshuo78520a 提交于
* del sequence_enumerate_op * del analyzer_pyramid_dnn_tester * fix
-
由 zhouweiwei2014 提交于
-
由 xiaoguoguo626807 提交于
* modify gradOpMaker * modify concat bug * modify concat bug * delete unnecessary block * modify fill_any_like value from 1 to 0 * modify fill_any_like dtype from other to -1 * ci_bug
-
由 zhangbo9674 提交于
* refine auto gen * refine code * refine code * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug
-
由 lijialin03 提交于
-
由 RedContritio 提交于
-
由 huangjiyi 提交于
* update * update * update * update * update * update * update * update
-
由 Yulong Ao 提交于
* [Auto Parallel] Reorganize the fold structure * [Auto Parallel] Fix some import errors
-
由 winter-wang 提交于
-
由 YuanRisheng 提交于
* fix compile error * delete code
-
由 RedContritio 提交于
* support auto generate for activation_op relu6 * add generated_static_op for activation_op in CMakeLists.txt
-
由 jameszhang 提交于
-
由 houj04 提交于
-
- 29 5月, 2023 4 次提交
-
-
由 Yuanle Liu 提交于
-
由 winter-wang 提交于
-
由 zhoutianzi666 提交于
-
由 wz1qqx 提交于
-
- 28 5月, 2023 1 次提交
-
-
由 kangguangli 提交于
* add op name normalizer * disable unittest
-
- 27 5月, 2023 1 次提交
-
-
由 张春乔 提交于
* rm , from \* @param(.*?), * Apply suggestions from code review * Apply suggestions from code review
-
- 26 5月, 2023 10 次提交
-
-
由 Leo Chen 提交于
* Copy QAT files from PaddleSlim * Integrate QAT API into Paddle * Replace eval function * Reduce test_quant_aware run time * Apply new formatter on modified files * Remove code check for Paddle version check * Copy quant_post_quant_aware UT from PaddleSlim * Integrate test_quant_post_quant_aware UT into PaddlePaddle * Apply new formatter on modified files * Remove redundant code and add unittests * Add new unittests * Update the time limit of new unittests
-
由 YuanRisheng 提交于
* create phi so * fix ci bugs * fix py3 bugs * add file * fix py3 bugs * fix windows bugs * perfect so * fix py3 bugs * delete all static target in phi * fix windows bugs * fix py3 bugs * fix ci bugs * fix windows bugs * fix bugs: gflags can't be linked by dynamic and static lib * fix bugs that can not load 3rd party * fix ci bugs * fix compile bugs * fix py3 bugs * fix conflict * fix xpu bugs * fix mac compile bugs * fix psgpu bugs * fix inference failed * deal with conflict * fix LIBRARY_PATH bug * fix windows bugs * fix onednn error * fix windows compile bugs * fix windows compile bugs * fix test_cuda_graph_static_mode_error aborted * fix windows bugs * fix mac-python3 error * fix hip compile bugs * change mode to static * change to static mode * fix ci bugs * fix py3 bugs * fix windows bugs * fix bugs * add static flag * add PADDLE_API * change position of PADDLE_API * fix windows bugs * change mode to dynamic lib * fix windows static bugs * deal with conflict * fix windows unit bug * fix coverage * deal with conflict * fix windows-inference * fix py3 bugs * fix bugs when compile type_info * fix compile bugs * fix py3 bugs * fix windows bugs * fix windows openblas * fix xpu bugs * fix enforce_test in windows * update code according comment * fix windows cmake bug * fix windows bugs * fix windows bugs * delete cinn unittest * fix cinn bugs --------- Co-authored-by: lzydev <1528794076@qq.com>
-
由 zhangbo9674 提交于
* on * add covrage for ir
-
由 zhaoyingli 提交于
* global view process_group * fix import * fix attr * fix tunner init comm
-
由 zhupengyang 提交于
* Not delete slice op if out op has shared var node
-
由 Yuanle Liu 提交于
* fix fp16 io * disable precision test
-
由 jiangfan06 提交于
-
由 risemeup1 提交于
* fix test error * fix test_exectuor_feed_non_tensor * fix test error
-
由 HongyuJia 提交于
-
由 Yuanle Liu 提交于
* [IR&PASS] part 1: add pass base, pass manager, adaptor pass, ut * include cstdint
-
- 25 5月, 2023 7 次提交
-
-
由 HongyuJia 提交于
-
由 Yuanle Liu 提交于
-
由 zhangkaihuo 提交于
-
由 zhangkaihuo 提交于
-
由 thunder95 提交于
-
由 zhangbo9674 提交于
* refine code * delete some unused code * refine code of build * refine code of build * add block * refine builder * refine code * refine code by comment * fix compiler bug
-
由 Zhang Jun 提交于
-