- 02 6月, 2023 14 次提交
-
-
由 Hui Zhang 提交于
* floor div support float/double/bfloat16/float16 * add ut * fix bug * fix fft.ifftshift for floor_divide upgrade * fix comment * fix bugs * fix bug
-
由 Zhang Zheng 提交于
* Optimize perf of broadcast matmul * support more dtype
-
由 Ruibiao Chen 提交于
-
由 傅剑寒 提交于
-
由 winter-wang 提交于
-
由 Yuanle Liu 提交于
-
由 Zhang Ting 提交于
* support master_grad for adam and momentum Co-authored-by: zhangting_2017@163.com <zhangting2020>
-
由 xiongkun 提交于
* function * remove print * add some pystate
-
由 Charles-hit 提交于
-
由 Wang Xin 提交于
* static graph autogen code for shape op * fix onednn * fix onednn
-
由 xiaoguoguo626807 提交于
-
由 Aurelius84 提交于
-
由 Aurelius84 提交于
* [IR]Polish code in builtin type declaration * fix typo
-
由 winter-wang 提交于
-
- 01 6月, 2023 15 次提交
-
-
由 zhangbo9674 提交于
* add build * add build * refine code * refine code * refine code * refine code * refine interface * fix bug * fix bug * fix bug * refine yaml
-
由 umiswing 提交于
-
由 zhouweiwei2014 提交于
-
由 tianshuo78520a 提交于
* revert sequence_enumerate_op * restore sequence_enumerate_op
-
由 Charles-hit 提交于
* support layer_norm prim op bf16 dtype * polish code * resolve conflict
-
由 ronnywang 提交于
* [ROCM] fix multihead_matmul * skip bf16 uts * update
-
由 Wang Xin 提交于
* static graph autogen code for check_finite_and_unscale_ op * bug fixed
-
由 LiYuRio 提交于
-
由 YuanRisheng 提交于
-
由 RedContritio 提交于
-
由 sprouteer 提交于
-
由 huangjiyi 提交于
* update * update cmake * update * update * update * update * Revert "update cmake" This reverts commit 1e1dc1b2bc9967b725201272607f939260070fd4. * update * update * update * update
-
由 zhaoyingli 提交于
* [AutoParallel] update while control_flow with pipeline * update process group instantiate * fix micro_bsz for reshard * update api for micro batch size * add strategy for dp optimization
-
由 tianshuo78520a 提交于
* mv all unittests test * fix error * fix error * fix * fix * del unittests * fix paddle_build.sh * fix * fix test * fix add test * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * merge develop * fix test_async_read_write * fix test_async_read_write * merge develop * fix * fix import legacy_test * fix * fix * fix * fix * fix * fix * fix * fix * fix bug * fix * fix coverage test bug * fix * fix * fix * fix * fix * fix code sstyle * fix code * fix code * fix * fix * fix * del test_sequence_enumerate_op.py * fix
-
由 hong 提交于
* add infer_shape interface * update
-
- 31 5月, 2023 5 次提交
-
-
由 RedContritio 提交于
* support auto generate for static op reduce_amax * set reduce_amax attr 'axis' type as IntArray
-
由 Ruibiao Chen 提交于
-
由 Yuanle Liu 提交于
-
由 YuanRisheng 提交于
* fix inference static lib bugs * add if for copy * fix py3 bugs
-
由 Charles-hit 提交于
* support activation prim op bf16 dtype * remove useless code
-
- 30 5月, 2023 6 次提交
-
-
由 risemeup1 提交于
* update_c++17 * update_c++17 * fix windows bug * solve cirle depend * solve cirle depend * solve cirle depend * solve cirle depend * solve cirle depend * fix windows bug * fix compiler error * fix compiler error * update eigen3 * update eigen3 * update eigen3 * fix mac-py3 compiler error * update C++17 * fix mac compiler error * fix compile error * fix coverage_compiler error * fix coverage_ci_problem * fix coverage_error * fix_kunlun200 compile error * fix kunlun200 compiler error * fix compile error * fix compiler error * fix py3 failed test * fix kunlun200 compiler error * test * fix test error * fix test error * fix test error * test * test * fix mac py3 error * fix mac py3 error * fix mac py3 error * fix test error * fix test error * fix compile error * fix compile error * fix compile error * test * test * fix compiler error * test * test * debug on ci * fix compiler error * fix compiler error * test * fix cinn compiler error * test * fix rocm cmpile error * fix cinn and kunlun compile error * update c++14 * Update flags.cmake
-
由 shaojie_wang 提交于
* softmax fwd: force vec size to 1 when dtype is float * use 1024 as threshold to use cudnn
-
由 Yiqun Liu 提交于
* Reimplement the check_nan_inf function as check_numerics kernel. * Remove the cpu implemention to phi. * Add ifdef for the including of omp.h. * Move the use of FLAGS_check_nan_inf_level out of header file. * Implement a common PrintAndThrowError function. * Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__. * Add dependency of phi. * Polish codes and unittest.
-
由 Leo Chen 提交于
* add timer to log deps * rename flag * add ut
-
由 Yuanle Liu 提交于
add pass/analysis_manager.h ir/type_name.h pass/pass_instrumentation.h pass/utils.h and adjust pass dir (#54170)
-
由 tianshuo78520a 提交于
* del sequence_enumerate_op * del analyzer_pyramid_dnn_tester * fix
-