- 02 6月, 2023 6 次提交
-
-
由 RedContritio 提交于
-
由 RedContritio 提交于
-
由 ronnywang 提交于
* [CustomDevice] add barrier op * update
-
由 Yuanle Liu 提交于
-
由 Wang Xin 提交于
* static graph autogen code for shape op * fix onednn * fix onednn
-
由 xiaoguoguo626807 提交于
-
- 01 6月, 2023 7 次提交
-
-
由 zhouweiwei2014 提交于
-
由 tianshuo78520a 提交于
* revert sequence_enumerate_op * restore sequence_enumerate_op
-
由 ronnywang 提交于
* [ROCM] fix multihead_matmul * skip bf16 uts * update
-
由 Wang Xin 提交于
* static graph autogen code for check_finite_and_unscale_ op * bug fixed
-
由 RedContritio 提交于
-
由 huangjiyi 提交于
* update * update cmake * update * update * update * update * Revert "update cmake" This reverts commit 1e1dc1b2bc9967b725201272607f939260070fd4. * update * update * update * update
-
由 tianshuo78520a 提交于
* mv all unittests test * fix error * fix error * fix * fix * del unittests * fix paddle_build.sh * fix * fix test * fix add test * fix * fix * fix * merge develop * fix * fix * fix * fix * fix * merge develop * fix test_async_read_write * fix test_async_read_write * merge develop * fix * fix import legacy_test * fix * fix * fix * fix * fix * fix * fix * fix * fix bug * fix * fix coverage test bug * fix * fix * fix * fix * fix * fix code sstyle * fix code * fix code * fix * fix * fix * del test_sequence_enumerate_op.py * fix
-
- 31 5月, 2023 1 次提交
-
-
由 RedContritio 提交于
* support auto generate for static op reduce_amax * set reduce_amax attr 'axis' type as IntArray
-
- 30 5月, 2023 5 次提交
-
-
由 risemeup1 提交于
* update_c++17 * update_c++17 * fix windows bug * solve cirle depend * solve cirle depend * solve cirle depend * solve cirle depend * solve cirle depend * fix windows bug * fix compiler error * fix compiler error * update eigen3 * update eigen3 * update eigen3 * fix mac-py3 compiler error * update C++17 * fix mac compiler error * fix compile error * fix coverage_compiler error * fix coverage_ci_problem * fix coverage_error * fix_kunlun200 compile error * fix kunlun200 compiler error * fix compile error * fix compiler error * fix py3 failed test * fix kunlun200 compiler error * test * fix test error * fix test error * fix test error * test * test * fix mac py3 error * fix mac py3 error * fix mac py3 error * fix test error * fix test error * fix compile error * fix compile error * fix compile error * test * test * fix compiler error * test * test * debug on ci * fix compiler error * fix compiler error * test * fix cinn compiler error * test * fix rocm cmpile error * fix cinn and kunlun compile error * update c++14 * Update flags.cmake
-
由 tianshuo78520a 提交于
* del sequence_enumerate_op * del analyzer_pyramid_dnn_tester * fix
-
由 RedContritio 提交于
-
由 huangjiyi 提交于
* update * update * update * update * update * update * update * update
-
由 RedContritio 提交于
* support auto generate for activation_op relu6 * add generated_static_op for activation_op in CMakeLists.txt
-
- 26 5月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* create phi so * fix ci bugs * fix py3 bugs * add file * fix py3 bugs * fix windows bugs * perfect so * fix py3 bugs * delete all static target in phi * fix windows bugs * fix py3 bugs * fix ci bugs * fix windows bugs * fix bugs: gflags can't be linked by dynamic and static lib * fix bugs that can not load 3rd party * fix ci bugs * fix compile bugs * fix py3 bugs * fix conflict * fix xpu bugs * fix mac compile bugs * fix psgpu bugs * fix inference failed * deal with conflict * fix LIBRARY_PATH bug * fix windows bugs * fix onednn error * fix windows compile bugs * fix windows compile bugs * fix test_cuda_graph_static_mode_error aborted * fix windows bugs * fix mac-python3 error * fix hip compile bugs * change mode to static * change to static mode * fix ci bugs * fix py3 bugs * fix windows bugs * fix bugs * add static flag * add PADDLE_API * change position of PADDLE_API * fix windows bugs * change mode to dynamic lib * fix windows static bugs * deal with conflict * fix windows unit bug * fix coverage * deal with conflict * fix windows-inference * fix py3 bugs * fix bugs when compile type_info * fix compile bugs * fix py3 bugs * fix windows bugs * fix windows openblas * fix xpu bugs * fix enforce_test in windows * update code according comment * fix windows cmake bug * fix windows bugs * fix windows bugs * delete cinn unittest * fix cinn bugs --------- Co-authored-by: lzydev <1528794076@qq.com>
-
- 25 5月, 2023 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 ronnywang 提交于
-
- 24 5月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas. * Change the repeat of cublaslt to 10. * Use FLAGS_cublaslt_exhaustive_search_times as repeats. * Fix compiling error on CI. * Polish the key and simplify codes.
-
- 23 5月, 2023 11 次提交
-
-
由 Fisher 提交于
* Enable check_cinn on some tests Tests: bitwise, compare, shape, assign_value, sum, expand_v2, lookup_table, lookup_table_v2 * Enable more CINN tests Tests with CINN: expand_v2, matmul, matmul_v2, mul, norm, one_hot_v2 Add target select in cinn_launch_op * Revert test_mul_op * Improve op unit tests
-
由 LiYuRio 提交于
-
由 gouzil 提交于
* [phi] autogen code tril_triu * [phi][api]fix tril_triu_grad args * [fluid] clean cmake; [phi] fix infer_meta
-
由 co63oc 提交于
-
由 cyberslack_lee 提交于
-
由 huangjiyi 提交于
* update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update HostAlloc * update param name * update cpu kernel * remove kernel header * update * update
-
由 huangjiyi 提交于
* update * update * update * set out dtype
-
由 Wang Xin 提交于
* static graph autogen code support for pad3d op * bug fixed * add ut for pad3d mkldnn op * fix coverage * fix bug * fix bug * Delete test_pad3d_mkldnn_op.py
-
由 ronnywang 提交于
* [CustomDevice] fix auto_paralell * update * update * update
-
由 LoneRanger 提交于
* fix the static op generation for group_norm * fix bug of mismatch * fix bug of AssertionError * fix setting of composite
-
由 HongyuJia 提交于
* [0D-Tensor] Support elementwise_add * support elementwise_add ZeroDim2&3
-
- 22 5月, 2023 3 次提交
-
-
由 risemeup1 提交于
* update_c++14_to_c++17_on_windows * disable test_audio_logmel_feature and test_audio_mel_feature
-
由 Yuanle Liu 提交于
[Inference] add config.enable_low_precision_io api and remove rely on AnalysisConfig::Precison in trt (#52485)
-
由 Tian Zheng 提交于
* Add GPU kernel for multiclass_nms3 op * Make multiclass_nms3 gpu kernel output consistent with cpu kernel * Fix API incompatibility * Fix unittests on builds without CUDA * Fix ROCM build * Remove fluid headers; Use default atol for unittest * Change function and variable naming * Add comments; Reduce redundant code * Use paddle test framework
-
- 19 5月, 2023 3 次提交
-
-
由 warrentdrew 提交于
* add minimum grad composite rules * add public python api * fix format * fix format * update testcase * fix testcase * fix format * fix cmakelist.txt * fix format * fix param problem * fix op and composite rule * fix bf16 cpu support problem * fix bf16 cpu issue * fix axis error log * add axis for maximum * revert commit * remove .orig * fix generic problem * revert max op * fix axis error * fix maximum axis * fix test_check_output * fix cinn * fix minimum maximum axis check
-
由 limingshu 提交于
* Reorganize the forward codes of flash-attention. * Fix forward. * Remove some noused codes. * Simplify codes and fix backward. * Change all LOG(INFO) to VLOG and fix the backward. * add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes * decrease the effect of debug print on performance * Unify the initialize of flashattn arguments. * Rewirte the reshape of temp_mask and temp_bias. * API support use_flash_attn. * Fix compiling error on CI. * Try to crop the flash-attention lib. * Correct the condition of whether can use flash-attn. * Remove the softmax_out argument. * Remove is_causal. * Polish codes. * Fix qkv_transpose_out's shape and scaling of Q * K. * Update commit of flash-attention. --------- Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-
由 Galaxy1458 提交于
-