- 31 5月, 2023 1 次提交
-
-
由 DrowFish19 提交于
* fix the strong Wolfe conditions for line search * pass the max_line_search_iters parameter into line search method (strong_wolfe)
-
- 30 5月, 2023 6 次提交
-
-
由 Yiqun Liu 提交于
* Reimplement the check_nan_inf function as check_numerics kernel. * Remove the cpu implemention to phi. * Add ifdef for the including of omp.h. * Move the use of FLAGS_check_nan_inf_level out of header file. * Implement a common PrintAndThrowError function. * Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__. * Add dependency of phi. * Polish codes and unittest.
-
由 zhouweiwei2014 提交于
-
由 xiaoguoguo626807 提交于
* modify gradOpMaker * modify concat bug * modify concat bug * delete unnecessary block * modify fill_any_like value from 1 to 0 * modify fill_any_like dtype from other to -1 * ci_bug
-
由 lijialin03 提交于
-
由 RedContritio 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Reorganize the fold structure * [Auto Parallel] Fix some import errors
-
- 26 5月, 2023 4 次提交
-
-
由 Leo Chen 提交于
* Copy QAT files from PaddleSlim * Integrate QAT API into Paddle * Replace eval function * Reduce test_quant_aware run time * Apply new formatter on modified files * Remove code check for Paddle version check * Copy quant_post_quant_aware UT from PaddleSlim * Integrate test_quant_post_quant_aware UT into PaddlePaddle * Apply new formatter on modified files * Remove redundant code and add unittests * Add new unittests * Update the time limit of new unittests
-
由 YuanRisheng 提交于
* create phi so * fix ci bugs * fix py3 bugs * add file * fix py3 bugs * fix windows bugs * perfect so * fix py3 bugs * delete all static target in phi * fix windows bugs * fix py3 bugs * fix ci bugs * fix windows bugs * fix bugs: gflags can't be linked by dynamic and static lib * fix bugs that can not load 3rd party * fix ci bugs * fix compile bugs * fix py3 bugs * fix conflict * fix xpu bugs * fix mac compile bugs * fix psgpu bugs * fix inference failed * deal with conflict * fix LIBRARY_PATH bug * fix windows bugs * fix onednn error * fix windows compile bugs * fix windows compile bugs * fix test_cuda_graph_static_mode_error aborted * fix windows bugs * fix mac-python3 error * fix hip compile bugs * change mode to static * change to static mode * fix ci bugs * fix py3 bugs * fix windows bugs * fix bugs * add static flag * add PADDLE_API * change position of PADDLE_API * fix windows bugs * change mode to dynamic lib * fix windows static bugs * deal with conflict * fix windows unit bug * fix coverage * deal with conflict * fix windows-inference * fix py3 bugs * fix bugs when compile type_info * fix compile bugs * fix py3 bugs * fix windows bugs * fix windows openblas * fix xpu bugs * fix enforce_test in windows * update code according comment * fix windows cmake bug * fix windows bugs * fix windows bugs * delete cinn unittest * fix cinn bugs --------- Co-authored-by: lzydev <1528794076@qq.com>
-
由 zhaoyingli 提交于
* global view process_group * fix import * fix attr * fix tunner init comm
-
由 risemeup1 提交于
* fix test error * fix test_exectuor_feed_non_tensor * fix test error
-
- 25 5月, 2023 5 次提交
-
-
由 HongyuJia 提交于
-
由 zhangkaihuo 提交于
-
由 thunder95 提交于
-
由 zhouweiwei2014 提交于
-
由 houj04 提交于
-
- 24 5月, 2023 3 次提交
-
-
由 Leo Chen 提交于
-
由 zhangkaihuo 提交于
-
由 Haohongxiang 提交于
-
- 23 5月, 2023 10 次提交
-
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Support float16 in selu * fix
-
由 Fisher 提交于
* Enable check_cinn on some tests Tests: bitwise, compare, shape, assign_value, sum, expand_v2, lookup_table, lookup_table_v2 * Enable more CINN tests Tests with CINN: expand_v2, matmul, matmul_v2, mul, norm, one_hot_v2 Add target select in cinn_launch_op * Revert test_mul_op * Improve op unit tests
-
由 co63oc 提交于
-
由 co63oc 提交于
-
由 co63oc 提交于
-
由 cyberslack_lee 提交于
-
由 Leo Chen 提交于
* add host memory stats * add ut
-
由 ronnywang 提交于
* [CustomDevice] fix auto_paralell * update * update * update
-
由 zxcd 提交于
* fix processing logic of the arange function when dtype is empty. * update commit version * fix ValueError when end is None. * add unitest for new case. * fix tensor type. * remove paddle.to_tensor(), add more test unit. * remove useless line. * fix enable_static * add new test unit. * fix by comment.
-
由 HongyuJia 提交于
* [0D-Tensor] Support elementwise_add * support elementwise_add ZeroDim2&3
-
- 22 5月, 2023 8 次提交
-
-
由 zhenhailiu 提交于
* unify code * remove useless code * polish * python/paddle/distributed/fleet/meta_parallel/pipeline_parallel.py * polish * polish
-
由 Meteor Liu 提交于
* [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * [dygraph]unify _non_static_mode() in_dygraph_mode() and in_dynamic_mode() * fixed cyclic reference that caused patial import * fixed bad change * fix bad import * fix bad import * fix bad import * fix ut failed caused by change in_dynamic_mode * fix ut failed caused by change in_dynamic_mode * fixed usage of in_dynamic_mode() or in_dygraph_mode() * revert python3 to python in .pre-commit-config.yaml * fix merge conflicts
-
由 Zhang Ting 提交于
-
由 niuliling123 提交于
-
由 niuliling123 提交于
-
由 JYChen 提交于
-
由 Tian Zheng 提交于
* Add GPU kernel for multiclass_nms3 op * Make multiclass_nms3 gpu kernel output consistent with cpu kernel * Fix API incompatibility * Fix unittests on builds without CUDA * Fix ROCM build * Remove fluid headers; Use default atol for unittest * Change function and variable naming * Add comments; Reduce redundant code * Use paddle test framework
-
由 niuliling123 提交于
Print python trace back when debugmode = CHECK_NAN_INF_AND_ABORT and backward has nan/inf (#52808)
-
- 20 5月, 2023 1 次提交
-
-
由 ShenLiang 提交于
-
- 19 5月, 2023 2 次提交
-
-
由 warrentdrew 提交于
* add minimum grad composite rules * add public python api * fix format * fix format * update testcase * fix testcase * fix format * fix cmakelist.txt * fix format * fix param problem * fix op and composite rule * fix bf16 cpu support problem * fix bf16 cpu issue * fix axis error log * add axis for maximum * revert commit * remove .orig * fix generic problem * revert max op * fix axis error * fix maximum axis * fix test_check_output * fix cinn * fix minimum maximum axis check
-
由 limingshu 提交于
* Reorganize the forward codes of flash-attention. * Fix forward. * Remove some noused codes. * Simplify codes and fix backward. * Change all LOG(INFO) to VLOG and fix the backward. * add scale for AF2 flash_attn, much thanks to xreki and shaojie for debug these codes * decrease the effect of debug print on performance * Unify the initialize of flashattn arguments. * Rewirte the reshape of temp_mask and temp_bias. * API support use_flash_attn. * Fix compiling error on CI. * Try to crop the flash-attention lib. * Correct the condition of whether can use flash-attn. * Remove the softmax_out argument. * Remove is_causal. * Polish codes. * Fix qkv_transpose_out's shape and scaling of Q * K. * Update commit of flash-attention. --------- Co-authored-by: NLiu Yiqun <liuyiqun01@baidu.com>
-