- 28 7月, 2023 2 次提交
- 27 7月, 2023 1 次提交
-
-
由 GGBond8488 提交于
【inplace api】batch add inplace api paddle.log_, paddle.i0_, paddle.nn.functional.leaky_relu_... (#55576) * batch add inplace api * add inplace test * add activation inplace * fix test * remove atan2 ge, gt, le, lt, nq * remove atan2 ge, gt, le, lt, nq * fix windows ci error * rerun ci * fix typro * fix bugs --------- Co-authored-by: Nzhangrui34 <v_zhangrui34@baidu.com>
-
- 25 7月, 2023 3 次提交
-
-
由 lishicheng1996 提交于
-
由 Jeng Bai-Cheng 提交于
* Fix LayerNormForward perf issue * Bugfix, fast_layer_norm OOB * apply pre-commit --------- Co-authored-by: NShijie Wang <jaywan@nvidia.com>
-
由 傅剑寒 提交于
-
- 18 7月, 2023 1 次提交
-
-
由 GGBond8488 提交于
* batch add inpalce api * fix inplace fn generate * add test for new inpalce api * fix typro * fix typro * fix typro * fix test error * fix atan2 * remove atan2 * auto genereate inpalce api * fix inplace generate fn error * fix windows error * fix test error * fix test error * fix windows ci error * fix test error * fix test_error * fix test error * fix eigen aliasing error in inplace * remove elementwise_pow inplace * fix doc error * fix test error
-
- 17 7月, 2023 1 次提交
-
-
由 zhangyikun02 提交于
* TensorSetConstantXPU support to use xpu::constant when T is float/float16 * add xpu_wait for TensorSetConstantXPU
-
- 13 7月, 2023 3 次提交
-
-
由 freeliuzc 提交于
* add init value for CudaSwishFunctor * add new phi kernel fusedBiasActKernel
-
由 RichardWooSJTU 提交于
* add matmul int8
-
由 Qi Shao 提交于
* modify the accuracy checking framework of bf16 optest, including both of forward and backward
-
- 12 7月, 2023 1 次提交
-
-
由 Wang Xin 提交于
* [clang-tidy] enable readability-container-size-empty check * fix test_custom_kernel Failed * add clang-tid-10 in dockerfile * add clang-tidy in dockerfile * fix bug
-
- 11 7月, 2023 1 次提交
-
-
由 ronnywang 提交于
-
- 05 7月, 2023 1 次提交
-
-
由 LUZY0726 提交于
-
- 04 7月, 2023 1 次提交
-
-
由 ronnywang 提交于
-
- 03 7月, 2023 2 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] release device manager in py::atexit * fix hip_version macro * update * update
-
由 FormlessUnit 提交于
* add linear_compress API
-
- 29 6月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Fix compiling on XPU related to MPTypeTrait. * Unify the use of MPTypeTrait. * Fix compiling error.
-
- 28 6月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [ROCM] fix cupti, hipcub * update * update
-
- 20 6月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 15 6月, 2023 1 次提交
-
-
由 Hui Zhang 提交于
* fix for log xxx * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward * fix docstring * fix bug * fix bugs * fix bugs * fix bugs * fix bugs * fix bug * using cast * fix test * fix api * fix other bugs * fix ci bug for not using dygraph guard * add bfloat16 test * fix ut * bf16 * exp/expm1 support int32/int64 * fix ut * fix ut * fix ut
-
- 14 6月, 2023 2 次提交
-
-
由 Chitsing KUI 提交于
-
由 zhouweiwei2014 提交于
* [Zero-Dim] paddle.nanmedian support 0D Tensor * fix CI
-
- 12 6月, 2023 1 次提交
-
-
由 Hui Zhang 提交于
* fix for log xxx * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward * fix docstring * fix bug * fix bugs * fix bugs * fix bugs * fix bugs * fix bug * using cast * fix test * fix api * fix other bugs * fix ci bug for not using dygraph guard * add bfloat16 test * fix ut * bf16
-
- 09 6月, 2023 1 次提交
-
-
由 pangengzheng 提交于
* support add(x_float32, bfloa16_) or add(x_float32, y_float16) * polish * fix test
-
- 08 6月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Add outputs to check_numerics_kernel. * Add check_numerics to yaml. * Add API and unittest. * Add check_nan_inf_level as argument of check_numerics_kernel. * Add more unittests. * Fix static API implementation and unittest. * Move the implementation of check_numerics to paddle.amp. * Fix import error.
-
- 06 6月, 2023 1 次提交
-
-
由 Zhang Zheng 提交于
* Fix compilation error by using thrust * fix
-
- 03 6月, 2023 1 次提交
-
-
由 Scotty 提交于
-
- 02 6月, 2023 3 次提交
-
-
由 Hui Zhang 提交于
* floor div support float/double/bfloat16/float16 * add ut * fix bug * fix fft.ifftshift for floor_divide upgrade * fix comment * fix bugs * fix bug
-
由 Zhang Zheng 提交于
* Optimize perf of broadcast matmul * support more dtype
-
由 傅剑寒 提交于
-
- 01 6月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [ROCM] fix multihead_matmul * skip bf16 uts * update
-
- 31 5月, 2023 1 次提交
-
-
由 Charles-hit 提交于
* support activation prim op bf16 dtype * remove useless code
-
- 30 5月, 2023 2 次提交
-
-
由 risemeup1 提交于
* update_c++17 * update_c++17 * fix windows bug * solve cirle depend * solve cirle depend * solve cirle depend * solve cirle depend * solve cirle depend * fix windows bug * fix compiler error * fix compiler error * update eigen3 * update eigen3 * update eigen3 * fix mac-py3 compiler error * update C++17 * fix mac compiler error * fix compile error * fix coverage_compiler error * fix coverage_ci_problem * fix coverage_error * fix_kunlun200 compile error * fix kunlun200 compiler error * fix compile error * fix compiler error * fix py3 failed test * fix kunlun200 compiler error * test * fix test error * fix test error * fix test error * test * test * fix mac py3 error * fix mac py3 error * fix mac py3 error * fix test error * fix test error * fix compile error * fix compile error * fix compile error * test * test * fix compiler error * test * test * debug on ci * fix compiler error * fix compiler error * test * fix cinn compiler error * test * fix rocm cmpile error * fix cinn and kunlun compile error * update c++14 * Update flags.cmake
-
由 Yiqun Liu 提交于
* Reimplement the check_nan_inf function as check_numerics kernel. * Remove the cpu implemention to phi. * Add ifdef for the including of omp.h. * Move the use of FLAGS_check_nan_inf_level out of header file. * Implement a common PrintAndThrowError function. * Fix the error using of __NVCC__, which should be instead with __CUDA_ARCH__. * Add dependency of phi. * Polish codes and unittest.
-
- 26 5月, 2023 1 次提交
-
-
由 YuanRisheng 提交于
* create phi so * fix ci bugs * fix py3 bugs * add file * fix py3 bugs * fix windows bugs * perfect so * fix py3 bugs * delete all static target in phi * fix windows bugs * fix py3 bugs * fix ci bugs * fix windows bugs * fix bugs: gflags can't be linked by dynamic and static lib * fix bugs that can not load 3rd party * fix ci bugs * fix compile bugs * fix py3 bugs * fix conflict * fix xpu bugs * fix mac compile bugs * fix psgpu bugs * fix inference failed * deal with conflict * fix LIBRARY_PATH bug * fix windows bugs * fix onednn error * fix windows compile bugs * fix windows compile bugs * fix test_cuda_graph_static_mode_error aborted * fix windows bugs * fix mac-python3 error * fix hip compile bugs * change mode to static * change to static mode * fix ci bugs * fix py3 bugs * fix windows bugs * fix bugs * add static flag * add PADDLE_API * change position of PADDLE_API * fix windows bugs * change mode to dynamic lib * fix windows static bugs * deal with conflict * fix windows unit bug * fix coverage * deal with conflict * fix windows-inference * fix py3 bugs * fix bugs when compile type_info * fix compile bugs * fix py3 bugs * fix windows bugs * fix windows openblas * fix xpu bugs * fix enforce_test in windows * update code according comment * fix windows cmake bug * fix windows bugs * fix windows bugs * delete cinn unittest * fix cinn bugs --------- Co-authored-by: lzydev <1528794076@qq.com>
-
- 25 5月, 2023 1 次提交
-
-
由 thunder95 提交于
-
- 24 5月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Try to increase the repeat of autotune and fix the setting of allow_tf32_cublas. * Change the repeat of cublaslt to 10. * Use FLAGS_cublaslt_exhaustive_search_times as repeats. * Fix compiling error on CI. * Polish the key and simplify codes.
-
- 23 5月, 2023 1 次提交
-
-
由 cyberslack_lee 提交于
-
- 22 5月, 2023 1 次提交
-
-
由 risemeup1 提交于
* update_c++14_to_c++17_on_windows * disable test_audio_logmel_feature and test_audio_mel_feature
-