- 03 8月, 2023 1 次提交
-
-
由 Wang Xin 提交于
-
- 02 8月, 2023 2 次提交
-
-
由 gouzil 提交于
* [clang-tidy] modernize-avoid-c-arrays * rollback * [clang-tidy] fix * close modernize-avoid-c-arrays * fix PHI_DEFINE_string; add PHI_DEFINE_bool NOLINT * fix PHI_DEFINE_string * fix next_h_state and parity err * fix win32 * fix cuda_graph * fix accuracy_kernel * fix math_function * fix fused_softmax_mask_kernel.cu load_data and warp_reduce; rollback concat_and_split_functor ins_addr * fix fused_dropout_add_grad_kernel * fix * rollback cu * rollback concat_and_split_functor.cu * rollback
-
由 wanghuancoder 提交于
* fix security bug
-
- 01 8月, 2023 2 次提交
- 31 7月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
support stride
-
- 28 7月, 2023 2 次提交
- 27 7月, 2023 1 次提交
-
-
由 GGBond8488 提交于
【inplace api】batch add inplace api paddle.log_, paddle.i0_, paddle.nn.functional.leaky_relu_... (#55576) * batch add inplace api * add inplace test * add activation inplace * fix test * remove atan2 ge, gt, le, lt, nq * remove atan2 ge, gt, le, lt, nq * fix windows ci error * rerun ci * fix typro * fix bugs --------- Co-authored-by: Nzhangrui34 <v_zhangrui34@baidu.com>
-
- 25 7月, 2023 3 次提交
-
-
由 lishicheng1996 提交于
-
由 Jeng Bai-Cheng 提交于
* Fix LayerNormForward perf issue * Bugfix, fast_layer_norm OOB * apply pre-commit --------- Co-authored-by: NShijie Wang <jaywan@nvidia.com>
-
由 傅剑寒 提交于
-
- 18 7月, 2023 1 次提交
-
-
由 GGBond8488 提交于
* batch add inpalce api * fix inplace fn generate * add test for new inpalce api * fix typro * fix typro * fix typro * fix test error * fix atan2 * remove atan2 * auto genereate inpalce api * fix inplace generate fn error * fix windows error * fix test error * fix test error * fix windows ci error * fix test error * fix test_error * fix test error * fix eigen aliasing error in inplace * remove elementwise_pow inplace * fix doc error * fix test error
-
- 17 7月, 2023 1 次提交
-
-
由 zhangyikun02 提交于
* TensorSetConstantXPU support to use xpu::constant when T is float/float16 * add xpu_wait for TensorSetConstantXPU
-
- 13 7月, 2023 3 次提交
-
-
由 freeliuzc 提交于
* add init value for CudaSwishFunctor * add new phi kernel fusedBiasActKernel
-
由 RichardWooSJTU 提交于
* add matmul int8
-
由 Qi Shao 提交于
* modify the accuracy checking framework of bf16 optest, including both of forward and backward
-
- 12 7月, 2023 1 次提交
-
-
由 Wang Xin 提交于
* [clang-tidy] enable readability-container-size-empty check * fix test_custom_kernel Failed * add clang-tid-10 in dockerfile * add clang-tidy in dockerfile * fix bug
-
- 11 7月, 2023 1 次提交
-
-
由 ronnywang 提交于
-
- 05 7月, 2023 1 次提交
-
-
由 LUZY0726 提交于
-
- 04 7月, 2023 1 次提交
-
-
由 ronnywang 提交于
-
- 03 7月, 2023 2 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] release device manager in py::atexit * fix hip_version macro * update * update
-
由 FormlessUnit 提交于
* add linear_compress API
-
- 29 6月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Fix compiling on XPU related to MPTypeTrait. * Unify the use of MPTypeTrait. * Fix compiling error.
-
- 28 6月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [ROCM] fix cupti, hipcub * update * update
-
- 20 6月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 15 6月, 2023 1 次提交
-
-
由 Hui Zhang 提交于
* fix for log xxx * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward * fix docstring * fix bug * fix bugs * fix bugs * fix bugs * fix bugs * fix bug * using cast * fix test * fix api * fix other bugs * fix ci bug for not using dygraph guard * add bfloat16 test * fix ut * bf16 * exp/expm1 support int32/int64 * fix ut * fix ut * fix ut
-
- 14 6月, 2023 2 次提交
-
-
由 Chitsing KUI 提交于
-
由 zhouweiwei2014 提交于
* [Zero-Dim] paddle.nanmedian support 0D Tensor * fix CI
-
- 12 6月, 2023 1 次提交
-
-
由 Hui Zhang 提交于
* fix for log xxx * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward * fix docstring * fix bug * fix bugs * fix bugs * fix bugs * fix bugs * fix bug * using cast * fix test * fix api * fix other bugs * fix ci bug for not using dygraph guard * add bfloat16 test * fix ut * bf16
-
- 09 6月, 2023 1 次提交
-
-
由 pangengzheng 提交于
* support add(x_float32, bfloa16_) or add(x_float32, y_float16) * polish * fix test
-
- 08 6月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
* Add outputs to check_numerics_kernel. * Add check_numerics to yaml. * Add API and unittest. * Add check_nan_inf_level as argument of check_numerics_kernel. * Add more unittests. * Fix static API implementation and unittest. * Move the implementation of check_numerics to paddle.amp. * Fix import error.
-
- 06 6月, 2023 1 次提交
-
-
由 Zhang Zheng 提交于
* Fix compilation error by using thrust * fix
-
- 03 6月, 2023 1 次提交
-
-
由 Scotty 提交于
-
- 02 6月, 2023 3 次提交
-
-
由 Hui Zhang 提交于
* floor div support float/double/bfloat16/float16 * add ut * fix bug * fix fft.ifftshift for floor_divide upgrade * fix comment * fix bugs * fix bug
-
由 Zhang Zheng 提交于
* Optimize perf of broadcast matmul * support more dtype
-
由 傅剑寒 提交于
-
- 01 6月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [ROCM] fix multihead_matmul * skip bf16 uts * update
-
- 31 5月, 2023 1 次提交
-
-
由 Charles-hit 提交于
* support activation prim op bf16 dtype * remove useless code
-
- 30 5月, 2023 1 次提交
-
-
由 risemeup1 提交于
* update_c++17 * update_c++17 * fix windows bug * solve cirle depend * solve cirle depend * solve cirle depend * solve cirle depend * solve cirle depend * fix windows bug * fix compiler error * fix compiler error * update eigen3 * update eigen3 * update eigen3 * fix mac-py3 compiler error * update C++17 * fix mac compiler error * fix compile error * fix coverage_compiler error * fix coverage_ci_problem * fix coverage_error * fix_kunlun200 compile error * fix kunlun200 compiler error * fix compile error * fix compiler error * fix py3 failed test * fix kunlun200 compiler error * test * fix test error * fix test error * fix test error * test * test * fix mac py3 error * fix mac py3 error * fix mac py3 error * fix test error * fix test error * fix compile error * fix compile error * fix compile error * test * test * fix compiler error * test * test * debug on ci * fix compiler error * fix compiler error * test * fix cinn compiler error * test * fix rocm cmpile error * fix cinn and kunlun compile error * update c++14 * Update flags.cmake
-