- 04 7月, 2023 2 次提交
-
-
由 hong19860320 提交于
* Add XPU plugin to support the customized ops or improve the performance of the fusion ops based on hand-written xpu micro kernels. * refine README.md
-
由 ronnywang 提交于
-
- 03 7月, 2023 5 次提交
-
-
由 jiangfan06 提交于
[XPU] Fix the topk, set_value ops that using temporary tensors avoiding the memory overlaps during multi-stream inference (#54851)
-
由 ronnywang 提交于
* [CustomDevice] release device manager in py::atexit * fix hip_version macro * update * update
-
由 LoneRanger 提交于
* add lerp bf16 support * fix bug * Update test_lerp_op.py modify the input dtype * modify the test_lerp_op.py * Update test_lerp_op.py * fix bug of import * add user_defined_grads * Update test_lerp_op.py * fix bug of grad * fix bug of grad * fix bug of grad * add the check for bfloat16 dtype
-
由 FormlessUnit 提交于
* add linear_compress API
-
由 niuliling123 提交于
-
- 02 7月, 2023 1 次提交
-
-
由 hong 提交于
* fix_fetch_op_and_null_type_bug * fix compile bug * add test case
-
- 30 6月, 2023 1 次提交
-
-
由 mjp9527 提交于
-
- 29 6月, 2023 3 次提交
-
-
由 Yiqun Liu 提交于
* Fix compiling on XPU related to MPTypeTrait. * Unify the use of MPTypeTrait. * Fix compiling error.
-
由 niuliling123 提交于
* style * more * update ctest * Update legacy_backward.yaml * Update legacy_ops.yaml * Update legacy_ops.yaml * update * update * update for move
-
由 haosicheng 提交于
-
- 28 6月, 2023 4 次提交
-
-
由 lijin23 提交于
* add int_with_ll to conv * fix bugs when output_size is specified for conv2d_transpose
-
由 Siming Dai 提交于
* add pos_weight in kernel * fix unittest * fix xpu * fix bce unittest, change infermeta order
-
由 ronnywang 提交于
* [ROCM] fix cupti, hipcub * update * update
-
由 Yiqun Liu 提交于
-
- 27 6月, 2023 2 次提交
-
-
由 zhangyuqin1998 提交于
* delete swish_raw * fix * Update activation_kernel.cc * fix
-
由 TaoTao Li 提交于
* add all_to_all phi operator, kernel, api * add all_to_all ut * tinyfix
-
- 26 6月, 2023 2 次提交
-
-
由 pangengzheng 提交于
-
由 Sonder 提交于
* remove ops from OpsWithFluidKernelNeedMoveToPhi set * open static build flag * OpsWithFluidKernelNeedMoveToPhi * open new_executor_static_build * add infermate for cudnn_lstm * fix * update * fix * update * update * update * fix pow2 decay * fix pow2 decay * recover analysis_predictor.cc * fix pow2 decay * fix cudnn lstm * add output register info for svd * fix pow2_decay_with_linear_warmup_kernel * recover test lstm cudnn * recover svg register codes * fix register info * fix reduce sum register info * add output info for adadelta * add output info for adadelta * add output info for adamax * fix complex abs register info * add register info for cudnn_lstm_grad * recover * fix lstm cudnn * fix * fix xpu output registe info * remove std::cout * add backend * remove output info in pow2_decay_with_linear_warmup_kernel * add judgment in TensorShouldBeFakeInitialized * recover power_ * close new_executor_static_build * fix set_value_xpu
-
- 25 6月, 2023 2 次提交
- 21 6月, 2023 1 次提交
-
-
由 lijin23 提交于
-
- 20 6月, 2023 4 次提交
- 19 6月, 2023 2 次提交
- 16 6月, 2023 3 次提交
-
-
由 cyber-pioneer 提交于
-
由 zhiboniu 提交于
* fix lamb always_adapt * fix optest * fix all optests
-
由 cyber-pioneer 提交于
-
- 15 6月, 2023 1 次提交
-
-
由 Hui Zhang 提交于
* fix for log xxx * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward * fix docstring * fix bug * fix bugs * fix bugs * fix bugs * fix bugs * fix bug * using cast * fix test * fix api * fix other bugs * fix ci bug for not using dygraph guard * add bfloat16 test * fix ut * bf16 * exp/expm1 support int32/int64 * fix ut * fix ut * fix ut
-
- 14 6月, 2023 4 次提交
-
-
由 Chitsing KUI 提交于
-
由 Charles-hit 提交于
-
由 zhouweiwei2014 提交于
* [Zero-Dim] paddle.nanmedian support 0D Tensor * fix CI
-
由 sneaxiy 提交于
* fix A100 CUDA12 ut * fix ci uts * fix test_sync_batch_norm_op * fix sync bn op ut again by separating 2 files * fix codestyle ci * combine other PRs * fix codestyle * fix codestyle ci
-
- 13 6月, 2023 1 次提交
-
-
由 NetPunk 提交于
-
- 12 6月, 2023 2 次提交
-
-
由 Hui Zhang 提交于
* fix for log xxx * add int32/int64 for cpu/gpu; add float16/bfloat16 for cpu forward * fix docstring * fix bug * fix bugs * fix bugs * fix bugs * fix bugs * fix bug * using cast * fix test * fix api * fix other bugs * fix ci bug for not using dygraph guard * add bfloat16 test * fix ut * bf16
-
由 Zhang Jun 提交于
* support bias's rank equal to input's rank
-