- 22 8月, 2023 1 次提交
-
-
由 Ruibin Cheung 提交于
-
- 21 8月, 2023 3 次提交
-
-
由 ronnywang 提交于
-
由 feifei-111 提交于
-
由 Lu Qi 提交于
-
- 18 8月, 2023 2 次提交
-
-
由 huangjiyi 提交于
-
由 lzy 提交于
[Inference] Make share_external_data supports bf16 and bool; fix while_op cache_inference_while_scope when using fleet_executor. (#56055) * 1. make share_external_data supports bf16 and bool; 2. don't drop_kids when cache_inference_while_scope * fix FLAGS_cache_inference_while_scope * add unitest * add unitest * skip unitest when cudnn_version < 8100 * skip test share_external_data_bf16 when CUDA_ARCH < 80
-
- 17 8月, 2023 2 次提交
-
-
由 RichardWooSJTU 提交于
-
由 hong 提交于
* fix op translator reshape type * new ir support vector type place transfer * add test case * fix new ir bn grad bug
-
- 16 8月, 2023 4 次提交
-
-
由 huangjiyi 提交于
* update * update
-
由 zxcd 提交于
-
由 Sonder 提交于
-
由 Ruibin Cheung 提交于
* [Fluid] Migrate c_embedding to PHI * fix * add python_api * fix ut * migrate xpu kernel * fix windows compile error
-
- 15 8月, 2023 3 次提交
-
-
由 xiaoxiaohehe001 提交于
* support_mmha * add_python_api * add_api_doc * fix_doc_error * fix_infermeta * add_infermeta * add_bf16_cuda_check * add_bf16_check * fix_ci_windows * fix_ci_windows_kernel_register * fix_test_mmha * add_cumoffsets * remove_bias * delete_mmha_reshape_input_output * rename_delete_hfile * remove_fluid --------- Co-authored-by: Nyangjianfengo1 <yangjianfeng01@baidu.com>
-
由 lzydev 提交于
* Improve GC for pipeline parallel * Delete print * fix bug of nop_op and sharding --------- Co-authored-by: Nchenruibiao <chenruibiao@baidu.com>
-
由 Ruibin Cheung 提交于
* [Fluid] Migrate number_count to PHI * fix out alloc * fix ut (add python_api)
-
- 14 8月, 2023 3 次提交
-
-
由 gouzil 提交于
-
由 cyberslack_lee 提交于
-
由 Sonder 提交于
-
- 09 8月, 2023 2 次提交
-
-
由 Xinyu Chen 提交于
* onednn: rename macro to PADDLE_WITH_DNNL * onednn: rename macro to CINN_WITH_DNNL
-
由 Ruibin Cheung 提交于
-
- 08 8月, 2023 3 次提交
-
-
由 Wang Xin 提交于
* move decayed_adagrad_op to phi * fix bug
-
由 huangjiyi 提交于
-
由 Ruibiao Chen 提交于
* Improve GC for pipeline parallel * Delete print
-
- 07 8月, 2023 3 次提交
-
-
由 Yuanle Liu 提交于
* fix cudnn 8.7+ bug on cudnnConvolutionBiasActivationForward * save_optimized_model_pass support tensorrt * update * update * fix compile * update * fix ut timeout
-
由 gouzil 提交于
-
由 Ruibin Cheung 提交于
-
- 04 8月, 2023 2 次提交
-
-
由 Ruibin Cheung 提交于
* [clang-tidy] enable modernize-use-emplace * Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into modernize_use_emplace
-
由 Zhenghai Zhang 提交于
-
- 03 8月, 2023 3 次提交
-
-
由 yangguohao 提交于
-
由 Wang Xin 提交于
-
由 WangZhen 提交于
-
- 01 8月, 2023 4 次提交
-
-
由 Sonder 提交于
* move prune_gate_by_capacity to phi * fix * fix registe info * remove useless codes
-
由 gouzil 提交于
-
由 hong 提交于
* new ir support print op * fix gpu bug * fix bug * update * remove layout to string * remove usless header * polish code * fix bug * posolis code
-
由 LiYuRio 提交于
* use string as key for comm_context_manager * remove device_id from comm_context
-
- 31 7月, 2023 3 次提交
-
-
由 Sonder 提交于
-
由 wanghuancoder 提交于
support stride
-
由 zhangyuqin1998 提交于
* rename BatchNormGradFunctor * Update batch_norm_grad_kernel.cc * Update batch_norm_grad_kernel.cu * Update batch_norm_grad_kernel.cc * fix * Update batch_norm_grad_kernel.cc
-
- 28 7月, 2023 1 次提交
-
-
由 hong 提交于
* new ir support save combine * update * polish code * update * new ir support fluid op * remove depulicate op * fix ir exe test compile error * fix compile bug * update * code format * update * update * polish code
-
- 27 7月, 2023 1 次提交
-
-
由 ming1753 提交于
* Paddle-TRT support float64 in/out type, support fill_any_like_op in int64
-