- 18 8月, 2023 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 lzy 提交于
[Inference] Make share_external_data supports bf16 and bool; fix while_op cache_inference_while_scope when using fleet_executor. (#56055) * 1. make share_external_data supports bf16 and bool; 2. don't drop_kids when cache_inference_while_scope * fix FLAGS_cache_inference_while_scope * add unitest * add unitest * skip unitest when cudnn_version < 8100 * skip test share_external_data_bf16 when CUDA_ARCH < 80
-
- 17 8月, 2023 2 次提交
-
-
由 tianshuo78520a 提交于
This reverts commit fd765f61.
-
由 zhouweiwei2014 提交于
-
- 16 8月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* add phi forward api gen impl * add phi backward gen code * polish api code gen impl * polish code gen impl * remove auto_paralel namespace * add dygraph forward impl * add for_auto_parallel cond * fix code gen errors * add dygraph backward impl * resolve conflict with develop * refactor dist api gen impl * revert origin api gen impl * replace template for override func * fix dnnl marco error * revert third_party change * add with distributed marco * Update grad_tensor_holder.cc details * merge dist tensor constructor * change test tensor to replicate * fx typo * resolve conflict with develop * fix out dim error
-
- 15 8月, 2023 2 次提交
-
-
由 cyberslack_lee 提交于
-
由 LiYuRio 提交于
-
- 14 8月, 2023 3 次提交
-
-
由 cyberslack_lee 提交于
-
由 LiYuRio 提交于
-
由 zhangbo9674 提交于
* add code * fix bug * fix bug * delete unused code * refine code * fix bug * fix bug * fix bug * fix bug * fix bug
-
- 10 8月, 2023 2 次提交
- 09 8月, 2023 1 次提交
-
-
由 Xinyu Chen 提交于
* onednn: rename macro to PADDLE_WITH_DNNL * onednn: rename macro to CINN_WITH_DNNL
-
- 07 8月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* make tcp store a global instance * fix windows compile error
-
- 04 8月, 2023 2 次提交
- 03 8月, 2023 2 次提交
- 02 8月, 2023 2 次提交
-
-
由 gouzil 提交于
* [clang-tidy] modernize-avoid-c-arrays * rollback * [clang-tidy] fix * close modernize-avoid-c-arrays * fix PHI_DEFINE_string; add PHI_DEFINE_bool NOLINT * fix PHI_DEFINE_string * fix next_h_state and parity err * fix win32 * fix cuda_graph * fix accuracy_kernel * fix math_function * fix fused_softmax_mask_kernel.cu load_data and warp_reduce; rollback concat_and_split_functor ins_addr * fix fused_dropout_add_grad_kernel * fix * rollback cu * rollback concat_and_split_functor.cu * rollback
-
由 zhangbo9674 提交于
* add interface * add code * add code * add code * add code * fix bug * fix bug * add var prefix * add code * add code * add code * fix compile bug * fix bug * refine code * refine code * refine code * refine code * fix bug * add code * add code * fix bug * add code * add code * refine code * refine code * fix bug * add code * fix bug in phi__kernel_utils * refine code * fix bug * open flag * refine code * fix bug * fix bug * refine code * fix bug
-
- 01 8月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
-
由 LiYuRio 提交于
* use string as key for comm_context_manager * remove device_id from comm_context
-
- 31 7月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
* support construct network using ir in Python * fix py3 bugs * change source return type to OpResult
-
由 wanghuancoder 提交于
support stride
-
- 26 7月, 2023 2 次提交
- 24 7月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* simplify dist tensor namespace path * fix tensor dist attr decl error
-
- 21 7月, 2023 1 次提交
-
-
由 Ruibin Cheung 提交于
-
- 20 7月, 2023 1 次提交
-
-
由 Xing-lil 提交于
* update broadcast gloo in dygraph * update * update reduce gloo in dygraph * update reduce gloo in dygraph * update * update allreduce allgather * update all * update * update * update
-
- 19 7月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* polish dist_tensor details * add unittest for coverage * revert uselesss change * skip test without dist
-
- 12 7月, 2023 1 次提交
-
-
由 Wang Xin 提交于
* [clang-tidy] enable readability-container-size-empty check * fix test_custom_kernel Failed * add clang-tid-10 in dockerfile * add clang-tidy in dockerfile * fix bug
-
- 11 7月, 2023 1 次提交
-
-
由 hong 提交于
* suport optional input in new_ir * polish code * add coverate test * update * update * add unitest * remove reduplicate code * udpate * fix assign error * revert test arg min max * update * fix bug * polish code * update * fix unique and close op bug * update * update * revert test code * revert unique test * polish code * remove useless code --------- Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
- 29 6月, 2023 1 次提交
-
-
由 hong 提交于
* update * refactor build context * fix bug * polish code * change func name
-
- 27 6月, 2023 2 次提交
-
-
由 JZ-LIANG 提交于
* base rule * add sharidng merge * add sharidng axis merge * define unified data class for inferencing dist_attr * test wrap DistTensorSpec in dygraph mode * matmul main logic done * define unified data class for inferencing dist_attr --------- Co-authored-by: NYichen Zhang <zhangyichen03@baidu.com>
-
由 hong 提交于
* add kernel dialect * change DenseTensorTypeStorage to DenseTensorType * add test case` * add first pd_op to kernel dialect * lower pd op to kernel dialect * update * update * remove useless code * add attrite print test * fix bug * update * update * update * update * polish code * fix bug * polish code and add python test * add test * fix test error * add env flag * fix bug * revert test env * change cc_test_old to cc_test * fix build_static bug * fix type test error * udpate cmake * disable test in windows * update * update * fix bug * split file * fix conflict * polish code and fix conflict * support place transformer * finish bug * add gpu flags * fix with cuda macro * update * add scope guard * polish code
-
- 20 6月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
-
- 19 6月, 2023 1 次提交
-
-
由 hong 提交于
* add kernel dialect * change DenseTensorTypeStorage to DenseTensorType * add test case` * add first pd_op to kernel dialect * lower pd op to kernel dialect * update * update * remove useless code * add attrite print test * fix bug * update * update * update * update * polish code * fix bug * polish code and add python test * add test * fix test error * add env flag * fix bug * revert test env * change cc_test_old to cc_test * fix build_static bug * fix type test error * udpate cmake * disable test in windows * fix inference compile * update * support tensor attribute runtime * add result check * polish test code * fix test error * add scalar test & polish code * re-open test case
-
- 15 6月, 2023 1 次提交
-
-
由 hong 提交于
* add kernel dialect * change DenseTensorTypeStorage to DenseTensorType * add test case` * add first pd_op to kernel dialect * lower pd op to kernel dialect * update * update * remove useless code * add attrite print test * fix bug * update * update * update * update * polish code * fix bug * polish code and add python test * add test * fix test error * add env flag * fix bug * revert test env * change cc_test_old to cc_test * fix build_static bug * fix type test error * udpate cmake * disable test in windows * fix inference compile
-
- 14 6月, 2023 1 次提交
-
-
由 Charles-hit 提交于
-
- 13 6月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* construct dist tensor * move constructor to header
-