- 28 8月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* simplify with dist marco * polish error message format * fix vtable error * fix cmake error * fix winsock redefined error * fix windows compile error * fix windows conpile failed * fix merge error * fix vec compile error * add port.h into test_cpu_vec * fix merge error * try to fix winsock error
-
- 26 8月, 2023 1 次提交
-
-
由 ronnywang 提交于
-
- 25 8月, 2023 2 次提交
- 24 8月, 2023 2 次提交
-
-
由 wanghuancoder 提交于
* stride flag open
-
由 liuruyan 提交于
* add symbolicDimProduct & symbolicDimMgr without method shape_constraint related * split ddim in phi, add a target ddim, used by pd_type * add pd_type.cc to ir_shape CMakeLists
-
- 22 8月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* polish dist teensor design * adjust constructor * polish details * polish details design * fix compile error * refactor init tensor impl * fix reshard test * polish details * add unittest for coverage
-
- 21 8月, 2023 1 次提交
-
-
由 LiYuRio 提交于
-
- 18 8月, 2023 3 次提交
-
-
由 wanghuancoder 提交于
-
由 zhouweiwei2014 提交于
-
由 lzy 提交于
[Inference] Make share_external_data supports bf16 and bool; fix while_op cache_inference_while_scope when using fleet_executor. (#56055) * 1. make share_external_data supports bf16 and bool; 2. don't drop_kids when cache_inference_while_scope * fix FLAGS_cache_inference_while_scope * add unitest * add unitest * skip unitest when cudnn_version < 8100 * skip test share_external_data_bf16 when CUDA_ARCH < 80
-
- 17 8月, 2023 2 次提交
-
-
由 tianshuo78520a 提交于
This reverts commit fd765f61.
-
由 zhouweiwei2014 提交于
-
- 16 8月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* add phi forward api gen impl * add phi backward gen code * polish api code gen impl * polish code gen impl * remove auto_paralel namespace * add dygraph forward impl * add for_auto_parallel cond * fix code gen errors * add dygraph backward impl * resolve conflict with develop * refactor dist api gen impl * revert origin api gen impl * replace template for override func * fix dnnl marco error * revert third_party change * add with distributed marco * Update grad_tensor_holder.cc details * merge dist tensor constructor * change test tensor to replicate * fx typo * resolve conflict with develop * fix out dim error
-
- 15 8月, 2023 2 次提交
-
-
由 cyberslack_lee 提交于
-
由 LiYuRio 提交于
-
- 14 8月, 2023 3 次提交
-
-
由 cyberslack_lee 提交于
-
由 LiYuRio 提交于
-
由 zhangbo9674 提交于
* add code * fix bug * fix bug * delete unused code * refine code * fix bug * fix bug * fix bug * fix bug * fix bug
-
- 10 8月, 2023 2 次提交
- 09 8月, 2023 1 次提交
-
-
由 Xinyu Chen 提交于
* onednn: rename macro to PADDLE_WITH_DNNL * onednn: rename macro to CINN_WITH_DNNL
-
- 07 8月, 2023 1 次提交
-
-
由 LiYuRio 提交于
* make tcp store a global instance * fix windows compile error
-
- 04 8月, 2023 2 次提交
- 03 8月, 2023 2 次提交
- 02 8月, 2023 2 次提交
-
-
由 gouzil 提交于
* [clang-tidy] modernize-avoid-c-arrays * rollback * [clang-tidy] fix * close modernize-avoid-c-arrays * fix PHI_DEFINE_string; add PHI_DEFINE_bool NOLINT * fix PHI_DEFINE_string * fix next_h_state and parity err * fix win32 * fix cuda_graph * fix accuracy_kernel * fix math_function * fix fused_softmax_mask_kernel.cu load_data and warp_reduce; rollback concat_and_split_functor ins_addr * fix fused_dropout_add_grad_kernel * fix * rollback cu * rollback concat_and_split_functor.cu * rollback
-
由 zhangbo9674 提交于
* add interface * add code * add code * add code * add code * fix bug * fix bug * add var prefix * add code * add code * add code * fix compile bug * fix bug * refine code * refine code * refine code * refine code * fix bug * add code * add code * fix bug * add code * add code * refine code * refine code * fix bug * add code * fix bug in phi__kernel_utils * refine code * fix bug * open flag * refine code * fix bug * fix bug * refine code * fix bug
-
- 01 8月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
-
由 LiYuRio 提交于
* use string as key for comm_context_manager * remove device_id from comm_context
-
- 31 7月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
* support construct network using ir in Python * fix py3 bugs * change source return type to OpResult
-
由 wanghuancoder 提交于
support stride
-
- 26 7月, 2023 2 次提交
- 24 7月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* simplify dist tensor namespace path * fix tensor dist attr decl error
-
- 21 7月, 2023 1 次提交
-
-
由 Ruibin Cheung 提交于
-
- 20 7月, 2023 1 次提交
-
-
由 Xing-lil 提交于
* update broadcast gloo in dygraph * update * update reduce gloo in dygraph * update reduce gloo in dygraph * update * update allreduce allgather * update all * update * update * update
-
- 19 7月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* polish dist_tensor details * add unittest for coverage * revert uselesss change * skip test without dist
-
- 12 7月, 2023 1 次提交
-
-
由 Wang Xin 提交于
* [clang-tidy] enable readability-container-size-empty check * fix test_custom_kernel Failed * add clang-tid-10 in dockerfile * add clang-tidy in dockerfile * fix bug
-