- 06 9月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* add spmd and reshard code gen * add backward reshard code gen * test matmul forward success * polish test impl * add unsafe mutable value * polish details and add test * fix unittest time out * fix typo * refactor reshard input generate impl * resolve conflict with develop * fix compile error
-
- 05 9月, 2023 1 次提交
-
-
由 Ghost Screaming 提交于
* [WIP] Support std::vector<phi::Tensor> input and output for DistTensor. Concat forward and backward are verified. * Polish code for new dist tensor implementation. * Fix bug of DistTensor upgrade. Add support functions for std::vector<Tensor> -> std::vector<Tensor>. * Add support for DistTensor type of std::vector<phi::Tensor> as input or output of operators. Following testcases are passed. 1. concat: std::vector<phi::Tensor> -> phi::Tensor 2. unbind: phi::Tensor -> std::vector<phi::Tensor> 3. broadcast_tensors: std::vector<phi::Tensor> -> std::vector<phi::Tensor> * Polish code. Remove useless comments. * Add update_loss_scaling in skip_op_lists. * Polish code.
-
- 28 8月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* simplify with dist marco * polish error message format * fix vtable error * fix cmake error * fix winsock redefined error * fix windows compile error * fix windows conpile failed * fix merge error * fix vec compile error * add port.h into test_cpu_vec * fix merge error * try to fix winsock error
-
- 16 8月, 2023 1 次提交
-
-
由 Chen Weihang 提交于
* add phi forward api gen impl * add phi backward gen code * polish api code gen impl * polish code gen impl * remove auto_paralel namespace * add dygraph forward impl * add for_auto_parallel cond * fix code gen errors * add dygraph backward impl * resolve conflict with develop * refactor dist api gen impl * revert origin api gen impl * replace template for override func * fix dnnl marco error * revert third_party change * add with distributed marco * Update grad_tensor_holder.cc details * merge dist tensor constructor * change test tensor to replicate * fx typo * resolve conflict with develop * fix out dim error
-
- 31 7月, 2023 1 次提交
-
-
由 wanghuancoder 提交于
support stride
-
- 29 6月, 2023 1 次提交
-
-
由 hong 提交于
* add kernel dialect * change DenseTensorTypeStorage to DenseTensorType * add test case` * add first pd_op to kernel dialect * lower pd op to kernel dialect * update * update * remove useless code * add attrite print test * fix bug * update * update * update * update * polish code * fix bug * polish code and add python test * add test * fix test error * add env flag * fix bug * revert test env * change cc_test_old to cc_test * fix build_static bug * fix type test error * udpate cmake * disable test in windows * update * update * fix bug * split file * fix conflict * polish code and fix conflict * support place transformer * finish bug * add gpu flags * fix with cuda macro * add fetch kernel * support fetch var in new ir * fix bug * polish code * change array equal to np.testing * support feed in new ir * update * fix bug * try to hack combine op * add scope guard * revert atan2 op * add scope guard * update * polish code * update * refactor build kernel context * fix unitest bug * polish code * use original order * remove useless code * polish code * fix bug
-
- 27 6月, 2023 1 次提交
-
-
由 hong 提交于
* add kernel dialect * change DenseTensorTypeStorage to DenseTensorType * add test case` * add first pd_op to kernel dialect * lower pd op to kernel dialect * update * update * remove useless code * add attrite print test * fix bug * update * update * update * update * polish code * fix bug * polish code and add python test * add test * fix test error * add env flag * fix bug * revert test env * change cc_test_old to cc_test * fix build_static bug * fix type test error * udpate cmake * disable test in windows * update * update * fix bug * split file * fix conflict * polish code and fix conflict * support place transformer * finish bug * add gpu flags * fix with cuda macro * update * add scope guard * polish code
-
- 09 3月, 2023 1 次提交
-
-
由 zyfncg 提交于
* register custom kernel for all type of custom device * fix bug * fix GetKernelInputArgDef * fix amp bug * fix TransToPhiPlace * adapt interpreter_util
-
- 27 10月, 2022 1 次提交
-
-
由 zyfncg 提交于
-
- 27 9月, 2022 1 次提交
-
-
由 jiahongyu 提交于
-
- 28 7月, 2022 1 次提交
-
-
由 zyfncg 提交于
* support auto fallback to cpu kernel for cusom device * fix some problem of kernel fallback
-
- 14 7月, 2022 1 次提交
-
-
由 Yuang Liu 提交于
-
- 27 5月, 2022 1 次提交
-
-
由 zyfncg 提交于
* refactor the optional tensor * remove optiona<MetaTensor> in InferMeta * fix bug * fix optional<vector<Tensor>> * fix bug * fix rmsprop * fix amp of eager_gen * polish code * fix deleted code * fix merge conflict * polish code * remove is_nullopt_ * fix merge conflict * fix merge conflict
-
- 25 3月, 2022 1 次提交
-
-
由 hong 提交于
* update * remove useless code * remove label smooth test * polish code * polish code * polish code * remove _in_eager_mode error;
-
- 28 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* fix selected_rows bug in C++ API * add optional for C++ APIO * data transform support optional * remove data transform for optional vector<Tensor> * adjust some format of funtcion * fix empyt bug
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 04 2月, 2022 1 次提交
-
-
由 zyfncg 提交于
* add data_transform in pten api * support GetKernelTypeForVar * fix complie problem of bfloat16 * change error namespace * add complex type transform unittest * fix merge conflict
-