- 18 5月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
* [Zero-Dim] update 0d tensor api en doc, test=document_fix * [BUG] fix windows kernel dispatch of _lzcnt bug (#53728)
-
- 09 5月, 2023 4 次提交
-
-
由 zqw_1997 提交于
* fix doc erros, test=allcase * conflict * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * fix doc erros, test=allcase * fix the to_tensor error
-
由 zhouweiwei2014 提交于
-
由 zhouweiwei2014 提交于
* [Zero-Dim] fix functool.reduce more safe with intial value, to support empty list (#53182) * [Zero-Dim] support 0d tensor for shape and squeeze onednn kernel (#52832) * support 0d tensor for shape and squeeze onednn kernel * set python api for shape op ut * [Zero-Dim] distributed scatter/all_to_all support input 0D tensor (#53186) * [Zero-Dim] Support paddle.sum/mean/loss api output 0D,test=allcase (#52739) * [CINN Support 0D-Tensor] CINN supports 0D-Tensor with trick temporarily (#53382) * [CINN Support 0D-Tensor] CINN supports 0D-Tensor with trick temporarily * Add unittest * [CINN Support 0D-Tensor] CINN hack squeeze2 with trick temporarily (#53454) * fix test_autograd_dynamic (#53473) Co-authored-by: Nzhwesky2010 <zhouwei25@baidu.com> --------- Co-authored-by: NYangQun <qun.yang@intel.com> Co-authored-by: NHongyuJia <jiahongyu@baidu.com> Co-authored-by: NHydrogenSulfate <490868991@qq.com>
-
由 JYChen 提交于
* support 0-D output and 0-D as indice in __getitem__ * fix tests * fix inference and UT * add unittest for setitem * fix xpu test * fix xpu 0-d * fix right value is 0d and index is List/Tensor * Hack__getitem__ from 0-d to 1-d with FLAGS_set_to_1d * change PHI_DECLARE_xxx to DECLARE_xxx since the change not merged to 2.5 * hack 1-D tensor to Scalar * throw warning at __getitem__, not slice_utils
-
- 06 5月, 2023 1 次提交
-
-
由 zhangkaihuo 提交于
att, cherry-pick: #52902 #53113
-
- 27 4月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
[cherry-pick2.5] [Zero-Dim] Support all/any/min/max/prod/logsumexp/amax/amin/some loss output 0D (#53192)
-
- 24 4月, 2023 2 次提交
-
-
由 kangguangli 提交于
* fix bug: wrong match between depend and c_allreduce_sum (cherry picked from commit 327da8035bdfee3ec2f016e8cda29ec8ee89bc95) * fix codestyle (cherry picked from commit bdb1483081adc41aa47d3f7df257f63f1cff399b) * fix bug (cherry picked from commit 373ba5253c45ac019ffaa8d69d4ce9e02cb9ae79) * add c_sync_calc_stream back (cherry picked from commit 9933d7533ae1f307b76f24a33bf0c59e4c8e8f01) * fix (cherry picked from commit abc9a31beaa326f6a566c08749419bb33e209672) * revert (cherry picked from commit 07bc98dbf7c9df43910fa6e86a6a2698731dffb2) * use flag to control (cherry picked from commit 8e5682a4b99759cbe35a49f3f8c9db735dc8fee4) * fix for code coverage (cherry picked from commit fe7e61bdef24fbc43e2f4e1cb67f68963c957cf1)
- 23 4月, 2023 1 次提交
-
-
由 JYChen 提交于
* support 0-D output and 0-D as indice in __getitem__ * fix tests * fix inference and UT * add unittest for setitem * fix xpu test * fix xpu 0-d
-
- 20 4月, 2023 1 次提交
-
-
由 kangguangli 提交于
* fix * fix * fix * fix * fix * fix fuse group order (cherry picked from commit 38ec37cd)
-
- 17 4月, 2023 5 次提交
-
-
由 Chitsing KUI 提交于
* add random control for fused dropout add * add __init__
-
由 Kim Yann 提交于
-
由 张春乔 提交于
* remove hccl in .py files * remove ascend in setup.py.in * remove ascend in setup.py
-
由 Haohongxiang 提交于
-
由 caozhou 提交于
* add o2 tune * add unittest * fix error * set unittest timeout
-
- 14 4月, 2023 2 次提交
-
-
由 Feiyu Chan 提交于
1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408) 2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition); 3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version; 3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute; 4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
-
由 ronnywang 提交于
-
- 13 4月, 2023 1 次提交
-
-
由 TaoTao Li 提交于
* add auto parallel tuner options in launch * add ut for launch in auto_parallel tuner fix code format * fix ci-converage
-
- 12 4月, 2023 4 次提交
-
-
由 ShenLiang 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Speedup the completion process * [Auto Parallel] Skip the property of dist_context when deepcopying * [Auto Parallel] Remove the unnecessary print * [Auto Parallel] Move some changes from 2.4 branch to develop * Update engine.py * [Auto Parallel] Fix a bug
-
由 张春乔 提交于
* remove c_comm_init_hccl_op.cc and c_gen_hccl_id_op.cc * remove gen_hccl_id_op.cc
-
由 CHANGer 提交于
-
- 11 4月, 2023 3 次提交
-
-
由 wangxiaoning 提交于
-
由 wuhuachaocoding 提交于
-
由 risemeup1 提交于
-
- 10 4月, 2023 1 次提交
-
-
由 JZ-LIANG 提交于
* unique id for mesh * rng ctrl * support dropout * register op * adopt for recompute * update unitest * support pp
-
- 09 4月, 2023 2 次提交
-
-
由 Chitsing KUI 提交于
-
由 ShenLiang 提交于
* add seed control * fix bug
-
- 07 4月, 2023 3 次提交
-
-
由 kangguangli 提交于
* remove run_program * remove FLAGS_USE_STANDALONE_EXECUTOR
-
由 TaoTao Li 提交于
fix merge conflicts
-
由 Roc 提交于
* fix mkdir * update
-
- 06 4月, 2023 2 次提交
-
-
由 Nyakku Shigure 提交于
-
由 Kim Yann 提交于
* rem is_compiled_with_npu * rem nup related code * make lint happy * rem test * remove some tests * Update grad_scaler.py * fix an error
-
- 04 4月, 2023 2 次提交
-
-
由 Tian 提交于
-
由 LoneRanger 提交于
* relocate debugger.py * fix bug * fix bug * fix bug * fix bug
-
- 03 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_mlu * fix some mlu_place and mlu_device_coount * make lint happy
-
- 31 3月, 2023 2 次提交
-
-
由 zhenhailiu 提交于
* gather with doc * resolve comment * polish * polish * code style * polish doc * add_test * polish * polish * add test check * add test check * polish * polish * polish * polish * fix_time_out * polish * fix timeout * fix_timeout * polish * polish * polish * polish * polish
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 30 3月, 2023 1 次提交
-
-
由 Ghost Screaming 提交于
* Support ignore_index for c_softmax_with_cross_entropy_op. * Polish code. Remove useless comments and add Testcase. * Polish code for TestCase. * Polish code. * Polish code style. * Polish code. * Change loss calculation formula and ignore_index dtype. * Polish TestCase. * Fix bug of c_softmax_with_cross_entropy_op_xpu_op. Attribute 'ignore_index' dtype is int64_t.
-