- 22 5月, 2023 1 次提交
-
-
由 niuliling123 提交于
-
- 18 5月, 2023 1 次提交
-
-
由 shaojie_wang 提交于
* add master gradients on static graph * add unit test for bf16 master grad static graph * use float16 as v100 test dtype * only skip GPU which do not support bf16 * use linear layer to test master grad * 1.push master grad creation before all optimizer ops; 2.remove useless unittest; 3.use a function to create master grad states
-
- 16 5月, 2023 2 次提交
-
-
由 zhouweiwei2014 提交于
-
由 niuliling123 提交于
-
- 15 5月, 2023 1 次提交
-
-
由 shaojie_wang 提交于
* fix embedding model weight type mismatch error * Update fp16_utils.py --------- Co-authored-by: NZhang Ting <zhangting_2017@163.com>
-
- 12 5月, 2023 1 次提交
-
-
由 Zhang Ting 提交于
-
- 11 5月, 2023 2 次提交
-
-
由 Kaipeng Deng 提交于
* move DataLoader to paddle.io. test=develop
-
由 张春乔 提交于
-
- 10 5月, 2023 1 次提交
-
-
由 Yiqun Liu 提交于
-
- 09 5月, 2023 1 次提交
-
-
由 RedContritio 提交于
-
- 08 5月, 2023 1 次提交
-
-
由 Zhang Ting 提交于
-
- 06 5月, 2023 1 次提交
-
-
由 xiaoluomi 提交于
-
- 04 5月, 2023 1 次提交
-
-
由 Aurelius84 提交于
* [Perf]Removed usless assign op in while_loop * refine assign
-
- 25 4月, 2023 1 次提交
-
-
由 qizhaoaoe 提交于
* fluid clean: remove print/switch from fluid to static * remove Switch in static.__init__ * fix conflicts. * replace Switch by case. * fix piecewise_lr decay. * fix typo * fix conflicts. * fix lr dtype * keep Switch in paddle.static.nn.control_flow and fix piecewise_lr. * fix conflicts. * keep Switch in the fluid. * fix Switch doc * fix example in Switch doc * fix Switch doc. * fix static/__init__.
-
- 24 4月, 2023 2 次提交
-
-
由 Yiqun Liu 提交于
[AMP] Allow to enable multi_precision through paddle.static.amp.decorate and add documents for some apis. (#53012) * Add document for some apis. test=docs_preview * Allow to set master_weight in paddle.static.amp.decorate. * Polish codes and add unittest. * Refine docs. * Remove the repetitive function.
-
由 Zhang Ting 提交于
* support promote dtype for static amp training * unify o1 and o2 * update for unittest * fix op_role * add use_promote arg * fix doc * add promote unittest * polish unittests * fix controflow and test
-
- 23 4月, 2023 1 次提交
-
-
由 LoneRanger 提交于
* relocate metri_op.py * reloacte nn.py * fix bug * fix bug * fix bug * fix bug * fix bug * fix bug * fix variable->tensor and fix __all__ * fix ctr_metric_bundle and sparse_embedding * fix bug of function init * fix bug of importing sparse_embedding and ctr_metric_bundle * fix bug * Update __init__.py
-
- 22 4月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 21 4月, 2023 1 次提交
-
-
由 Zhang Zheng 提交于
* [AMP OP&Test] Add check_dtype for some API (part 2) * fix ci * fix ci
-
- 19 4月, 2023 1 次提交
-
-
由 zhouzj 提交于
-
- 18 4月, 2023 3 次提交
-
-
由 zhouzj 提交于
-
由 Meteor Liu 提交于
* rename _varbase_creator as create_tensor * rename _varbase_creator as create_tensor
-
由 Yiqun Liu 提交于
* Implement a common AmpTestBase. * Support overload of decorate. * Change the ignore list of flake and fix an error.
-
- 14 4月, 2023 3 次提交
-
-
由 duanyanhui 提交于
-
由 Feiyu Chan 提交于
1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408) 2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition); 3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version; 3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute; 4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
-
由 Yiqun Liu 提交于
* Unify the static amp codes of fp16 and bf16. * Polish apis and add unittest. * Add operator stats collecting tools for program. * Add the check of number of bloat16 operators in unittest. * Add warning for operator not supported for amp. * Add testing of BF16 O1 and O2.
-
- 12 4月, 2023 1 次提交
-
-
由 qizhaoaoe 提交于
* fix dtype cast in amp. * add test case and update docs. * remove set_prim.
-
- 10 4月, 2023 1 次提交
-
-
由 qizhaoaoe 提交于
* add fp16 and bf16 support for instance_norm * fix /= operator which not support bf16 * fix instance_norm_grad kernel and unittests. * fix fp32 unittests. * fix instance_norm_kernel and unittests. * fix instance_norm_grad_kernel and unittest threshold. * add fp16/bf16 for instance_norm_grad_grad op. * add bf16 dtype check. * fix conflicts. * fix cpu support for fp32 op and fix type in instance_norm_grad_kernel. * fix type in instance_norm_kernel. * fix bf16 outputs in unittests and refine codes. * fix dx computation. * delete unuseful params and head including. * add fp16/bf16 for static graph. * fix device condiction for instance_norm op. * fix instance_norm_grad_grad and bf16 op tests. * fix op_test to support grad of bf16 can be compared with fp32. * remove updates. * add self-defined grad.
-
- 09 4月, 2023 1 次提交
-
-
由 shaojie_wang 提交于
-
- 06 4月, 2023 2 次提交
- 05 4月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 04 4月, 2023 1 次提交
-
-
由 JYChen 提交于
-
- 03 4月, 2023 1 次提交
-
-
由 Kim Yann 提交于
* rem is_compiled_with_mlu * fix some mlu_place and mlu_device_coount * make lint happy
-
- 01 4月, 2023 1 次提交
-
-
由 张春乔 提交于
-
- 31 3月, 2023 1 次提交
-
-
由 张春乔 提交于
* autofix Co-authored-by: NLiyulingyue <83450930+Liyulingyue@users.noreply.github.com> * revert changes in python/paddle/distributed/fleet/utils/hybrid_parallel_util.py * empty commit, trigger ci * fix test_slice --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
- 30 3月, 2023 3 次提交
-
-
由 Wang Xin 提交于
* add autogen code support for spectral_norm * bug fixed * fix PR-CI-Static-Check fail
-
由 cyberslack_lee 提交于
[CodeStyle][C416][C417] rewrite unnecessary comprehension with function call and use generator instead of map (#52140) * codestyle c416 c417 * fix error * fix inc * unify all C4 rules into one * fix inc --------- Co-authored-by: NSigureMo <sigure.qaq@gmail.com>
-
由 wanghuancoder 提交于
* delete old dygraph op test
-
- 29 3月, 2023 1 次提交
-
-
由 zhouweiwei2014 提交于
-