- 05 1月, 2022 1 次提交
-
-
由 0x45f 提交于
Fix error when calling sublayer's non-forward func in dy2stat cherrypick: #37713、#37759、#37296、#38540、#37888
-
- 16 12月, 2021 1 次提交
-
-
由 0x45f 提交于
修复tensor_shape_transformer中的错误。 之前在类似if len(paddle.shape(x)[0]) > 0中,paddle会被当做一个变量被传入convert_var_shape函数中
-
- 15 12月, 2021 1 次提交
-
-
由 chentianyu03 提交于
Issue37932 反馈 layer.to 不支持paddle.dtype类型的问题,新增了对这类型的支持。详情见:#38018 同时,也一并cherry-pick了遗漏的 PR 36779 的commit。 修改内容: Cherry-pick #36779 cherrry-pick #38018
-
- 13 12月, 2021 2 次提交
-
-
由 0x45f 提交于
动转静时,将函数中的注释行进行删除。 有函数体外的注释行,使用gast库进行func2ast转换时会导致出错,本PR之后将注释行(#开头的行)进行了删除
-
由 zhangbo9674 提交于
优化layer.to实现逻辑,相关pr: Remove additional warnning in layer.to ( #36700) Refine param conversion logic in layer.to ( #36862) Fix Layer.to() of device bug ( #37156)
-
- 09 12月, 2021 1 次提交
-
-
由 0x45f 提交于
Polish for zip in dy2stat
-
- 25 11月, 2021 1 次提交
-
-
由 Steffy-zxf 提交于
* fix data parallel when VOCAB var in program * fix ci coverage
-
- 23 11月, 2021 1 次提交
-
-
由 0x45f 提交于
[Dy2stat]Allow users to switch eval/train mode when using @to_static to decorate a function (#37383) (#37432) 本PR之前使用@to_static装饰一个单独的function时,对于生成的Program无法切换train/eval模式,只能运行在train模式下。这也就导致动转静后用户多次调用function显存会一直增长。 本PR之后,使用@to_static装饰一个单独的function时,可以通过function.train()或者function.eval()的方式来切换train/eval模式。
-
- 19 11月, 2021 1 次提交
-
-
由 0x45f 提交于
该PR使得动转静模块能够正确转换如下的for i in [1, 2, 3]语句。
-
- 28 10月, 2021 2 次提交
- 26 10月, 2021 3 次提交
-
-
由 Steffy-zxf 提交于
* Add FasterTokenizer Operator (#34491) Add Tokenizer related functionalities for Transformer model in order that the process of training and predicting is consistent. * support the text string as an input Tensor * support the "VOCAB"unordered_map<wstring, int> as an input Tensor to lookup tokens * Tokenizer used for BERT. This tokenizer applies an end-to-end, text string to wordpiece tokenization. * It first applies basic tokenization, followed by wordpiece tokenization. * optimize fast tokenizer * remove const_cast Co-authored-by: Nzhoushunjie <zhoushunjie@baidu.com> Co-authored-by: Nwawltor <fangzeyang0904@hotmail.com>
-
由 Haohongxiang 提交于
* fix bugs in HybridParallelClipGrad of hybrid_parallel_optimizer (#36237) * fix bugs in HybridParallelClipGrad of hybrid_parallel_optimizer * update * update * fix bugs in mp_layers、pp_layers and HybridParallelClipGrad (#36144) * fix calling bug of HybridParallelClipGrad * fix bugs of HybridParallelClipGrad * add unittest of pp with HybridParallelClipGrad * fix bugs in mp_layers.py * update * fix bugs in pp_layers.py * update * [HybridParallel]Rebuild code for pipeline (#36396) * add no_sync for parameters sync * add pipeline for moe * [HybridParallel]Support fp16 in dygraph hybrid parallel (#36420) * [HybridParallel]Support fp16 in dygraph hybrid parallel * update * update * update for recompute * add unittest of pp+fp16 * add unittest of recompute+fp16 * update * modify ut * modify ut of cond (#36475) * fix bugs of ClipGradByGlobalNorm in HybridParallel (#36555) * fix bugs of ClipGradByGlobalNorm * add unittests * add unittests * [HybridParallel]fix bug of check_inf in fleet_base.py (#36651) * fix bug of check_inf * fix allreduce * support ClipGradByGlobalNorm in sharding (#36012) * support ClipGradByGlobalNorm in sharding * support ClipGradByGlobalNorm in sharding * test=allcase * Update test_linalg_cond.py * Update hybrid_parallel_util.py * Update hybrid_parallel_util.py Co-authored-by: NShenLiang <1422485404@qq.com> Co-authored-by: Nzhaoyingli <86812880+zhaoyinglia@users.noreply.github.com>
-
由 Leo Chen 提交于
* refine amp level * fix typo * update tracer._amp_level
-
- 21 10月, 2021 1 次提交
-
-
由 0x45f 提交于
* remove no_value using var.name
-
- 20 10月, 2021 1 次提交
-
-
由 xiongkun 提交于
* catch the generatorfunction and intercept it. * add test generator * add test case * refine the testcase
-
- 18 10月, 2021 1 次提交
-
-
由 0x45f 提交于
[Cherry-pick][Dy2stat]fix no_grad context error in train mode when using save/load (#36434) (#36463) 修复使用jit.save/load接口加载模型后,在train模式和no_grad上下文中,显存会一直增长的问题
-
- 13 10月, 2021 1 次提交
-
-
由 0x45f 提交于
* change time to remove static tempfile * delete remove_static_file() function
-
- 26 9月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
1、Split function GradScaler::minimize() to GradScaler::step() + GradScaler::update() 2、Add GradScaler::unscale_(optimizer)
-
- 22 9月, 2021 1 次提交
-
-
由 zhangbo9674 提交于
fix bug of module paddle has no attribute fluid for python3.6.
-
- 17 9月, 2021 2 次提交
-
-
由 zhangbo9674 提交于
* add pure fp16 major function in auto_cast & tracer * support master weight in dygraph for pure fp16 * check mix dtype of fp16&fp32 for check_finite_and_unscale op * change pure fp16 funtion name * refine some bug in auto_cast * refine auto_cast interface logic * add param _casted_by_pure_fp16 for class Layer * support state_dict hook for save model by user appointed dtype in pure_fp16_decorator * refine pure_fp16_decorator as decorator * add unittest * add comment * add comment * support recompute * add comment for auto_cast and decorator * support to_static_state_dict for paddle.jit.save * unlimite models num and optimizers num * add lookup_table in black_list * fix momentum and layer state_dict * fix bug in layer state_dict * fix bug in layer state_dict_helper * refine unittest * refine test_momentun_op * refine interface and some code * refine amp_decorator interface * refine pure fp16 interface * refine master weight interface
-
由 WeiXin 提交于
-
- 16 9月, 2021 1 次提交
-
-
由 0x45f 提交于
* fix no_grad context error in dy2stat * remove useless comments * fix error by drop_kids in python * add test and fix review
-
- 15 9月, 2021 3 次提交
- 14 9月, 2021 2 次提交
-
-
由 Haohongxiang 提交于
* Add solutions to PyLayer which is unsupported in DataParallel * modify note format for parallel.py * modify docs of dataparallel * add docs of dp with pylayer * modify docs format * modify example format * change example of dp with pylayer * add unittest for dp with pylayer * modify ut * merge latest codes * update * modify for CI-Coverage * modify text-indent
-
由 0x45f 提交于
* dy2stat_error: add revise suggestion for two error cases * fix test_error * fix review
-
- 13 9月, 2021 1 次提交
-
-
由 chentianyu03 提交于
This reverts commit ae93d9c2.
-
- 10 9月, 2021 1 次提交
-
-
由 ShenLiang 提交于
-
- 08 9月, 2021 1 次提交
-
-
由 zhouweiwei2014 提交于
-
- 07 9月, 2021 1 次提交
-
-
由 Sing_chan 提交于
-
- 06 9月, 2021 1 次提交
-
-
由 WeiXin 提交于
* support numpy dtype and polish code of list index. * polish code.
-
- 03 9月, 2021 1 次提交
-
-
由 0x45f 提交于
-
- 01 9月, 2021 3 次提交
-
-
由 0x45f 提交于
* modify dy2stat error message in compile time * fix variable name
-
由 Aurelius84 提交于
* Support append method and initialized value for List in ControlFlow * polish error msg and en doc * fix code style
-
由 zyfncg 提交于
* Support getitem by Bool index * delete some debug info of bool index * support the case that the shape of bool index is different from indexed tensor * support setitem by bool index * add the unittest for throwing exception * merge conflict * add check for int tensor when index is bool
-
- 26 8月, 2021 1 次提交
-
-
由 WeiXin 提交于
* polish code * polish code. * polish code. * polish code. * polish code.
-
- 24 8月, 2021 1 次提交
-
-
由 Haohongxiang 提交于
* Add no_sync in data parallel for dynamic graph * modify UT of no_sync * delete test_parallel_dygraph_dataparallel_no_sync.py * add test_parallel_dygraph_no_sync.py * modify run_trainer_with_spawn in UTs * Add UT of complex control flow in no_sync * add specific descriptions and notes for no_sync * check code style * modify UT's TIMEOUT in CMakeLists.txt
-
- 20 8月, 2021 1 次提交
-
-
由 shangliang Xu 提交于
-