- 08 5月, 2023 4 次提交
-
-
由 GGBond8488 提交于
* add complex support for optest * add complex grad test * append one * move some debug info * move some debug info * move some debug info * move some debug info * add more complex test * Fix naming ambiguity * Revert "add more complex test" This reverts commit dbcb0516b8e53ba42e2d6089878a39b395345969. * change backward gradient, add TODO
-
由 张春乔 提交于
-
由 zhouweiwei2014 提交于
-
由 JYChen 提交于
-
- 27 4月, 2023 3 次提交
-
-
由 zhupengyang 提交于
-
由 Zhang Ting 提交于
* support OD level and skip dynamic loss scaling for bf16
-
由 JYChen 提交于
-
- 25 4月, 2023 2 次提交
-
-
由 YuanRisheng 提交于
* add flags for phi * fix compile bugs * fix ci bugs * fix inference bugs * fix cinn' bugs * fix cinn bugs * perfect code according comment * fix ci bugs * fix ci bugs
-
由 Chitsing KUI 提交于
* print modifed flags * fix ref, opt print * fix default getter * fix ut
-
- 24 4月, 2023 5 次提交
-
-
由 niuliling123 提交于
-
由 张春乔 提交于
-
由 张春乔 提交于
-
由 zqw_1997 提交于
* test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * test=allcase * fix doc erros, test=allcase
-
由 Galaxy1458 提交于
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test ,test=develop
-
- 23 4月, 2023 3 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add pipeline parallel support * update * update
-
由 risemeup1 提交于
* apply gcc12 to gpups * apply gcc12 to gpups * apply gcc12 to gpups * apply gcc12 to gpups * apply gcc12 to gpups * apply gcc12 to gpups * apply gcc12 to gpips * apply gcc12 to gpups * apply gcc12 to gpups * test * test * apply gcc12 to gpups * apply_gcc12_to_gpups * fix compiler bug * fix compiler bug * test * fix dangling-pointer compiler * fix dangling-pointer compiler * fix dangling-pointer compiler * apply_gcc12_to_gpups * apply gcc12 to gpups * Update cuda_streams_py.cc
-
由 niuliling123 提交于
* Delete temp param in eager_gen
-
- 21 4月, 2023 1 次提交
-
-
由 JYChen 提交于
* support 0-D output and 0-D as indice in __getitem__ * fix tests * fix inference and UT * add unittest for setitem * fix xpu test * fix xpu 0-d
-
- 20 4月, 2023 2 次提交
- 19 4月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [CustomDevice] add recompute support * update
-
- 18 4月, 2023 2 次提交
-
-
由 niuliling123 提交于
-
由 张春乔 提交于
-
- 17 4月, 2023 3 次提交
-
-
由 LiYuRio 提交于
* cherry-pick fleet executor from 2.4 * fix test case
-
由 JingZhuangzhuang 提交于
-
由 张春乔 提交于
-
- 14 4月, 2023 3 次提交
-
-
由 Feiyu Chan 提交于
1. modify set_value op, use Scalars to represent attr `values`, instead of a bunch of attributs of various types; (#52408) 2. add program converter and set_value op as an example, which provides the functionality to convert `paddle::framework::ProgramDesc` between old and new formats(the differences are mainly some operators with incompatible updates in the definition); 3. program version and operator version map now are always saved when serializing `paddle::framework::ProgramDesc` to identify the version; 3. provide an option `legacy_format=false` in serialization of `paddle::framework::ProgramDesc`, it decided whether to convert ProgramDesc back to a legacy format, which is compatible for paddle 2.4.2 or earlier versions to load and execute; 4. deserialization of `paddle::framework::ProgramDesc` is now automatically detecting whether the bytes it receives is in legacy format(contains any of the operators that has been incompatibly updated and have any attribute of type `Scalar`) and convert it to new format. But if you want a faithful deserialization without the automatic conversion, you can use protobuf's deserialization instead. Though it is not recommended, it can be used for the purpose of testing.
-
由 Kim Yann 提交于
-
由 ronnywang 提交于
-
- 13 4月, 2023 1 次提交
-
-
由 Yuanle Liu 提交于
-
- 12 4月, 2023 1 次提交
-
-
由 liuruyan 提交于
-
- 11 4月, 2023 3 次提交
-
-
由 Yuanle Liu 提交于
-
由 Xiaoxu Chen 提交于
-
由 wangzhen38 提交于
-
- 10 4月, 2023 4 次提交
-
-
由 Zhang Ting 提交于
* support set master_grad * move register_hook to auto_cast * update unittest * fix fp16 test * update for review comments
-
由 HongyuJia 提交于
* [Opt Performance] Optimize custom operator performance, reconstruct python API auto-gen, add cache and use const inference * opt AutoGradMeta implementation * remove profiler codes * fix unit test * change year, 2021->2023 * fix int64_t parse bug
-
由 kangguangli 提交于
* add strategy force_sequential_run * remove flag * fix * fix * fix * fix * fix * fix * fix * fix * fix
-
由 张春乔 提交于
* mv WITH_ASCEND_CL * mv WITH_ASCEND * rollback * remove WITH_ASCEND * remove WITH_ASCEND
-
- 08 4月, 2023 2 次提交
-
-
由 kangguangli 提交于
* add strategy force_sequential_run * fix * fix * fix * fix * fix
-
由 张春乔 提交于
* mv WITH_ASCEND_CL * mv WITH_ASCEND * rollback
-