- 01 6月, 2022 2 次提交
-
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the parallel tuner * [Auto Parallel] Improve the parallel tuner and fix some bugs * upodate cost model * update import Resharder by dist op * update cost model * fix comp cost bug * update cost model * [Auto Parallel] Amend the dist attr for #processses=1 * update cost model and tuner * update cost model and tuner * update cost model and tuner * update cluster * update reshard * [Auto Parallel] Add the estimation from the cost model * [Auto Parallel] Reimplement the backup and restore functions * [Auto Parallel] Fix the bugs of the parallel tuner * [Auto Parallel] Update the engine api and dist context * [Auto Parallel] Work around the high order grad problem * [Auto Parallel] Add some miscellaneous improvements * [Auto Parallel] Add a unittest for DistributedContext Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
由 chentianyu03 提交于
* add conv3d yaml * add conv3d_grad, conv3d_double_grad * add final_state_conv3d test case * add conv3d double test case * add depthwise_conv2d grad yaml * add depthwise_conv2d double grad test case * modify the order of args * add depthwise_conv2d_grad_grad config
-
- 31 5月, 2022 17 次提交
-
-
由 Sławomir Siwek 提交于
* remove attrs from base op * fix typos * remove brelu * undo removing code related to matmul * remove whitespaces * undo changes in matmul * remove empty line
-
由 pangyoki 提交于
* add double_grad and triple_grad inplace info in backward.yaml * only generate inplace api in forward
-
由 BrilliantYuKaimin 提交于
-
由 cambriconhsq 提交于
-
由 yaozhixin 提交于
* [IPU] support paddle.distributed.launch with IPUs * add device_num to env_args_mapping
-
由 David Nicolas 提交于
* update RandomCrop class code annotation; test=document_fix * update adjust_brightness api in functional.py test=document_fix * udpate uniform api in random.py * update transforms.py
-
由 BrilliantYuKaimin 提交于
-
由 Chen Weihang 提交于
* polish append op using * fix var error * fix group norm impl
-
由 Aganlengzi 提交于
* fix arg_max and reduce_max * add arg_max ut
-
由 thunder95 提交于
* rrelu逻辑部分 * unregistered op kernel (unresolved) * commit before merge * 丰富测试用例 * 修复rrelu-sig的bug * 修复cpu环境测试 * 修改拼写错误 * 修改code format * 尝试优化测试用例timeout的问题 * 优化测试用例 * 移除seed, 优化随机函数 * update en doc for rrelu * fix rrelu en docs, test=document_fix * add paper link for en docs, test=document_fix * udpate en doc * add r,test=document_fix
-
由 Haohongxiang 提交于
-
由 xiongkun 提交于
* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 * make EInsumOP support bf16 * add unittest for BF16 * add condition for test_BF16 * fix bugs * fix
-
由 Leo Chen 提交于
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
由 Li Min 提交于
* replace dropout_is_test with is_test. * improve atol on a100.
-
由 Weilong Wu 提交于
* [Eager] fix collective_global_gather * fix eager_ode = 1
-
由 zyfncg 提交于
* add embedding yaml * fix infermeta bug * fix bug of selected_rows infer_meta * fix selected_rows * add unittest
-
由 jakpiase 提交于
OneDNN md-in-tensor refactoring part 5: Memory descriptor enabled for elementwises, reductions and expand_v2 ops (#43036) * enabled md in elementwises, reductions and expand_v2 * CI fix for invalid numpy copy * fixed formatting * CI rerun * changes after review
-
- 30 5月, 2022 12 次提交
-
-
由 Chenxiao Niu 提交于
-
由 Li Min 提交于
* add fused_bias_dropout_residual_ln op and layer.
-
由 pangyoki 提交于
-
由 crystal 提交于
-
由 zhaoyingli 提交于
* use original id in dist_op_context.grad_op_id_to_op_id * del assert * remove redundant map
-
由 thunder95 提交于
* nanmedian op * 修改cuda kernel的bug * 修复count_if在其他硬件平台不兼容 * 修复某些cpu硬件不兼容 * 修复某些cpu硬件不兼容 * 修复isnan判断 * 兼容numpy低版本不支持全部nan的情况 * 兼容numpy低版本不支持全部nan的情况 * fix code example * fix api comment error * 修改反向传播逻辑以及c++处理逻辑 * 完成修改建议 * typo pre_dim * update en docs, test=document_fix * remove numpy in en doc, test=document_fix * add r,test=document_fix * 添加api到all * follow advice from chenwhql
-
由 levi131 提交于
* enhance check for current block docstring for prim2orig interface * refine if else syntax
-
由 cambriconhsq 提交于
-
由 WangZhen 提交于
* Fix cond_block_grad error when handle no need grad vras * Add comment and UT
-
由 Aganlengzi 提交于
* addmm supports 1-d input * fix coverage * fix * more ut
-
由 zyfncg 提交于
* make data transform inplace when tensor is on gpupinned in new dygraph * fix unittest
-
由 zhangchunle 提交于
-
- 28 5月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix alltoall * rename utest
-
- 27 5月, 2022 8 次提交
-
-
由 Weilong Wu 提交于
-
由 zyfncg 提交于
* refactor the optional tensor * remove optiona<MetaTensor> in InferMeta * fix bug * fix optional<vector<Tensor>> * fix bug * fix rmsprop * fix amp of eager_gen * polish code * fix deleted code * fix merge conflict * polish code * remove is_nullopt_ * fix merge conflict * fix merge conflict
-
由 Aurelius84 提交于
* [Dy2Stat]Replace paddle.jit.dy2stat with _jst * [Dy2Stat]Replace paddle.jit.dy2stat with _jst * refine code style * refine code style
-
由 zhangbo9674 提交于
* refine trunc uinttest * refine unittest * refine ut * refine fp64 grad check
-
由 wanghuancoder 提交于
-
由 xiongkun 提交于
-
由 Baibaifan 提交于
-
由 Ruibiao Chen 提交于
* Support memory stats for CPU * Add UTs * Fix typos * Fix typos
-