- 02 6月, 2022 8 次提交
-
-
由 光明和真理 提交于
Co-authored-by: Nliupeiyu <liupeiyu@cambricon.com>
-
由 ziyoujiyi 提交于
* back fl * delete ssl cert * . * make warning * . * unittest paral degree * solve unittest * heter & multi cloud commm ready * . * . * fl-ps v1.0 * . * support N + N mode * . * . * . * . * delete print * . * . * . * .
-
由 Wangzheee 提交于
* new general transformer inference support
-
由 Siming Dai 提交于
* support heter reindex * add unittest, fix bug * add comment * delete empty line * refine example * fix codestyle * add disable static
-
由 Guoxia Wang 提交于
-
由 zhaoyingli 提交于
* prepare only once
-
由 zhaoyingli 提交于
-
由 sneaxiy 提交于
* support CUDAGraph for partial graph * add ut * fix ci * fix ut again because of eager mode * fix kunlun ci * fix win ci
-
- 01 6月, 2022 8 次提交
-
-
由 YuanRisheng 提交于
* add yaml * fix infrt compile bugs
-
由 sneaxiy 提交于
* support weight transpose * add ut * add template * fix transpose error * fix transpose_comment * add api tests * add skipif * add doc
-
由 zhouweiwei2014 提交于
-
由 Guoxia Wang 提交于
* fix the bug of adamw which set the attribute in param group not working * fix undefined variable * fix api example typo * add unittest * fix unittest typo
-
由 caozhou 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the parallel tuner * [Auto Parallel] Improve the parallel tuner and fix some bugs * upodate cost model * update import Resharder by dist op * update cost model * fix comp cost bug * update cost model * [Auto Parallel] Amend the dist attr for #processses=1 * update cost model and tuner * update cost model and tuner * update cost model and tuner * update cluster * update reshard * [Auto Parallel] Add the estimation from the cost model * [Auto Parallel] Reimplement the backup and restore functions * [Auto Parallel] Fix the bugs of the parallel tuner * [Auto Parallel] Update the engine api and dist context * [Auto Parallel] Work around the high order grad problem * [Auto Parallel] Add some miscellaneous improvements * [Auto Parallel] Add a unittest for DistributedContext Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
由 chentianyu03 提交于
* add conv3d yaml * add conv3d_grad, conv3d_double_grad * add final_state_conv3d test case * add conv3d double test case * add depthwise_conv2d grad yaml * add depthwise_conv2d double grad test case * modify the order of args * add depthwise_conv2d_grad_grad config
-
- 31 5月, 2022 11 次提交
-
-
由 Sławomir Siwek 提交于
* remove attrs from base op * fix typos * remove brelu * undo removing code related to matmul * remove whitespaces * undo changes in matmul * remove empty line
-
由 cambriconhsq 提交于
-
由 yaozhixin 提交于
* [IPU] support paddle.distributed.launch with IPUs * add device_num to env_args_mapping
-
由 Aganlengzi 提交于
* fix arg_max and reduce_max * add arg_max ut
-
由 thunder95 提交于
* rrelu逻辑部分 * unregistered op kernel (unresolved) * commit before merge * 丰富测试用例 * 修复rrelu-sig的bug * 修复cpu环境测试 * 修改拼写错误 * 修改code format * 尝试优化测试用例timeout的问题 * 优化测试用例 * 移除seed, 优化随机函数 * update en doc for rrelu * fix rrelu en docs, test=document_fix * add paper link for en docs, test=document_fix * udpate en doc * add r,test=document_fix
-
由 xiongkun 提交于
* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 * make EInsumOP support bf16 * add unittest for BF16 * add condition for test_BF16 * fix bugs * fix
-
由 Leo Chen 提交于
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
由 Li Min 提交于
* replace dropout_is_test with is_test. * improve atol on a100.
-
由 Weilong Wu 提交于
* [Eager] fix collective_global_gather * fix eager_ode = 1
-
由 zyfncg 提交于
* add embedding yaml * fix infermeta bug * fix bug of selected_rows infer_meta * fix selected_rows * add unittest
-
由 jakpiase 提交于
OneDNN md-in-tensor refactoring part 5: Memory descriptor enabled for elementwises, reductions and expand_v2 ops (#43036) * enabled md in elementwises, reductions and expand_v2 * CI fix for invalid numpy copy * fixed formatting * CI rerun * changes after review
-
- 30 5月, 2022 8 次提交
-
-
由 Chenxiao Niu 提交于
-
由 Li Min 提交于
* add fused_bias_dropout_residual_ln op and layer.
-
由 crystal 提交于
-
由 thunder95 提交于
* nanmedian op * 修改cuda kernel的bug * 修复count_if在其他硬件平台不兼容 * 修复某些cpu硬件不兼容 * 修复某些cpu硬件不兼容 * 修复isnan判断 * 兼容numpy低版本不支持全部nan的情况 * 兼容numpy低版本不支持全部nan的情况 * fix code example * fix api comment error * 修改反向传播逻辑以及c++处理逻辑 * 完成修改建议 * typo pre_dim * update en docs, test=document_fix * remove numpy in en doc, test=document_fix * add r,test=document_fix * 添加api到all * follow advice from chenwhql
-
由 cambriconhsq 提交于
-
由 WangZhen 提交于
* Fix cond_block_grad error when handle no need grad vras * Add comment and UT
-
由 Aganlengzi 提交于
* addmm supports 1-d input * fix coverage * fix * more ut
-
由 zhangchunle 提交于
-
- 28 5月, 2022 1 次提交
-
-
由 ShenLiang 提交于
* fix alltoall * rename utest
-
- 27 5月, 2022 4 次提交
-
-
由 Weilong Wu 提交于
-
由 Aurelius84 提交于
* [Dy2Stat]Replace paddle.jit.dy2stat with _jst * [Dy2Stat]Replace paddle.jit.dy2stat with _jst * refine code style * refine code style
-
由 zhangbo9674 提交于
* refine trunc uinttest * refine unittest * refine ut * refine fp64 grad check
-
由 xiongkun 提交于
-