- 01 6月, 2022 13 次提交
-
-
由 zhouweiwei2014 提交于
-
由 Sing_chan 提交于
-
由 JZ-LIANG 提交于
* adapt for 10 loss * partitioner support optimizer
-
由 BrilliantYuKaimin 提交于
-
由 houj04 提交于
* update xpu cmake: xdnn 0527. test=kunlun * update to xdnn 0531. * update to xdnn 0531. test=kunlun * update to xdnn 0601. test=kunlun
-
由 zhangchunle 提交于
unittest parallel Co-authored-by: Nzhangbo9674 <zhangbo54@baidu.com>
-
由 Ruibiao Chen 提交于
* Add pinned memory to HostMemoryStats * Add macro for WrapStatAllocator * Fix CI errors
-
由 zhiboniu 提交于
-
由 Guoxia Wang 提交于
* fix the bug of adamw which set the attribute in param group not working * fix undefined variable * fix api example typo * add unittest * fix unittest typo
-
由 huzhiqiang 提交于
-
由 caozhou 提交于
-
由 Yulong Ao 提交于
* [Auto Parallel] Add the parallel tuner * [Auto Parallel] Improve the parallel tuner and fix some bugs * upodate cost model * update import Resharder by dist op * update cost model * fix comp cost bug * update cost model * [Auto Parallel] Amend the dist attr for #processses=1 * update cost model and tuner * update cost model and tuner * update cost model and tuner * update cluster * update reshard * [Auto Parallel] Add the estimation from the cost model * [Auto Parallel] Reimplement the backup and restore functions * [Auto Parallel] Fix the bugs of the parallel tuner * [Auto Parallel] Update the engine api and dist context * [Auto Parallel] Work around the high order grad problem * [Auto Parallel] Add some miscellaneous improvements * [Auto Parallel] Add a unittest for DistributedContext Co-authored-by: Ncaozhou <caozhou@radi.ac.cn>
-
由 chentianyu03 提交于
* add conv3d yaml * add conv3d_grad, conv3d_double_grad * add final_state_conv3d test case * add conv3d double test case * add depthwise_conv2d grad yaml * add depthwise_conv2d double grad test case * modify the order of args * add depthwise_conv2d_grad_grad config
-
- 31 5月, 2022 22 次提交
-
-
由 Sławomir Siwek 提交于
* remove attrs from base op * fix typos * remove brelu * undo removing code related to matmul * remove whitespaces * undo changes in matmul * remove empty line
-
由 pangyoki 提交于
* add double_grad and triple_grad inplace info in backward.yaml * only generate inplace api in forward
-
由 wanghuancoder 提交于
* fix full zero * fix full zero * fix full zero * fix full zero * refine * refine * refine
-
由 Sing_chan 提交于
-
由 Chen Weihang 提交于
* fix assign kernel copy impl * fix test failed
-
由 BrilliantYuKaimin 提交于
-
由 cambriconhsq 提交于
-
由 yaozhixin 提交于
* [IPU] support paddle.distributed.launch with IPUs * add device_num to env_args_mapping
-
由 David Nicolas 提交于
* update RandomCrop class code annotation; test=document_fix * update adjust_brightness api in functional.py test=document_fix * udpate uniform api in random.py * update transforms.py
-
由 BrilliantYuKaimin 提交于
-
由 Chen Weihang 提交于
* polish append op using * fix var error * fix group norm impl
-
由 Aganlengzi 提交于
* fix arg_max and reduce_max * add arg_max ut
-
由 thunder95 提交于
* rrelu逻辑部分 * unregistered op kernel (unresolved) * commit before merge * 丰富测试用例 * 修复rrelu-sig的bug * 修复cpu环境测试 * 修改拼写错误 * 修改code format * 尝试优化测试用例timeout的问题 * 优化测试用例 * 移除seed, 优化随机函数 * update en doc for rrelu * fix rrelu en docs, test=document_fix * add paper link for en docs, test=document_fix * udpate en doc * add r,test=document_fix
-
由 Haohongxiang 提交于
-
由 xiongkun 提交于
* change einsum_v2 as default and add new flags: FLAG_einsum_opt=1|0 * make EInsumOP support bf16 * add unittest for BF16 * add condition for test_BF16 * fix bugs * fix
-
由 Leo Chen 提交于
Co-authored-by: NRyan Jeng <rjeng@nvidia.com>
-
由 Jiabin Yang 提交于
* support is empty * fix error * fix code error * change to fake empty * using fake empty first * using fake empty first * Support backward prune in fluid
-
由 Li Min 提交于
* replace dropout_is_test with is_test. * improve atol on a100.
-
由 Weilong Wu 提交于
* [Eager] fix collective_global_gather * fix eager_ode = 1
-
由 zyfncg 提交于
* add embedding yaml * fix infermeta bug * fix bug of selected_rows infer_meta * fix selected_rows * add unittest
-
由 Wilber 提交于
-
由 jakpiase 提交于
OneDNN md-in-tensor refactoring part 5: Memory descriptor enabled for elementwises, reductions and expand_v2 ops (#43036) * enabled md in elementwises, reductions and expand_v2 * CI fix for invalid numpy copy * fixed formatting * CI rerun * changes after review
-
- 30 5月, 2022 5 次提交
-
-
由 Chenxiao Niu 提交于
-
由 Li Min 提交于
* add fused_bias_dropout_residual_ln op and layer.
-
由 heliqi 提交于
-
由 shentanyue 提交于
* update lite compile cmake * Update delete_fill_constant_op_pass.cc * Update analysis_config.cc
-
由 pangyoki 提交于
* support backward inplace in eager fluid mode * fix * fix * optimize format * little change
-