- 08 10月, 2021 3 次提交
-
-
由 Qi Li 提交于
* [NPU] support NCL and NCL for BatchNorm, test=develop * [NPU] remove debug files, test=develop * update, test=develop
-
由 arlesniak 提交于
* Added oneDNN BF16 relu * fixed typo * refactored test, review fixes
-
由 Zeng Jinle 提交于
-
- 07 10月, 2021 1 次提交
-
-
由 Adam Osewski 提交于
* Remove unused header. * Use ConvMKLDNNHandlerT for conv2d INT8. * Use absolute module path to import.
-
- 05 10月, 2021 1 次提交
-
-
由 jakpiase 提交于
* tmp * added concat BF16/FP32 BWD oneDNN kernel * minor change * minor change * fix for CI * added formatting * Reverted deleting static keyword * added reviewers suggestions * reverted deleting concat bf16 test file * fixed concat tests
-
- 30 9月, 2021 1 次提交
-
-
由 Aganlengzi 提交于
* [NPU] modify transpose2 and index_select_grad kernels for model xlnet * add transpose2 int64_t unit test * add more transpose2 unit tests * update test_transpose_op_npu.py
-
- 29 9月, 2021 7 次提交
-
-
由 zhulei 提交于
* [npu] add box coder * [npu] add box coder
-
由 pangyoki 提交于
-
由 zhulei 提交于
* [NPU] Add group norm * [NPU] Add group norm * [NPU] Add group norm * [NPU] Add group norm * [NPU] Add group_norm op
-
由 Aganlengzi 提交于
* merge conflict of paddle_gtest_main.cc * modify FLAGS_npu_precision_mode and default not to call aclSetCompileopt
-
由 Yiqun Liu 提交于
-
由 Li Min 提交于
-
由 ronnywang 提交于
-
- 28 9月, 2021 5 次提交
-
-
由 Liu-xiandong 提交于
Add sparse_attention OPs, python api will be added in next pr
-
由 Lijunhui 提交于
* Add paddle.linalg.eig op * remove comments * remove comments * extend batch_size to the origin * add real times complex functor & destroy the backward complex output bug * terminate output diff when input real tensors * correct tiny doc errors * move functions from eig_helper to svd_helper and remove eig_helper * remove tensor.Resize * remove no longer used code * use existing lapack functions * reply review comments 21/27 * remove .cu as this op is only executed on CPU * remove const_cast & add const in argument list for read-only references * fix sample code error in CI * remove template typename Tbase and more * remove eig exposure in paddle.* * add 'name=None' in eig python implementation * handle the unittest * try to solve the unittest * solve CI coverage * remove no longer used code * polish API doc and more * reply review comments * polish unittest, commit plan B * polish unittest
-
由 ronnywang 提交于
-
由 xiayanming 提交于
* [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid * [HIP] fix op not support AMD GPU bug, the flag PADDLE_WITH_ROCM is invalid * [HIP] fix op not support AMD GPU bug * [hybrid] seed and dropout op support force-cpu * [hybrid] seed and dropout op support force-cpu * [hybrid] seed and dropout op support force-cpu * [hybrid] seed and dropout op support force-cpu * [hybrid] seed and dropout op support force-cpu * [hybrid] fix seed ci failed issue * add AsExtra for force_cpu of seed op
-
由 Guoxia Wang 提交于
-
- 27 9月, 2021 3 次提交
-
-
由 Jiawei Wang 提交于
* fix extra op for expand, expand_as, tile, unstack * fix unique unstack dim 0 * Update expand_v2_op.cc * fix unique_op format
-
由 limingshu 提交于
* A leap of try for cudaLaunchCooperativeKernel * fix bugs * Totally replace the lar cuda kernel * Fix bugs * fix code according to comments * fix codes according to review comments * adding some function overload * relocate the power operation.
-
由 jakpiase 提交于
* refactored reshape multiop kernel and added flatten1/2 kernels * added formatting for flatten tests * CI fix * disabled reshape_kernel ops after succesful CI run * minor fix
-
- 26 9月, 2021 7 次提交
-
-
由 JZ-LIANG 提交于
-
由 JYChen 提交于
* add func/class API psroi_pool and UT * add UT in static mode * Remove redundant type checks in static mode * More detailed description for test_psroi_pool_op * fix code format of UT * fix en-doc
-
由 Yulong Ao 提交于
-
由 whs 提交于
-
由 crystal 提交于
-
由 YuanRisheng 提交于
* Add New Op: gumbel_softmax * Add New Op: gumbel_softmax * Add New Op: gumbel_softmax (amend) * add __main__ function in unit test * fix bugs when test in windows ci * update en docs * delete reletive error in unit test * delete relative error in unit test * set hard=True in unit test * Support fix seed in Python for test
-
由 niuliling123 提交于
-
- 25 9月, 2021 1 次提交
-
-
由 baoachun 提交于
-
- 24 9月, 2021 5 次提交
-
-
由 jiangcheng 提交于
* add gradient kernel of det op and slogdet op * fix CI APPROVAL problem
-
由 piotrekobiIntel 提交于
* Add elementwise_sub_mkldnn_op without grad * Add test to static_mode_white_list * Refactor code, change license years * Remove invalid grad implementation * Fix element_wise_sub_op test * Fix CI Approval error * Remove unnecessary EltwiseSubMKLDNNGradKernel class * Fix CI Approval 2 * Fix CI Approval 3 * Fix CI Approval Attempt #4 * Fix CI Approve Attempt #5 * Fix CI Approval Attempt #6 * Fix CI Approval Attemt #7 * Change test names containing add to sub * Fix old tests testing add instead of sub * Copy grad implementation from elementwise_add_mkldnn * CI test fix attempt * Revert "CI test fix attempt" This reverts commit c647cacf41e6a87c715385a185de5cbf65fc8900. * Fix CI attempt 2 * Fix elementwise_sub tests, temporary mkldnn broadcast test disable * Add working implementation of elementwise_sub grad * Fix build errors caused by pull * Fix format error * Fix format error 2 * Disable elementwise_sub_mkldnn test on GPU * Apply fix for paddle.fluid import * Revert changes of test_elementwise_sub and Fix mkldnn test * Revert "Apply fix for paddle.fluid import" This reverts commit fc3b122fec8e12f2bcb32928a2685ba4d20fd742. * fix bug of module 'paddle' has no attribute 'fluid' for python3.6 (#35862) * Add changes suggested by reviewers * Change @unittest.skipIf... to @OpTestTool.skip_if_not_cpu_bf16() to satisfy Approval CI * Remove check_dygraph=False to satisify CI Approval Co-authored-by: Nzhangbo9674 <82555433+zhangbo9674@users.noreply.github.com>
-
由 Jacek Czaja 提交于
* - candidate fix * - More fixes to #34554 * - another incosnstent fix to key * - Remvoed unneeded line * - matching the cache behaviour to other ops
-
由 Weilong Wu 提交于
* Add linalg.solve op, test=develop * Fix a bug caused by accidental deletion * updated description and fix a bug: missing a comma * Add linalg.solve op, test=develop * updated solve op backward logic * updated solve op backward logic again * Add linalg.solve Op, test=develop * Updated and modified to fit CI requirements * Fix a bug * 1)Add more test cases; 2)Fix a wrong usage in reduces operation; 3)Remove redundant code * Remove redundant comments * 1)Removed redundant code; 2)Updated to enhance code robustness * Removed redundant code * Updated API documents
-
由 wawltor 提交于
* add the shape check for the matmul * remove the test case for the linear
-
- 23 9月, 2021 3 次提交
- 22 9月, 2021 3 次提交
-
-
由 Zhang Zheng 提交于
-
由 ronnywang 提交于
* add randperm_op_npu * fix test_set_value_op_npu
-
由 TeslaZhao 提交于
* Pass compat of conv_transpose_bias_mkldnn_fuse_pass * Fix a bug of strided_slice op, about the axes parameter access memory out of bounds * Fix a bug of transpose op, about accessing memory out of bounds of the perm param * op:transpose_op supports bool type
-