- 14 9月, 2021 3 次提交
-
-
由 chenenquan 提交于
* Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Add empty_cache api to release idle gpu memory hold by allocator,test=develop * Fix test coverage problem for empty_cache * delete redundant check for empty_cache * fix the problem of empty_cache's doc * delete the nvidia-smi comment in doc of empty_cache, test=document_fix
-
由 Wilber 提交于
-
由 WeiXin 提交于
-
- 13 9月, 2021 20 次提交
-
-
由 YUNSHEN XIE 提交于
-
由 YUNSHEN XIE 提交于
* Change uts to nightly mode * remove test_trt_pool_op from parallel_UT_rule.py,test=document_fix
-
由 ceci3 提交于
* fix instance norm index error * add unittest * update * fix
-
由 xiongkun 提交于
-
由 zhulei 提交于
* [RC22] Fix linear with matmul_op replace * [RC22] Fix linear with matmul_op replace * [RC22] Fix linear with matmul_op replace * [RC22] Fix linear with matmul_op replace * [RC22] Fix linear with matmul_op replace
-
由 baoachun 提交于
* add flatten/flatten2 converter test cases * add fatten/flatten2 trt converter test cases
-
由 JZ-LIANG 提交于
* reshape support zero-input * add unitest * revise error message
-
由 李季 提交于
* upload global scatter and global gather operators related files
-
由 Zhang Zheng 提交于
-
由 baoachun 提交于
-
由 baoachun 提交于
-
由 Qi Li 提交于
-
由 Yanxing Shi 提交于
* fix github name * fix CI error * fix review and CI error * fix inf,nan error and modify unittest samples * add unittest samples * add unittest samples * fix unittest error * test=document_fix * test=document_fix * modify doc and add unittest samples * fix error newline in constant * modify doc after mentor review * modify __all__ and doc * modify doc
-
由 ShenLiang 提交于
* support grad group * fix single card condition
-
由 baoachun 提交于
* add group_norm trt converter test case * update group_norm trt converter test case
-
由 chentianyu03 提交于
This reverts commit ae93d9c2.
-
由 JYChen 提交于
-
由 Guoxia Wang 提交于
* support hybrid parallel inference helper class
-
由 zhulei 提交于
* [ROCM] fix top_k_v2 with large shape * [ROCM] fix top_k_v2 with large shape
-
由 jakpiase 提交于
* implemented clip op bf16/fp32 * added skipping if not cpu or bf16 * CI rerun after bf16 package change * added parentheses to ensure formatting
-
- 11 9月, 2021 3 次提交
- 10 9月, 2021 12 次提交
-
-
由 Leo Chen 提交于
* change metaclass of Layer from pybind11_builtins.pybind11_type to type * fix cast * add ut
-
由 Feng Xing 提交于
-
由 Feng Xing 提交于
-
由 zhiboniu 提交于
-
由 hlygit66666 提交于
* add test_cumprod_op * Revert "add test_cumprod_op" This reverts commit c96cf6dff5d09ae7d8cc72c1e8ae4369a153aa19. * recommit * add error message * test input(x) initialize * test use cpu * update test code * add test type * add test case * solve ci problem * add complex case test * add complex case test * fix review problem * fix conflict * fix some docs * change test case * change test case * fix review problems again * fix docs * fix inclusivescan bug
-
由 huangxu96 提交于
This PR supports gradient clip (ClipGradByGlobalNorm) when training with AMP(auto mixed precision).
-
由 baoachun 提交于
-
由 feng_shuai 提交于
-
由 baoachun 提交于
-
由 Zeng Jinle 提交于
* fix scatter gather bug: * fix windows ci
-
由 wenbin 提交于
* conv3d * remove const_cast * modify ut * disable dynamic shape for trt6.0 * remove trt5
-
由 pangyoki 提交于
* add asExtra for nce op * fix unittest error in macos * remove asExtra for is_test
-
- 09 9月, 2021 1 次提交
-
-
由 0x45f 提交于
* init matrix_rank op, add matrix_rank CPU code and test * add GPU kernel, remove svd_eigen.h * add CPU kernel when tol is tensor * add cpu and gpu code when tol is tensor * fix CI-ROCM error * add matrix_rank API describe, fix PR-CI-Py3 error * fix PR-CI-Windows error, add matrix_rank API test * delete useless comments * fix review * add my code in svd_helper.h * update doc commets * remove spaces
-
- 08 9月, 2021 1 次提交
-
-
由 zhouweiwei2014 提交于
-