- 06 2月, 2021 1 次提交
-
-
由 Jacek Czaja 提交于
-
- 05 2月, 2021 8 次提交
-
-
由 Chengmo 提交于
add truncated gaussian random
-
由 liuyuhui 提交于
-
由 liym27 提交于
Performance optimization for dynamic setitem: Call op set_value to speed up because the original call to TensorToPyArray will introduce unnecessary data copy. (#30817)
-
由 liuyuhui 提交于
-
由 wanghuancoder 提交于
-
由 taixiurong 提交于
-
由 Adam Osewski 提交于
* Additionally change to not throw error from inside pass.
-
由 tianshuo78520a 提交于
-
- 04 2月, 2021 5 次提交
-
-
由 Jacek Czaja 提交于
-
由 joanna.wozna.intel 提交于
* Update Xbyak and add bf16 fast performance verification * Fix formating * Change LOG message * Trigger an update of a new tag
-
由 Shang Zhizhou 提交于
* fix split trt plugin initialize * update
-
由 WangXi 提交于
-
由 wanghuancoder 提交于
* use iwyu clean include second time, test=develop
-
- 03 2月, 2021 9 次提交
-
-
由 cucuzg 提交于
-
由 wawltor 提交于
fix the broadcast for the large second input
-
由 JamesLim 提交于
-
由 AshburnLee 提交于
-
由 QingshuChen 提交于
* try again if kunlun memory malloc failed * minor
-
由 石晓伟 提交于
* support xpu inference with analysis predictor, test=develop * merge the cmake of the xpu toolchain, test=develop * add c-apis, test=develop * fix a bug in extern_xpu, test=develop
-
由 liuyuhui 提交于
-
由 Adam Osewski 提交于
-
由 WangXi 提交于
-
- 02 2月, 2021 2 次提交
-
-
由 joanna.wozna.intel 提交于
-
由 Shang Zhizhou 提交于
* fix trt plugin clone and initialize bugs * fix unit test error * enable trt in ci py3 * update unittest timeout
-
- 01 2月, 2021 6 次提交
-
-
由 Wilber 提交于
* update lite tag * disable ut
-
由 Thunderbrook 提交于
* dump to cpu * format * format * format
-
由 Chengmo 提交于
fix push sparse int64 bug
-
由 Qi Li 提交于
-
由 QingshuChen 提交于
* fix malloc L3 failed bug for kunlun * minor
-
由 WangXi 提交于
-
- 29 1月, 2021 2 次提交
- 28 1月, 2021 6 次提交
-
-
由 Qi Li 提交于
* [ROCM] update fluid platform for rocm35 (part1), test=develop * address review comments, test=develop
-
由 Wojciech Uss 提交于
-
由 lidanqing 提交于
* solve python3 incompatibility issues * update checksum
-
由 alncat 提交于
-
由 Chengmo 提交于
* fix Brpc get hostname
-
由 taixiurong 提交于
* transformer predict * trans bug fix
-
- 27 1月, 2021 1 次提交
-
-
由 jakpiase 提交于
* added external reorder to profiler * resolved conflict * added enable_static * initial version of lstm, not working yet * added lstm to operators.cmake * added vanilla lstm mkldnn op * added peephole weights integration * minor changes * added formatting * added fusion_lstm_mkldnn to static_whitelist * added formatting * removed comment * moved use_peepholes attribute inside is_cached block * reverted wrong changes * minor formatting change * minor changes * changed stream handling * minor change * added datatype to GetExpectedKernelType() * added reading stream from TLS
-