- 01 6月, 2023 1 次提交
-
-
由 ronnywang 提交于
* [ROCM] fix multihead_matmul * skip bf16 uts * update
-
- 05 5月, 2023 1 次提交
-
-
由 iSerendipity 提交于
This reverts commit 9c406531.
-
- 28 4月, 2023 1 次提交
-
-
由 iSerendipity 提交于
-
- 10 1月, 2023 1 次提交
-
-
由 MarDino 提交于
-
- 20 2月, 2022 1 次提交
-
-
由 Chen Weihang 提交于
* rename pten dir to phi * rename namespace to phi * rename infrt pten dir to phi * resolve conflict * rename pten to phi in cmake * revert all infrt change * change needed files * fix infrt failed * fix inference failed
-
- 27 1月, 2022 1 次提交
-
-
由 Feiyu Chan 提交于
-
- 22 6月, 2021 1 次提交
-
-
由 zhiboniu 提交于
-
- 08 3月, 2021 1 次提交
-
-
由 Qi Li 提交于
-
- 05 3月, 2021 1 次提交
-
-
由 JamesLim 提交于
-
- 02 3月, 2021 1 次提交
-
-
由 Qi Li 提交于
* [ROCM] update fluid operators for rocm (part5), test=develop * address review comments, test=develop * fix typo, test=develop
-
- 27 11月, 2020 1 次提交
-
-
由 Shang Zhizhou 提交于
* remove -DSUPPORTS_CUDA_FP16 in cuda.cmake * comile with cuda9 * add some unittest * notest;test=coverage * add unittest for trt plugin swish && split * update ernie unittest * fix some error message * remove repeated judgement of CUDA version in mbEltwiseLayerNormOpConverter * fix comile errror when CUDA_ARCH_NAME < Pascal" * fix comile error * update unittest timeout * compile with cuda9 * update error msg * fix code style * add some comments * add define IF_CUDA_ARCH_SUPPORT_FP16 * rename IF_CUDA_ARCH_SUPPORT_FP16 to CUDA_ARCH_FP16_SUPPORTED
-
- 10 7月, 2020 1 次提交
-
-
由 zlsh80826 提交于
* add explicit specialization * add skiplayernorm vector load if available * test=develop
-
- 17 6月, 2020 1 次提交
-
-
由 zlsh80826 提交于
* blockReduce opt * launch threads align to warpSize * reduce unnecessary shared memory for broadcast reduced value * vectorize SoftmaxKernelWithEltadd * add fp16 constrain * test=develop
-
- 26 3月, 2020 1 次提交
-
-
由 Zhaolong Xing 提交于
* add dynamic plugin support. test=develop * change emb eltwise layernorm to math function test=develop * add emb eltwise layernorm test=develop * can run dynamic shape ernie test=develop * fix ci test=develop * add ut for trt ernie dynamic test=develop * refine dynamic shape c++ interface. test=develop * fix comments test=develop * fix comments test=develop
-