- 10 1月, 2020 1 次提交
-
-
由 GaoWei8 提交于
* Optimize the kernel implementation of layernorm with openmp (#20895) * Add ernie c++ inference test (#21015) * Add ernie unit test test=develop * Add ernie unit test test=develop * Add ernie unit test test=develop * remove ngraph * optimize gpu test test=develop * optimize codes test=develop * fix cmake fails on inference_download_and_uncompress (#21185) * solve cmake fails on inference_download_and_uncompress test=develop * solve cmake fails on inference_download_and_uncompress test=develop * Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972) * Add fc padding to solve mkl performance test=develop * fix gpu pass and error information test=develop * fix fc_fuse_pass_test test=develop * fix error information test=develop * fix error information test=develop * fix name and add fc op padding test test=develop * fix attributes test=develop * optimize fc padding test=develop * fix test test=develop * Polish the codes of fc when needs padding (#21378) test=develop * Add ernie large c++ inference test (#21365) * add ernie-large test test=develop * add ernie large c++ inference test test=develop * Modify padding strategy: remove weight copy in fc padding (#21650) test=develop * optimize fc jit (#21878) test=develop Co-authored-by: NYihua Xu <yihuaxu@hotmail.com>
-
- 11 10月, 2019 2 次提交
-
-
由 Aurelius84 提交于
test=release/1.6 * Add fp16 in input.dtype check test=develop * Add warning of fp16 in CPU test=develop * add unittest code for fp16 test=develop * fix float16 list error test=develop
-
由 Aurelius84 提交于
* add api check in fc test=develop * enforce shape error info of sum op test=develop * fix spelling test=develop * print x_dims info test=develop * enhance shape error info test=develop
-
- 11 9月, 2019 1 次提交
-
-
由 Yiqun Liu 提交于
* Refine the codes related to fc op. * Add GPU implementation for fc functor. * Apply fc_fuse_pass in GPU inference. test=develop * Change the cmake for fc op. * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ. * Add an attribute to set the activation type in fc_op. * Enhance the unittest of fc_op. test=develop * Remove the declaration of FCOpGrad back to the header file. test=develop * Set default value for newly added arguments in test_fc_op. test=develop
-
- 16 8月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 14 8月, 2018 1 次提交
-
-
由 tensor-tang 提交于
-
- 17 4月, 2018 1 次提交
-
-
由 mozga-intel 提交于
-
- 03 4月, 2018 1 次提交
-
-
由 mozga-intel 提交于
-