[cherry-pick] Add FC padding, ernie test unit and layernorm parallel (#22198)
* Optimize the kernel implementation of layernorm with openmp (#20895)
* Add ernie c++ inference test (#21015)
* Add ernie unit test
test=develop
* Add ernie unit test
test=develop
* Add ernie unit test
test=develop
* remove ngraph
* optimize gpu test
test=develop
* optimize codes
test=develop
* fix cmake fails on inference_download_and_uncompress (#21185)
* solve cmake fails on inference_download_and_uncompress
test=develop
* solve cmake fails on inference_download_and_uncompress
test=develop
* Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
* Add fc padding to solve mkl performance
test=develop
* fix gpu pass and error information
test=develop
* fix fc_fuse_pass_test
test=develop
* fix error information
test=develop
* fix error information
test=develop
* fix name and add fc op padding test
test=develop
* fix attributes
test=develop
* optimize fc padding
test=develop
* fix test
test=develop
* Polish the codes of fc when needs padding (#21378)
test=develop
* Add ernie large c++ inference test (#21365)
* add ernie-large test
test=develop
* add ernie large c++ inference test
test=develop
* Modify padding strategy: remove weight copy in fc padding (#21650)
test=develop
* optimize fc jit (#21878)
test=develop
Co-authored-by: NYihua Xu <yihuaxu@hotmail.com>
Showing
想要评论请 注册 或 登录