Created by: lcy-seso
We merge the CPU implementation of the layer normalization operator. The documentation and the unittest still need to be enhanced. And the GPU implementation is required.