1). add static trt load model 2). fix bug: when device_id is not 0, the trt will have a bug test=develop
* Initialize the elementwise plugin. * Implement the basic CUDA kernel of elementwise plugin. test=develop