MUL CUDA kernel is faked, for the cublas not works Some enhancement needs for TypeSystem, unittests
- make the target wrapper for host works - code clean