Created by: jczaja
- Introducing Softmax MKLDNN grad operator to take advantage of recently introduced softmax backward primitive in MKLDNN .
- Redesigned reusing mechanism and applied it softax op and softmax grad op.
With this PR machine translation when MKLDNN is used (use_mkldnn=True, shall be added in fc layer of machine_translation.py) is speeding up significantly