Created by: xinghai-sun
resolve #5 (closed)
The model is implemented mainly according to the paper Memory-enhanced Decoder for Neural Machine Translation, with a few minor differences (will be listed in README later). And it is also slightly different from this V1 configuration.
Besides, to avoid running into this potential bug (Issue), I put write
ahead of read
(upon external memory) within one recurrent step (different from the original paper) . And it seems that such a change makes no difference (equivalent) to the final model structure, and has successfully bypassed the bug.