The LoD is pretty slow since it uses malloc/free/memcpy
Created by: reyoung
In DynamicRNN of understand_sentiment. The LoD malloc and free takes 26s, and the whole C++ computation take 66s
.
It is too slow because LoD
is not managed by our memory pool.
We should do more experiments on the current LoD
performance.