Created by: reyoung
The sample code is here.
When setting with_parallel_do=True and sparse_update=False. The cost could be NaN after several batches.
with_parallel_do=True
sparse_update=False