Add multi-process version of xmap_reader to take place of multi-thread version for a significant acceleration and add seqbin data parser for internal 1w data training. (!355) · 合并请求 · PaddlePaddle / models

Add multi-process version of xmap_reader to take place of multi-thread version for a significant acceleration and add seqbin data parser for internal 1w data training. !355

Created by: xinghai-sun

Done:

Add multi-process version of xmap_reader_mp to take place of multi-thread version of paddle.reader.xmap_reader.
Add seqbin data parser to adapt training with internal 1w English dataset.

This speeds up the training with the internal 1w dataset by more than 3X, and now the GPU utilization rises back to 81% (from 15%).