update README.md

d0319531 · qiuxuezhong · b3175bd2 · d0319531
隐藏空白更改
内联并排

Showing with 12 addition and 9 deletion

fluid/machine_reading_comprehesion/DuReader/README.md fluid/machine_reading_comprehesion/DuReader/README.md +12 -9

未找到文件。
--- a/fluid/machine_reading_comprehesion/DuReader/README.md
+++ b/fluid/machine_reading_comprehesion/DuReader/README.md
-# DuReader Dataset
-DuReader is a new large-scale real-world and human sourced MRC dataset in Chinese. DuReader focuses on real-world open-domain question answering. The advantages of DuReader over existing datasets are concluded as follows:
+# Abstract
+Dureader is an end-to-end neural networks model for machine reading comprehesion style question answering, which aims to anser questions from given passages. We first match the question and passage with a bidireactional attention flow networks to obtrain the question-aware passages represenation. Then we employ the pointer networks to locate the positions of answers from passages. Our experimental evalutions show that DuReader model achieves the state-of-the-art results in DuReader Dadaset.
+# Dataset
+DuReader Dataset is a new large-scale real-world and human sourced MRC dataset in Chinese. DuReader focuses on real-world open-domain question answering. The advantages of DuReader over existing datasets are concluded as follows:
 - Real question
 - Real article
 - Real answer
 - Real application scenario
 - Rich annotation

-# DuReader Network
-DuReader is inspired by 3 classic reading comprehension models([BiDAF](https://arxiv.org/abs/1611.01603), [Match-LSTM](https://arxiv.org/abs/1608.07905), [R_NET](https://www.microsoft.com/en-us/research/wp-content/uploads/2017/05/r-net.pdf)).
+# Network
+DuReader is inspired by 3 classic reading comprehension models([BiDAF](https://arxiv.org/abs/1611.01603), [Match-LSTM](https://arxiv.org/abs/1608.07905), [R-NET](https://www.microsoft.com/en-us/research/wp-content/uploads/2017/05/r-net.pdf)).

-Attention Flow from [BiDAF](https://arxiv.org/abs/1611.01603)
+DuReader model is a hierarchical multi_stage process adn consist of five layers

-Anwser Point Network from [Match-LSTM](https://arxiv.org/abs/1608.07905)
+- **Word Embedding Layer** maps each word to a vector space using a pre-trained word embedding model.
+- **Encoding Layer** extract context infomation for each position in question and passages with bi-directional LSTM network.
+- **Attention Flow Layer** couples the query and context vectors and produces a set of query-aware feature vectors for each word in the context. Please refer to [BiDAF](https://arxiv.org/abs/1611.01603) for more details.
+- **Fusion Layer** employs two layers of bi-directional LSTM to capture the interaction among context words independent of the query.
+- **Answer Point Network Layer with Attention Pooling** please refer to [Match-LSTM](https://arxiv.org/abs/1608.07905) and [R_NET](https://www.microsoft.com/en-us/research/wp-content/uploads/2017/05/r-net.pdf) for more details.

-Attention Pooling from [R_NET](https://www.microsoft.com/en-us/research/wp-content/uploads/2017/05/r-net.pdf)
 ## How to Run
 ### Download the Dataset
 To Download DuReader dataset:
@@ -36,8 +41,6 @@ cat data/raw/trainset/search.train.json | python utils/preprocess.py > data/prep
 ```
 The preprocessed data can be automatically downloaded by `data/download.sh`, and is stored in `data/preprocessed`, the raw data before preprocessing is under `data/raw`.

-### Run with PaddlePaddle
-
 #### Get the Vocab File

 Once the preprocessed data is ready, you can run `utils/get_vocab.py` to generate the vocabulary file, for example, if you want to train model with Baidu Search data: