Add embedding modules (#1179)

* Add embedding modules

Add embedding modules (#1179)
* Add embedding modules
08cb2a73 · KP · GitHub · d37b2d80 · 08cb2a73 · 08cb2a73
183 changed file
--- a/modules/text/embedding/fasttext_crawl_target_word-word_dim300_en/README.md
+++ b/modules/text/embedding/fasttext_crawl_target_word-word_dim300_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='fasttext_crawl_target_word-word_dim300_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m fasttext_crawl_target_word-word_dim300_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/fasttext_crawl_target_word-word_dim300_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/fasttext_crawl_target_word-word_dim300_en/__init__.py
+++ b/modules/text/embedding/fasttext_crawl_target_word-word_dim300_en/__init__.py
--- a/modules/text/embedding/fasttext_crawl_target_word-word_dim300_en/module.py
+++ b/modules/text/embedding/fasttext_crawl_target_word-word_dim300_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="fasttext_crawl_target_word-word_dim300_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="fasttext.crawl.target.word-word.dim300.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/fasttext_wiki-news_target_word-word_dim300_en/README.md
+++ b/modules/text/embedding/fasttext_wiki-news_target_word-word_dim300_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='fasttext_wiki-news_target_word-word_dim300_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m fasttext_wiki-news_target_word-word_dim300_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/fasttext_wiki-news_target_word-word_dim300_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/fasttext_wiki-news_target_word-word_dim300_en/__init__.py
+++ b/modules/text/embedding/fasttext_wiki-news_target_word-word_dim300_en/__init__.py
--- a/modules/text/embedding/fasttext_wiki-news_target_word-word_dim300_en/module.py
+++ b/modules/text/embedding/fasttext_wiki-news_target_word-word_dim300_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="fasttext_wiki-news_target_word-word_dim300_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="fasttext.wiki-news.target.word-word.dim300.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/glove_twitter_target_word-word_dim100_en/README.md
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim100_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='glove_twitter_target_word-word_dim100_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m glove_twitter_target_word-word_dim100_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/glove_twitter_target_word-word_dim100_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/glove_twitter_target_word-word_dim100_en/__init__.py
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim100_en/__init__.py
--- a/modules/text/embedding/glove_twitter_target_word-word_dim100_en/module.py
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim100_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="glove_twitter_target_word-word_dim100_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="glove.twitter.target.word-word.dim100.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/glove_twitter_target_word-word_dim200_en/README.md
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim200_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='glove_twitter_target_word-word_dim200_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m glove_twitter_target_word-word_dim200_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/glove_twitter_target_word-word_dim200_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/glove_twitter_target_word-word_dim200_en/__init__.py
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim200_en/__init__.py
--- a/modules/text/embedding/glove_twitter_target_word-word_dim200_en/module.py
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim200_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="glove_twitter_target_word-word_dim200_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="glove.twitter.target.word-word.dim200.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/glove_twitter_target_word-word_dim25_en/README.md
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim25_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='glove_twitter_target_word-word_dim25_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m glove_twitter_target_word-word_dim25_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/glove_twitter_target_word-word_dim25_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/glove_twitter_target_word-word_dim25_en/__init__.py
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim25_en/__init__.py
--- a/modules/text/embedding/glove_twitter_target_word-word_dim25_en/module.py
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim25_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="glove_twitter_target_word-word_dim25_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="glove.twitter.target.word-word.dim25.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/glove_twitter_target_word-word_dim50_en/README.md
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim50_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='glove_twitter_target_word-word_dim50_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m glove_twitter_target_word-word_dim50_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/glove_twitter_target_word-word_dim50_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/glove_twitter_target_word-word_dim50_en/__init__.py
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim50_en/__init__.py
--- a/modules/text/embedding/glove_twitter_target_word-word_dim50_en/module.py
+++ b/modules/text/embedding/glove_twitter_target_word-word_dim50_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="glove_twitter_target_word-word_dim50_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="glove.twitter.target.word-word.dim50.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim100_en/README.md
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim100_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='glove_wiki2014-gigaword_target_word-word_dim100_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m glove_wiki2014-gigaword_target_word-word_dim100_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/glove_wiki2014-gigaword_target_word-word_dim100_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim100_en/__init__.py
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim100_en/__init__.py
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim100_en/module.py
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim100_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="glove_wiki2014-gigaword_target_word-word_dim100_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="glove.wiki2014-gigaword.target.word-word.dim100.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim200_en/README.md
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim200_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='glove_wiki2014-gigaword_target_word-word_dim200_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m glove_wiki2014-gigaword_target_word-word_dim200_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/glove_wiki2014-gigaword_target_word-word_dim200_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim200_en/__init__.py
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim200_en/__init__.py
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim200_en/module.py
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim200_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="glove_wiki2014-gigaword_target_word-word_dim200_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="glove.wiki2014-gigaword.target.word-word.dim200.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim300_en/README.md
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim300_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='glove_wiki2014-gigaword_target_word-word_dim300_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m glove_wiki2014-gigaword_target_word-word_dim300_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/glove_wiki2014-gigaword_target_word-word_dim300_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim300_en/__init__.py
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim300_en/__init__.py
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim300_en/module.py
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim300_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="glove_wiki2014-gigaword_target_word-word_dim300_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="glove.wiki2014-gigaword.target.word-word.dim300.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim50_en/README.md
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim50_en/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='glove_wiki2014-gigaword_target_word-word_dim50_en')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m glove_wiki2014-gigaword_target_word-word_dim50_en
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/glove_wiki2014-gigaword_target_word-word_dim50_en"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim50_en/__init__.py
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim50_en/__init__.py
--- a/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim50_en/module.py
+++ b/modules/text/embedding/glove_wiki2014-gigaword_target_word-word_dim50_en/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="glove_wiki2014-gigaword_target_word-word_dim50_en",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="glove.wiki2014-gigaword.target.word-word.dim50.en", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-1_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-1_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-character_char1-1_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-character_char1-1_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-character_char1-1_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-1_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-1_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-1_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-1_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-character_char1-1_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-character.char1-1.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-2_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-2_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-character_char1-2_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-character_char1-2_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-character_char1-2_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-2_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-2_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-2_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-2_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-character_char1-2_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-character.char1-2.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-4_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-4_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-character_char1-4_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-character_char1-4_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-character_char1-4_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-4_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-4_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-4_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-character_char1-4_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-character_char1-4_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-character.char1-4.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-ngram_1-2_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-ngram.1-2.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-ngram_1-3_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-ngram.1-3.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-ngram_2-2_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-ngram.2-2.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordLR_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordLR_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-wordLR_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-wordLR_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-wordLR_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordLR_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordLR_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordLR_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordLR_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-wordLR_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-wordLR.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordPosition_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordPosition_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-wordPosition_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-wordPosition_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-wordPosition_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordPosition_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordPosition_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordPosition_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-wordPosition_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-wordPosition_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-wordPosition.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-word_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_context_word-word_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_context_word-word_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_context_word-word_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_context_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_context_word-word_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_context_word-word_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.context.word-word.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_bigram-char_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_bigram-char_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_target_bigram-char_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_target_bigram-char_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_target_bigram-char_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_bigram-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_bigram-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_bigram-char_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_bigram-char_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_bigram-char_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.bigram-char.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-1_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-1_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_target_word-character_char1-1_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_target_word-character_char1-1_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_target_word-character_char1-1_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-1_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-1_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-1_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-1_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_word-character_char1-1_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.word-character.char1-1.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-2_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-2_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_target_word-character_char1-2_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_target_word-character_char1-2_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_target_word-character_char1-2_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-2_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-2_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-2_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-2_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_word-character_char1-2_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.word-character.char1-2.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-4_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-4_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_target_word-character_char1-4_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_target_word-character_char1-4_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_target_word-character_char1-4_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-4_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-4_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-4_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-character_char1-4_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_word-character_char1-4_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.word-character.char1-4.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_word-ngram_1-2_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.word-ngram.1-2.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_word-ngram_1-3_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.word-ngram.1-3.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_word-ngram_2-2_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.word-ngram.2-2.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordLR_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordLR_dim300/README.md
+## 概述
+PaddleHub提供多个开源的预训练Embedding模型。这些Embedding模型可根据不同语料、不同训练方式和不同的维度进行区分，关于模型的具体信息可参考PaddleNLP的文档：[Embedding模型汇总](https://github.com/PaddlePaddle/models/blob/release/2.0-beta/PaddleNLP/docs/embeddings.md)
+## API
+```python
+def __init__(
+    *args,
+    **kwargs
+)
+```
+创建一个Embedding Module对象，默认无需参数。
+**参数**
+* `*args`： 用户额外指定的列表类型的参数。
+* `**kwargs`：用户额外指定的关键字字典类型的参数。
+关于额外参数的详情可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+```python
+def search(
+    words: Union[List[str], str, int],
+)
+```
+获取一个或多个词的embedding。输入可以是`str`、`List[str]`和`int`类型，分别代表获取一个词，多个词和指定词编号的embedding，词的编号和模型的词典相关，词典可通过模型实例的`vocab`属性获取。
+**参数**
+* `words`： 需要获取的词向量的词、词列表或者词编号。
+```python
+def cosine_sim(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的余弦相似度。需要注意的是`word_a`和`word_b`都需要是词典里的单词，否则将会被认为是OOV(Out-Of-Vocabulary)，同时被替换为`unknown_token`。
+**参数**
+* `word_a`： 需要计算余弦相似度的单词a。
+* `word_b`： 需要计算余弦相似度的单词b。
+```python
+def dot(
+    word_a: str,
+    word_b: str,
+)
+```
+计算两个词embedding的内积。对于输入单词同样需要注意OOV问题。
+**参数**
+* `word_a`： 需要计算内积的单词a。
+* `word_b`： 需要计算内积的单词b。
+更多api详情和用法可参考[paddlenlp.embeddings](https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings)
+## 代码示例
+```python
+import paddlehub as hub
+embedding = hub.Module(name='w2v_baidu_encyclopedia_target_word-wordLR_dim300')
+# 获取单词的embedding
+embedding.search("中国")
+# 计算两个词向量的余弦相似度
+embedding.cosine_sim("中国", "美国")
+# 计算两个词向量的内积
+embedding.dot("中国", "美国")
+```
+## 部署服务
+通过PaddleHub Serving，可以部署一个在线获取两个词向量的余弦相似度的服务。
+### Step1: 启动PaddleHub Serving
+运行启动命令：
+```shell
+$ hub serving start -m w2v_baidu_encyclopedia_target_word-wordLR_dim300
+```
+这样就完成了一个获取词向量的余弦相似度服务化API的部署，默认端口号为8866。
+**NOTE:** 如使用GPU预测，则需要在启动服务之前，请设置CUDA_VISIBLE_DEVICES环境变量，否则不用设置。
+### Step2: 发送预测请求
+配置好服务端，以下数行代码即可实现发送预测请求，获取预测结果
+```python
+import requests
+import json
+# 指定用于计算余弦相似度的单词对[[word_a, word_b], [word_a, word_b], ... ]]
+word_pairs = [["中国", "美国"], ["今天", "明天"]]
+# 以key的方式指定word_pairs传入预测方法的时的参数，此例中为"data"，对于每一对单词，调用cosine_sim进行余弦相似度的计算
+data = {"data": word_pairs}
+# 发送post请求，content-type类型应指定json方式，url中的ip地址需改为对应机器的ip
+url = "http://10.12.121.132:8866/predict/w2v_baidu_encyclopedia_target_word-wordLR_dim300"
+# 指定post请求的headers为application/json方式
+headers = {"Content-Type": "application/json"}
+r = requests.post(url=url, headers=headers, data=json.dumps(data))
+print(r.json())
+```
+## 查看代码
+https://github.com/PaddlePaddle/models/tree/release/2.0-beta/PaddleNLP/paddlenlp/embeddings
+## 依赖
+paddlepaddle >= 2.0.0
+paddlehub >= 2.0.0
+## 更新历史
+* 1.0.0
+  初始发布
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordLR_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordLR_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordLR_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordLR_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_word-wordLR_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.word-wordLR.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordPosition_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordPosition_dim300/README.md
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordPosition_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordPosition_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordPosition_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-wordPosition_dim300/module.py
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+from typing import List
+from paddlenlp.embeddings import TokenEmbedding
+from paddlehub.module.module import moduleinfo, serving
+@moduleinfo(
+    name="w2v_baidu_encyclopedia_target_word-wordPosition_dim300",
+    version="1.0.0",
+    summary="",
+    author="paddlepaddle",
+    author_email="",
+    type="nlp/semantic_model")
+class Embedding(TokenEmbedding):
+    """
+    Embedding model
+    """
+    def __init__(self, *args, **kwargs):
+        super(Embedding, self).__init__(embedding_name="w2v.baidu_encyclopedia.target.word-wordPosition.dim300", *args, **kwargs)
+    @serving
+    def calc_similarity(self, data: List[List[str]]):
+        """
+        Calculate similarities of giving word pairs.
+        """
+        results = []
+        for word_pair in data:
+            if len(word_pair) != 2:
+                raise RuntimeError(
+                    f'The input must have two words, but got {len(word_pair)}. Please check your inputs.')
+            if not isinstance(word_pair[0], str) or not isinstance(word_pair[1], str):
+                raise RuntimeError(
+                    f'The types of text pair must be (str, str), but got'
+                    f' ({type(word_pair[0]).__name__}, {type(word_pair[1]).__name__}). Please check your inputs.')
+            for word in word_pair:
+                if self.get_idx_from_word(word) == \
+                        self.get_idx_from_word(self.vocab.unk_token):
+                    raise RuntimeError(
+                        f'Word "{word}" is not in vocab. Please check your inputs.')
+            results.append(str(self.cosine_sim(*word_pair)))
+        return results
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_baidu_encyclopedia_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_baidu_encyclopedia_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_financial_target_bigram-char_dim300/README.md
+++ b/modules/text/embedding/w2v_financial_target_bigram-char_dim300/README.md
--- a/modules/text/embedding/w2v_financial_target_bigram-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_financial_target_bigram-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_financial_target_bigram-char_dim300/module.py
+++ b/modules/text/embedding/w2v_financial_target_bigram-char_dim300/module.py
--- a/modules/text/embedding/w2v_financial_target_word-bigram_dim300/README.md
+++ b/modules/text/embedding/w2v_financial_target_word-bigram_dim300/README.md
--- a/modules/text/embedding/w2v_financial_target_word-bigram_dim300/__init__.py
+++ b/modules/text/embedding/w2v_financial_target_word-bigram_dim300/__init__.py
--- a/modules/text/embedding/w2v_financial_target_word-bigram_dim300/module.py
+++ b/modules/text/embedding/w2v_financial_target_word-bigram_dim300/module.py
--- a/modules/text/embedding/w2v_financial_target_word-char_dim300/README.md
+++ b/modules/text/embedding/w2v_financial_target_word-char_dim300/README.md
--- a/modules/text/embedding/w2v_financial_target_word-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_financial_target_word-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_financial_target_word-char_dim300/module.py
+++ b/modules/text/embedding/w2v_financial_target_word-char_dim300/module.py
--- a/modules/text/embedding/w2v_financial_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_financial_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_financial_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_financial_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_financial_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_financial_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_literature_target_bigram-char_dim300/README.md
+++ b/modules/text/embedding/w2v_literature_target_bigram-char_dim300/README.md
--- a/modules/text/embedding/w2v_literature_target_bigram-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_literature_target_bigram-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_literature_target_bigram-char_dim300/module.py
+++ b/modules/text/embedding/w2v_literature_target_bigram-char_dim300/module.py
--- a/modules/text/embedding/w2v_literature_target_word-bigram_dim300/README.md
+++ b/modules/text/embedding/w2v_literature_target_word-bigram_dim300/README.md
--- a/modules/text/embedding/w2v_literature_target_word-bigram_dim300/__init__.py
+++ b/modules/text/embedding/w2v_literature_target_word-bigram_dim300/__init__.py
--- a/modules/text/embedding/w2v_literature_target_word-bigram_dim300/module.py
+++ b/modules/text/embedding/w2v_literature_target_word-bigram_dim300/module.py
--- a/modules/text/embedding/w2v_literature_target_word-char_dim300/README.md
+++ b/modules/text/embedding/w2v_literature_target_word-char_dim300/README.md
--- a/modules/text/embedding/w2v_literature_target_word-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_literature_target_word-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_literature_target_word-char_dim300/module.py
+++ b/modules/text/embedding/w2v_literature_target_word-char_dim300/module.py
--- a/modules/text/embedding/w2v_literature_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_literature_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_literature_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_literature_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_literature_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_literature_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_mixed-large_target_word-char_dim300/README.md
+++ b/modules/text/embedding/w2v_mixed-large_target_word-char_dim300/README.md
--- a/modules/text/embedding/w2v_mixed-large_target_word-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_mixed-large_target_word-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_mixed-large_target_word-char_dim300/module.py
+++ b/modules/text/embedding/w2v_mixed-large_target_word-char_dim300/module.py
--- a/modules/text/embedding/w2v_mixed-large_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_mixed-large_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_mixed-large_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_mixed-large_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_mixed-large_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_mixed-large_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_people_daily_target_bigram-char_dim300/README.md
+++ b/modules/text/embedding/w2v_people_daily_target_bigram-char_dim300/README.md
--- a/modules/text/embedding/w2v_people_daily_target_bigram-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_people_daily_target_bigram-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_people_daily_target_bigram-char_dim300/module.py
+++ b/modules/text/embedding/w2v_people_daily_target_bigram-char_dim300/module.py
--- a/modules/text/embedding/w2v_people_daily_target_word-bigram_dim300/README.md
+++ b/modules/text/embedding/w2v_people_daily_target_word-bigram_dim300/README.md
--- a/modules/text/embedding/w2v_people_daily_target_word-bigram_dim300/__init__.py
+++ b/modules/text/embedding/w2v_people_daily_target_word-bigram_dim300/__init__.py
--- a/modules/text/embedding/w2v_people_daily_target_word-bigram_dim300/module.py
+++ b/modules/text/embedding/w2v_people_daily_target_word-bigram_dim300/module.py
--- a/modules/text/embedding/w2v_people_daily_target_word-char_dim300/README.md
+++ b/modules/text/embedding/w2v_people_daily_target_word-char_dim300/README.md
--- a/modules/text/embedding/w2v_people_daily_target_word-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_people_daily_target_word-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_people_daily_target_word-char_dim300/module.py
+++ b/modules/text/embedding/w2v_people_daily_target_word-char_dim300/module.py
--- a/modules/text/embedding/w2v_people_daily_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_people_daily_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_people_daily_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_people_daily_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_people_daily_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_people_daily_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_sikuquanshu_target_word-bigram_dim300/README.md
+++ b/modules/text/embedding/w2v_sikuquanshu_target_word-bigram_dim300/README.md
--- a/modules/text/embedding/w2v_sikuquanshu_target_word-bigram_dim300/__init__.py
+++ b/modules/text/embedding/w2v_sikuquanshu_target_word-bigram_dim300/__init__.py
--- a/modules/text/embedding/w2v_sikuquanshu_target_word-bigram_dim300/module.py
+++ b/modules/text/embedding/w2v_sikuquanshu_target_word-bigram_dim300/module.py
--- a/modules/text/embedding/w2v_sikuquanshu_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_sikuquanshu_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_sikuquanshu_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_sikuquanshu_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_sikuquanshu_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_sikuquanshu_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_sogou_target_bigram-char_dim300/README.md
+++ b/modules/text/embedding/w2v_sogou_target_bigram-char_dim300/README.md
--- a/modules/text/embedding/w2v_sogou_target_bigram-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_sogou_target_bigram-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_sogou_target_bigram-char_dim300/module.py
+++ b/modules/text/embedding/w2v_sogou_target_bigram-char_dim300/module.py
--- a/modules/text/embedding/w2v_sogou_target_word-bigram_dim300/README.md
+++ b/modules/text/embedding/w2v_sogou_target_word-bigram_dim300/README.md
--- a/modules/text/embedding/w2v_sogou_target_word-bigram_dim300/__init__.py
+++ b/modules/text/embedding/w2v_sogou_target_word-bigram_dim300/__init__.py
--- a/modules/text/embedding/w2v_sogou_target_word-bigram_dim300/module.py
+++ b/modules/text/embedding/w2v_sogou_target_word-bigram_dim300/module.py
--- a/modules/text/embedding/w2v_sogou_target_word-char_dim300/README.md
+++ b/modules/text/embedding/w2v_sogou_target_word-char_dim300/README.md
--- a/modules/text/embedding/w2v_sogou_target_word-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_sogou_target_word-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_sogou_target_word-char_dim300/module.py
+++ b/modules/text/embedding/w2v_sogou_target_word-char_dim300/module.py
--- a/modules/text/embedding/w2v_sogou_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_sogou_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_sogou_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_sogou_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_sogou_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_sogou_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_weibo_target_bigram-char_dim300/README.md
+++ b/modules/text/embedding/w2v_weibo_target_bigram-char_dim300/README.md
--- a/modules/text/embedding/w2v_weibo_target_bigram-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_weibo_target_bigram-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_weibo_target_bigram-char_dim300/module.py
+++ b/modules/text/embedding/w2v_weibo_target_bigram-char_dim300/module.py
--- a/modules/text/embedding/w2v_weibo_target_word-bigram_dim300/README.md
+++ b/modules/text/embedding/w2v_weibo_target_word-bigram_dim300/README.md
--- a/modules/text/embedding/w2v_weibo_target_word-bigram_dim300/__init__.py
+++ b/modules/text/embedding/w2v_weibo_target_word-bigram_dim300/__init__.py
--- a/modules/text/embedding/w2v_weibo_target_word-bigram_dim300/module.py
+++ b/modules/text/embedding/w2v_weibo_target_word-bigram_dim300/module.py
--- a/modules/text/embedding/w2v_weibo_target_word-char_dim300/README.md
+++ b/modules/text/embedding/w2v_weibo_target_word-char_dim300/README.md
--- a/modules/text/embedding/w2v_weibo_target_word-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_weibo_target_word-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_weibo_target_word-char_dim300/module.py
+++ b/modules/text/embedding/w2v_weibo_target_word-char_dim300/module.py
--- a/modules/text/embedding/w2v_weibo_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_weibo_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_weibo_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_weibo_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_weibo_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_weibo_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_wiki_target_bigram-char_dim300/README.md
+++ b/modules/text/embedding/w2v_wiki_target_bigram-char_dim300/README.md
--- a/modules/text/embedding/w2v_wiki_target_bigram-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_wiki_target_bigram-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_wiki_target_bigram-char_dim300/module.py
+++ b/modules/text/embedding/w2v_wiki_target_bigram-char_dim300/module.py
--- a/modules/text/embedding/w2v_wiki_target_word-bigram_dim300/README.md
+++ b/modules/text/embedding/w2v_wiki_target_word-bigram_dim300/README.md
--- a/modules/text/embedding/w2v_wiki_target_word-bigram_dim300/__init__.py
+++ b/modules/text/embedding/w2v_wiki_target_word-bigram_dim300/__init__.py
--- a/modules/text/embedding/w2v_wiki_target_word-bigram_dim300/module.py
+++ b/modules/text/embedding/w2v_wiki_target_word-bigram_dim300/module.py
--- a/modules/text/embedding/w2v_wiki_target_word-char_dim300/README.md
+++ b/modules/text/embedding/w2v_wiki_target_word-char_dim300/README.md
--- a/modules/text/embedding/w2v_wiki_target_word-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_wiki_target_word-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_wiki_target_word-char_dim300/module.py
+++ b/modules/text/embedding/w2v_wiki_target_word-char_dim300/module.py
--- a/modules/text/embedding/w2v_wiki_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_wiki_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_wiki_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_wiki_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_wiki_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_wiki_target_word-word_dim300/module.py
--- a/modules/text/embedding/w2v_zhihu_target_bigram-char_dim300/README.md
+++ b/modules/text/embedding/w2v_zhihu_target_bigram-char_dim300/README.md
--- a/modules/text/embedding/w2v_zhihu_target_bigram-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_zhihu_target_bigram-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_zhihu_target_bigram-char_dim300/module.py
+++ b/modules/text/embedding/w2v_zhihu_target_bigram-char_dim300/module.py
--- a/modules/text/embedding/w2v_zhihu_target_word-bigram_dim300/README.md
+++ b/modules/text/embedding/w2v_zhihu_target_word-bigram_dim300/README.md
--- a/modules/text/embedding/w2v_zhihu_target_word-bigram_dim300/__init__.py
+++ b/modules/text/embedding/w2v_zhihu_target_word-bigram_dim300/__init__.py
--- a/modules/text/embedding/w2v_zhihu_target_word-bigram_dim300/module.py
+++ b/modules/text/embedding/w2v_zhihu_target_word-bigram_dim300/module.py
--- a/modules/text/embedding/w2v_zhihu_target_word-char_dim300/README.md
+++ b/modules/text/embedding/w2v_zhihu_target_word-char_dim300/README.md
--- a/modules/text/embedding/w2v_zhihu_target_word-char_dim300/__init__.py
+++ b/modules/text/embedding/w2v_zhihu_target_word-char_dim300/__init__.py
--- a/modules/text/embedding/w2v_zhihu_target_word-char_dim300/module.py
+++ b/modules/text/embedding/w2v_zhihu_target_word-char_dim300/module.py
--- a/modules/text/embedding/w2v_zhihu_target_word-word_dim300/README.md
+++ b/modules/text/embedding/w2v_zhihu_target_word-word_dim300/README.md
--- a/modules/text/embedding/w2v_zhihu_target_word-word_dim300/__init__.py
+++ b/modules/text/embedding/w2v_zhihu_target_word-word_dim300/__init__.py
--- a/modules/text/embedding/w2v_zhihu_target_word-word_dim300/module.py
+++ b/modules/text/embedding/w2v_zhihu_target_word-word_dim300/module.py