- 18 12月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
* add more English embedding name * fix doc bug * delete useless description * add comments of TokenEmbedding * add embedding model info
-
- 17 12月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
fix url bug for word2vec
-
- 14 12月, 2020 2 次提交
-
-
由 wawltor 提交于
update the readme for the word_embedding (#5050)
-
由 Jack Zhou 提交于
* add all wiki embedding and part of baidu encyclopedia embedding. * add embedding example * add people_daily, weibo, sougou pretrained embedding * add zhihu, finacial,literature embedding * Add embedding model readme; add embedding train example and readme * fix README example * fix embedding doc
-
- 12 12月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
* 1. move PAD to the end 2. Add Readme.md * optimize readme doc
-
- 10 12月, 2020 1 次提交
-
-
由 Jack Zhou 提交于
* Add TokenEmbedding * download corpus embedding data * load embedding data by specifying corpus name * extend the vocab of tokenizer from corpus embedding data * add unk token setting * modify tokenizer * add extend voacb * move jieba tokenizer and rename corpus_name->embedding_name * use bos url instead of localhost * add log when loading data * add token dot computation; add __repr__ of TokenEmbedding * add color logging * use paddlenlp.utils.log * adjust repr * update pretrained embedding table * fix padding idx
-