提交 · egs · PaddlePaddle / DeepSpeech

21 5月, 2021 5 次提交
- H
  
  fix ds2 default config · 759b2e0b
  由 Hui Zhang 提交于 5月 21, 2021
  
  759b2e0b
- H
  remove sequnce_mask and change ds2 export audio shape to [B,T,D] (#639) · b3bc4513
  由 Hui Zhang 提交于 5月 21, 2021
```
* remove sequnce_mask

* format

* fix ds2 export audio shape from B,D,T to B,T,D
```
  b3bc4513
- H
  add Cedict egs & pypinyin using jieba as wordseg & phkit using local pypinyin package (#637) · 749a1130
  由 Hui Zhang 提交于 5月 21, 2021
```
* down and parse cedict

* remove useless

* using third party python pinyin

* jieba as default wordseg

* remove useless

* remove pinyin dict

* using jieba.lcut

* remove doc of cedict egs

* add fan2jian test

* add description for say_digit
```
  749a1130
- H
  
  more docs (#636) · f22f6819
  由 Hui Zhang 提交于 5月 21, 2021
  
  f22f6819
- H
  
  chinese to phoneme (#633) · fd521310
  由 Hui Zhang 提交于 5月 21, 2021
  
  fd521310
20 5月, 2021 1 次提交
- H
  refactor g2p egs (#630) · 1b373bfc
  由 Hui Zhang 提交于 5月 20, 2021
```
* refactor g2p egs

* add sha-bone; remove avg.sh from egs;
```
  1b373bfc
19 5月, 2021 8 次提交

由 Feiyu Chan 提交于 5月 19, 2021

* add an example to convert transcription to pinyin with pypinyin and jieba

* format code

* 1. remove script for data downloading, since Baker dataset is not easily downloaded via terminal;
2. remove pypinyin as an extra requirement; it is alreay required by the main project;
3. clean code.

* change output format

075635d2

H

add praat and texgrid doc (#628) · 9cc750bf
由 Hui Zhang 提交于 5月 19, 2021

9cc750bf
H

fix doc link (#627) · 2ff726a6
由 Hui Zhang 提交于 5月 19, 2021

2ff726a6
H

add tarball utils (#626) · 0a7958b3
由 Hui Zhang 提交于 5月 19, 2021

0a7958b3
F
Merge pull request #625 from PaddlePaddle/rsl · 8697d422
由 Feiyu Chan 提交于 5月 19, 2021
```
fix doc representation
```
8697d422
H

fix result; add feature list · 37c53241
由 Hui Zhang 提交于 5月 19, 2021

37c53241

more decoding method (#618) · 0a3a840b

由 Hui Zhang 提交于 5月 19, 2021

* more decoding method

* all decode method test scripts; result readme

* exp libri confi

* parallel data scripts; more mask test; need pybind11 repo

* speed perturb config

* libri conf test set

0a3a840b

train ds2 model (#622) · 295f8bda

由 Hui Zhang 提交于 5月 19, 2021

* default cmvn compute config; more log of grad clip; diff ds2 cmvn compute and conf; ds2 lr step by epoch;

* fix ds2 config

* fix install and egs link

* sox speed pertrub shape (T, C), float64, process using int32

* fix libri ds2 scripts; add ngram and spm doc

* aishell ds2 cer7.86

* fix ds2 result

295f8bda

18 5月, 2021 1 次提交
- H
  add text normlization (#620) · 48537615
  由 Hui Zhang 提交于 5月 18, 2021
```
* add text normlization

* add space
```
  48537615
17 5月, 2021 3 次提交
- H
  
  using soxbinddings (#619) · d0635c65
  由 Hui Zhang 提交于 5月 17, 2021
  
  d0635c65
- H
  ctc decoding weight 0.5 (#614) · d777edc6
  由 Hui Zhang 提交于 5月 17, 2021
```
* ctc decoding weight 0.5

* tiny decoding conf

* more label of mergify

* format doc
```
  d777edc6
- H
  chinese char/word ngram lm (#613) · 538bf271
  由 Hui Zhang 提交于 5月 17, 2021
```
* add ngram lm egs

* add zhon repo

* install kenlm, zhon

* format

* add chinese_text_normalization repo

* add ngram lm egs
```
  538bf271
14 5月, 2021 1 次提交
- H
  
  fix image link (#612) · 2bdf4c94
  由 Hui Zhang 提交于 5月 14, 2021
  
  2bdf4c94
13 5月, 2021 3 次提交
- H
  
  fix doc (#611) · db022fac
  由 Hui Zhang 提交于 5月 13, 2021
  
  db022fac
- H
  
  Fix readme install link (#610) · 90512c39
  由 Hui Zhang 提交于 5月 13, 2021
  
  90512c39
- H
  speech text process docs (#607) · a12b1678
  由 Hui Zhang 提交于 5月 13, 2021
```
* add more speech doc

* fix doc path and mergify

* format doc
```
  a12b1678
12 5月, 2021 5 次提交

more speech docs (#606) · 7bbe1d66

由 Hui Zhang 提交于 5月 12, 2021

* add speech related docs: tts, text front end, ngram lm, corrector

* format doc

* mergify with doc

7bbe1d66

H

add mergify config (#605) · e9a0e178
由 Hui Zhang 提交于 5月 12, 2021

e9a0e178
H

add stale config (#604) · 6e57a789
由 Hui Zhang 提交于 5月 12, 2021

6e57a789
H
update doc (#603) · c6ae9857
由 Hui Zhang 提交于 5月 12, 2021
```
* fix doc format

* format doc
```
c6ae9857

E2E/Streaming Transformer/Conformer ASR (#578) · 71e046b0

由 Hui Zhang 提交于 5月 12, 2021

* add cmvn and label smoothing loss layer

* add layer for transformer

* add glu and conformer conv

* add torch compatiable hack, mask funcs

* not hack size since it exists

* add test; attention

* add attention, common utils, hack paddle

* add audio utils

* conformer batch padding mask bug fix #223

* fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2

* fix ci

* fix ci

* add encoder

* refactor egs

* add decoder

* refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils

* refactor docs

* add fix

* fix readme

* fix bugs, refactor collator, add pad_sequence, fix ckpt bugs

* fix docstring

* refactor data feed order

* add u2 model

* refactor cmvn, test

* add utils

* add u2 config

* fix bugs

* fix bugs

* fix autograd maybe has problem when using inplace operation

* refactor data, build vocab; add format data

* fix text featurizer

* refactor build vocab

* add fbank, refactor feature of speech

* refactor audio feat

* refactor data preprare

* refactor data

* model init from config

* add u2 bins

* flake8

* can train

* fix bugs, add coverage, add scripts

* test can run

* fix data

* speed perturb with sox

* add spec aug

* fix for train

* fix train logitc

* fix logger

* log valid loss, time dataset process

* using np for speed perturb, remove some debug log of grad clip

* fix logger

* fix build vocab

* fix logger name

* using module logger as default

* fix

* fix install

* reorder imports

* fix board logger

* fix logger

* kaldi fbank and mfcc

* fix cmvn and print prarams

* fix add_eos_sos and cmvn

* fix cmvn compute

* fix logger and cmvn

* fix subsampling, label smoothing loss, remove useless

* add notebook test

* fix log

* fix tb logger

* multi gpu valid

* fix log

* fix log

* fix config

* fix compute cmvn, need paddle 2.1

* add cmvn notebook

* fix layer tools

* fix compute cmvn

* add rtf

* fix decoding

* fix layer tools

* fix log, add avg script

* more avg and test info

* fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh;

* add vimrc

* refactor tiny script, add transformer and stream conf

* spm demo; librisppech scripts and confs

* fix log

* add librispeech scripts

* refactor data pipe; fix conf; fix u2 default params

* fix bugs

* refactor aishell scripts

* fix test

* fix cmvn

* fix s0 scripts

* fix ds2 scripts and bugs

* fix dev & test dataset filter

* fix dataset filter

* filter dev

* fix ckpt path

* filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test

* add comment

* add syllable doc

* fix ds2 configs

* add doc

* add pypinyin tools

* fix decoder using blank_id=0

* mmseg with pybind11

* format code

71e046b0

14 4月, 2021 1 次提交
- H
  Fix (#594) · 3a2de9e4
  由 Hui Zhang 提交于 4月 14, 2021
```
* fix install

* rm feature request
```
  3a2de9e4
30 3月, 2021 1 次提交
- H
  
  fix install (#580) · a9d0117c
  由 Hui Zhang 提交于 3月 30, 2021
  
  a9d0117c
24 3月, 2021 4 次提交
- H
  
  disscusion for questions, issue only for bug report (#573) · 9ac99f7c
  由 Hui Zhang 提交于 3月 24, 2021
  
  9ac99f7c
- H
  
  Update issue templates · 18567ced
  由 Hui Zhang 提交于 3月 24, 2021
  
  18567ced
- H
  
  Update issue templates · e0c9bddb
  由 Hui Zhang 提交于 3月 24, 2021
  
  e0c9bddb
- H
  
  Update issue templates · 52b9c8fa
  由 Hui Zhang 提交于 3月 24, 2021
  
  52b9c8fa
23 3月, 2021 1 次提交
- H
  fix doc link and enhance install (#570) · d4e84f9b
  由 Hui Zhang 提交于 3月 23, 2021
```
* fix doc link

* fix install

* fix install doc

* fix typo

* fix lm doc
```
  d4e84f9b
22 3月, 2021 1 次提交

batch average ctc loss (#567) · e0a87a5a

由 Hui Zhang 提交于 3月 22, 2021

* when loss div batchsize, change lr, more epoch, loss can reduce more and cer lower than before

* since loss reduce more when loss div batchsize,  less lm alpha can be better.

* less lm alpha, more cer reduce

* alpha 2.2, cer 0.077478

* alpha 1.9, cer 0.077249

* large librispeech lr for batch_average ctc loss

* since loss reduce and model more confidence, then less lm alpha

e0a87a5a

17 3月, 2021 1 次提交
- H
  fix egs bugs (#552) · 258307df
  由 Hui Zhang 提交于 3月 17, 2021
```
* fix egs

* fix log
```
  258307df
11 3月, 2021 4 次提交
- Z
  
  Update README_cn.md · 4c8c2178
  由 Zeyu Chen 提交于 3月 11, 2021
  
  4c8c2178
- Z
  
  Update README.md · aaafe141
  由 Zeyu Chen 提交于 3月 11, 2021
  
  aaafe141
- Z
  
  add pr template (#550) · d3a5c6d5
  由 Zeyu Chen 提交于 3月 11, 2021
  
  d3a5c6d5
- H
  Refactor CTC module, add embedding and fix log (#549) · 1539f3e0
  由 Hui Zhang 提交于 3月 11, 2021
```
* add acts, refactor ctc, add pos embed

* fix export, dataloader time log

* fix egs

* fix libri readme
```
  1539f3e0

PaddlePaddle / DeepSpeech 大约 2 年 前同步成功

PaddlePaddle / DeepSpeech
大约 2 年前同步成功