提交 · 0a3a840beef768da54db66d26ccb8476300833a3 · PaddlePaddle / DeepSpeech

19 5月, 2021 2 次提交

由 Hui Zhang 提交于 5月 19, 2021

* more decoding method

* all decode method test scripts; result readme

* exp libri confi

* parallel data scripts; more mask test; need pybind11 repo

* speed perturb config

* libri conf test set

0a3a840b

train ds2 model (#622) · 295f8bda

由 Hui Zhang 提交于 5月 19, 2021

* default cmvn compute config; more log of grad clip; diff ds2 cmvn compute and conf; ds2 lr step by epoch;

* fix ds2 config

* fix install and egs link

* sox speed pertrub shape (T, C), float64, process using int32

* fix libri ds2 scripts; add ngram and spm doc

* aishell ds2 cer7.86

* fix ds2 result

295f8bda

18 5月, 2021 1 次提交
- H
  add text normlization (#620) · 48537615
  由 Hui Zhang 提交于 5月 18, 2021
```
* add text normlization

* add space
```
  48537615
17 5月, 2021 3 次提交
- H
  
  using soxbinddings (#619) · d0635c65
  由 Hui Zhang 提交于 5月 17, 2021
  
  d0635c65
- H
  ctc decoding weight 0.5 (#614) · d777edc6
  由 Hui Zhang 提交于 5月 17, 2021
```
* ctc decoding weight 0.5

* tiny decoding conf

* more label of mergify

* format doc
```
  d777edc6
- H
  chinese char/word ngram lm (#613) · 538bf271
  由 Hui Zhang 提交于 5月 17, 2021
```
* add ngram lm egs

* add zhon repo

* install kenlm, zhon

* format

* add chinese_text_normalization repo

* add ngram lm egs
```
  538bf271
14 5月, 2021 1 次提交
- H
  
  fix image link (#612) · 2bdf4c94
  由 Hui Zhang 提交于 5月 14, 2021
  
  2bdf4c94
13 5月, 2021 3 次提交
- H
  
  fix doc (#611) · db022fac
  由 Hui Zhang 提交于 5月 13, 2021
  
  db022fac
- H
  
  Fix readme install link (#610) · 90512c39
  由 Hui Zhang 提交于 5月 13, 2021
  
  90512c39
- H
  speech text process docs (#607) · a12b1678
  由 Hui Zhang 提交于 5月 13, 2021
```
* add more speech doc

* fix doc path and mergify

* format doc
```
  a12b1678
12 5月, 2021 5 次提交

more speech docs (#606) · 7bbe1d66

由 Hui Zhang 提交于 5月 12, 2021

* add speech related docs: tts, text front end, ngram lm, corrector

* format doc

* mergify with doc

7bbe1d66

H

add mergify config (#605) · e9a0e178
由 Hui Zhang 提交于 5月 12, 2021

e9a0e178
H

add stale config (#604) · 6e57a789
由 Hui Zhang 提交于 5月 12, 2021

6e57a789
H
update doc (#603) · c6ae9857
由 Hui Zhang 提交于 5月 12, 2021
```
* fix doc format

* format doc
```
c6ae9857

E2E/Streaming Transformer/Conformer ASR (#578) · 71e046b0

由 Hui Zhang 提交于 5月 12, 2021

* add cmvn and label smoothing loss layer

* add layer for transformer

* add glu and conformer conv

* add torch compatiable hack, mask funcs

* not hack size since it exists

* add test; attention

* add attention, common utils, hack paddle

* add audio utils

* conformer batch padding mask bug fix #223

* fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2

* fix ci

* fix ci

* add encoder

* refactor egs

* add decoder

* refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils

* refactor docs

* add fix

* fix readme

* fix bugs, refactor collator, add pad_sequence, fix ckpt bugs

* fix docstring

* refactor data feed order

* add u2 model

* refactor cmvn, test

* add utils

* add u2 config

* fix bugs

* fix bugs

* fix autograd maybe has problem when using inplace operation

* refactor data, build vocab; add format data

* fix text featurizer

* refactor build vocab

* add fbank, refactor feature of speech

* refactor audio feat

* refactor data preprare

* refactor data

* model init from config

* add u2 bins

* flake8

* can train

* fix bugs, add coverage, add scripts

* test can run

* fix data

* speed perturb with sox

* add spec aug

* fix for train

* fix train logitc

* fix logger

* log valid loss, time dataset process

* using np for speed perturb, remove some debug log of grad clip

* fix logger

* fix build vocab

* fix logger name

* using module logger as default

* fix

* fix install

* reorder imports

* fix board logger

* fix logger

* kaldi fbank and mfcc

* fix cmvn and print prarams

* fix add_eos_sos and cmvn

* fix cmvn compute

* fix logger and cmvn

* fix subsampling, label smoothing loss, remove useless

* add notebook test

* fix log

* fix tb logger

* multi gpu valid

* fix log

* fix log

* fix config

* fix compute cmvn, need paddle 2.1

* add cmvn notebook

* fix layer tools

* fix compute cmvn

* add rtf

* fix decoding

* fix layer tools

* fix log, add avg script

* more avg and test info

* fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh;

* add vimrc

* refactor tiny script, add transformer and stream conf

* spm demo; librisppech scripts and confs

* fix log

* add librispeech scripts

* refactor data pipe; fix conf; fix u2 default params

* fix bugs

* refactor aishell scripts

* fix test

* fix cmvn

* fix s0 scripts

* fix ds2 scripts and bugs

* fix dev & test dataset filter

* fix dataset filter

* filter dev

* fix ckpt path

* filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test

* add comment

* add syllable doc

* fix ds2 configs

* add doc

* add pypinyin tools

* fix decoder using blank_id=0

* mmseg with pybind11

* format code

71e046b0

14 4月, 2021 1 次提交
- H
  Fix (#594) · 3a2de9e4
  由 Hui Zhang 提交于 4月 14, 2021
```
* fix install

* rm feature request
```
  3a2de9e4
30 3月, 2021 1 次提交
- H
  
  fix install (#580) · a9d0117c
  由 Hui Zhang 提交于 3月 30, 2021
  
  a9d0117c
24 3月, 2021 4 次提交
- H
  
  disscusion for questions, issue only for bug report (#573) · 9ac99f7c
  由 Hui Zhang 提交于 3月 24, 2021
  
  9ac99f7c
- H
  
  Update issue templates · 18567ced
  由 Hui Zhang 提交于 3月 24, 2021
  
  18567ced
- H
  
  Update issue templates · e0c9bddb
  由 Hui Zhang 提交于 3月 24, 2021
  
  e0c9bddb
- H
  
  Update issue templates · 52b9c8fa
  由 Hui Zhang 提交于 3月 24, 2021
  
  52b9c8fa
23 3月, 2021 1 次提交
- H
  fix doc link and enhance install (#570) · d4e84f9b
  由 Hui Zhang 提交于 3月 23, 2021
```
* fix doc link

* fix install

* fix install doc

* fix typo

* fix lm doc
```
  d4e84f9b
22 3月, 2021 1 次提交

batch average ctc loss (#567) · e0a87a5a

由 Hui Zhang 提交于 3月 22, 2021

* when loss div batchsize, change lr, more epoch, loss can reduce more and cer lower than before

* since loss reduce more when loss div batchsize,  less lm alpha can be better.

* less lm alpha, more cer reduce

* alpha 2.2, cer 0.077478

* alpha 1.9, cer 0.077249

* large librispeech lr for batch_average ctc loss

* since loss reduce and model more confidence, then less lm alpha

e0a87a5a

17 3月, 2021 1 次提交
- H
  fix egs bugs (#552) · 258307df
  由 Hui Zhang 提交于 3月 17, 2021
```
* fix egs

* fix log
```
  258307df
11 3月, 2021 4 次提交
- Z
  
  Update README_cn.md · 4c8c2178
  由 Zeyu Chen 提交于 3月 11, 2021
  
  4c8c2178
- Z
  
  Update README.md · aaafe141
  由 Zeyu Chen 提交于 3月 11, 2021
  
  aaafe141
- Z
  
  add pr template (#550) · d3a5c6d5
  由 Zeyu Chen 提交于 3月 11, 2021
  
  d3a5c6d5
- H
  Refactor CTC module, add embedding and fix log (#549) · 1539f3e0
  由 Hui Zhang 提交于 3月 11, 2021
```
* add acts, refactor ctc, add pos embed

* fix export, dataloader time log

* fix egs

* fix libri readme
```
  1539f3e0
10 3月, 2021 3 次提交
- H
  
  add decoder reference doc (#547) · 00889bfa
  由 Hui Zhang 提交于 3月 10, 2021
  
  00889bfa
- H
  
  Fix doc format (#546) · 19e0f2ac
  由 Hui Zhang 提交于 3月 10, 2021
  
  19e0f2ac
- H
  
  Fix Doc (#544) · 57ed5cd2
  由 Hui Zhang 提交于 3月 10, 2021
  
  57ed5cd2
08 3月, 2021 1 次提交

Support paddle 2.x (#538) · d7e75354

由 Hui Zhang 提交于 3月 08, 2021

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49

d7e75354

06 2月, 2021 1 次提交
- Z
  Refactor egs & add licences · 054d795d
  由 Zeyu Chen 提交于 2月 06, 2021
```
refactor egs & add licences
```
  054d795d
04 2月, 2021 7 次提交
- H
  
  update cn readme · ed5ebc91
  由 Hui Zhang 提交于 2月 04, 2021
  
  ed5ebc91
- H
  
  update en readme · a43ee0ff
  由 Hui Zhang 提交于 2月 04, 2021
  
  a43ee0ff
- H
  
  update aishell egs · 141109b4
  由 Hui Zhang 提交于 2月 04, 2021
  
  141109b4
- H
  
  update en readme · c246d315
  由 Hui Zhang 提交于 2月 04, 2021
  
  c246d315
- H
  
  fix egs · 7357e521
  由 Hui Zhang 提交于 2月 04, 2021
  
  7357e521
- H
  
  fix dataset dir · 9cfca9fc
  由 Hui Zhang 提交于 2月 04, 2021
  
  9cfca9fc
- H
  
  exclude decoders cpp · 0838e2ce
  由 Hui Zhang 提交于 2月 04, 2021
  
  0838e2ce

PaddlePaddle / DeepSpeech 1 年多 前同步成功

PaddlePaddle / DeepSpeech
1 年多前同步成功