提交 · bf929ed8e1b1b10702bffd8cf55d43f0413b1c43 · PaddlePaddle / DeepSpeech

23 6月, 2021 1 次提交
- H
  
  ds2 default using 4gpu; new result of ds2 · 133a522f
  由 Hui Zhang 提交于 6月 23, 2021
  
  133a522f
22 6月, 2021 2 次提交
- H
  
  fix config for new datapipeline · 68149cb9
  由 Hui Zhang 提交于 6月 22, 2021
  
  68149cb9
- H
  
  fix chunk default config; tarball ckpt prfix dir; · 5a3a9e1f
  由 Hui Zhang 提交于 6月 22, 2021
  
  5a3a9e1f
21 6月, 2021 1 次提交
- H
  
  fix ds2 conf for new data pipeline · 8c1bf1a7
  由 Hui Zhang 提交于 6月 21, 2021
  
  8c1bf1a7
18 6月, 2021 1 次提交
- H
  
  move redundant params · 55742773
  由 Haoxin Ma 提交于 6月 18, 2021
  
  55742773
17 6月, 2021 1 次提交
- H
  
  move batch_size, work_nums, shuffle_method, sortagrad to collator · 698d7a9b
  由 Haoxin Ma 提交于 6月 17, 2021
  
  698d7a9b
16 6月, 2021 1 次提交
- H
  
  finish aishell/s0 · 6ee3033c
  由 Haoxin Ma 提交于 6月 16, 2021
  
  6ee3033c
15 6月, 2021 1 次提交
- H
  
  revise example/ting/s1 · 7bae32f3
  由 Haoxin Ma 提交于 6月 15, 2021
  
  7bae32f3
11 6月, 2021 1 次提交
- H
  
  feat_dim, vocab_size · b9110af9
  由 Haoxin Ma 提交于 6月 11, 2021
  
  b9110af9
09 6月, 2021 2 次提交
- H
  
  fix bugs · b4bda290
  由 Haoxin Ma 提交于 6月 09, 2021
  
  b4bda290
- H
  
  result with specaug · 255a6f00
  由 Hui Zhang 提交于 6月 09, 2021
  
  255a6f00
08 6月, 2021 3 次提交
- H
  
  move process utt to collator · 279348d7
  由 Haoxin Ma 提交于 6月 08, 2021
  
  279348d7
- H
  
  fix export and run.sh · 8781ab58
  由 Haoxin Ma 提交于 6月 08, 2021
  
  8781ab58
- H
  
  add result output · a58b1cb3
  由 Haoxin Ma 提交于 6月 08, 2021
  
  a58b1cb3
07 6月, 2021 2 次提交
- H
  
  fix clip conf · 9bb18062
  由 Hui Zhang 提交于 6月 07, 2021
  
  9bb18062
- H
  
  add stream conf · b3d23e6a
  由 Hui Zhang 提交于 6月 07, 2021
  
  b3d23e6a
04 6月, 2021 2 次提交
- H
  
  utt datapipeline · c8368410
  由 Haoxin Ma 提交于 6月 04, 2021
  
  c8368410
- H
  
  fix mask for bool type; fix other · 69dfc2a5
  由 Hui Zhang 提交于 6月 04, 2021
  
  69dfc2a5
01 6月, 2021 1 次提交
- H
  
  add crf · 34689bd1
  由 Hui Zhang 提交于 6月 01, 2021
  
  34689bd1
28 5月, 2021 1 次提交
- H
  
  add libri ds2 exp result · de780a0c
  由 Hui Zhang 提交于 5月 28, 2021
  
  de780a0c
25 5月, 2021 1 次提交
- C
  1. use space as separator; · 4c3c5546
  由 chenfeiyu 提交于 5月 25, 2021
```
2. add docstring for some functions.
```
  4c3c5546
21 5月, 2021 3 次提交
- H
  
  fix ds2 default config · 759b2e0b
  由 Hui Zhang 提交于 5月 21, 2021
  
  759b2e0b
- H
  remove sequnce_mask and change ds2 export audio shape to [B,T,D] (#639) · b3bc4513
  由 Hui Zhang 提交于 5月 21, 2021
```
* remove sequnce_mask

* format

* fix ds2 export audio shape from B,D,T to B,T,D
```
  b3bc4513
- H
  add Cedict egs & pypinyin using jieba as wordseg & phkit using local pypinyin package (#637) · 749a1130
  由 Hui Zhang 提交于 5月 21, 2021
```
* down and parse cedict

* remove useless

* using third party python pinyin

* jieba as default wordseg

* remove useless

* remove pinyin dict

* using jieba.lcut

* remove doc of cedict egs

* add fan2jian test

* add description for say_digit
```
  749a1130
20 5月, 2021 1 次提交
- H
  refactor g2p egs (#630) · 1b373bfc
  由 Hui Zhang 提交于 5月 20, 2021
```
* refactor g2p egs

* add sha-bone; remove avg.sh from egs;
```
  1b373bfc
19 5月, 2021 5 次提交

由 Feiyu Chan 提交于 5月 19, 2021

* add an example to convert transcription to pinyin with pypinyin and jieba

* format code

* 1. remove script for data downloading, since Baker dataset is not easily downloaded via terminal;
2. remove pypinyin as an extra requirement; it is alreay required by the main project;
3. clean code.

* change output format

075635d2

H

add tarball utils (#626) · 0a7958b3
由 Hui Zhang 提交于 5月 19, 2021

0a7958b3
H

fix result; add feature list · 37c53241
由 Hui Zhang 提交于 5月 19, 2021

37c53241

more decoding method (#618) · 0a3a840b

由 Hui Zhang 提交于 5月 19, 2021

* more decoding method

* all decode method test scripts; result readme

* exp libri confi

* parallel data scripts; more mask test; need pybind11 repo

* speed perturb config

* libri conf test set

0a3a840b

train ds2 model (#622) · 295f8bda

由 Hui Zhang 提交于 5月 19, 2021

* default cmvn compute config; more log of grad clip; diff ds2 cmvn compute and conf; ds2 lr step by epoch;

* fix ds2 config

* fix install and egs link

* sox speed pertrub shape (T, C), float64, process using int32

* fix libri ds2 scripts; add ngram and spm doc

* aishell ds2 cer7.86

* fix ds2 result

295f8bda

17 5月, 2021 2 次提交

ctc decoding weight 0.5 (#614) · d777edc6

由 Hui Zhang 提交于 5月 17, 2021

* ctc decoding weight 0.5

* tiny decoding conf

* more label of mergify

* format doc

d777edc6

chinese char/word ngram lm (#613) · 538bf271

由 Hui Zhang 提交于 5月 17, 2021

* add ngram lm egs

* add zhon repo

* install kenlm, zhon

* format

* add chinese_text_normalization repo

* add ngram lm egs

538bf271

12 5月, 2021 1 次提交

E2E/Streaming Transformer/Conformer ASR (#578) · 71e046b0

由 Hui Zhang 提交于 5月 12, 2021

* add cmvn and label smoothing loss layer

* add layer for transformer

* add glu and conformer conv

* add torch compatiable hack, mask funcs

* not hack size since it exists

* add test; attention

* add attention, common utils, hack paddle

* add audio utils

* conformer batch padding mask bug fix #223

* fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2

* fix ci

* fix ci

* add encoder

* refactor egs

* add decoder

* refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils

* refactor docs

* add fix

* fix readme

* fix bugs, refactor collator, add pad_sequence, fix ckpt bugs

* fix docstring

* refactor data feed order

* add u2 model

* refactor cmvn, test

* add utils

* add u2 config

* fix bugs

* fix bugs

* fix autograd maybe has problem when using inplace operation

* refactor data, build vocab; add format data

* fix text featurizer

* refactor build vocab

* add fbank, refactor feature of speech

* refactor audio feat

* refactor data preprare

* refactor data

* model init from config

* add u2 bins

* flake8

* can train

* fix bugs, add coverage, add scripts

* test can run

* fix data

* speed perturb with sox

* add spec aug

* fix for train

* fix train logitc

* fix logger

* log valid loss, time dataset process

* using np for speed perturb, remove some debug log of grad clip

* fix logger

* fix build vocab

* fix logger name

* using module logger as default

* fix

* fix install

* reorder imports

* fix board logger

* fix logger

* kaldi fbank and mfcc

* fix cmvn and print prarams

* fix add_eos_sos and cmvn

* fix cmvn compute

* fix logger and cmvn

* fix subsampling, label smoothing loss, remove useless

* add notebook test

* fix log

* fix tb logger

* multi gpu valid

* fix log

* fix log

* fix config

* fix compute cmvn, need paddle 2.1

* add cmvn notebook

* fix layer tools

* fix compute cmvn

* add rtf

* fix decoding

* fix layer tools

* fix log, add avg script

* more avg and test info

* fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh;

* add vimrc

* refactor tiny script, add transformer and stream conf

* spm demo; librisppech scripts and confs

* fix log

* add librispeech scripts

* refactor data pipe; fix conf; fix u2 default params

* fix bugs

* refactor aishell scripts

* fix test

* fix cmvn

* fix s0 scripts

* fix ds2 scripts and bugs

* fix dev & test dataset filter

* fix dataset filter

* filter dev

* fix ckpt path

* filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test

* add comment

* add syllable doc

* fix ds2 configs

* add doc

* add pypinyin tools

* fix decoder using blank_id=0

* mmseg with pybind11

* format code

71e046b0

22 3月, 2021 1 次提交

batch average ctc loss (#567) · e0a87a5a

由 Hui Zhang 提交于 3月 22, 2021

* when loss div batchsize, change lr, more epoch, loss can reduce more and cer lower than before

* since loss reduce more when loss div batchsize,  less lm alpha can be better.

* less lm alpha, more cer reduce

* alpha 2.2, cer 0.077478

* alpha 1.9, cer 0.077249

* large librispeech lr for batch_average ctc loss

* since loss reduce and model more confidence, then less lm alpha

e0a87a5a

17 3月, 2021 1 次提交
- H
  fix egs bugs (#552) · 258307df
  由 Hui Zhang 提交于 3月 17, 2021
```
* fix egs

* fix log
```
  258307df
11 3月, 2021 1 次提交
- H
  Refactor CTC module, add embedding and fix log (#549) · 1539f3e0
  由 Hui Zhang 提交于 3月 11, 2021
```
* add acts, refactor ctc, add pos embed

* fix export, dataloader time log

* fix egs

* fix libri readme
```
  1539f3e0
10 3月, 2021 2 次提交
- H
  
  Fix doc format (#546) · 19e0f2ac
  由 Hui Zhang 提交于 3月 10, 2021
  
  19e0f2ac
- H
  
  Fix Doc (#544) · 57ed5cd2
  由 Hui Zhang 提交于 3月 10, 2021
  
  57ed5cd2
08 3月, 2021 1 次提交

Support paddle 2.x (#538) · d7e75354

由 Hui Zhang 提交于 3月 08, 2021

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49

d7e75354

04 2月, 2021 1 次提交
- H
  
  update aishell egs · 141109b4
  由 Hui Zhang 提交于 2月 04, 2021
  
  141109b4

PaddlePaddle / DeepSpeech 大约 1 年 前同步成功

PaddlePaddle / DeepSpeech
大约 1 年前同步成功