提交 · 69055698a2f2af98ac147e02f25f55c51a6a5b2e · PaddlePaddle / DeepSpeech

05 11月, 2021 1 次提交
- H
  
  transformer using batch data loader · 69055698
  由 Hui Zhang 提交于 11月 05, 2021
  
  69055698
29 7月, 2021 1 次提交
- H
  
  修复了librispeech和mini_libirspeech · b13630b1
  由 huangyuxin 提交于 7月 29, 2021
  
  b13630b1
27 7月, 2021 1 次提交
- H
  
  fix dataset meta path · 03eb343d
  由 Hui Zhang 提交于 7月 27, 2021
  
  03eb343d
13 7月, 2021 1 次提交
- H
  
  fix librispeech meta path · 99e28b8a
  由 Hui Zhang 提交于 7月 13, 2021
  
  99e28b8a
29 6月, 2021 1 次提交
- H
  
  add thchs30, aidatatang; · 9e99f99b
  由 Hui Zhang 提交于 6月 29, 2021
  
  9e99f99b
12 5月, 2021 1 次提交

E2E/Streaming Transformer/Conformer ASR (#578) · 71e046b0

由 Hui Zhang 提交于 5月 12, 2021

* add cmvn and label smoothing loss layer

* add layer for transformer

* add glu and conformer conv

* add torch compatiable hack, mask funcs

* not hack size since it exists

* add test; attention

* add attention, common utils, hack paddle

* add audio utils

* conformer batch padding mask bug fix #223

* fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2

* fix ci

* fix ci

* add encoder

* refactor egs

* add decoder

* refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils

* refactor docs

* add fix

* fix readme

* fix bugs, refactor collator, add pad_sequence, fix ckpt bugs

* fix docstring

* refactor data feed order

* add u2 model

* refactor cmvn, test

* add utils

* add u2 config

* fix bugs

* fix bugs

* fix autograd maybe has problem when using inplace operation

* refactor data, build vocab; add format data

* fix text featurizer

* refactor build vocab

* add fbank, refactor feature of speech

* refactor audio feat

* refactor data preprare

* refactor data

* model init from config

* add u2 bins

* flake8

* can train

* fix bugs, add coverage, add scripts

* test can run

* fix data

* speed perturb with sox

* add spec aug

* fix for train

* fix train logitc

* fix logger

* log valid loss, time dataset process

* using np for speed perturb, remove some debug log of grad clip

* fix logger

* fix build vocab

* fix logger name

* using module logger as default

* fix

* fix install

* reorder imports

* fix board logger

* fix logger

* kaldi fbank and mfcc

* fix cmvn and print prarams

* fix add_eos_sos and cmvn

* fix cmvn compute

* fix logger and cmvn

* fix subsampling, label smoothing loss, remove useless

* add notebook test

* fix log

* fix tb logger

* multi gpu valid

* fix log

* fix log

* fix config

* fix compute cmvn, need paddle 2.1

* add cmvn notebook

* fix layer tools

* fix compute cmvn

* add rtf

* fix decoding

* fix layer tools

* fix log, add avg script

* more avg and test info

* fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh;

* add vimrc

* refactor tiny script, add transformer and stream conf

* spm demo; librisppech scripts and confs

* fix log

* add librispeech scripts

* refactor data pipe; fix conf; fix u2 default params

* fix bugs

* refactor aishell scripts

* fix test

* fix cmvn

* fix s0 scripts

* fix ds2 scripts and bugs

* fix dev & test dataset filter

* fix dataset filter

* filter dev

* fix ckpt path

* filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test

* add comment

* add syllable doc

* fix ds2 configs

* add doc

* add pypinyin tools

* fix decoder using blank_id=0

* mmseg with pybind11

* format code

71e046b0

08 3月, 2021 1 次提交

Support paddle 2.x (#538) · d7e75354

由 Hui Zhang 提交于 3月 08, 2021

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49

d7e75354

04 2月, 2021 2 次提交
- H
  
  add copyright · ba7cf078
  由 Hui Zhang 提交于 2月 04, 2021
  
  ba7cf078
- H
  
  refactor tiny egs · 75c8018e
  由 Hui Zhang 提交于 2月 04, 2021
  
  75c8018e
03 2月, 2021 1 次提交
- H
  
  support py3 · 126677a3
  由 Hui Zhang 提交于 2月 03, 2021
  
  126677a3
02 2月, 2021 1 次提交
- H
  
  using china data source · 457323e2
  由 Hui Zhang 提交于 2月 02, 2021
  
  457323e2
17 10月, 2019 1 次提交
- L
  
  update deepspeech to fluid api · d74f4ff3
  由 lfchener 提交于 10月 17, 2019
  
  d74f4ff3
19 9月, 2017 2 次提交
- Y
  
  Add data preparing for Aishell. · e9a42044
  由 yangyaming 提交于 9月 19, 2017
  
  e9a42044
- Y
  
  Extract common utility functions. · d1420d12
  由 yangyaming 提交于 9月 19, 2017
  
  d1420d12
13 9月, 2017 1 次提交
- X
  
  Update RAEDME.md and librispeech.py by following Yaming's review. · 351f61e3
  由 Xinghai Sun 提交于 9月 13, 2017
  
  351f61e3
12 9月, 2017 2 次提交
- X
  
  Add bash code highlight to README.md for DS2. · 35caf5e0
  由 Xinghai Sun 提交于 9月 12, 2017
  
  35caf5e0
- X
  
  Update examples scripts and REAME.md for DS2. · e11b735d
  由 Xinghai Sun 提交于 9月 12, 2017
  
  e11b735d
10 9月, 2017 1 次提交
- X
  
  Rename some folders and update examples. · ae7ef792
  由 Xinghai Sun 提交于 9月 10, 2017
  
  ae7ef792
06 9月, 2017 1 次提交
- X
  
  Re-organize folder structure and hierarchy for DS2. · 0bbb9c3e
  由 Xinghai Sun 提交于 9月 05, 2017
  
  0bbb9c3e
09 8月, 2017 2 次提交
- Y
  
  Add more test cases and make DP more clear. · 04970705
  由 yangyaming 提交于 8月 09, 2017
  
  04970705
- Y
  
  Unify encoding to 'utf-8' and optimize error rate calculation. · 14d2fb79
  由 yangyaming 提交于 8月 09, 2017
  
  14d2fb79
01 8月, 2017 1 次提交
- L
  
  change the wget method in run.sh of deep_speech2 · 5e20dfd4
  由 Luo Tao 提交于 8月 01, 2017
  
  5e20dfd4
15 6月, 2017 1 次提交
- X
  
  Add shuffle type of instance_shuffle and batch_shuffle_clipped. · ed5f04af
  由 Xinghai Sun 提交于 6月 15, 2017
  
  ed5f04af
13 6月, 2017 1 次提交
- X
  
  Add function, class and module docs for data parts in DS2. · b07ee84a
  由 Xinghai Sun 提交于 6月 13, 2017
  
  b07ee84a
12 6月, 2017 1 次提交

Refactor whole data preprocessor for DS2 (re-design classes, re-organize dir,... · cd3617ae

由 Xinghai Sun 提交于 6月 12, 2017

Refactor whole data preprocessor for DS2 (re-design classes, re-organize dir, add augmentaion interfaces etc.).

1. Refactor data preprocessor with new added class AudioSegment, SpeechSegment, TextFeaturizer, AudioFeaturizer, SpeechFeaturizer.
2. Add data augmentation interfaces and class AugmentorBase, AugmentationPipeline, VolumnPerturbAugmentor etc..
3. Seperate normalizer's mean and std computing from training, by adding FeatureNormalizer and a seperate tool compute_mean_std.py.
4. Re-organize directory.

cd3617ae

08 6月, 2017 1 次提交
- X
  
  Remove manifest's line number check from librispeech.py and update README.md. · 06e9f713
  由 Xinghai Sun 提交于 6月 08, 2017
  
  06e9f713
07 6月, 2017 1 次提交

Refine librispeech.py for DeepSpeech2. · d3eeb7fd

由 Xinghai Sun 提交于 6月 07, 2017

Summary:
1. Add manifest line check.
2. Avoid re-unpacking if unpacked data already exists.
3. Add full_download (download all 7 sub-datasets of LibriSpeech).

d3eeb7fd

03 6月, 2017 2 次提交
- X
  
  Update DS2 README.md and fix bug in librispeech.py · 730d5c4d
  由 Xinghai Sun 提交于 6月 03, 2017
  
  730d5c4d
- X
  
  Refactor decoder interfaces and add ./data directory. · 2a834865
  由 Xinghai Sun 提交于 6月 03, 2017
  
  2a834865
02 6月, 2017 1 次提交

1. Fix incorrect decoder result printing. · 8313895e

由 Xinghai Sun 提交于 6月 02, 2017

2. Fix incorrect batch-norm usage in RNN.
3. Fix overlapping train/dev/test manfests.
4. Update README.md and requirements.txt.
5. Expose more arguments to users in argparser.
6. Update all other details.

8313895e

25 5月, 2017 3 次提交
- X
  
  Add function docs. · 0babc5c4
  由 Xinghai Sun 提交于 5月 25, 2017
  
  0babc5c4
- X
  
  Add infererence and add SortaGrad for only first pass. · 70a343a4
  由 Xinghai Sun 提交于 5月 25, 2017
  
  70a343a4
- X
  Add librispeech dataset, audio data provider and simplfied DeepSpeech2 model configuration. · 3fc94427
  由 Xinghai Sun 提交于 5月 25, 2017
```
Bug exists when run training.
```
  3fc94427

PaddlePaddle / DeepSpeech 12 个月 前同步成功

PaddlePaddle / DeepSpeech
12 个月前同步成功