提交 · bb5f9e7ad7bc8be4d905abcc7a94bdfdfb7b3b30 · PaddlePaddle / DeepSpeech

03 11月, 2021 1 次提交
- H
  
  short paddleaudio to audio · bb5f9e7a
  由 Hui Zhang 提交于 11月 03, 2021
  
  bb5f9e7a
28 10月, 2021 1 次提交
- K
  
  Rename to paddleaudio. · ee9972a0
  由 KP 提交于 10月 28, 2021
  
  ee9972a0
27 10月, 2021 1 次提交
- K
  
  Merge PaddleAudio into PaddleSpeech. · e6756194
  由 KP 提交于 10月 27, 2021
  
  e6756194
12 10月, 2021 1 次提交
- H
  
  refactor raw ctc decoder into ctcdecoder · 69bd17dc
  由 Hui Zhang 提交于 10月 12, 2021
  
  69bd17dc
08 3月, 2021 1 次提交

由 Hui Zhang 提交于 3月 08, 2021

* 2.x model

* model test pass

* fix data

* fix soundfile with flac support

* one thread dataloader test pass

* export feasture size
add trainer and utils
add setup model and dataloader
update travis using Bionic dist

* add venv; test under venv

* fix unittest; train and valid

* add train and config

* add config and train script

* fix ctc cuda memcopy error

* fix imports

* fix train valid log

* fix dataset batch shuffle shift start from 1
fix rank_zero_only decreator error
close tensorboard when train over
add decoding config and code

* test process can run

* test with decoding

* test and infer with decoding

* fix infer

* fix ctc loss
lr schedule
sortagrad
logger

* aishell egs

* refactor train
add aishell egs

* fix dataset batch shuffle and add batch sampler log
print model parameter

* fix model and ctc

* sequence_mask make all inputs zeros, which cause grad be zero, this is a bug of LessThanOp
add grad clip by global norm
add model train test notebook

* ctc loss
remove run prefix
using ord value as text id

* using unk when training
compute_loss need text ids
ord id using in test mode, which compute wer/cer

* fix tester

* add lr_deacy
refactor code

* fix tools

* fix ci
add tune
fix gru model bugs
add dataset and model test

* fix decoding

* refactor repo
fix decoding

* fix musan and rir dataset

* refactor io, loss, conv, rnn, gradclip, model, utils

* fix ci and import

* refactor model
add export jit model

* add deploy bin and test it

* rm uselss egs

* add layer tools

* refactor socket server
new model from pretrain

* remve useless

* fix instability loss and grad nan or inf for librispeech training

* fix sampler

* fix libri train.sh

* fix doc

* add license on cpp

* fix doc

* fix libri script

* fix install

* clip 5 wer 7.39, clip 400 wer 7.54, 1.8 clip 400 baseline 7.49

d7e75354

04 2月, 2021 1 次提交
- H
  
  add copyright · ba7cf078
  由 Hui Zhang 提交于 2月 04, 2021
  
  ba7cf078
14 6月, 2017 1 次提交
- X
  
  Enable min_batch_num in train.py and update train info print. · 04a225ae
  由 Xinghai Sun 提交于 6月 14, 2017
  
  04a225ae
12 6月, 2017 1 次提交

Refactor whole data preprocessor for DS2 (re-design classes, re-organize dir,... · cd3617ae

由 Xinghai Sun 提交于 6月 12, 2017

Refactor whole data preprocessor for DS2 (re-design classes, re-organize dir, add augmentaion interfaces etc.).

1. Refactor data preprocessor with new added class AudioSegment, SpeechSegment, TextFeaturizer, AudioFeaturizer, SpeechFeaturizer.
2. Add data augmentation interfaces and class AugmentorBase, AugmentationPipeline, VolumnPerturbAugmentor etc..
3. Seperate normalizer's mean and std computing from training, by adding FeatureNormalizer and a seperate tool compute_mean_std.py.
4. Re-organize directory.

cd3617ae

PaddlePaddle / DeepSpeech 大约 1 年 前同步成功

PaddlePaddle / DeepSpeech
大约 1 年前同步成功