1. 14 6月, 2022 1 次提交
  2. 22 4月, 2022 1 次提交
  3. 02 3月, 2022 1 次提交
    • K
      refactor · c52f0f80
      KP 提交于
      c52f0f80
  4. 25 2月, 2022 2 次提交
  5. 17 9月, 2021 1 次提交
    • H
      Kaldi (#839) · 16e60160
      Hui Zhang 提交于
      * can do frames, real stft
      
      * format
      
      * stft complex, powspec, magspec
      
      * add common utils
      
      * add window process func
      
      * using frames and matmul as stft
      
      * read with 2d; window process
      
      * test with dither, remove dc offset, preermphs
      
      * add doc string
      
      * more frontend utils
      
      * add logspec
      
      * fix typing
      
      * add delpoy mergify label
      16e60160
  6. 12 5月, 2021 1 次提交
    • H
      E2E/Streaming Transformer/Conformer ASR (#578) · 71e046b0
      Hui Zhang 提交于
      * add cmvn and label smoothing loss layer
      
      * add layer for transformer
      
      * add glu and conformer conv
      
      * add torch compatiable hack, mask funcs
      
      * not hack size since it exists
      
      * add test; attention
      
      * add attention, common utils, hack paddle
      
      * add audio utils
      
      * conformer batch padding mask bug fix #223
      
      * fix typo, python infer fix rnn mem opt name error and batchnorm1d, will be available at 2.0.2
      
      * fix ci
      
      * fix ci
      
      * add encoder
      
      * refactor egs
      
      * add decoder
      
      * refactor ctc, add ctc align, refactor ckpt, add warmup lr scheduler, cmvn utils
      
      * refactor docs
      
      * add fix
      
      * fix readme
      
      * fix bugs, refactor collator, add pad_sequence, fix ckpt bugs
      
      * fix docstring
      
      * refactor data feed order
      
      * add u2 model
      
      * refactor cmvn, test
      
      * add utils
      
      * add u2 config
      
      * fix bugs
      
      * fix bugs
      
      * fix autograd maybe has problem when using inplace operation
      
      * refactor data, build vocab; add format data
      
      * fix text featurizer
      
      * refactor build vocab
      
      * add fbank, refactor feature of speech
      
      * refactor audio feat
      
      * refactor data preprare
      
      * refactor data
      
      * model init from config
      
      * add u2 bins
      
      * flake8
      
      * can train
      
      * fix bugs, add coverage, add scripts
      
      * test can run
      
      * fix data
      
      * speed perturb with sox
      
      * add spec aug
      
      * fix for train
      
      * fix train logitc
      
      * fix logger
      
      * log valid loss, time dataset process
      
      * using np for speed perturb, remove some debug log of grad clip
      
      * fix logger
      
      * fix build vocab
      
      * fix logger name
      
      * using module logger as default
      
      * fix
      
      * fix install
      
      * reorder imports
      
      * fix board logger
      
      * fix logger
      
      * kaldi fbank and mfcc
      
      * fix cmvn and print prarams
      
      * fix add_eos_sos and cmvn
      
      * fix cmvn compute
      
      * fix logger and cmvn
      
      * fix subsampling, label smoothing loss, remove useless
      
      * add notebook test
      
      * fix log
      
      * fix tb logger
      
      * multi gpu valid
      
      * fix log
      
      * fix log
      
      * fix config
      
      * fix compute cmvn, need paddle 2.1
      
      * add cmvn notebook
      
      * fix layer tools
      
      * fix compute cmvn
      
      * add rtf
      
      * fix decoding
      
      * fix layer tools
      
      * fix log, add avg script
      
      * more avg and test info
      
      * fix dataset pickle problem; using 2.1 paddle; num_workers can > 0; ckpt save in exp dir;fix setup.sh;
      
      * add vimrc
      
      * refactor tiny script, add transformer and stream conf
      
      * spm demo; librisppech scripts and confs
      
      * fix log
      
      * add librispeech scripts
      
      * refactor data pipe; fix conf; fix u2 default params
      
      * fix bugs
      
      * refactor aishell scripts
      
      * fix test
      
      * fix cmvn
      
      * fix s0 scripts
      
      * fix ds2 scripts and bugs
      
      * fix dev & test dataset filter
      
      * fix dataset filter
      
      * filter dev
      
      * fix ckpt path
      
      * filter test, since librispeech will cause OOM, but all test wer will be worse, since mismatch train with test
      
      * add comment
      
      * add syllable doc
      
      * fix ds2 configs
      
      * add doc
      
      * add pypinyin tools
      
      * fix decoder using blank_id=0
      
      * mmseg with pybind11
      
      * format code
      71e046b0