1. 02 5月, 2023 3 次提交
  2. 26 4月, 2023 2 次提交
  3. 18 4月, 2023 1 次提交
  4. 20 2月, 2023 1 次提交
  5. 18 2月, 2023 4 次提交
  6. 04 2月, 2023 1 次提交
  7. 07 1月, 2023 2 次提交
  8. 26 12月, 2022 1 次提交
    • T
      Fix variable name (#4815) · aa17817f
      Tanmay Jain 提交于
      Fix "glossaries_opt" variable name at line number 39. It's misspelled due to which words in the glossaries weren't reserved while creating BPE.
      aa17817f
  9. 13 12月, 2022 1 次提交
    • D
      [src] Make word alignment optional (#4802) · be22248e
      Daniel Galvez 提交于
      * Remove unused variable.
      
      * cudadecoder: Make word alignment optional.
      
      For CTC models using word pieces or graphemes, there is not enough
      positional information to use the word alignment.
      
      I tried marking every unit as "singleton" word_boundary.txt, but this
      explodes the state space very, very often. See:
      
      https://github.com/nvidia-riva/riva-asrlib-decoder/issues/3
      
      With the "_" character in CTC models predicting word pieces, we at the
      very least know which word pieces begin a word and which ones are
      either in the middle of the word or the end of a word, but the
      algorithm would still need to be rewritten, especially since "blank"
      is not a silence phoneme (it can appear between).
      
      I did look into using the lexicon-based word alignment. I don't have a
      specific complaint about it, but I did get a weird error where it
      couldn't create a final state at all in the output lattice, which
      caused Connect() to output an empty lattice. This may be because I
      wasn't quite sure how to handle the blank token. I treat it as its own
      phoneme, bcause of limitations in TransitionInformation, but this
      doesn't really make any sense.
      
      Needless to say, while the CTM outputs of the cuda decoder will be
      correct from a WER point of view, their time stamps won't be correct,
      but they probably never were in the first place, for CTC models.
      be22248e
  10. 11 12月, 2022 1 次提交
  11. 07 12月, 2022 1 次提交
  12. 26 9月, 2022 1 次提交
  13. 20 9月, 2022 2 次提交
  14. 14 9月, 2022 1 次提交
  15. 01 9月, 2022 2 次提交
  16. 28 8月, 2022 1 次提交
  17. 18 8月, 2022 2 次提交
  18. 17 8月, 2022 4 次提交
  19. 16 8月, 2022 3 次提交
  20. 12 8月, 2022 2 次提交
    • J
      Kaldi recipe for SPGISpeech (#4772) · 0cf557ed
      Jan "yenda" Trmal 提交于
      * Kaldi recipe for SPGISpeech
      
      * adding readme to spgispeech
      
      * fixing some style issues
      
      * more syntax checks cleared
      
      * more style fixes
      
      * yaml format fix
      
      * fix issues reported by @desh2608
      
      * remove the conda calls
      
      * remove more shellcheck errors
      
      * fix one more grumpy shellcheck
      0cf557ed
    • J
      fix portaudio install script, closes #4755 (#4773) · 7087c4fc
      Jan "yenda" Trmal 提交于
      fixed portaudio archive location
      fixed case when the directory portaudio exists
      fixed whitespace formatting
      7087c4fc
  21. 20 7月, 2022 1 次提交
  22. 15 7月, 2022 2 次提交
  23. 25 6月, 2022 1 次提交