提交 · master · shareinfo2018 / kaldi

02 5月, 2023 3 次提交
- D
  Merge pull request #4850 from danpovey/fix_liblbfgs · 71f38e62
  由 Daniel Povey 提交于 5月 02, 2023
```
Fix download location in install_liblbfgs.sh
```
  71f38e62
- D
  
  Merge · 5ef3962a
  由 Daniel Povey 提交于 5月 02, 2023
  
  5ef3962a
- D
  
  Fix download location in install_liblbfgs.sh · 0d7f17f3
  由 Daniel Povey 提交于 5月 02, 2023
  
  0d7f17f3
26 4月, 2023 2 次提交
- N
  Fix matrix data offset for large matrices (#4823) · 19185083
  由 Nickolay V. Shmyrev 提交于 4月 26, 2023
```
* Fix matrix data offset for large matrices

* Fix overflow in cudamatrix too
```
  19185083
- Y
  More fixes of unwanted ADL usage of std algos (#4828) · 9a8588ac
  由 Yuriy Chernyshov 提交于 4月 26, 2023
```
This continues the work started in #4809.
```
  9a8588ac
18 4月, 2023 1 次提交
- N
  No need for atomicAdd for float2, conflicts with CUDA 12.1 (#4838) · ab8fa9e4
  由 Nickolay V. Shmyrev 提交于 4月 17, 2023
```
function is not used anyway
```
  ab8fa9e4
20 2月, 2023 1 次提交
- Y
  
  Fix for issue#4801 (#4826) · 59299d1c
  由 Yifan Yang 提交于 2月 20, 2023
  
  59299d1c
18 2月, 2023 4 次提交
- D
  Merge pull request #4825 from georgthegreat/patch-2 · 362ecc2b
  由 Daniel Povey 提交于 2月 18, 2023
```
Fix -Wdeprecated-copy from c++11
```
  362ecc2b
- Y
  
  Fix -Wdeprecated-copy from c++11 · ed910d60
  由 Yuriy Chernyshov 提交于 2月 18, 2023
  
  ed910d60
- D
  Merge pull request #4810 from daanzu/pr/srilm · ab6c168a
  由 Daniel Povey 提交于 2月 18, 2023
```
SRILM: allow bypassing download/extraction during automated installation
```
  ab6c168a
- D
  Merge pull request #4809 from georgthegreat/patch-1 · 561d8c28
  由 Daniel Povey 提交于 2月 18, 2023
```
Do not use ADL to invoke std::binary_search
```
  561d8c28
04 2月, 2023 1 次提交
- K
  egs/ami: Fix BUT path to wavs in AMI scripts, add beamformer config (#4820) · e4eb4f67
  由 Karel Vesely 提交于 2月 03, 2023
```
- the audio data no longer exist in that path
- the beamformer config was missing in 'ami/s5b', it's taken from 'ami/s5'
```
  e4eb4f67
07 1月, 2023 2 次提交

由 Daniel Galvez 提交于 12月 13, 2022

This is to fix a CI error.

It appears that this is from using "ubuntu-latest" in the CI
workflow. It got upgraded to ubuntu 22.04 automatically, and this
doesn't have python2.7 by default.

ae8cbe88

D

Add support for CUDA 12 and Hopper. · 0785b665
由 Daniel Galvez 提交于 1月 05, 2023

0785b665

26 12月, 2022 1 次提交

Fix variable name (#4815) · aa17817f

由 Tanmay Jain 提交于 12月 26, 2022

Fix "glossaries_opt" variable name at line number 39. It's misspelled due to which words in the glossaries weren't reserved while creating BPE.

aa17817f

13 12月, 2022 1 次提交

[src] Make word alignment optional (#4802) · be22248e

由 Daniel Galvez 提交于 12月 13, 2022

* Remove unused variable.

* cudadecoder: Make word alignment optional.

For CTC models using word pieces or graphemes, there is not enough
positional information to use the word alignment.

I tried marking every unit as "singleton" word_boundary.txt, but this
explodes the state space very, very often. See:

https://github.com/nvidia-riva/riva-asrlib-decoder/issues/3

With the "_" character in CTC models predicting word pieces, we at the
very least know which word pieces begin a word and which ones are
either in the middle of the word or the end of a word, but the
algorithm would still need to be rewritten, especially since "blank"
is not a silence phoneme (it can appear between).

I did look into using the lexicon-based word alignment. I don't have a
specific complaint about it, but I did get a weird error where it
couldn't create a final state at all in the output lattice, which
caused Connect() to output an empty lattice. This may be because I
wasn't quite sure how to handle the blank token. I treat it as its own
phoneme, bcause of limitations in TransitionInformation, but this
doesn't really make any sense.

Needless to say, while the CTM outputs of the cuda decoder will be
correct from a WER point of view, their time stamps won't be correct,
but they probably never were in the first place, for CTC models.

be22248e

11 12月, 2022 1 次提交
- D
  
  SRILM: allow bypassing download/extraction during automated installation · a023f3fe
  由 daanzu 提交于 12月 11, 2022
  
  a023f3fe
07 12月, 2022 1 次提交
- Y
  
  Do not use ADL to invoke std::binary_search · 99101e8d
  由 Yuriy Chernyshov 提交于 12月 06, 2022
  
  99101e8d
26 9月, 2022 1 次提交

Update run_blstm.sh (#4790) · f6f4ccaf

由 xu-gaopeng 提交于 9月 26, 2022

* Update run_blstm.sh

fix bug aspire run_blstm.sh

* Update egs/aspire/s5/local/nnet3/run_blstm.sh
Co-authored-by: NCy 'kkm' Katsnelson <kkm@pobox.com>
Co-authored-by: NCy 'kkm' Katsnelson <kkm@pobox.com>

f6f4ccaf

20 9月, 2022 2 次提交
- K
  Provide provision to pass subword separator as argument to... · 716f558f
  由 Kavya Manohar 提交于 9月 20, 2022
```
Provide provision to pass subword separator as argument to make_position_dependent_subword_lexicon.py (#4794)
```
  716f558f
- N
  change [utter] to [utterance.] in data_prep.dox (#4795) · 299dadcf
  由 npovey 提交于 9月 19, 2022
```
Co-authored-by: Nnpovey <you@example.com>
```
  299dadcf
14 9月, 2022 1 次提交
- B
  
  [egs] Kill feral dupes of --num-threads in few local eg scripts (#4792) · 727e4548
  由 bhuang 提交于 9月 14, 2022
  
  727e4548
01 9月, 2022 2 次提交
- T
  
  [scripts] Move otherwise misleading comment in mkgraph.sh (#4787) · 7bc53ef8
  由 Taku Takamatsu 提交于 8月 31, 2022
  
  7bc53ef8
- J
  [scripts] Copy utt2lang in copy_data_dir.sh (#4789) · a754c189
  由 Jonghwan Hyeon 提交于 9月 01, 2022
```
Coauthored-By: NJonghwan Hyeon <hyeon0145@gmail.com>
```
  a754c189
28 8月, 2022 1 次提交
- A
  Fix: ali-to-post piping in post-to-tacc example (#4788) · 0fb502d1
  由 Agrover112 提交于 8月 28, 2022
```
The example for the post-to-tacc fails , but with the correct of `ark:- |`  there is no piping error
```
  0fb502d1
18 8月, 2022 2 次提交
- J
  
  [infra] add cpu-only docker build (#4783) · 4592c547
  由 Jan "yenda" Trmal 提交于 8月 18, 2022
  
  4592c547
- J
  [infra] docker images automatically using gh (#4782) · 3dd259bc
  由 Jan "yenda" Trmal 提交于 8月 18, 2022
```
* [infra] docker images automatically using gh
* minor change
```
  3dd259bc
17 8月, 2022 4 次提交

J

remove the c++17 removed function random_shuffle · 09a63063
由 Jan "yenda" Trmal 提交于 8月 12, 2022

09a63063
J

remove obsoleted and C++17 incompatible unary_function<>, resolves #4732 · 85c120a4
由 Jan "yenda" Trmal 提交于 8月 12, 2022

85c120a4

[egs] Set PYTHONUNBUFFERED=1 in all recipes (#4770) · ae2efea0

由 Sourya Kakarla 提交于 8月 17, 2022

* Set PYTHONUNBUFFERED=TRUE for wsj,tedlium

To force stdout, stderr to be unbuffered for python scripts.
Without this setting parts of the stream might be lost in a few cases.
Relevant mainly for python versions <3.7.

* Set PYTHONUNBUFFERED=TRUE for all recipes

Add to path.sh in all except the already updated wsj, tedlium.
To force stdout, stderr to be unbuffered for python scripts.
Relevant mainly for python versions <3.7.

* Set PYTHONUNBUFFERED=TRUE in remaining path.sh

These files were missed by mistake in earlier commits.

* Use 1 instead of TRUE for PYTHONUNBUFFERED

ae2efea0

M

Fix typo in gen_cmake_skeleton (#4779) · 6c86e03e
由 Michael McAuliffe 提交于 8月 16, 2022

6c86e03e

16 8月, 2022 3 次提交
- J
  
  use last segment of the CXX as the ccbin compiler for cuda (#4778) · b8f30acc
  由 Jan "yenda" Trmal 提交于 8月 16, 2022
  
  b8f30acc
- J
  Windows conda fixes (#4777) · e786efa3
  由 Jan "yenda" Trmal 提交于 8月 16, 2022
```
* Fix missing cblas and lapack external symbols for netlib

* Remaining Conda-forge changes

* Fix indent

* usleep using C+11 constructs

* make codefactor happy
Co-authored-by: NMichael McAuliffe <michael.e.mcauliffe@gmail.com>
```
  e786efa3
- J
  Replace TravisCI with github actions (#4776) · dd97e1ba
  由 Jan "yenda" Trmal 提交于 8月 16, 2022
```
* Github action for kaldi
* add ccache
* remove spaces
```
  dd97e1ba
12 8月, 2022 2 次提交

Kaldi recipe for SPGISpeech (#4772) · 0cf557ed

由 Jan "yenda" Trmal 提交于 8月 12, 2022

* Kaldi recipe for SPGISpeech

* adding readme to spgispeech

* fixing some style issues

* more syntax checks cleared

* more style fixes

* yaml format fix

* fix issues reported by @desh2608

* remove the conda calls

* remove more shellcheck errors

* fix one more grumpy shellcheck

0cf557ed

J
fix portaudio install script, closes #4755 (#4773) · 7087c4fc
由 Jan "yenda" Trmal 提交于 8月 12, 2022
```
fixed portaudio archive location
fixed case when the directory portaudio exists
fixed whitespace formatting
```
7087c4fc

20 7月, 2022 1 次提交
- C
  
  [scripts] egs/chime6/s5c_track2/local/ts-vad/diarize_TS-VAD_itX.sh: use python3 (#4766) · 9af2c5c1
  由 Christoph Boeddeker 提交于 7月 20, 2022
  
  9af2c5c1
15 7月, 2022 2 次提交
- C
  
  [egs] chime6/s5c_track2: example code for pretrained model (#4763) · 05f66603
  由 Christoph Boeddeker 提交于 7月 15, 2022
  
  05f66603
- C
  
  [egs] chime6: missing --cmd argument (#4764) · 503a622a
  由 Christoph Boeddeker 提交于 7月 15, 2022
  
  503a622a
25 6月, 2022 1 次提交
- Z
  [src] Fix Windows build - Add cctype import to kaldi-utils.cc, remove mkl_intel_c.lib (#4761) · 3dd90feb
  由 Zohan 提交于 6月 24, 2022
```
* Fix build for Windows

* Remove double cctype
```
  3dd90feb

shareinfo2018 / kaldi 与 Fork 源项目一致

shareinfo2018 / kaldi
与 Fork 源项目一致