PaddlePaddle / DeepSpeech
1 年多前同步成功

代码
- 文件
- 提交
- 分支
- Tags
- 贡献者
- 分支图
- Diff
Issue 245
- 列表
- 看板
- 标记
- 里程碑
合并请求 3
Wiki 0
- Wiki
分析
- 仓库
- DevOps
项目成员
Pages

中文识别时间问题

Created by: jiaozhusos

使用aishell模型进行中文识别，感觉非常慢，而且效果也不好。另外，想基于baidu_1.2k_model进行识别，但是模型里没有vocab.text，此外，相应的warmup_manifest也没有，需要怎么进行设置呢? （待识别的数据本来是一条3-5min的，但是实在太慢，就换成aishell的其中一条wav数据了，只有几秒钟，还是非常慢，等了很久也没有最终的结果出来，是我的设置有问题吗还是参数需要调整一下？） ----------- Configuration Arguments ----------- alpha: 2.5 beam_size: 500 beta: 0.3 cutoff_prob: 0.99 cutoff_top_n: 40 decoding_method: ctc_beam_search host_ip: localhost host_port: 8086 lang_model_path: models/lm/zh_giga.no_cna_cmn.prune01244.klm mean_std_path: models/aishell/mean_std.npz model_path: models/aishell/params.tar.gz num_conv_layers: 2 num_rnn_layers: 3 rnn_layer_size: 2048 share_rnn_weights: True specgram_type: linear speech_save_dir: my_data use_gpu: False use_gru: 1 vocab_path: data/aishell/vocab.txt warmup_manifest: data/aishell/manifest.test

I0506 15:04:53.478561 3892 Util.cpp:166] commandline: --use_gpu=False --trainer_count=1 [INFO 2018-05-06 15:04:53,587 layers.py:2689] output for conv_0: c = 32, h = 81, w = 54, size = 139968 [INFO 2018-05-06 15:04:53,588 layers.py:3251] output for batch_norm_0: c = 32, h = 81, w = 54, size = 139968 [INFO 2018-05-06 15:04:53,589 layers.py:7409] output for scale_sub_region_0: c = 32, h = 81, w = 54, size = 139968 [INFO 2018-05-06 15:04:53,590 layers.py:2689] output for conv_1: c = 32, h = 41, w = 54, size = 70848 [INFO 2018-05-06 15:04:53,591 layers.py:3251] output for batch_norm_1: c = 32, h = 41, w = 54, size = 70848 [INFO 2018-05-06 15:04:53,591 layers.py:7409] output for scale_sub_region_1: c = 32, h = 41, w = 54, size = 70848 [INFO 2018-05-06 15:05:00,766 model.py:243] begin to initialize the external scorer for decoding [INFO 2018-05-06 15:05:06,010 model.py:253] language model: is_character_based = 1, max_order = 5, dict_size = 0 [INFO 2018-05-06 15:05:06,010 model.py:254] end initializing scorer

Warming up ... ('Warm-up Test Case %d: %s', 0, u'/home/hwja/.cache/paddle/dataset/speech/Aishell/data_aishell/wav/test/S0913/BAC009S0913W0441.wav') Response Time: 51.677608, Transcript: 篮 ('Warm-up Test Case %d: %s', 1, u'/home/hwja/.cache/paddle/dataset/speech/Aishell/data_aishell/wav/test/S0912/BAC009S0912W0243.wav') Response Time: 46.002436, Transcript: 篮 ('Warm-up Test Case %d: %s', 2, u'/home/hwja/.cache/paddle/dataset/speech/Aishell/data_aishell/wav/test/S0902/BAC009S0902W0189.wav') Response Time: 40.113678, Transcript: 篮

ASR Server Started.