Are you sure you want to delete this task? Once this task is deleted, it cannot be recovered.
GT老张 19d9abc263 | 2 years ago | |
---|---|---|
conf | 2 years ago | |
dataset | 3 years ago | |
docs | 2 years ago | |
download_data | 2 years ago | |
ppasr | 2 years ago | |
static | 2 years ago | |
templates | 2 years ago | |
tools | 2 years ago | |
.gitignore | 2 years ago | |
LICENSE | 3 years ago | |
README.md | 2 years ago | |
create_data.py | 2 years ago | |
eval.py | 2 years ago | |
export_model.py | 2 years ago | |
infer_gui.py | 2 years ago | |
infer_path.py | 2 years ago | |
infer_server.py | 2 years ago | |
requirements.txt | 2 years ago | |
setup.py | 2 years ago | |
train.py | 2 years ago |
本项目将分三个阶段分支,分别是入门级 、进阶级 和最终级 分支,当前为最终级,持续维护版本。PPASR中文名称PaddlePaddle中文语音识别(PaddlePaddle Automatic Speech Recognition),是一款基于PaddlePaddle实现的语音识别框架,PPASR致力于简单,实用的语音识别项目。可部署在服务器,Nvidia Jetson设备,未来还计划支持Android等移动设备。
本项目使用的环境:
数据集 | 使用模型 | 测试集字错率 | 下载地址 |
---|---|---|---|
aishell(179小时) | deepspeech2 | 0.077042 | 点击下载 |
free_st_chinese_mandarin_corpus(109小时) | deepspeech2 | 0.137442 | 点击下载 |
thchs_30(34小时) | deepspeech2 | 0.062654 | 点击下载 |
超大数据集(1600多小时真实数据)+(1300多小时合成数据) | deepspeech2 | 0.056835 | 点击下载 |
说明:
eval.py
程序并使用集束搜索解码ctc_beam_search
方法计算得到的。mean_std.npz
和vocabulary.txt
,需要把解压得到的全部文件复制到项目根目录下。有问题欢迎提 issue 交流
python infer_path.py --wav_path=./dataset/test.wav
输出结果:
----------- Configuration Arguments -----------
alpha: 1.2
beam_size: 10
beta: 0.35
cutoff_prob: 1.0
cutoff_top_n: 40
decoding_method: ctc_greedy
enable_mkldnn: False
is_long_audio: False
lang_model_path: ./lm/zh_giga.no_cna_cmn.prune01244.klm
mean_std_path: ./dataset/mean_std.npz
model_dir: ./models/infer/
to_an: True
use_gpu: True
use_tensorrt: False
vocab_path: ./dataset/zh_vocab.txt
wav_path: ./dataset/test.wav
------------------------------------------------
消耗时间:132, 识别结果: 近几年不但我用书给女儿儿压岁也劝说亲朋不要给女儿压岁钱而改送压岁书, 得分: 94
python infer_path.py --wav_path=./dataset/test_vad.wav --is_long_audio=True
No Description
Python JavaScript HTML CSS other
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》