您好,欢迎来到星星旅游。
搜索
您的当前位置:首页Kaldi 提取MFCC40维的参数设置

Kaldi 提取MFCC40维的参数设置

来源:星星旅游

1. 数据准备:wave文件,,获取wav.scp,spk2utt,utt2spk三个文件

find /*/16kwav -name '*.wav' | awk -F '/' '{print $NF " " $0}' > ./data/wav.scp

find /*/16kwav -name '*.wav' | awk -F '/' '{print $NF " " $NF}' > ./data/spk2utt

find /*/16kwav -name '*.wav' | awk -F '/' '{print $NF " " $NF}' > ./data/utt2spk

2. 特征提取

首先需要更改conf/mfcc.conf文件参数,更改如下:

# config for high-resolution MFCC features, intended for neural network training.

# Note: we keep all cepstra, so it has the same info as filterbank features,

# but MFCC is more easily compressible (because less correlated) which is why

# we prefer this method.

--use-energy=false       # use average of log energy, not energy.

--sample-frequency=16000 # AISHELL-2 is sampled at 16kHz

--num-mel-bins=40        # similar to Google's setup.

--num-ceps=40            # there is no dimensionality reduction.

--low-freq=20            # low cutoff frequency for mel bins

--high-freq=-400         # high cutoff frequency, relative to Nyquist of 8000 (=7600)

接下来运行如下命令:

utils/fix_data_dir.sh       /*/data
./steps/make_mfcc.sh  /*/data  ./ts_log /*/data/mfcc

 

因篇幅问题不能全部显示,请点此查看更多更全内容

Copyright © 2019- stra.cn 版权所有 赣ICP备2024042791号-4

违法及侵权请联系:TEL:199 1889 7713 E-MAIL:2724546146@qq.com

本站由北京市万商天勤律师事务所王兴未律师提供法律服务