HCLG.fst由四部分构成
1.G:语言游段模型WFST,输拆磨掘入输出符号相同,实际是一个WFSA(acceptor接受机),为了方便与其它三个WFST进行 *** 作,将其视为一个输入输出相同的WFST。
2.L:发音词典WFST,输入符号:monophone,输出符号:词
3.C:上下文相关WFST,输入符号:triphone(上下文相关),输出符号:monophnoe
4.H:HMM声学旅核模型WFST,输入符号:HMMtransitions-ids,输出符号:triphone。
L.fst: The Phonetic Dictionary FST
L maps monophone sequences to words.
The file L.fst is the Finite State Transducer form of the lexicon with phone symbols on the input and word symbols on the output.
L_disambig.fst: The Phonetic Dictionary with Disambiguation Symbols FST
A lexicon with disambiguation symbols
G.fst:The Language Model FST
FSA grammar (can be built from an n-gram grammar).
C.fst:The Context FST
C maps triphone sequences to monophones.
Expands the phones into context-dependent phones.
H.fst:The HMM FST
H maps multiple HMM states (a.k.a. transition-ids in Kaldi-speak) to context-dependent triphones.
Expands out the HMMs. On the right are the context-dependent phones and on the left are the pdf-ids.
HCLG.fst: final graph
总结一下:
构图过程 G ->L ->C ->H
G: 作为 acceptor (输入吵告誉 symbol 与输出相同),用于对grammar 或者 language model进行编码
L:Lexicon, 其输出 symbol 是 words, 输入 symbol 是 phones
C:context-dependency其输出 symbol 是 phones, 其输入 symbol 为表升段示context-dependencyphones
H: 包括HMM definitions,其输出 symbol 为context-dependencyphones, 其输入 symbol 为transitions-ids(即 对 pdf-id 和 其它信息编码后的友敏 id)
欢迎分享,转载请注明来源:内存溢出
评论列表(0条)