hclg.fst_cn是什么文件可以删除吗_教程

可以删除的。

HCLG.fst由四部分构成

1.G：语言游段模型WFST，输拆磨掘入输出符号相同，实际是一个WFSA（acceptor接受机），为了方便与其它三个WFST进行 *** 作，将其视为一个输入输出相同的WFST。

2.L：发音词典WFST，输入符号：monophone，输出符号：词

3.C:上下文相关WFST，输入符号：triphone（上下文相关），输出符号：monophnoe

4.H:HMM声学旅核模型WFST，输入符号：HMMtransitions-ids，输出符号：triphone。

L.fst: The Phonetic Dictionary FST

L maps monophone sequences to words.

The file L.fst is the Finite State Transducer form of the lexicon with phone symbols on the input and word symbols on the output.

L_disambig.fst: The Phonetic Dictionary with Disambiguation Symbols FST

A lexicon with disambiguation symbols

G.fst:The Language Model FST

FSA grammar (can be built from an n-gram grammar).

C.fst:The Context FST

C maps triphone sequences to monophones.

Expands the phones into context-dependent phones.

H.fst:The HMM FST

H maps multiple HMM states (a.k.a. transition-ids in Kaldi-speak) to context-dependent triphones.

Expands out the HMMs. On the right are the context-dependent phones and on the left are the pdf-ids.

HCLG.fst: final graph

总结一下：

构图过程 G ->L ->C ->H

G: 作为 acceptor (输入吵告誉 symbol 与输出相同)，用于对grammar 或者 language model进行编码

L:Lexicon, 其输出 symbol 是 words, 输入 symbol 是 phones

C:context-dependency其输出 symbol 是 phones, 其输入 symbol 为表升段示context-dependencyphones

H: 包括HMM definitions,其输出 symbol 为context-dependencyphones, 其输入 symbol 为transitions-ids(即对 pdf-id 和其它信息编码后的友敏 id)

欢迎分享，转载请注明来源：内存溢出

hclg.fst_cn是什么文件可以删除吗