张丽青, 寿永熙, 马志强. 最大熵算法在汉语拼音标注中的研究与实现[J]. 微电子学与计算机, 2012, 29(8): 120-122,126.
引用本文: 张丽青, 寿永熙, 马志强. 最大熵算法在汉语拼音标注中的研究与实现[J]. 微电子学与计算机, 2012, 29(8): 120-122,126.
ZHANG Li-qing, SHOU Yong-xi, MA Zhi-qiang. The Research and Implementation of Maximum Entropy Algorithm in Phonetic Annotation[J]. Microelectronics & Computer, 2012, 29(8): 120-122,126.
Citation: ZHANG Li-qing, SHOU Yong-xi, MA Zhi-qiang. The Research and Implementation of Maximum Entropy Algorithm in Phonetic Annotation[J]. Microelectronics & Computer, 2012, 29(8): 120-122,126.

最大熵算法在汉语拼音标注中的研究与实现

The Research and Implementation of Maximum Entropy Algorithm in Phonetic Annotation

  • 摘要: 经过对最大熵模型的研究,找到一种适合汉语拼音标注的最大熵模型算法.利用该算法解决了多音字单字成词的情况,从而使得所有包含多音字的词都是两字或多字词.使用该算法随机抽取“读者文摘”中的一篇文章进行标注实验,实验表明拼音标注正确率达到了96.6%以上.

     

    Abstract: Through maximum entropy model study, a algorithm for maximum entropy model that is for pinyin marked must be founded.Using the algorithm put an end to the situation that polyphone word is considered to be a word, so that all words with multiple pronunciations are two or more words.Using the algorithm mark the article in Reader's Digest, the results show that pinyin marked rate has reached 96.6 percent or more.

     

/

返回文章
返回