Resolution of overlapping ambiguity strings based on maximum entropy model

ZHANG Feng, FAN Xiao-zhong

PDF(321 KB)
PDF(321 KB)
Front. Electr. Electron. Eng. ›› 2006, Vol. 1 ›› Issue (3) : 273-276. DOI: 10.1007/s11460-006-0037-9

Resolution of overlapping ambiguity strings based on maximum entropy model

  • ZHANG Feng, FAN Xiao-zhong
Author information +
History +

Abstract

The resolution of overlapping ambiguity strings (OAS) is studied based on the maximum entropy model. There are two model outputs, where either the first two characters form a word or the last two characters form a word. The features of the model include one word in context of OAS, the current OAS and word probability relation of two kinds of segmentation results. OAS in training text is found by the combination of the FMM and BMM segmentation method. After feature tagging they are used to train the maximum entropy model. The People Daily corpus of January 1998 is used in training and testing. Experimental results show a closed test precision of 98.64 % and an open test precision of 95.01 %. The open test precision is 3.76 % better compared with that of the precision of common word probability method.

Cite this article

Download citation ▾
ZHANG Feng, FAN Xiao-zhong. Resolution of overlapping ambiguity strings based on maximum entropy model. Front. Electr. Electron. Eng., 2006, 1(3): 273‒276 https://doi.org/10.1007/s11460-006-0037-9
PDF(321 KB)

Accesses

Citations

Detail

Sections
Recommended

/