A constraint-based topic modeling approach for name disambiguation

Feng WANG,Jie TANG,Juanzi LI,Kehong WANG,

PDF(316 KB)
PDF(316 KB)
Front. Comput. Sci. ›› 2010, Vol. 4 ›› Issue (1) : 100-111. DOI: 10.1007/s11704-009-0064-9
Research articles

A constraint-based topic modeling approach for name disambiguation

  • Feng WANG,Jie TANG,Juanzi LI,Kehong WANG,
Author information +
History +

Abstract

Name ambiguity refers to a problem that different people might be referenced with an identical name. This problem has become critical in many applications, particularly in online bibliography systems, such as DBLP and CiterSeer. Although much work has been conducted to address this problem, there still exist many challenges. In this paper, a general framework of constraint-based topic modeling is proposed, which can make use of user-defined constraints to enhance the performance of name disambiguation. A Gibbs sampling algorithm that integrates the constraints has been proposed to do the inference of the topic model. Experimental results on a real-world dataset show that significant improvements can be obtained by taking the proposed approach.

Keywords

name disambiguation / constraint / topic model

Cite this article

Download citation ▾
Feng WANG, Jie TANG, Juanzi LI, Kehong WANG,. A constraint-based topic modeling approach for name disambiguation. Front. Comput. Sci., 2010, 4(1): 100‒111 https://doi.org/10.1007/s11704-009-0064-9
AI Summary AI Mindmap
PDF(316 KB)

Accesses

Citations

Detail

Sections
Recommended

/