Disambiguating named entitieswith deep supervised learning via crowd labels

Le-kui ZHOU; Si-liang TANG; Jun XIAO; Fei WU; Yue-ting ZHUANG

doi:10.1631/FITEE.1601835

PDF(451 KB)

Front. Inform. Technol. Electron. Eng ›› 2017, Vol. 18 ›› Issue (1) : 97-106. DOI: 10.1631/FITEE.1601835

Article

Disambiguating named entitieswith deep supervised learning via crowd labels

Author information +

History +

Abstract

Named entity disambiguation (NED) is the task of linking mentions of ambiguous entities to their referenced entities in a knowledge base such as Wikipedia. We propose an approach to effectively disentangle the discriminative features in the manner of collaborative utilization of collective wisdom (via human-labeled crowd labels) and deep learning (via human-generated data) for the NED task. In particular, we devise a crowd model to elicit the underlying features (crowd features) from crowd labels that indicate a matching candidate for each mention, and then use the crowd features to fine-tune a dynamic convolutional neural network (DCNN). The learned DCNN is employed to obtain deep crowd features to enhance traditional hand-crafted features for the NED task. The proposed method substantially benefits from the utilization of crowd knowledge (via crowd labels) into a generic deep learning for the NED task. Experimental analysis demonstrates that the proposed approach is superior to the traditional hand-crafted features when enough crowd labels are gathered.

Keywords

Named entity disambiguation / Crowdsourcing / Deep learning

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Le-kui ZHOU, Si-liang TANG, Jun XIAO, Fei WU, Yue-ting ZHUANG. Disambiguating named entitieswith deep supervised learning via crowd labels. Front. Inform. Technol. Electron. Eng, 2017, 18(1): 97‒106 https://doi.org/10.1631/FITEE.1601835

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Alhelbawy, A., Gaizauskas, R., 2014. Graph ranking for collective named entity disambiguation. Proc. 52nd Annual Meeting of the Association for Computational Linguistics, p.75–80.

[2]	Bagga, A., Baldwin, B., 1998. Entity-based cross-document coreferencing using the vector space model. Proc. 36th Annual Meeting of the Association for Computational Linguistics and 17th Int. Conf. on Computational Linguistics, p.79–85. http://dx.doi.org/10.3115/980845.980859

[3]	Blei, D.M., Ng, A.Y., Jordan, M.I., 2003. Latent Dirichlet allocation. J. Mach. Learn. Res., 3:993–1022.

[4]	Bunescu, R., Paşca, M., 2006. Using encyclopedic knowledge for named entity disambiguation. Proc. 11th Conf. of the European Chapter of the Association for Computational Linguistics, p.3–7.

[5]	Cucerzan, S., 2007. Large-scale named entity disambiguation based on Wikipedia data. Proc. Joint Conf. on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, p.708–716.

[6]	Demartini, G., Difallah, D.E., Cudre-Mauroux, P., 2012. ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking. Proc. 21st Int. Conf. on World Wide Web, p.469–478. http://dx.doi.org/10.1145/2187836.2187900

[7]	Gabrilovich, E., Markovitch, S., 2007. Computing semantic relatedness using Wikipedia-based explicit semantic analysis. Proc. Int. Joint Conf. on Artificial Intelligence, p.1606–1611. http://dx.doi.org/10.1145/2063576.2063865

[8]	Gentile, A.L., Zhang, Z.Q.,Xia, L., , 2009. Graph-based semantic relatedness for named entity disambiguation. Proc. Int. Conf. on Software, Services & Semantic Technologies, p.13–20.

[9]	Haas, K., Mika, P., Tarjan, P., , 2011. Enhanced results for web search. Proc. 34th Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, p.725–734. http://dx.doi.org/10.1145/2009916.2010014

[10]	Hakimov, S., Oto, S.A., Dogdu, E., 2012. Named entity recognition and disambiguation using linked data and graph-based centrality scoring. Proc. 4th Int. Workshop on Semantic Web Information Management, Article 4. http://dx.doi.org/10.1145/2237867.2237871

[11]	Hochreiter, S., Schmidhuber, J., 1997. Long short-term memory. Neur. Comput., 9(8):1735–1780.

[12]	Kalchbrenner, N., Grefenstette, E., Blunsom, P., 2014. A convolutional neural network for modelling sentences. Proc. 52nd Annual Meeting of the Association for Computational Linguistics, p.655–665.

[13]	Mikolov, T., Sutskever, I., Chen, K., , 2013. Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems, p.3111–3119.

[14]	Pan, Y.H., 2016. Heading toward artificial intelligence 2.0. Engineering, 2(4):409–413. http://dx.doi.org/10.1016/J.ENG.2016.04.018

[15]	Wu, F., Wang, Z.H., Zhang, Z.F., , 2015. Weakly semi-supervised deep learning for multi-label image annotation. IEEE Trans. Big Data, 1(3):109–122. http://dx.doi.org/10.1109/TBDATA.2015.2497270