Disambiguating named entitieswith deep supervised learning via crowd labels

Le-kui ZHOU , Si-liang TANG , Jun XIAO , Fei WU , Yue-ting ZHUANG

Front. Inform. Technol. Electron. Eng ›› 2017, Vol. 18 ›› Issue (1) : 97 -106.

PDF (451KB)
Front. Inform. Technol. Electron. Eng ›› 2017, Vol. 18 ›› Issue (1) : 97 -106. DOI: 10.1631/FITEE.1601835
Article
Article

Disambiguating named entitieswith deep supervised learning via crowd labels

Author information +
History +
PDF (451KB)

Abstract

Named entity disambiguation (NED) is the task of linking mentions of ambiguous entities to their referenced entities in a knowledge base such as Wikipedia. We propose an approach to effectively disentangle the discriminative features in the manner of collaborative utilization of collective wisdom (via human-labeled crowd labels) and deep learning (via human-generated data) for the NED task. In particular, we devise a crowd model to elicit the underlying features (crowd features) from crowd labels that indicate a matching candidate for each mention, and then use the crowd features to fine-tune a dynamic convolutional neural network (DCNN). The learned DCNN is employed to obtain deep crowd features to enhance traditional hand-crafted features for the NED task. The proposed method substantially benefits from the utilization of crowd knowledge (via crowd labels) into a generic deep learning for the NED task. Experimental analysis demonstrates that the proposed approach is superior to the traditional hand-crafted features when enough crowd labels are gathered.

Keywords

Named entity disambiguation / Crowdsourcing / Deep learning

Cite this article

Download citation ▾
Le-kui ZHOU, Si-liang TANG, Jun XIAO, Fei WU, Yue-ting ZHUANG. Disambiguating named entitieswith deep supervised learning via crowd labels. Front. Inform. Technol. Electron. Eng, 2017, 18(1): 97-106 DOI:10.1631/FITEE.1601835

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Alhelbawy, A., Gaizauskas, R., 2014. Graph ranking for collective named entity disambiguation. Proc. 52nd Annual Meeting of the Association for Computational Linguistics, p.75–80.

[2]

Bagga, A., Baldwin, B., 1998. Entity-based cross-document coreferencing using the vector space model. Proc. 36th Annual Meeting of the Association for Computational Linguistics and 17th Int. Conf. on Computational Linguistics, p.79–85.

[3]

Blei, D.M., Ng, A.Y., Jordan, M.I., 2003. Latent Dirichlet allocation. J. Mach. Learn. Res., 3:993–1022.

[4]

Bunescu, R., Paşca, M., 2006. Using encyclopedic knowledge for named entity disambiguation. Proc. 11th Conf. of the European Chapter of the Association for Computational Linguistics, p.3–7.

[5]

Cucerzan, S., 2007. Large-scale named entity disambiguation based on Wikipedia data. Proc. Joint Conf. on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, p.708–716.

[6]

Demartini, G., Difallah, D.E., Cudre-Mauroux, P., 2012. ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking. Proc. 21st Int. Conf. on World Wide Web, p.469–478.

[7]

Gabrilovich, E., Markovitch, S., 2007. Computing semantic relatedness using Wikipedia-based explicit semantic analysis. Proc. Int. Joint Conf. on Artificial Intelligence, p.1606–1611.

[8]

Gentile, A.L., Zhang, Z.Q.,Xia, L., , 2009. Graph-based semantic relatedness for named entity disambiguation. Proc. Int. Conf. on Software, Services & Semantic Technologies, p.13–20.

[9]

Haas, K., Mika, P., Tarjan, P., , 2011. Enhanced results for web search. Proc. 34th Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, p.725–734.

[10]

Hakimov, S., Oto, S.A., Dogdu, E., 2012. Named entity recognition and disambiguation using linked data and graph-based centrality scoring. Proc. 4th Int. Workshop on Semantic Web Information Management, Article 4.

[11]

Hochreiter, S., Schmidhuber, J., 1997. Long short-term memory. Neur. Comput., 9(8):1735–1780.

[12]

Kalchbrenner, N., Grefenstette, E., Blunsom, P., 2014. A convolutional neural network for modelling sentences. Proc. 52nd Annual Meeting of the Association for Computational Linguistics, p.655–665.

[13]

Mikolov, T., Sutskever, I., Chen, K., , 2013. Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems, p.3111–3119.

[14]

Pan, Y.H., 2016. Heading toward artificial intelligence 2.0. Engineering, 2(4):409–413.

[15]

Wu, F., Wang, Z.H., Zhang, Z.F., , 2015. Weakly semi-supervised deep learning for multi-label image annotation. IEEE Trans. Big Data, 1(3):109–122.

RIGHTS & PERMISSIONS

Zhejiang University and Springer-Verlag Berlin Heidelberg

AI Summary AI Mindmap
PDF (451KB)

Supplementary files

FITEE-0097-17008-LKZ_suppl_1

FITEE-0097-17008-LKZ_suppl_2

2985

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/