Information networks fusion based on multi-task coordination

Dong LI; Derong SHEN; Yue KOU; Tiezheng NIE

doi:10.1007/s11704-020-9195-9

PDF(852 KB)

Front. Comput. Sci. ›› 2021, Vol. 15 ›› Issue (4) : 154608. DOI: 10.1007/s11704-020-9195-9

RESEARCH ARTICLE

Information networks fusion based on multi-task coordination

Author information +

History +

Abstract

Information networks provide a powerful representation of entities and the relationships between them. Information networks fusion is a technique for information fusion that jointly reasons about entities, links and relations in the presence of various sources. However, existing methods for information networks fusion tend to rely on a single task which might not get enough evidence for reasoning. In order to solve this issue, in this paper, we present a novel model called MC-INFM (information networks fusion model based on multi-task coordination). Different from traditional models, MC-INFM casts the fusion problem as a probabilistic inference problem, and collectively performs multiple tasks (including entity resolution, link prediction and relation matching) to infer the final result of fusion. First, we define the intra-features and the inter-features respectively and model them as factor graphs, which can provide abundant evidence to infer. Then, we use conditional random field (CRF) to learn the weight of each feature and infer the results of these tasks simultaneously by performing the maximum probabilistic inference. Experiments demonstrate the effectiveness of our proposed model.

Keywords

information networks fusion / multi-task coordination / conditional random field / inference

Cite this article

EndNote

Ris (Procite)

Bibtex

Download citation ▾

Dong LI, Derong SHEN, Yue KOU, Tiezheng NIE. Information networks fusion based on multi-task coordination. Front. Comput. Sci., 2021, 15(4): 154608 https://doi.org/10.1007/s11704-020-9195-9

This is a preview of subscription content, contact us for subscripton.

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Zhang J. Social network fusion and mining: a survey. 2018, arXiv preprint arXiv:1804.09874

[2]	Namata G, Kok S, Getoor L. Collective graph identification. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2011, 87–95 CrossRef Google scholar

[3]	Lacoste-Julien S, Palla K, Davies A, Kasneci G, Graepel T. SIGMa: simple greedy matching for aligning large knowledge bases. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2012, 572–580 CrossRef Google scholar

[4]	Suchanek F, Abiteboul S, Senellart P. PARIS: probabilistic alignment of relations, instances, and schema. Proceedings of the VLDB Endowment, 2011, 5(3): 157–168 CrossRef Google scholar

[5]	Niu F, Re C, Doan A, Shavlik J. Tuffy: scaling up statistical inference in Markov logic networks using an RDBMS. Proceedings of the VLDB Endowment, 2011, 4(6): 373–384 CrossRef Google scholar

[6]	Lao N, Mitchell T, Cohen W. Random walk inference and learning in a large scale knowledge base. In: Proceedings of Conference on Empirical Methods in Natural Language Processing. 2011, 27–31

[7]	Kong X, Zhang J, Yu P. Inferring anchor links across multiple heterogeneous social networks. In: Proceedings of ACM International Conference on Information and Knowledge Management. 2013, 179–188 CrossRef Google scholar

[8]	Koutra D, Tong H, Lubensky D. Big-align: fast bipartite graph alignment. In: Proceedings of International Conference on Data Mining. 2013, 389–398 CrossRef Google scholar

[9]	Zafarani R, Liu H. Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2013, 41–49 CrossRef Google scholar

[10]	Zhang J, Shao W, Wang S, Kong X, Yu P. PNA: partial network alignment with generic stable matching. In: Proceedings of IEEE International Conference on Information Reuse and Integration. 2015, 166–173 CrossRef Google scholar

[11]	Zhang J, Yu P. Integrated anchor and social link predictions across partially aligned social networks. In: Proceedings of International Joint Conference on Artificial Intelligence. 2015, 1620–1626

[12]	Zhang J, Yu P. Multiple anonymized social networks alignment. In: Proceedings of International Conference on Data Mining. 2015, 599–608 CrossRef Google scholar

[13]	Zhang J, Yu P. PCT: partial co-alignment of social networks. In: Proceedings of International World Wide Web Conference. 2016, 749–759 CrossRef Google scholar

[14]	Zhang J, Kong X, Yu P. Predicting social links for new users across aligned heterogeneous social networks. In: Proceedings of International Conference on Data Mining. 2013, 1289–1294 CrossRef Google scholar

[15]	Zhang J, Kong X, Yu P. Transfer heterogeneous links across locationbased social networks. In: Proceedings of ACM International Conference on Web Search and Data Mining. 2014, 303–312 CrossRef Google scholar

[16]	Zhang J. Link prediction across heterogeneous social networks: a survey. Dissertation, University of Illinois at Chicago, US. 2014

[17]	Zhang J, Yu P, Zhou Z. Meta-path based multi-network collective link prediction. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2014, 1286–1295 CrossRef Google scholar

[18]	Richardson M, Domingos P. Markov logic networks. Machine Learning, 2006, 62: 107–136 CrossRef Google scholar

[19]	Lafferty J, McCallum A, Pereira F. Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: Proceedings of International Conference on Machine Learning. 2001, 282–289

[20]	Zhou T, Lv L, Zhang Y. Predicting missing links via local information. The European Physical Journal B, 2009, 71(4): 623–630 CrossRef Google scholar

[21]	Lv L, Zhou T. Link prediction in complex networks: a survey. Physica A: Statistical Mechanics and its Applications, 2011, 390: 1150–1170 CrossRef Google scholar

[22]	Lee J Y, Tukhvatov R. Evaluations of similarity measures on VK for link prediction. Data Science and Engineering, 2018, 3(3): 277–289 CrossRef Google scholar

[23]	Hasan M, Chaoji V, Salem S, Zaki M. Link prediction using supervised learning. In: Proceedings of SIAM International Conference on Data Mining. 2006

[24]	Aditya K, Menon A, Elkan C. Link prediction via matrix factorization. In: Proceedings of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. 2011, 437–452 CrossRef Google scholar

[25]	Dunlavy D, Kolda T, Acar E. Temporal link prediction using matrix and tensor factorizations. ACM Transactions on Knowledge Discovery from Data, 2011, 5(2): 10 CrossRef Google scholar

[26]	Tang J, Gao H, Hu X, Liu H. Exploiting homophily effect for trust prediction. In: Proceedings of ACM International Conference on Web Search and Data Mining. 2013, 53–62 CrossRef Google scholar

[27]	Yates A, Etzioni O. Unsupervised methods for determining object and relation synonyms on the web. Journal of Artificial Intelligence Research, 2009, 34(1): 255–296 CrossRef Google scholar

[28]	Dong X, Srivastava D. Knowledge curation and knowledge fusion: challenges, models and applications. In: Proceedings of the ACM SIGMDD International Conference on Management of Data. 2015, 2063–2066 CrossRef Google scholar

[29]	Galarraga L, Heitz G. Canonicalizing open knowledge bases. In: Proceedings of ACM International Conference on Information and Knowledge Management. 2014, 1679–1688 CrossRef Google scholar

[30]	Cohen W, Ravikumar P, Fienberg S. A comparison of string distance metrics for name-matching tasks. In: Proceedings of International Joint Conference on Artificial Intelligence. 2003, 73–78

[31]	Chen Y, Wang D. Knowledge expansion over probabilistic knowledge bases. In: Proceedings of International Conference on Management of Data. 2014, 649–660 CrossRef Google scholar

[32]	Rossi R J. Mathematical Statistics: an Introduction to Likelihood Based Inference. New York: John Wiley & Sons, 2018 CrossRef Google scholar

[33]	Fader A, Soderland S, Etzioni O. Identifying relations for open information extraction. In: Proceedings of Conference on Empirical Methods in Natural Language Processing. 2011, 1535–1545

[34]	Suchanek F, Kasneci G, Weikum G. Yago: a core of semantic knowledge. In: Proceedings of International World Wide Web Conference. 2007, 697–706 CrossRef Google scholar

[35]	Lehmann J, Isele R, Jakob M, Jentzsch A, Kontokostas D, et al. DBpedia — a large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web, 2013, 6(2): 167–195 CrossRef Google scholar