MDJOSC: matching digital talents and job titles in open source communities

Xin LIU , Hang SU , Shuo WANG , Xuesong LU , Aoying ZHOU

Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (8) : 2008614

PDF (2367KB)
Front. Comput. Sci. ›› 2026, Vol. 20 ›› Issue (8) : 2008614 DOI: 10.1007/s11704-025-50084-x
Information Systems
RESEARCH ARTICLE

MDJOSC: matching digital talents and job titles in open source communities

Author information +
History +
PDF (2367KB)

Abstract

Open source communities have a wealth of digital talents, who are urgently needed by various industries under the digitalization process of the entire society. However, barriers exist between digital talents in open source communities and employers. On one hand, open source contributors wonder whether their expertise matches the requirements of specific jobs; On the other hand, developers working on small open source projects are less likely to get recognition from employers, compared with those contributing to well-known projects. To bridge this gap, we propose a new task, matching digital talents and job titles in open source communities, which measures the matching degrees between digital talents with open source experience and job titles requiring digital skills. To solve the task, we construct a heterogeneous information network connecting open source communities and job markets, and propose a semi-supervised network alignment model to augment the connectivity of the network. Then we employ a graph neural network to learn the representations of the digital talents and the job titles from the augmented network, based on which we measure the matching degrees between them. Experimental results demonstrate that our method achieves improvements of at least 5.34, 3.52, 2.37, 2.93, and 8.21 in accuracy, precision, recall, F1, and AUC compared to other possible solutions.

Graphical abstract

Keywords

open source community / matching digital talents and job titles / heterogeneous information network / network alignment

Cite this article

Download citation ▾
Xin LIU, Hang SU, Shuo WANG, Xuesong LU, Aoying ZHOU. MDJOSC: matching digital talents and job titles in open source communities. Front. Comput. Sci., 2026, 20(8): 2008614 DOI:10.1007/s11704-025-50084-x

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Bergson-Shilcock A, Taylor R. Closing the digital skill divide: the payoff for workers, business, and the economy. See t.newsletterext.worldbank.org/r/?id=h23ec8764,c882dba,c88543a website, 2023

[2]

OpenUK. State of open: the UK in 2023 phase three “skills or bust”. See openuk.uk/wp-content/uploads/2023/11/State-of-Open-The-UK-in-2023-Phase-Three.pdf website, 2023

[3]

Geyik S C, Dialani V, Meng M, Smith R. In-session personalization for talent search. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 2018, 2107–2115

[4]

Ozcaglar C, Geyik S, Schmitz B, Sharma P, Shelkovnykov A, Ma Y, Buchanan E. Entity personalized talent search models with tree interaction features. In: Proceedings of World Wide Web Conference. 2019, 3116–3122

[5]

Luo Y, Zhang H, Wen Y, Zhang X. ResumeGAN: an optimized deep representation learning framework for talent-job fit via adversarial learning. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 2019, 1101–1110

[6]

Qin C, Zhu H, Xu T, Zhu C, Jiang L, Chen E, Xiong H. Enhancing person-job fit for talent recruitment: an ability-aware neural network approach. In: Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 2018, 25–34

[7]

Yao K, Zhang J, Qin C, Wang P, Zhu H, Xiong H. Knowledge enhanced person-job fit for talent recruitment. In: Proceedings of the 38th IEEE International Conference on Data Engineering. 2022, 3467–3480

[8]

Gong Z, Song Y, Zhang T, Wen J R, Zhao D, Yan R. Your career path matters in person-job fit. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 8427–8435

[9]

Li L, Jing H, Tong H, Yang J, He Q, Chen B C. NEMO: next career move prediction with contextual embedding. In: Proceedings of the 26th International Conference on World Wide Web Companion. 2017, 505–513

[10]

Wang C, Zhu H, Hao Q, Xiao K, Xiong H. Variable interval time sequence modeling for career trajectory prediction: deep collaborative perspective. In: Proceedings of Web Conference 2021. 2021, 612–623

[11]

Zhang L, Zhou D, Zhu H, Xu T, Zha R, Chen E, Xiong H. Attentive heterogeneous graph embedding for job mobility prediction. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2021, 2192–2201

[12]

Meng Q, Zhu H, Xiao K, Zhang L, Xiong H. A hierarchical career-path-aware neural network for job mobility prediction. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2019, 14–24

[13]

Fang C, Qin C, Zhang Q, Yao K, Zhang J, Zhu H, Zhuang F, Xiong H. RecruitPro: a pretrained language model with skill-aware prompt learning for intelligent recruitment. In: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2023, 3991–4002

[14]

Hemamou L, Felhi G, Vandenbussche V, Martin J C, Clavel C. HireNet: a hierarchical attention model for the automatic analysis of asynchronous video job interviews. In: Proceedings of the 33rd AAAI Conference on Artificial Intelligence. 2019, 573–581

[15]

Shen D, Qin C, Zhu H, Xu T, Chen E, Xiong H . Joint representation learning with relation-enhanced topic models for intelligent job interview assessment. ACM Transactions on Information Systems (TOIS), 2021, 40( 1): 15

[16]

Hang J, Dong Z, Zhao H, Song X, Wang P, Zhu H. Outside in: market-aware heterogeneous graph neural network for employee turnover prediction. In: Proceedings of the 15th ACM International Conference on Web Search and Data Mining. 2022, 353–362

[17]

Wang C, Zhu H, Zhu C, Zhang X, Chen E, Xiong H. Personalized employee training course recommendation with career development awareness. In: Proceedings of Web Conference 2020. 2020, 1648–1659

[18]

Wang C, Zhu H, Wang P, Zhu C, Zhang X, Chen E, Xiong H . Personalized and explainable employee training course recommendations: a Bayesian variational approach. ACM Transactions on Information Systems (TOIS), 2021, 40( 4): 70

[19]

Dey T, Karnauch A, Mockus A. Representation of developer expertise in open source software. In: Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering. 2021, 995–1007

[20]

Hu Z, Dong Y, Wang K, Sun Y. Heterogeneous graph transformer. In: Proceedings of Web Conference 2020. 2020, 2704–2710

[21]

Qin C, Zhang L, Cheng Y, Zha R, Shen D, Zhang Q, Chen X, Sun Y, Zhu C, Zhu H, Xiong H. A comprehensive survey of artificial intelligence techniques for talent analytics. Proceedings of the IEEE, doi: 10.1109/JPROC.2025.3572744

[22]

Yao K, Zhang J, Qin C, Song X, Wang P, Zhu H, Xiong H. ResuFormer: semantic structure understanding for resumes via multi-modal pre-training. In: Proceedings of 2023 IEEE 39th International Conference on Data Engineering. 2023, 3154–3167

[23]

Wei M, He Y, Zhang Q. Robust layout-aware IE for visually rich documents with pre-trained language models. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020, 2367–2376

[24]

Jiang F, Qin C, Zhang J, Yao K, Chen X, Shen D, Zhu C, Zhu H, Xiong H. Towards efficient resume understanding: A multi-granularity multi-modal pre-training approach. In: Proceedings of 2024 IEEE International Conference on Multimedia and Expo. 2024, 1–6

[25]

Ramanath R, Inan H, Polatkan G, Hu B, Guo Q, Ozcaglar C, Wu X, Kenthapadi K, Geyik S C. Towards deep and representation learning for talent search at linkedin. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management. 2018, 2253–2261

[26]

Chen H, Du L, Lu Y, Fu Q, Chen X, Han S, Kang Y, Lu G, Li Z. Professional network matters: connections empower person-job fit. In: Proceedings of the 17th ACM International Conference on Web Search and Data Mining. 2024, 96–105

[27]

Zhu C, Zhu H, Xiong H, Ma C, Xie F, Ding P, Li P . Person-job fit: adapting the right talent for the right job with joint representation learning. ACM Transactions on Management Information Systems (TMIS), 2018, 9( 3): 12

[28]

Yang C, Hou Y, Song Y, Zhang T, Wen J R, Zhao W X. Modeling two-way selection preference for person-job fit. In: Proceedings of the 16th ACM Conference on Recommender Systems. 2022, 102–112

[29]

Liu X, Wang Y, Dong Q, Lu X. Job title prediction as a dual task of expertise prediction in open source software. In: Proceedings of Joint European Conference on Machine Learning and Knowledge Discovery in Databases. 2024, 381–396

[30]

Man T, Shen H, Liu S, Jin X, Cheng X. Predict anchor links across social networks via an embedding approach. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2016, 1823–1829

[31]

Zhang S, Tong H. FINAL: fast attributed network alignment. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016, 1345–1354

[32]

Zhou F, Liu L, Zhang K, Trajcevski G, Wu J, Zhong T. DeepLink: a deep learning approach for user identity linkage. In: Proceedings of IEEE INFOCOM 2018 - IEEE Conference on Computer Communications. 2018, 1313–1321

[33]

Chu X, Fan X, Yao D, Zhu Z, Huang J, Bi J. Cross-Network embedding for multi-network alignment. In: Proceedings of World Wide Web Conference. 2019, 273–284

[34]

Jiao P, Liu Y, Wang Y, Zhang G. CINA: curvature-based integrated network alignment with hypergraph. In: Proceedings of 2024 IEEE 40th International Conference on Data Engineering. 2024, 2709–2722

[35]

Du X, Yan J, Zha H. Joint link prediction and network alignment via cross-graph embedding. In: Proceedings of the 28th International Joint Conference on Artificial Intelligence. 2019, 2251–2257

[36]

Gao J, Huang X, Li J. Unsupervised graph alignment with Wasserstein distance discriminator. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2021, 426–435

[37]

Peng J, Xiong F, Pan S, Wang L, Xiong X. Robust network alignment with the combination of structure and attribute embeddings. In: Proceedings of 2023 IEEE International Conference on Data Mining. 2023, 498–507

[38]

Chen C, Xie W, Xu T, Rong Y, Huang W, Ding X, Huang Y, Huang J. Unsupervised adversarial graph alignment with graph embedding. 2019, arXiv preprint arXiv: 1907.00544

[39]

Trung H T, Van Vinh T, Tam N T, Yin H, Weidlich M, Hung N Q V. Adaptive network alignment with unsupervised and multi-order convolutional networks. In: Proceedings of 2020 IEEE 36th International Conference on Data Engineering. 2020, 85–96

[40]

Sun Q, Lin X, Zhang Y, Zhang W, Chen C. Towards higher-order topological consistency for unsupervised network alignment. In: Proceedings of the 39th IEEE International Conference on Data Engineering. 2023, 177–190

[41]

Brito G, Mombach T, Valente M T. Migrating to GraphQL: a practical assessment. In: Proceedings of the 26th IEEE International Conference on Software Analysis, Evolution and Reengineering. 2019, 140–150

[42]

Zhang M, Jensen K, Sonniks S, Plank B. SkillSpan: hard and soft skill extraction from English job postings. In: Proceedings of 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2022, 4962–4984

[43]

De Smedt J, Le Vrang M, Papantoniou A. ESCO: towards a semantic web for the European labor market. In: Proceedings of Workshop on Linked Data on the Web. 2015

[44]

Nesta. Skills Extractor Library. See nestauk.github.io/ojd_daps_skills/, 2024

[45]

Saxena S, Chandra J. A survey on network alignment: approaches, applications and future directions. In: Proceedings of the 33rd International Joint Conference on Artificial Intelligence. 2024, 8216–8224

[46]

Kipf T N, Welling M. Semi-supervised classification with graph convolutional networks. In: Proceedings of the 5th International Conference on Learning Representations. 2017

[47]

Velicković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y. Graph attention networks. In: Proceedings of the 6th International Conference on Learning Representations. 2018

[48]

Kleinbaum D G, Klein M. Logistic regression: a self-learning text. Springer, 2002

[49]

John G H, Langley P. Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the 11th Annual Conference on Uncertainty in Artificial Intelligence. 1995, 338–345

[50]

Friedman J H . Greedy function approximation: a gradient boosting machine. The Annals of Statistics, 2001, 29( 5): 1189–1232

[51]

Freund Y, Schapire R E. Large margin classification using the perceptron algorithm. In: Proceedings of the 11th Annual Conference on Computational Learning Theory. 1998, 209–217

[52]

Rendle S, Freudenthaler C, Gantner Z, Schmidt-Thieme L. BPR: Bayesian personalized ranking from implicit feedback. In: Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. 2009, 452–461

[53]

He X, Liao L, Zhang H, Nie L, Hu X, Chua T S. Neural collaborative filtering. In: Proceedings of the 26th International Conference on World Wide Web. 2017, 173–182

[54]

He X, Deng K, Wang X, Li Y, Zhang Y, Wang M. LightGCN: simplifying and powering graph convolution network for recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020, 639–648

[55]

Cai X, Huang C, Xia L, Ren X. LightGCL: simple yet effective graph contrastive learning for recommendation. In: Proceedings of the 11th International Conference on Learning Representations. 2023

[56]

Grattafiori A, Dubey A, Jauhri A, Pandey A, Kadian A, , . The llama 3 herd of models. 2024, arXiv preprint arXiv: 2407.21783

[57]

DeepSeek-AI. DeepSeek-R1: incentivizing reasoning capability in LLMs via reinforcement learning. 2025, arXiv preprint arXiv: 2501.12948

[58]

OpenAI. Gpt-4 technical report. 2024, arXiv preprint arXiv: 2303.08774

[59]

OpenAI. Learning to reason with LLMs. See openai.com/index/learning-to-reason-with-llms/ website, 2024

[60]

Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, , . Transformers: state-of-the-art natural language processing. In: Proceedings of 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. 2020, 38–45

[61]

OpenAI. OpenAI developer platform. See platform.openai.com/docs/overview website, 2023

[62]

Ying C, Cai T, Luo S, Zheng S, Ke G, He D, Shen Y, Liu T Y. Do transformers really perform bad for graph representation? In: Proceedings of the 35th International Conference on Neural Information Processing Systems. 2021, 28877–28888

[63]

Maaten v. d L, Hinton G . Visualizing data using t-SNE. Journal of Machine Learning Research, 2008, 9( 86): 2579–2605

RIGHTS & PERMISSIONS

Higher Education Press

AI Summary AI Mindmap
PDF (2367KB)

Supplementary files

Highlights

601

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/