Artificial intelligence in spine surgery

Cheng Zhang , Shanshan Liu , Jialin Shi , Xingyu Zhou , Peter Passias , Nanfang Xu , Weishi Li

Spine Research ›› 2025, Vol. 1 ›› Issue (1) : 13 -22.

PDF (1271KB)
Spine Research ›› 2025, Vol. 1 ›› Issue (1) : 13 -22. DOI: 10.1097/br9.0000000000000005
Review

Artificial intelligence in spine surgery

Author information +
History +
PDF (1271KB)

Abstract

Artificial intelligence (AI) technology has rapidly advanced in recent years, particularly in fields such as computer vision and natural language processing, where significant breakthroughs have been made. The emergence of large language models has greatly enhanced AI’s ability to understand and generate text, accelerating its application across various domains. The AI-generated content has maintained a trend of rapid growth, with ChatGPT (OpenAI, USA) and DeepSeek-V3 (DeepSeek, China) gaining global attention due to their outstanding performance. AI development in spinal surgery is still in its early stages. Although some hospitals have pioneered the deployment of deep learning models in imaging and surgical assistance systems, AI tools that are widely adopted and routinely integrated into the daily practice of most spinal surgeons remain scarce. Developing models and tools with high accuracy, strong interpretability, and trustworthiness remains one of the primary goals for AI development in spinal surgery. This review summarizes the recent advancements in AI within the field of spinal surgery, exploring the current challenges, transformations, and future opportunities of AI in spinal surgery. The aim of this review is to enhance the understanding of AI’s role in spinal care among clinicians, clinical researchers, AI scientists, and patients. Our goal is to promote interdisciplinary collaboration and development, thereby fostering a comprehensive understanding of AI’s potential in improving spinal care.

Keywords

artificial intelligence / ChatGPT / DeepSeek / large language models / spine

Cite this article

Download citation ▾
Cheng Zhang, Shanshan Liu, Jialin Shi, Xingyu Zhou, Peter Passias, Nanfang Xu, Weishi Li. Artificial intelligence in spine surgery. Spine Research, 2025, 1(1): 13-22 DOI:10.1097/br9.0000000000000005

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Rajpurkar P, Chen E, Banerjee O, Topol EJ. AI in health and medicine. Nat Med. 2022;28:31–8.

[2]

Huang T, Xu H, Wang H, et al. Artificial intelligence for medicine: progress, challenges, and perspectives. Innov Med. 2023;1:100030.

[3]

Topol EJ. Learning the language of life with AI. Science. 2025;387:eadv4414.

[4]

Machine learning helps to determine the diverse conformations of RNA molecules [published online ahead of print December 18, 2024]. Nature. 2024.

[5]

Liu Y, Chen Y, Han L. Bioinformatics: advancing biomedical discovery and innovation in the era of big data and artificial intelligence. Innov Med. 2023;1:100012.

[6]

Varghese C, Harrison EM, O’Grady G, Topol EJ. Artificial intelligence in surgery. Nat Med. 2024;30:1257–68.

[7]

Webster P. How AI-powered handheld devices are boosting disease diagnostics -from cancer to dermatology. Nat Med. 2024;30:914–5.

[8]

Daneshjou R, Vodrahalli K, Novoa RA, et al. Disparities in dermatology AI performance on a diverse, curated clinical image set. Sci Adv. 2022;8:eabq6147.

[9]

Kose K, Rotemberg V. The promise and drawbacks of federated learning for dermatology AI. JAMA Dermatol. 2024;160:269–70.

[10]

Cen LP, Ji J, Lin JW, et al. Automatic detection of 39 fundus diseases and conditions in retinal photographs using deep neural networks. Nat Commun. 2021;12:4828.

[11]

Varadarajan AV, Bavishi P, Ruamviboonsuk P, et al. Predicting optical coherence tomography-derived diabetic macular edema grades from fundus photographs using deep learning. Nat Commun. 2020;11:130.

[12]

Teo ZL, Ting DSW. AI telemedicine screening in ophthalmology: health economic considerations. Lancet Glob Health. 2023;11:e318–20.

[13]

Lu MY, Chen TY, Williamson DFK, et al. AI-based pathology predicts origins for cancers of unknown primary. Nature. 2021;594:106–10.

[14]

Abdurrachim D, Lek S, Ong CZL, et al. Utility of AI digital pathology as an aid for pathologists scoring fibrosis in MASH. J Hepatol. 2025;82:898–908.

[15]

Pulaski H, Harrison SA, Mehta SS, et al. Clinical validation of an AI-based pathology tool for scoring of metabolic dysfunction-associated steatohepatitis. Nat Med. 2025;31:315–22.

[16]

Rajpurkar P, Lungren MP. The current and future state of AI interpretation of medical images. N Engl J Med. 2023;388:1981–90.

[17]

Achiam OJ, Adler S, Agarwal S, et al. GPT-4 Technical Report; 2023.

[18]

Hurst OA, Lerer A, Goucher AP, et al. GPT-4o system card. ArXiv. 2024;abs/2410.21276.

[19]

Liu A, Feng B, et al.; DeepSeek-AI. DeepSeek-V3 technical report ArXiv. 2024;abs/2412.19437.

[20]

Benjamens S, Dhunnoo P, Meskó B. The state of artificial intelligence-based FDA-approved medical devices and algorithms: an online database. NPJ Digit Med. 2020;3:118.

[21]

GBD 2017 Disease and Injury Incidence and Prevalence Collaborators. Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018;392:1789–858.

[22]

Knezevic NN, Candido KD, Vlaeyen JWS, Van Zundert J, Cohen SP. Low back pain. Lancet. 2021;398:78–92.

[23]

Sounderajah V, Ashrafian H, Aggarwal R, et al. Developing specific reporting guidelines for diagnostic accuracy studies assessing AI interventions: the STARD-AI Steering Group. Nat Med. 2020;26:807–8.

[24]

Sounderajah V, Ashrafian H, Golub RM, et al; STARD-AI Steering Committee. Developing a reporting guideline for artificial intelligence-centred diagnostic test accuracy studies: the STARD-AI protocol. BMJ Open. 2021;11:e047709.

[25]

Li J, Guan Z, Wang J, et al. Integrated image-based deep learning and language models for primary diabetes care. Nat Med. 2024;30:2886–96.

[26]

Wan P, Huang Z, Tang W, et al. Outpatient reception via collaboration between nurses and a large language model: a randomized controlled trial. Nat Med. 2024;30:2878–85.

[27]

Shi X, Cui Y, Wang S, Pan Y, Wang B, Lei M. Development and validation of a web-based artificial intelligence prediction model to assess massive intraoperative blood loss for metastatic spinal disease using machine learning techniques. Spine J. 2024;24:146–60.

[28]

Phellan Aro R, Hachem B, Clin J, Mac-Thiong J-M, Duong L. Real-time prediction of postoperative spinal shape with machine learning models trained on finite element biomechanical simulations. Int J Comput Assist Radiol Surg. 2024;19:1983–90.

[29]

Jiang F, Li X, Liu L, Xie Z, Wu X, Wang Y. Automated machine learning-based model for the prediction of pedicle screw loosening after degenerative lumbar fusion surgery. Biosci Trends. 2024;18:83–93.

[30]

Wu X, Wang Z, Zheng L, et al. Construction and verification of a machine learning-based prediction model of deep vein thrombosis formation after spinal surgery. Int J Med Inform. 2024;192:105609.

[31]

Karabacak M, Schupper AJ, Carr MT, Bhimani AD, Steinberger J, Margetis K. Development and internal validation of machine learning models for personalized survival predictions in spinal cord glioma patients. Spine J. 2024;24:1065–76.

[32]

Fan G, Yang S, Qin J, et al. Machine learning predict survivals of spinal and Pelvic Ewing’s Sarcoma with the SEER database. Global Spine J. 2024;14:1125–36.

[33]

Cui YP, Shi XD, Qin Y, et al. Establishment and validation of an interactive artificial intelligence platform to predict postoperative ambulatory status for patients with metastatic spinal disease: a multicenter analysis. Int J Surg. 2024;110:2738–56.

[34]

Fan GX, Liu HQ, Yang S, et al. Early prognostication of critical patients with spinal cord injury a machine learning study with 1485 case. Spine. 2024;49:754–62.

[35]

Guo ZJ, Wang PY, Ye SH, et al. Interpretable machine learning models based on shapley additive explanations for predicting the risk of cerebrospinal fluid leakage in lumbar fusion surgery. Spine. 2024;49:1281–93.

[36]

Al-Shawwa A, Ost K, Anderson D, et al. Advanced MRI metrics improve the prediction of baseline disease severity for individuals with degenerative cervical myelopathy. Spine J. 2024;24:1605–14.

[37]

Mohanty S, Hassan FM, Lenke LG, et al; International Spine Study Group. Machine learning clustering of adult spinal deformity patients identifies four prognostic phenotypes: a multicenter prospective cohort analysis with single surgeon external validation. Spine J. 2024;24:1095–108.

[38]

Schönnagel L, Caffard T, Vu-Han TL, et al. Predicting postoperative outcomes in lumbar spinal fusion: development of a machine learning model. Spine J. 2024;24:239–49.

[39]

Schönnagel L, Tani S, Vu-Han TL, et al. Predicting conversion of ambulatory ACDF patients to inpatient: a machine learning approach. Spine J. 2024;24:563–71.

[40]

Berg B, Gorosito MA, Fjeld O, et al. Machine learning models for predicting disability and pain following lumbar disc herniation surgery. JAMA Netw Open. 2024;7:e2355024.

[41]

Pedro KM, Alvi MA, Hejrati N, Quddusi AI, Singh A, Fehlings MG. Machine learning-based cluster analysis identifies four unique phenotypes of patients with degenerative cervical myelopathy with distinct clinical profiles and long-term functional and neurological outcomes. Ebiomedicine. 2024;106:105226.

[42]

Thirunavukarasu AJ, Ting DSJ, Elangovan K, Gutierrez L, Tan TF, Ting DSW. Large language models in medicine. Nat Med. 2023;29:1930–40.

[43]

Zhao WX, Zhou K, Li J, et al. A survey of large language models. ArXiv. 2023;abs/2303.18223.

[44]

Wei J, Tay Y, Bommasani R, et al. Emergent abilities of large language models. ArXiv. 2022;abs/2206.07682.

[45]

Roberts K. Large language models for reducing clinicians’ documentation burden. Nat Med. 2024;30:942–3.

[46]

Omiye JA, Gui H, Rezaei SJ, Zou J, Daneshjou R. Large language models in medicine: the potentials and pitfalls: a narrative review. Ann Intern Med. 2024;177:210–20.

[47]

Singhal K, Azizi S, Tu T, et al. Large language models encode clinical knowledge. Nature. 2023;620:172–80.

[48]

Zhang C, Liu S, Zhou X, et al. Examining the role of large language models in orthopedics: systematic review. J Med Internet Res. 2024;26:e59607.

[49]

Shrestha N, Shen Z, Zaidat B, et al. Performance of ChatGPT on NASS clinical guidelines for the diagnosis and treatment of low back pain: a comparison study. Spine (Phila Pa 1976). 2024;49:640–51.

[50]

Gianola S, Bargeri S, Castellini G, et al. Performance of ChatGPT compared to clinical practice guidelines in making informed decisions for lumbosacral radicular pain: a cross-sectional study. J Orthop Sports Phys Ther. 2024;54:222–8.

[51]

Kasthuri VS, Glueck J, Pham H, et al. Assessing the accuracy and reliability of AI-generated responses to patient questions regarding spine surgery. J Bone Joint Surg Am. 2024;106:1136–42.

[52]

Temel MH, Erden Y, Bagcier F. Information quality and readability: ChatGPT’s responses to the most common questions about spinal cord injury. World Neurosurg. 2024;181:e1138–44.

[53]

Kirchner GJ, Kim RY, Weddle JB, Bible JE. Can artificial intelligence improve the readability of patient education materials? Clin Orthop Relat Res. 2023;481:2260–7.

[54]

Fabijan A, Polis B, Fabijan R, Zakrzewski K, Nowosławska E, Zawadzka-Fabijan A. Artificial intelligence in scoliosis classification: an investigation of language-based models. J. Pers. Med. 2023;13:1695.

[55]

Coraci D, Maccarone MC, Regazzo G, Accordi G, Papathanasiou JV, Masiero S. ChatGPT in the development of medical questionnaires. The example of the low back pain. Eur J Transl Myol. 2023;33:12114.

[56]

Stroop A, Stroop T, Alsofy SZ, et al. Large language models: are artificial intelligence-based chatbots a reliable source of patient information for spinal surgery? Eur Spine J. 2024;33:4135–43.

[57]

Kokabu T, Kanai S, Kawakami N, et al. An algorithm for using deep learning convolutional neural networks with three dimensional depth sensor imaging in scoliosis detection. Spine J. 2021;21:980–7.

[58]

Mandel W, Oulbacha R, Roy-Beaudry M, Parent S, Kadoury S. Image-guided tethering spine surgery with outcome prediction using spatio-temporal dynamic networks. IEEE Trans Med Imaging. 2021;40:491–502.

[59]

Zhang L, Pei BQ, Zhang SJ, et al. A new method for scoliosis screening incorporating deep learning with back images. Glob. Spine J. 2024;15:2062–74.

[60]

Won D, Lee HJ, Lee SJ, Park SH. Spinal stenosis grading in magnetic resonance imaging using deep convolutional neural networks. Spine (Phila Pa 1976). 2020;45:804–12.

[61]

Li KY, Weng JJ, Li HL, Ye H-B, Xiang J-W, Tian N-F. Development of a deep-learning model for diagnosing lumbar spinal stenosis based on CT images. Spine. 2024;49:884–91.

[62]

Suzuki H, Kokabu T, Yamada K, et al. Deep learning-based detection of lumbar spinal canal stenosis using convolutional neural networks. Spine J. 2024;24:2086–101.

[63]

Won D, Lee HJ, Lee SJ, Park SH. Lumbar spinal stenosis grading in multiple level magnetic resonance imaging using deep convolutional neural networks. Glob. Spine J. 2024;15:2309–17.

[64]

Löffler MT, Jacob A, Scharr A, et al. Automatic opportunistic osteoporosis screening in routine CT: improved prediction of patients with prevalent vertebral fractures compared to DXA. Eur Radiol. 2021;31:6069–77.

[65]

Nian SQ, Zhao YY, Li CJ, et al. Development and validation of a radiomics-based model for predicting osteoporosis in patients with lumbar compression fractures. Spine J. 2024;24:1625–34.

[66]

Oh J, Kim B, Oh G, Hwangbo Y, Ye JC. End-to-end semisupervised opportunistic osteoporosis screening using computed tomography. Endocrinol Metab (Seoul). 2024;39:500–10.

[67]

Oh S, Kang WY, Park H, et al. Evaluation of deep learning-based quantitative computed tomography for opportunistic osteoporosis screening. Sci Rep. 2024;14:9.

[68]

Wu Y, Yang XP, Wang MY, et al. Artificial intelligence assisted automatic screening of opportunistic osteoporosis in computed tomography images from different scanners. Eur Radiol. 2024;35:2287–95.

[69]

Zhang J, Xia L, Zhang XL, et al. Development and validation of a predictive model for vertebral fracture risk in osteoporosis patients. Eur Spine J. 2024;33:3242–60.

[70]

Li YC, Chen HH, Horng-Shing Lu H, Hondar Wu H-T, Chang M-C, Chou P-H. Can a deep-learning model for the automated detection of vertebral fractures approach the performance level of human subspecialists? Clin Orthop Relat Res. 2021;479:1598–612.

[71]

Ma S, Huang Y, Che X, Gu R. Faster RCNN-based detection of cervical spinal cord injury and disc degeneration. J Appl Clin Med Phys. 2020;21:235–43.

[72]

Kishikawa J, Kobayakawa K, Saiwai H, et al. Verification of the accuracy of cervical spinal cord injury prognosis prediction using clinical data-based artificial neural networks. J. Clin. Med. 2024;13:253.

[73]

Zhang ZF, Li N, Ding Y, Cheng H. An integrative nomogram based on MRI radiomics and clinical characteristics for prognosis prediction in cervical spinal cord injury. Eur Spine J. 2024;34:1164–76.

[74]

Zhang K, Xu N, Guo C, Wu J. MPF-net: an effective framework for automated cobb angle estimation. Med Image Anal. 2022;75:102277.

[75]

Zhang D, Chen B, Li S. Sequential conditional reinforcement learning for simultaneous vertebral body detection and segmentation with modeling the spine anatomy. Med Image Anal. 2021;67:101861.

[76]

Fang Y, Li W, Chen X, et al. Opportunistic osteoporosis screening in multi-detector CT images using deep convolutional neural networks. Eur Radiol. 2021;31:1831–42.

[77]

Zheng HD, Sun YL, Kong DW, et al. Deep learning-based highaccuracy quantitation for lumbar intervertebral disc degeneration from MRI. Nat Commun. 2022;13:12.

[78]

Liebmann F, von Atzigen M, Stütz D, et al. Automatic registration with continuous pose updates for marker-less surgical navigation in spine surgery. Med Image Anal. 2024;91:103027.

[79]

Pang SM, Pang CL, Su ZH, et al. DGMSNet: spine segmentation for MR image by a detection-guided mixed-supervised segmentation network. Med Image Anal. 2022;75:102261.

[80]

Xie L, Wisse LEM, Wang JC, et al. Deep label fusion: a generalizable hybrid multi-atlas and deep convolutional neural network for medical image segmentation. Med Image Anal. 2023;83:17.

[81]

Huang MY, Zhou SL, Chen XM, Lai H, Feng Q. Semi-supervised hybrid spine network for segmentation of spine MR images. Comput Med Imaging Graph. 2023;107:102245.

[82]

Serrador L, Villani FP, Moccia S, Santos CP. Knowledge distillation on individual vertebrae segmentation exploiting 3D U-Net. Comput Med Imaging Graph. 2024;113:102350.

[83]

Saravi B, Guzel HE, Zink A, et al. Synthetic 3D spinal vertebrae reconstruction from biplanar X-rays utilizing generative adversarial networks. J Pers Med. 2023;13:1642.

[84]

He Z, Lu N, Chen Y, et al. Conditional generative adversarial networkassisted system for radiation-free evaluation of scoliosis using a single smartphone photograph: a model development and validation study. Eclinicalmedicine. 2024;75:102779.

[85]

Meng N, Wong KK, Zhao M, Cheung JPY, Zhang T. Radiographcomparable image synthesis for spine alignment analysis using deep learning with prospective clinical validation. EClinicalMedicine. 2023;61:102050.

[86]

Gafencu MA, Velikova Y, Saleh M, et al. Shape completion in the dark: completing vertebrae morphology from 3D ultrasound. Int. J. Comput. Assist. Radiol. Surg. 2024;19:1339–47.

[87]

Wong JS, Reformat M, Parent E, Lou E. Validity and accuracy of automatic cobb angle measurement on 3D spinal ultrasonographs for children with adolescent idiopathic scoliosis: SOSORT 2024 award winner. Eur Spine J. 202534:1622–30.

[88]

Ward M, Maity A, Brown EDL, et al. Analysis of ChatGPT in the triage of common spinal complaints. World Neurosurg. 2024;192:e273–80.

[89]

Subramanian T, Araghi K, Amen TB, et al. Chat generative pretraining transformer answers patient-focused questions in cervical spine surgery. Clin. Spine Surg. 2024;37:E278–81.

[90]

Nian PP, Saleet J, Magruder M, et al. ChatGPT as a source of patient information for lumbar spinal fusion and laminectomy. Clin. Spine Surg. 2024;37:E394–403.

[91]

Ito S, Nakashima H, Yoshii T, et al. Deep learning-based prediction model for postoperative complications of cervical posterior longitudinal ligament ossification. Eur Spine J. 2023;32:3797–806.

[92]

Lee CH, Jo DJ, Oh JK, et al. Development and validation of an online calculator to predict proximal junctional kyphosis after adult spinal deformity surgery using machine learning. Neurospine. 2023;20:1272–80.

[93]

Zhang Y, Wan DH, Chen M, et al. Automated machine learning-based model for the prediction of delirium in patients after surgery for degenerative spinal disease. CNS Neurosci Ther. 2023;29:282–95.

[94]

Gonzalez-Suarez AD, Rezaii PG, Herrick D, et al. Using machine learning models to identify factors associated with 30-day readmissions after posterior cervical fusions: a longitudinal cohort study. Neurospine. 2024;21:620–32.

[95]

Shahrestani S, Reardon T, Brown NJ, et al. Developing mixed-effects models to compare the predictive ability of various comorbidity indices in a contemporary cohort of patients undergoing lumbar fusion. Neurosurgery. 2024;94:711–20.

[96]

Wang SK, Wang P, Li ZE, Li X-Y, Kong C, Lu S-B. Development and external validation of a nomogram for predicting postoperative adverse events in elderly patients undergoing lumbar fusion surgery: comparison of three predictive models. J. Orthop. Surg. Res. 2024;19:11.

[97]

Li Z, Jiang S, Song X, et al. Collaborative spinal robot system for laminectomy: a preliminary study. Neurosurg Focus. 2022;52:E11.

[98]

Li Z, Wang C, Song X, et al. Accuracy evaluation of a novel spinal robotic system for autonomous laminectomy in thoracic and lumbar vertebrae: a cadaveric study. J Bone Joint Surg Am. 2023;105:943–50.

[99]

Li X, Chen J, Wang B, et al. Evaluating the status and promising potential of robotic spinal surgery systems. Orthop Surg. 2024;16:2620–32.

[100]

Poore GD, Kopylova E, Zhu Q, et al. RETRACTED ARTICLE: Microbiome analyses of blood and tissues suggest cancer diagnostic approach. Nature. 2020;579:567–74.

[101]

Chen P, Ye J, Wang G, et al. GMAI-MMBench: a comprehensive multimodal evaluation benchmark towards general medical AI. ArXiv. 2024;abs/2408.03361.

[102]

Kochanski RB, Lombardi JM, Laratta JL, Lehman RA, O’Toole JE. Image-guided navigation and robotics in spine surgery. Neurosurgery. 2019;84:1179–89.

[103]

Móga K, Hölgyesi A, Zrubka Z, Péntek M, Haidegger T. Augmented or mixed reality enhanced head-mounted display navigation for in vivo spine surgery: a systematic review of clinical outcomes. J Clin Med. 2023;12:3788.

[104]

Azad TD, Warman A, Tracz JA, Hughes LP, Judy BF, Witham TF. Augmented reality in spine surgery -past, present, and future. Spine J. 2024;24:1–13.

RIGHTS & PERMISSIONS

2025 the Author(s). published by Wolters Kluwer Health, Inc. on behalf of Higher Education Press.

AI Summary AI Mindmap
PDF (1271KB)

1071

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/