Distinguishing Rectal Cancer from Colon Cancer Based on the Support Vector Machine Method and RNA-sequencing Data

Yan Zhang , Yuan Wu , Zi-ying Gong , Hai-dan Ye , Xiao-kai Zhao , Jie-yi Li , Xiao-mei Zhang , Sheng Li , Wei Zhu , Mei Wang , Ge-yu Liang , Yun Liu , Xin Guan , Dao-yun Zhang , Bo Shen

Current Medical Science ›› 2021, Vol. 41 ›› Issue (2) : 368 -374.

PDF
Current Medical Science ›› 2021, Vol. 41 ›› Issue (2) : 368 -374. DOI: 10.1007/s11596-021-2356-8
Article

Distinguishing Rectal Cancer from Colon Cancer Based on the Support Vector Machine Method and RNA-sequencing Data

Author information +
History +
PDF

Abstract

Colorectal cancer (CRC) is the third most commonly diagnosed cancer worldwide. Several studies have indicated that rectal cancer is significantly different from colon cancer in terms of treatment, prognosis, and metastasis. Recently, the differential mRNA expression of colon cancer and rectal cancer has received a great deal of attention. The current study aimed to identify significant differences between colon cancer and rectal cancer based on RNA sequencing (RNA-seq) data via support vector machines (SVM). Here, 393 CRC samples from the The Cancer Genome Atlas (TCGA) database were investigated, including 298 patients with colon cancer and 95 with rectal cancer. Following the random forest (RF) analysis of the mRNA expression data, 96 genes such as HOXB13, PRAC, and BCLAF1 were identified and utilized to build the SVM classification model with the Leave-One-Out Cross-validation (LOOCV) algorithm. In the training (n=196) and the validation cohorts (n=197), the accuracy (82.1 % and 82.2 %, respectively) and the AUC (0.87 and 0.91, respectively) indicated that the established optimal SVM classification model distinguished colon cancer from rectal cancer reasonably. However, additional experiments are required to validate the predicted gene expression levels and functions.

Keywords

colon cancer / rectal cancer / support vector machine / classification / gene selection

Cite this article

Download citation ▾
Yan Zhang, Yuan Wu, Zi-ying Gong, Hai-dan Ye, Xiao-kai Zhao, Jie-yi Li, Xiao-mei Zhang, Sheng Li, Wei Zhu, Mei Wang, Ge-yu Liang, Yun Liu, Xin Guan, Dao-yun Zhang, Bo Shen. Distinguishing Rectal Cancer from Colon Cancer Based on the Support Vector Machine Method and RNA-sequencing Data. Current Medical Science, 2021, 41(2): 368-374 DOI:10.1007/s11596-021-2356-8

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

BrayF, FerlayJ, SoerjomataramI, et al.. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin, 2018, 68(6): 394-424

[2]

PietrzykL, TorresA, MaciejewskiR, et al.. Obesity and Obese-related Chronic Low-grade Inflammation in Promotion of Colorectal Cancer Development. Asian Pac J Cancer Prev, 2015, 16(10): 4161-4168

[3]

KeumN, GiovannucciE. Global burden of colorectal cancer: emerging trends, risk factors and prevention strategies. Nat Rev Gastroenterol Hepatol, 2019, 16(12): 713-732

[4]

SiegelRL, MillerKD, Goding SauerA, et al.. Colorectal cancer statistics, 2020. CA Cancer J Clin, 2020, 70(3): 145-164

[5]

LiM, LiJY, ZhaoAL, et al.. Colorectal cancer or colon and rectal cancer: Clinicopathological comparison between colon and rectal carcinomas. Oncology, 2007, 73(1–2): 52-57

[6]

MinooP, ZlobecI, PetersonM, et al.. Characterization of rectal, proximal and distal colon cancers based on clinicopathological, molecular and protein profiles. Int J Oncol, 2010, 37(3): 707-718

[7]

IchimasaK, KudoSE, MiyachiH, et al.. Comparative clinicopathological characteristics of colon and rectal T1 carcinoma. Oncol Lett, 2017, 13(2): 805-810

[8]

O’Rahilly R, Müller F. Basic human anatomy: A regional study of human structure. W.B. Saunders company, 1983.

[9]

PihlE, HughesES, McDermottFT, et al.. Lung recurrence after 24 curative surgery for colorectal cancer. Dis Colon Rectum, 1987, 30(6): 417-419

[10]

KapiteijnE, MarijnenCA, NagtegaalID, et al.. Preoperative radiotherapy combined with total mesorectal excision for resectable rectal cancer. N Engl J Med, 2001, 345(9): 638-646

[11]

SauerR, BeckerH, HohenbergerW, et al.. Preoperative versus postoperative chemoradiotherapy for rectal cancer. N Engl J Med, 2004, 351(17): 1731-1740

[12]

TamasK, WalenkampAM, de VriesEG, et al.. Rectal and colon cancer: Not just a different anatomic site. Cancer Treat Rev, 2015, 41(8): 671-679

[13]

JimenezIG, LadronAL, PancorboDM, et al.. Influence of the localization of the primary tumor in the survival of patients with metastatic colon-rectal cancer treated with bevacizumab. J Clin Onco, 2014, 32: 546

[14]

LoudJT, MurphyJ. Cancer Screening and Early Detection in the 21(st) Century. Semin Oncol Nurs, 2017, 33(2): 121-128

[15]

DekkerE, TanisPJ, VleugelsJLA, et al.. Colorectal cancer. Lancet, 2019, 394(10207): 1467-1480

[16]

KandothC, McLellanMD, VandinF, et al.. Mutational landscape and significance across 12 major cancer types. Nature, 2013, 502(7471): 333-339

[17]

VargasAJ, HarrisCC. Biomarker development in the precision medicine era: lung cancer as a case study. Nat Rev Cancer, 2016, 16(8): 525-537

[18]

Cancer Genome Atlas Network. Comprehensive Molecular Characterization of Human Colon and Rectal Cancer. Nature, 2012, 487(7407): 330-337

[19]

BosmanFT, YanP, TejparS, et al.. Tissue biomarker development in a multicentre trial context: a feasibility study on the PETACC3 stage II and III colon cancer adjuvant treatment trial. Clin Cancer Res, 2009, 15(17): 5528-5533

[20]

Sanz-PamplonaR, CorderoD, BerenguerA, et al.. Gene expression differences between colon and rectum tumors. Clin Cancer Res, 2011, 17(23): 7303-7312

[21]

YamadaA, YuP, LinW, et al.. A RNA-Sequencing approach for the identification of novel long non-coding RNA biomarkers in colorectal cancer. Sci Rep, 2018, 8(1): 575

[22]

OhBY, ChoJ, HongHK, et al.. Exome and transcriptome sequencing identifies loss of PDLIM2 in metastatic colorectal cancers. Cancer Manag Res, 2017, 9: 581-589

[23]

LeeJR, KwonCH, ChoiY, et al.. Transcriptome analysis of paired primary colorectal carcinoma and liver metastases reveals fusion transcripts and similar gene expression profiles in primary carcinoma and liver metastases. BMC Cancer, 2016, 16: 539

[24]

WeitschekE, LauroSD, CappelliE, et al.. CamurWeb: a classification software and a large knowledge base for gene expression data of cancer. BMC Bioinformatics, 2018, 19: 354

[25]

BrownMPS, GrundyWN, LinD, et al.Support Vector Machine Classification of Microarray Gene Expression Data, 1999, Santa Cruz, University of CaliforniaTechnical Report UCSC-CRL-99–09

[26]

LiawA, WienerM. Classification and Regression by randomForest. R News, 2002, 2(3): 18-22

[27]

VapnikVN. The Nature of Statistical Learning Theory. IEEE T Neural Network, 1997, 8: 1564

[28]

BeckerN, ToedtG, LichterP, et al.. Elastic SCAD as a novel penalization method for SVM classification tasks in high-dimensional data. BMC Bioinformatics, 2011, 12: 138

[29]

BreimanL. Random Forests. Machine Learning, 2001, 45: 5-32

[30]

GabereMN, HusseinMA, AzizMA. Filtered selection coupled with support vector machines generate a functionally relevant prediction model for colorectal cancer. Onco Targets Ther, 2016, 9: 3313-3325

[31]

ZhiJ, SunJ, WangZ, et al.. Support vector machine classifier for prediction of the metastasis of colorectal cancer. Int J Mol Med, 2018, 41(3): 1419-1426

[32]

LiJN, ZhaoL, WuJ, et al.. Differences in gene expression profiles and carcinogenesis pathways between colon and rectal cancer. J Dig Dis, 2012, 13: 24-32

[33]

SlatteryML, CurtinK, WolffRK, et al.. A comparison of colon and rectal somatic DNA alterations. Dis Colon Rectum, 2009, 52: 1304-1311

[34]

SlatteryML, PellattDF, MullanyLE, et al.. Differential gene expression in colon tissue associated with diet, lifestyle, and related oxidative stress. PLoS One, 2015, 10(7): e0134406

[35]

SlatteryML, FriedmanGD, PotterJD, et al.. Adescription of age, sex, and site distributions of colon carcinoma in three geographic areas. Cancer, 1996, 78(8): 1666-1670

[36]

FleshnerP, SlaterG, AufsesAHJr.. Age and sex distribution of patients with colorectal cancer. Dis Colon Rectum, 1989, 32(2): 107-111

[37]

LuZ, ChenH, JiaoX, et al.. Prediction of immune checkpoint inhibition with immune oncology-related gene expression in gastrointestinal cancer using a machine learning classifier. J Immunother Cancer, 2020, 8(2): e000631

[38]

YangWL, LeeYE, ChenMH, et al.. In-silico drug screening and potential target identification for hepatocellular carcinoma using Support Vector Machines based on drug screening result. Gene, 2013, 518(1): 201-208

[39]

KristiansenI, StephanC, JungK, et al.. Sensitivity of HOXB13 as a diagnostic immunohistochemical marker of prostatic origin in prostate cancer metastases: comparison to PSA, Prostein. Androgen Receptor, ERG, NKX3.1, PSAP, and PSMA. Int J Mol Sci, 2017, 18(6): E1151

[40]

Cantile M, Pettinato G, Procino A, et al. In vivo expression of the whole HOX gene network in human breast cancer, Eur J Cancer, 39(2003):257–264

[41]

MaedaK, HamadaJ, TakahashiY, et al.. Altered expressions of HOX genes in human cutaneous malignant melanoma. Int J Cancer, 2005, 114(3): 436-441

[42]

ZhaoY, YamashitaT, IshikawaM. Regulation of tumor invasion by HOXB13 gene over expressed in human endometrial cancer. Oncol Rep, 2005, 13(4): 721-726

[43]

JungC, KimRS, ZhangH, et al.. HOXB13 is downregulated in colorectal cancer to confer TCF4-mediated transactivation. Br J Cancer, 2005, 92(12): 2233-2239

[44]

BarrosE, SavageKS, HarkinDP. 314: BCLAF1; a multi-faceted protein involved DNA repair, apoptosis and autophagy. Eur J Cancer, 2014, 50: S75

AI Summary AI Mindmap
PDF

105

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/