1 Introduction
In resectable non-small cell lung cancer (NSCLC), some patients experience disease recurrence post-radical surgical resection [
1,
2], highlighting the need for enhanced risk stratification and personalized monitoring strategies. Molecular residual disease (MRD) detection via circulating tumor DNA (ctDNA) has emerged as a promising approach to identify potentially cured populations [
3] and individuals at high-risk of disease recurrence [
4,
5] as well as potentially guide adjuvant therapy decisions [
6,
7]. Although ctDNA assays are extensively used for detecting various gene variations, they often exhibit reduced sensitivity to structural variations, copy number alterations (CNA), and splice site variants [
8]. Given ctDNA’s extremely low presence in plasma samples, especially post-curative treatment, and its potential origin in multiple tumors or clones, the detection of structural variations, CNA and splice site variants relies on sufficient read coverage [
9]. Studies that reported the clinical potential of perioperative ctDNA in informing disease recurrence of lung cancer included a small number of patients, 15 patients in Zhang
et al.’s [
3] and 16 patients in Fu
et al.’s study [
4], with gene fusions,
MET exon 14 skipping, or
MET amplification. Some studies excluded structural variations or CNA from ctDNA detection during MRD testing [
5,
10]. As such, no studies have described the clinical detection efficacy of ctDNA-based MRD testing in operable NSCLC with oncogenic gene fusions,
MET exon skipping, or
MET amplification, separately and comprehensively. To this end, we conducted a prospective observational study to elucidate the role of MRD detection in patients with operable NSCLC harboring oncogenic fusions,
MET exon skipping, or
MET amplification following surgical resection and explore the association between postoperative MRD mutations and clinical outcomes.
2 Materials and methods
2.1 Study design and patients
This prospective observational study was designed to evaluate the clinical utility of MRD testing in operable NSCLC with oncogenic actionable gene fusions, MET exon 14 skipping, or de novo MET amplification without other driver mutations. A total of 49 NSCLC patients who harbored next-generation sequencing (NGS)-confirmed actionable gene fusions, MET exon 14 skipping, or de novo MET amplification and underwent surgery from January 2018 to March 2023 at Guangdong Provincial People’s Hospital, China. Six patients were excluded because of a lack of perioperative blood samples (n = 4) or loss of follow-up (n = 2). Ultimately, 43 patients were included in this study. Of these, 37 early-stage patients underwent radical surgery, and six patients with oligopersistent NSCLC underwent systemic treatment and received surgery as a local consolidative therapy. Lung cancer and disease stages were classified according to the World Health Organization criteria and the American Joint Committee on Cancer staging system (eighth edition), respectively.
2.2 Sample collection
Tumor tissues were obtained via needle biopsy or surgery. Peripheral blood samples (20 mL) were collected in Streck tubes before and/or after surgery and during follow-up. The landmark time points were samples collected in the time window of 3–37 days after surgery. Patients were then scheduled to be followed up every 3 or 6 months with CT scans and ctDNA tests until disease recurrence was determined by CT scan results. The longitudinal time point was defined as the serial postoperative time points after surgery until disease recurrence (including the disease recurrence time point). The surveillance time point was defined as within 6 months before clinical recurrence (including the disease recurrence time point).
2.3 Sample processing and DNA extraction
Tumor DNA was extracted from formalin-fixed, paraffin-embedded tumor tissue specimens using the ReliaPrep™ FFPE gDNA Miniprep System (Promega, Madison, WI, USA). Peripheral blood samples were separated by concentration at 1600× g for 10 min, and the supernatant was transferred to microcentrifuge tubes and centrifuged at 16 000× g for 10 min to remove cell debris. cfDNA was isolated using the QIAamp Circulating Nucleic Acid Kit (Qiagen, Hilden, Germany). Peripheral blood leukocytes (PBLs) were collected in the first centrifugation step to extract germline genomic DNA. To avoid contamination by ctDNA, the junction between the plasma and PBLs was first discarded and then the PBLs were carefully transferred. The QIAamp DNA Blood Mini Kit (Qiagen, Hilden, Germany) was used to extract the germline genomic DNA from the PBLs. The concentration and fragment distribution of cfDNA were assessed using an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA).
2.4 Targeted next-generation sequencing
Genomic DNA (400–800 ng) extracted from PBLs and tumor specimens was sheared into fragments at a 200–250 bp peak using a Covaris S2 ultrasonicator (Covaris, Woburn, MA, USA). A volume of 20–80 ng DNA from the plasma was used for library construction, and unique identifiers were tagged onto each double-stranded DNA to distinguish authentic somatic mutations from artifacts. Indexed Illumina NGS libraries were prepared from PBLs, tumors, and plasma DNA using a KAPA Library Preparation Kit (Kapa Biosystems, Wilmington, MA, USA).
DNA libraries of the tumor and its paired germline were hybridized to a previously reported custom-designed panel, which covers ~1.5 Mbp of the genome and targets 1021 cancer-related genes [
3,
4]. DNA libraries of plasma and paired germline DNA were hybridized to a custom-designed biotinylated oligonucleotide probe covering 550 kbp of the genome, and targeting 338 cancer-related genes [
3,
4]. The hybridized libraries were sequenced using a 100-bp paired-end configuration on a Gene
+Seq-2000 sequencing system (GenePlus-Suzhou, Suzhou, China). The mean depth of coverage for tissue DNA and ctDNA were 1808× and 38 756× respectively.
2.5 Tumor somatic variant calling
After removing terminal adaptor sequences and low-quality reads using fastp [
11], the remaining reads were mapped to the reference human genome (hg19) and aligned using Burrows-Wheel Aligner (v0.7.12-r1039) with default parameters. Duplicate reads were removed using the MarkDuplicates tool in Picard (v4.0.4.0; Broad Institute, Cambridge, MA, USA). RealDcaller (v1.8.1, Geneplus-Beijing, Beijing, China, in-house) [
3] and GATK (v3.6-0-g89b7209, Broad Institute) were employed to call the tumor somatic single nucleotide variants (SNVs) and small insertions and deletions (indels). CNVkit was used to identify copy number variations [
12]. Copy number alterations of
MET with copy number ≥ 2.6 were considered the potential amplification, alterations were manually confirmed with a CNA plot. NCsv (v0.2.3, Geneplus-Beijing, Beijing, China, in-house) was applied to detect structural variants [
3,
13]. All the candidate variants were manually confirmed using an integrative genomics viewer browser. Variants were filtered to exclude germline mutations in dbSNP and those that occur at a population frequency > 1% in ExAc (v0.3.1) or 1000 Genomes Project. Variants captured by PBL sequencing that are canonically associated with hematologic malignancies but do not meet the leukemia diagnostic criteria are considered to have clonal hematopoiesis of indeterminate potential. An in-house database of clonal hematopoiesis variants > 10 000 pan-cancer patients and healthy individuals was used to filter clonal hematopoiesis-related variants [
14].
2.6 ctDNA-based MRD detection
The ctDNA-based MRD detection method has been previously reported [
3,
4] and is detailed below. Duplicated reads were removed, and sequencing errors were polished using unique identifiers and realSeq (v3.1.0, Geneplus-Beijing, in-house) [
3]. SNV and indels were called using realDcaller and GATK. TNscope (Sentieon Inc., San Jose, CA, USA) was used as an auxiliary software to improve the detection of long indels. Variants meeting the following criteria were filtered out: (1) variants present in matched germline DNA, (2) variants occurring at a population frequency > 1% in ExAc (v0.3.1) or the 1000 Genomes Project, (3) variants with positional depth less than 300×, and (4) for sequencing error removal, a set of ~500 healthy plasma samples was sequenced to construct a sequencing background for each targeted SNV.
Depending on whether the plasma variants occurred in the matched tumor tissue, the resulting plasma variants were classified as tissue-derived and ctDNA-private. In other words, tissue-derived ctDNA mutations occurred in matched tumor tissues, whereas ctDNA-private mutations did not. The tissue-derived variants showed statistically significant differences in background characteristics and were considered reliable. When the following conditions were met, true somatic mutations were identified as follows: (1) for tissue-specific driver mutations, ≥ 2 high-quality support reads; and (2) for other tissue-specific non-recurrent mutations, ≥ 4 high-quality support reads. For ctDNA-private variants, the reliable somatic mutations were identified if they met the following stringent conditions: (1) for hotspot mutations, ≥ 4 high-quality support reads; (2) for non-hotspots, ≥ 8 high-quality support reads; and (3) clonal hematopoiesis were filtered through deep sequencing of PBLs. Positive ctDNA was defined as the detection of at least one variant [
3,
4].
2.7 Statistical analysis
The primary outcome measure was disease-free survival (DFS), which was assessed using standard radiological criteria. DFS was defined as the time from the date of surgery to the first radiological recurrence and was censored at the last follow-up. Statistical analyses were performed using the R statistical software (v4.1.3 for Windows). Analysis of the sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) was conducted in patients with disease recurrence and patients without disease recurrence who were followed up for at least 6 months after the first MRD test. Survival analysis was performed using the Kaplan–Meier method and compared using the log-rank test. The Mann–Whiney U test and Student’s t-test were used for non-normally and normally distributed continuous variables, respectively. A comparison of categorical variables was conducted with Pearson’s χ2 test or Fisher’s exact tests. A two-sided P value < 0.05 was considered significant. Univariate and multivariate analyses were performed using the Cox proportional hazards regression analysis and included all factors that were significant in the univariate analysis.
3 Results
3.1 Patient characteristics
Forty-nine NSCLC patients with actionable gene fusions, MET exon 14 skipping, or de novo MET amplification who underwent surgery were included in this study. Four patients had no perioperative ctDNA test and two patients lost to follow-up were excluded; the remaining 43 patients were included for analysis of the clinical performance of perioperative ctDNA, including 22 patients with ALK fusion, 9 with ROS1 fusion, 6 with RET fusion, 1 with FGFR1 fusion, 3 with MET exon 14 skipping and 2 with de novo MET amplification (Fig.1). Notably, patients with de novo MET amplification did not harbor any other driver mutations. The median age at diagnosis was 53 years (range, 32–73), with 55.8% females and 16.3% smokers. Most patients had adenocarcinoma (n = 38, 88.4%). Nineteen patients had stage III disease (44.2%), followed by stage I (n = 14, 32.5%) and stage II (n = 4, 9.3%). Six patients (14.0%) with oligopersistent NSCLC who underwent systemic therapy and received surgery as a local consolidative therapy were also included. Lymph node metastasis was in 28 (65.1%) patients. With a median follow-up of 9.9 months (range, 0.1–54.2), 13 patients experienced disease recurrence (Tables S1 and S2).
3.2 MRD predicts prognosis in operable NSCLC with fusions, MET exon 14 skipping or MET amplification
A total of 111 perioperative blood samples (including four preoperative and 107 postoperative samples) were collected in our study (Fig.2). Four patients had preoperative blood samples, and all patients had postoperative samples. Thirty-four patients and 41 blood samples were collected 3–37 days after surgery and used for landmark MRD analysis (Table S3). Three patients (P008, P014, and P040) had landmark ctDNA samples collected 37 days after surgery, and one of them (P008) also had landmark ctDNA sample collected 5 days after surgery. Two of these three patients had negative landmark MRD (P008 and P040) and remained disease-free at 6.9 and 17.9 months of follow-up, respectively, while one patient (P014) had a positive landmark MRD and experienced disease recurrence 1.8 months after the landmark MRD test (Fig.2). These results indicated that clinical outcomes in patients with NSCLC could be predicted 37 days post-surgery. Among the 34 patients who underwent a landmark MRD test, 10 had a detectable landmark MRD, and its proportion was similar across patients with different fusion/MET alterations (Fig. S1A). Importantly, patients with positive landmark MRD had significantly reduced DFS compared with those with negative landmark MRD (HR, 8.02; 95% CI, 1.82–35.36; P < 0.001; Fig.2). Univariate Cox proportional hazards regression analysis showed that advanced tumor stage (III/IV), lymph node involvement, and positive landmark MRD were associated with worse DFS, while positive landmark MRD was the only independent risk factor in the multivariate analysis (Fig. S2A).
All 43 patients and 107 blood samples were enrolled for longitudinal MRD analysis. Fourteen patients had positive longitudinal MRD, and their proportions were comparable across patients with different fusion/MET alterations (Fig. S1B). Expectably, patients with positive longitudinal MRD had markedly worse DFS than those with negative longitudinal MRD (HR, 8.65; 95% CI, 2.57–29.1; P < 0.001; Fig.2), and positive longitudinal MRD was independently associated with worse DFS (Fig. S2B).
3.3 Clinical efficiency of MRD in operable NSCLC with fusions, MET exon 14 skipping or MET amplification
To eliminate the effect of a short follow-up time on the performance of MRD in predicting disease recurrence, we excluded recurrence-free patients < 6 months after the first postoperative MRD test. Patients with a positive landmark MRD had a higher recurrence rate (100% (7/7)) than those with a negative landmark MRD (17.7% (3/17), P < 0.001), with a sensitivity of 70%, specificity of 100%, PPV of 100% and NPV of 82.4% (Fig. S3A and S3B). Moreover, the recurrence rate was significantly higher in patients with positive longitudinal MRD (positive vs. negative, 90.9% (10/11) vs. 13.6% (3/22), P < 0.001), the sensitivity and NPV increased to 76.9% and 86.4%, respectively, and the specificity and PPV were 95% and 90.9%, respectively (Fig. S3A and S3B). Of the patients in the surveillance MRD test, seven had positive ctDNA, producing a sensitivity of 70% (Fig. S3A).
Of the 13 patients with radiological recurrence, positive postoperative MRD was found in 10 patients, and detectable ctDNA was seen on the median of 4.2 months (range, 1.0–27.0 months) before radiological recurrence (Fig.3). Three patients (P003, P005, and P037) showed undetectable ctDNA before recurrence (Fig.2). Two of them (P003 and P037) had brain-only recurrence, and patient P005 had pleural recurrence, supporting the low sensitivity of the MRD test for brain-only and intrathoracic-only metastases [
3,
8,
10].
Clinical surveillance after curative-intent surgery using diagnostic blood biomarkers such as carcinoembryonic antigen (CEA) is recommended in current treatment regimens to inform recurrence early. However, their sensitivity is limited [
15,
16]. Indeed, four patients with disease recurrence in our cohort had successive CEA detection during follow-up; none of them had positive CEA (< 5 ng/mL) before recurrence, while positive longitudinal ctDNA identified 10 out of 13 (76.9%) disease recurrence (
P = 0.01). For example, patient P009 with stage IIB adenocarcinoma underwent postoperative MRD and CEA monitoring. CEA was negative before recurrence and turned positive at relapse, whereas ctDNA was detected 2.8 months before imaging recurrence (Fig.3). Our data suggested that ctDNA-MRD is superior to CEA for predicting disease recurrence.
3.4 ctDNA-MRD mutation types are associated with patient outcome
We then explored the relationship between the mutation types in postoperative MRD and prognosis. Nineteen positive postoperative samples from 14 patients were included in this exploration. ROS1 and TP53 were the two most commonly altered genes in detectable longitudinal ctDNA (Fig.4). ctDNA variants were grouped into tissue-derived and ctDNA-private variants according to whether they occurred in their matched tumor tissues; the former occurred in their matched tumor tissues, whereas the latter did not. Eleven and three patients had tissue-derived and ctDNA-private mutations, respectively. Fusion/MET alterations detected in the tumor tissues were found in the longitudinal ctDNA of the six patients (Fig.4). The detection rates of fusion/MET alterations at the landmark and longitudinal points were comparable across patients with different fusion/MET alterations (Fig. S4A and S4B). Patients with fusion/MET alterations in longitudinal ctDNA had clinical features similar to those of patients without fusion/MET alterations (Table S4). Intriguingly, patients with fusion/MET alterations of longitudinal ctDNA had reduced DFS, although the difference is at the very edge of significance (HR, 2.84; 95% CI, 0.69–11.64; P = 0.053; Fig.4). Moreover, we grouped patients into those with fusion/MET alterations in their longitudinal ctDNA, those with other tissue-derived alterations except fusion/MET alterations and those with ctDNA-private alterations. We observed that patients with fusion/MET alterations in their longitudinal ctDNA tended to have worse DFS than those in the other two groups and that patients with ctDNA-private alterations tended to have better DFS than those in the other two groups, although this difference was not statistically significant (P > 0.05, Fig.4).
4 Discussion
Oncogenic driver gene fusions comprised ~10% of Chinese patients with NSCLC [
17].
MET exon skipping and
de novo MET amplification occurred in 2% [
18] and 1%–5% [
19] NSCLC patients, respectively. However, owing to the limited sensitivity of ctDNA assays for detecting gene fusions, copy number changes, and splicing variations in plasma samples, expert consensus recommends prioritizing tissue-based testing for these tumor types [
8,
20]. Nevertheless, the potent prognostic value of ctDNA-based MRD detection in operable lung cancer cannot be discounted. Hence, in this prospective observational study, we examined the clinical performance of MRD testing in operable NSCLC patients with actionable gene fusions,
MET exon 14 skipping, and
de novo MET amplification for the first time. Our results demonstrated that detectable postoperative MRD is significantly associated with a high risk of recurrence and poor clinical outcomes. Moreover, besides postoperative MRD status, the mutation types of postoperative plasma samples, classified as whether they are tissue-derived gene fusions or
MET alterations, also inform prognosis.
Our results suggest that MRD serves as a reliable predictor of disease recurrence in operable NSCLC harboring oncogenic actionable fusion mutations,
MET exon skipping, or
de novo MET amplification, with a PPV of 100% and 90.9% at the landmark and longitudinal time points, respectively, comparable with previous NSCLC cohorts [
3,
21]. However, longitudinal ctDNA detection produced an NPV of 86.4%, which was lower than those reported by Zhang (96.8%) [
3] and Chen (96.6%) [
22]. This disparity might be due to our smaller cohort size (
n = 43) compared with 261 in Zhang’s study and 181 in Chen’s study, as well as a shorter median follow-up time (9.9 months) versus 19.7 in Zhang’s study and 1071 days in Chen’s study. Postoperative longitudinal MRD detection demonstrated the potential to identify recurrence earlier than imaging modalities with a median lead time of 4.2 months, similar to that in previously reported cohorts [
3,
23]. Collectively, this study provides evidence for the use of ctDNA-based MRD detection in postoperative recurrence risk stratification in operable NSCLC with actionable fusion mutations,
MET exon skipping or
de novo MET amplification.
The comparable clinical performance between our cohort with actionable gene fusions,
MET exon skipping, or
de novo MET amplification-driven NSCLC patients and other cohorts with SNV/indels mutation-driven NSCLC patients may be due to our tumor-informed MRD assay. Having prior knowledge of tumor mutations can maximize ctDNA detection sensitivity [
24]. Notably, among patients with undetectable MRD but experienced recurrence, the primary sites of recurrence were identified as the brain or small-volume intrathoracic tumors. This finding aligns with previous reports and highlights the limitations of ctDNA-based MRD, presenting false negatives [
3,
8,
10]. Thus, it emphasizes the necessity of imaging examinations such as brain magnetic resonance imaging in follow-up surveillance to address this specific limitation.
Diseases with fusion variants exhibit unique tumor biological behavior characterized by a potentially brief window of early-stage residence and rapid progression to advanced stages [
25]. Therefore, the detection of fusion or non-fusion mutations in the blood may indicate different prognoses. We observed worse prognosis in patients with detectable driver fusions, splices, and amplification mutations in longitudinal ctDNA than in those with other tissue-derived mutations or ctDNA-private mutations. The absence of statistical significance in DFS between them may be explained by the limited number of patients enrolled in this analysis (
n = 14). We speculate that a worse prognosis in patients with detectable driver fusions, splices, and amplification mutations in longitudinal plasma samples may be attributed to the sustained oncogenic effects of driver mutations on promoting tumor growth.
This study has several limitations. First, due to the relatively low incidence of fusion mutations, MET splicing variants and de novo MET amplification without other driver mutations in the NSCLC population, our cohort had a limited sample size, primarily comprising adenocarcinoma (88.4%). Second, the correlation between postoperative MRD status and the treatment course was not explored in this study. Therefore, the generalizability of our findings to larger NSCLC populations with comprehensive histology, treatment data and long-term follow-up should be confirmed in future prospective studies with larger sample sizes.