-
Abbreviations
- CI
- confidence interval
- FDR
- false discovery rate
- HR
- hazard ratio
- LUAD
- lung adenocarcinoma
- LUSC
- lung squamous cell carcinoma
- NSCLC
- non-small-cell lung cancer
- QC
- quality control
- SD
- standard deviation
- TCGA
- The Cancer Genome Atlas
Lung cancer is a leading cause of cancer mortality worldwide. In the United States, lung cancer was estimated as likely to account for 154 050 deaths in 2018, or one-fourth of all cancer deaths (Siegel et al., ). A large proportion of lung cancer cases are attributed to smoking, a well-known risk factor (Flanders et al., ), and smoking cessation prolongs survival and decreases mortality of lung cancer patients (Balduyck et al., ; Parsons et al., ). However, the underlying mechanisms of these benefits remain largely unclear (Bhatt et al., ; Parsons et al., ).
DNA methylation, a reversible epigenetic modification, regulates gene expression and provides potential cancer biomarkers and therapeutic targets (Egger et al., ; Feinberg and Tycko, ), including for non-small-cell lung cancer (NSCLC) (Guo et al., ; Shen et al., ; Wei et al., ). Furthermore, as a potential mechanistic link between cigarette smoking and disease, DNA methylation changes can result from various environmental exposures and may explain part of the association between smoking and cancer recurrence or mortality (Lee and Pausova, ; Shui et al., ).
Progression of complex diseases, such as cancer, results from interactions between clinical, environmental, genetic, and epigenetic factors (Lacombe et al., ; Mcnerney et al., ). However, most epigenome-wide association studies are designed to identify main effects using a standard marginal test (Karlsson et al., ) while ignoring epigenetic–environment interactions. These traditional mining procedures may reduce the power to identify new epigenomic biomarkers (Slade and Kraft, ).
In this study, we hypothesized that epigenetic and smoking cessation interactions may affect NSCLC survival. Epigenome-wide DNA methylation data composed of four study cohorts containing lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) cases were used for discovery, and the findings were independently validated in The Cancer Genome Atlas (TCGA) data.
Materials and methods Study populationEarly-stage (stage I–II) LUAD and LUSC patients who were former or current smokers were included in the study. Never smokers were defined as those who smoked ≤ 100 cigarettes over a lifetime. Current smokers were defined as those who were smoking within 1 year of diagnosis. Former smokers were defined as smokers who quit > 1 year before diagnosis or interview (Suk et al., ). We encoded the variable smoking cessation as ‘yes’ for former smokers and ‘no’ for current smokers. Data were harmonized from five international study centers, which have been previously described (Guo et al., ; Shen et al., ; Wei et al., ; Zhang et al., ). All patients provided written informed consent, and the study methodologies conformed to the standards set by the Declaration of Helsinki and received approval by its respective institutional review board.
HarvardThe Harvard Lung Cancer Study cohort was described previously (Suk et al., ). Briefly, all patients were recruited at Massachusetts General Hospital (MGH) from 1992 to present and had newly diagnosed, histologically confirmed primary NSCLC. We included 133 early-stage LUAD and LUSC patients who were former or current smokers for the current study. DNA was extracted from tumor specimens that were evaluated by an MGH pathologist for amount (tumor cellularity > 70%) and quality of tumor cells and histologically classified using World Health Organization criteria. The study protocol was approved by the Institutional Review Boards at Harvard School of Public Health and MGH.
SpainThe Spain study population was reported previously (Sandoval et al., ) and included 196 LUAD and LUSC patients recruited at eight subcenters from 1991 to 2009. In brief, tumor DNA was extracted from fresh-frozen tumor specimens that were collected by surgical resection, and the median clinical follow-up was 7.2 years. The study was approved by the Bellvitge Biomedical Research Institute institutional review board.
NorwayThe Norway cohort consisted of 116 LUAD patients with operable lung cancer tumors who were seen at Oslo University Hospital, Rikshospitalet, Norway, in 2006–2011 (Bjaanæs et al., ). Tumor tissues obtained during surgery were snap-frozen in liquid nitrogen and stored at −80 °C until DNA isolation. The project was approved by Oslo University institutional review board and regional ethics committee (S-05307).
SwedenThe Sweden cohort included 85 LUAD and LUSC patients. Tumor DNA was collected from early-stage lung cancer patients who underwent an operation at the Skane University Hospital, Lund, Sweden (Karlsson et al., ). The study was approved by the Regional Ethical Review Board in Lund, Sweden (Registration no. 2004/762 and 2008/702).
TCGAThe TCGA database contains 562 early-stage LUAD and LUSC patients who have full information of survival time and covariates. Level 1 HumanMethylation450 DNA methylation data (image data) for each patient were downloaded on October 1, 2015.
Quality control proceduresDNA methylation was profiled using Infinium HumanMethylation450 BeadChips (Illumina Inc., San Diego, CA, USA) for all patients. Raw image data were imported into GenomeStudio Methylation Module V1.8 (Illumina Inc.) to calculate methylation signals and to perform normalization, background subtraction, and quality control (QC). Beta values, which range from 0% (unmethylated) to 100% (methylated), were used to measure methylation level of each probe. Unqualified probes were excluded if they met any one of the following QC criteria: (a) failed detection (P > 0.05) in > 5% samples; (b) coefficient of variance of < 5%; (c) methylated or unmethylated in all samples; (d) common single nucleotide polymorphisms located in probe sequence or 10-bp flanking regions; (e) cross-reactive or cross-hybridizing probes (Chen et al., ); or (f) did not pass QC in all centers. Samples with > 5% undetectable probes were excluded. Methylation signals were further processed for quantile normalization, design bias correction for type I and II probes, and batch effect adjustment using ComBat correction (Marabita et al., ). We performed QC procedures above in each center separately and then merged all data together before association analysis. Details of QC processes are described in Fig. S1.
Gene expression dataExpression and mRNA sequencing data were available for 281 LUAD and 277 LUSC patients of the TCGA dataset (Table S1). TCGA mRNA sequencing data processing and QC were done by the TCGA workgroup. Raw counts were normalized using RNA sequencing by expectation maximization. Level 3 gene quantification data were downloaded from the TCGA data portal (
Analysis flow is described in Fig. . Patients from the first four study centers (Harvard, Spain, Norway, and Sweden) were assigned into the discovery phase. A histology-stratified Cox proportional hazards model was used to test the interaction item, which was the interaction effect between DNA methylation of each CpG probe and smoking cessation (CpG probe × smoking cessation) on overall survival. The model was adjusted for age, sex, smoking cessation, clinical stage, and study center. Hazard ratio (HR) and 95% confidence interval (CI) were described per 1% methylation increment. Multiple testing corrections were performed using the false discovery rate method (FDR, measured by FDR-q value) by the Benjamini–Hochberg procedure. CpG probes with interaction FDR-q ≤ 0.05 were replicated in the validation phase using the TCGA dataset. Robustly significant probes were retained if they met all criteria: (a) interaction P ≤ 0.05 in the validation phase; and (b) consistent effect direction in both discovery and validation phases. We performed stratified analysis for robustly significant CpG probes in LUAD and LUSC patients. Finally, CpG probes with a significant interaction with smoking cessation in both phases were identified as histology-specific probes.
Sensitivity analysis for significant CpG probesDue to the complex tumor microenvironment—including noncancerous components, which might alter analysis of tumor samples (Aran et al., )—we assessed tumor purity with InfiniumPurify (Zhang et al., ) using methylation array data from TCGA samples. Tumor purity was included as an additional covariate in the Cox regression model for sensitivity analysis.
Genome-wide methylation transcription analysisFor robustly significant histology-specific prognostic CpG probes, we also performed genome-wide methylation transcription analysis using mRNA sequencing data from TCGA. The correlation between DNA methylation and gene expression was tested using a linear regression model adjusted for the same covariates mentioned above. Association with FDR-q ≤ 0.05 was considered significant. Additionally, we tested the association between gene expression and overall survival using a Cox proportional hazards model adjusted for the same covariates. Genes involved in significant associations with both methylation and NSCLC survival were filtered.
Statistical analysisContinuous variables were summarized as mean ± standard deviation (SD), and categorized variables were described by frequency (n) and proportion (%). Kaplan–Meier survival curves were used to compare survival difference among subgroups. Statistical analyses were performed using r version 3.4.4 (The R Foundation for Statistical Computing).
ResultsAfter QC, epigenome-wide DNA methylation data including 311 891 CpG sites from 1092 tumor samples of early-stage (stage I–II) NSCLC patients were retained. There were 530 patients (NLUAD = 413 and NLUSC = 117) in the discovery phase and 562 patients (NLUAD = 285 and NLUSC = 277) in the validation phase. Table details demographic and clinical information for the study population. There were 37% and 27% current smokers in the discovery and validation phases, respectively.
Demographic and clinical characteristics of former and current smokers with early-stage NSCLC in five study centers
Variable | Discovery phase | Validation phase | Combined data | ||||
Harvard (N = 133) | Spain (N = 196) | Norway (N = 116) | Sweden (N = 85) | Discovery: All (N = 530) | TCGA (N = 562) | Overall (N = 1092) | |
Age (years), mean ± SD | 68.04 ± 9.39 | 65.69 ± 10.41 | 64.89 ± 8.94 | 65.65 ± 9.69 | 66.10 ± 9.78 | 66.55 ± 9.39 | 66.33 ± 9.58 |
Sex, n (%) | |||||||
Female | 56 (42.11) | 78 (39.80) | 59 (50.86) | 42 (49.41) | 235 (44.34) | 217 (38.61) | 452 (41.39) |
Male | 77 (57.89) | 118 (60.20) | 57 (49.14) | 43 (50.59) | 295 (55.66) | 345 (61.39) | 640 (58.61) |
Smoking cessation, n (%) | |||||||
No | 52 (39.10) | 71 (36.22) | 42 (36.21) | 31 (36.47) | 196 (36.98) | 168 (26.89) | 364 (33.33) |
Yes | 81 (60.90) | 120 (61.22) | 74 (63.79) | 54 (63.53) | 329 (62.08) | 376 (66.90) | 705 (64.56) |
Unknown | 0 | 5 | 0 | 0 | 5 | 18 | 23 |
Race, n (%) | |||||||
White | 133 (100) | 196 (100) | 116 (100) | 85 (100) | 530 (100) | 444 (79.00) | 974 (89.19) |
Black or African American | 0 | 0 | 0 | 0 | 0 | 54 (9.61) | 54 (4.95) |
Asian | 0 | 0 | 0 | 0 | 0 | 5 (0.89) | 5 (0.46) |
Unknown | 0 | 0 | 0 | 0 | 0 | 59 | 59 |
TNM stage, n (%) | |||||||
I | 87 (65.41) | 160 (81.63) | 80 (68.97) | 79 (92.94) | 406 (76.60) | 358 (63.70) | 764 (69.96) |
II | 46 (34.59) | 36 (18.37) | 36 (31.03) | 6 (7.06) | 124 (23.40) | 204 (36.30) | 328 (30.04) |
Histology, n (%) | |||||||
LUAD | 79 (59.40) | 155 (79.08) | 116 (100) | 63 (74.12) | 413 (77.92) | 285 (50.71) | 698 (63.92) |
LUSC | 54 (40.60) | 41 (20.92) | 0 | 22 (25.88) | 117 (22.08) | 277 (49.29) | 394 (36.08) |
Chemotherapy, n (%) | |||||||
No | 125 (93.98) | 157 (80.10) | 86 (74.14) | 51 (60.00) | 419 (79.06) | 178 (31.67) | 597 (54.67) |
Yes | 8 (6.02) | 15 (7.65) | 30 (25.86) | 6 (7.06) | 59 (11.13) | 53 (9.43) | 112 (10.26) |
Unknown | 0 | 24 | 0 | 28 | 52 | 331 | 383 |
Radiotherapy, n (%) | |||||||
No | 115 (86.47) | 161 (82.14) | 115 (99.14) | 57 (67.06) | 448 (84.53) | 220 (39.15) | 668 (61.17) |
Yes | 18 (13.53) | 11 (5.61) | 1 (0.86) | 0 | 30 (5.66) | 11 (1.96) | 41 (3.75) |
Unknown | 0 | 24 | 0 | 28 | 52 | 331 | 383 |
Adjuvant therapy, n (%) | |||||||
No | 110 (82.71) | 148 (75.51) | 85 (73.28) | 51 (60.00) | 394 (74.34) | 172 (30.60) | 556 (51.83) |
Yes | 23 (17.29) | 24 (12.24) | 31 (26.72) | 6 (7.06) | 84 (15.85) | 59 (10.50) | 143 (13.10) |
Unknown | 0 | 24 | 0 | 28 | 52 | 331 | 383 |
Survival year | |||||||
Median (95% CI) | 6.44 (4.93–7.44) | 6.53 (5.06–8.70) | 7.34 (6.70–7.98) | 5.89 (4.18–8.82) | 7.12 (6.23–7.95) | 4.54 (3.69–5.41) | 6.23 (5.55–7.20) |
Censored rate, % | 14.29 | 54.59 | 68.97 | 40.00 | 45.30 | 76.16 | 61.17 |
3Spain is a collaborative study center, containing samples from Spain, Italy, the UK, France, and the United States.
4Censored rate is the proportion of samples lost to follow-up or alive at the study end.
5Restricted mean survival time is provided because median was not available.
*Statistically significant difference (P ≤ 0.05) was observed between combined discovery set and validation set (TCGA).
In the discovery phase, 15 methylation–smoking cessation interactions were identified with FDR-q ≤ 0.05 (Fig. S2A), and the Manhattan plot also showed the results for main effect additionally (Fig. S2B). Only 1 interaction remained statistically significant in the validation phase under the most stringent criteria (Table S2). This site, cg02268510, is located in signal-induced proliferation-associated 1-like 3 (SIPA1L3). Further histology-stratified analysis showed that cg02268510SIPA1L3 is a LUAD-specific CpG probe that interacts with smoking cessation to affect patient survival in the discovery phase (HRinteraction = 1.10; 95% CI: 1.05–1.16; P = 2.95 × 10–5), the validation phase (HRinteraction = 1.17; 95% CI: 1.02–1.35; P = 0.0255), and the combined data (HRinteraction = 1.12; 95% CI: 1.07–1.16; P = 4.30 × 10–7). Moreover, fixed-effect meta-analysis of five centers also remained significant (HRinteraction = 1.09; 95% CI: 1.05–1.13; P = 6.66 × 10–6; Fig. S3). As presented in Fig. A, with decreased methylation level of cg02268510SIPA1L3, there was an elevated benefit effect size of smoking cessation on LUAD survival. Thus, there was a modification effect of cg02268510SIPA1L3 on the association between smoking cessation and survival.
DNA methylation and smoking cessation interaction on survival of LUAD patients. (A) HR of smoking cessation estimated based on methylation level of cg02268510. The shallow area represents 95% CI, with red, gray and blue areas indicating low, medium and high methylation, respectively. Histogram on the top shows the distribution of methylation. (B) Forest plots of the effects of smoking cessation among combined LUAD populations with low, medium, or high methylation of cg02268510. Pheterogeneity was used to evaluate heterogeneity of HRs across groups. (C) Kaplan–Meier survival curves of current and former smokers among LUAD patients with varying methylation levels.
After including tumor purity as an additional covariate in sensitivity analysis, DNA methylation at cg02268510SIPA1L3 retained a significant interaction with smoking cessation on LUAD survival (HRinteraction = 1.18; 95% CI: 1.02–1.36; P = 0.024). The interaction P-value was still significant but slightly inflated due to (a) the smaller sample size (51% of original) of the sensitivity analysis, which was only performed in TCGA; and (b) low tumor purity (~ 60%) for NSCLC samples in TCGA due to mixed cell types (Zheng et al., ).
To better illustrate the interaction pattern between DNA methylation and smoking cessation, patients were categorized into low, medium, and high groups based on tertiles of cg02268510SIPA1L3 methylation. The effect of smoking cessation varied across LUAD patients with different DNA methylation levels. Smoking cessation only benefited LUAD patients with low methylation of cg02268510SIPA1L3 (HRlow = 0.53; 95% CI: 0.34–0.82; P = 4.61 × 10–3). However, there was no significant association between smoking cessation and survival in LUAD patients with medium–high methylation of cg02268510SIPA1L3 (HRmedium = 1.12; 95% CI: 0.67–1.87; P = 0.665; HRhigh = 1.29; 95% CI: 0.80–2.07; P = 0.293; HRmedium–high = 1.21; 95% CI: 0.86–1.70; P = 0.266). We observed significant heterogeneity of smoking cessation effect across the three groups (P = 0.014; Fig. B), and Kaplan–Meier curves confirmed these results (Fig. C).
These results also indicated that LUAD patients who did not quit smoking (current smokers) had the poorest prognosis if their methylation of cg02268510SIPA1L3 was in a low level. So we combined the medium and high methylation groups and performed further analysis. Current smokers in the low methylation group had 1.94 times the mortality risk compared with the medium or high methylation group (Fig. A), but there was no statistically significant difference between groups for former smokers (Fig. B). The results also indicated that smoking cessation was quite urgent for LUAD patients with low methylation of cg02268510SIPA1L3.
Kaplan–Meier survival curves of LUAD patients categorized into low and medium–high methylation groups according to tertiles in different smoking cessation groups: (A) current smokers (No) and (B) former smokers (Yes). HR, 95% CI, and P-value were derived from the Cox proportional hazards model adjusted for age, sex, clinical stage, and study center.
In addition, we evaluated the joint effect of CpG methylation level (medium–high vs low) and smoking cessation (Yes vs No) on LUAD survival (Table ). We used the poorest-prognosis group (current smokers with low methylation) as the reference to evaluate effect of elevated methylation level, smoking cessation, and their interaction. In the combined dataset, the effect of smoking cessation was HR = 0.5506 (95% CI: 0.36–0.84; P = 5.62 × 10−3) and the effect of medium–high methylation of cg02268510SIPA1L3 was HR = 0.5214 (95% CI: 0.34–0.81; P = 3.48 × 10−3). However, the joint effect was HR = 0.6268 (95% CI: 0.43–0.92; P = 1.84 × 10−2), which was greater than the product of the two individual protective effects (0.5506 × 0.5214 = 0.2871). The joint effect of two protective factors was less protective than expected, indicating an antagonistic interaction between elevated methylation of cg02268510SIPA1L3 and smoking cessation (HRinteraction = 2.1835; 95% CI: 1.27–3.74; P = 4.46 × 10−3).
Joint effect and interaction of elevated methylation and smoking cessation on the prognosis of early-stage LUAD
Effect type | Medium–high methylation | Smoking cessation | Number | HR (95% CI) | P |
No | No | 71 | Ref. | ||
Main effect1 | No | Yes | 157 | 0.5506 (0.3609, 0.8400) | 5.62 × 10−3 |
Main effect2 | Yes | No | 158 | 0.5214 (0.3369, 0.8070) | 3.48 × 10−3 |
Joint effect | Yes | Yes | 299 | 0.6268 (0.4251, 0.9243) | 1.84 × 10−2 |
Interaction | 2.1835 (1.2747, 3.7401) | 4.46 × 10−3 |
7 Patients were categorized into two groups (medium–high vs low) by tertiles of cg02268510SIPA1L3 methylation level.
8 Main effects of elevated methylation and smoking cessation and their joint effect and interaction were derived from the Cox proportional hazards model adjusted for covariates.
9 Interaction = Joint effect ÷ (main effect1 × main effect2). 2.1835 = 0.6268 ÷ (0.5506 × 0.5214).
A growing body of research has reported potential associations of DNA methylation with age and smoking (Fraga and Esteller, ; Wan et al., ; Zaghlool et al., ). Therefore, we also tested the association between methylation of cg02268510SIPA1L3 and age, as well as smoking-related variables: pack-year of smoking, years of smoking, and years of smoking cessation using a linear regression model adjusted for age, sex, clinical stage, and study centers. Smoking-related characteristics of former and current smokers in early-stage LUAD are described in Table S3. There was no significant association between methylation of cg02268510SIPA1L3 and age (β = −0.01; P = 0.521) or years of smoking (β = 0.03; P = 0.210), but pack-year of smoking (β = 0.02; P = 3.42 × 10−3) as well as years of smoking cessation (β = −0.06; P = 5.08 × 10−3) in former smoker LUAD patients (Fig. S4).
Further, because cg02268510SIPA1L3 maps to SIPA1L3, the association between cg02268510SIPA1L3 and SIPA1L3 expression was evaluated using the TCGA dataset. We observed a significant association between cg02268510SIPA1L3 and SIPA1L3 expression (β = −0.02; P = 0.015) in LUAD patients (Fig. ), indicating that cg02268510SIPA1L3 cis-regulates gene expression. Moreover, genome-wide methylation transcription analysis revealed that expression of 633 genes was significantly correlated with methylation level of cg02268510SIPA1L3 (Fig. S5A). Among them, expression of only seven genes was significantly associated with overall survival: growth arrest and DNA damage-inducible gamma (GADD45G), maturin (MTURN), TMEM200B, RGS20, RELT-like 1 (RELL1), PGM2, and receptor-interacting serine/threonine kinase 2 (RIPK2; Fig. S5B–H).
Association between DNA methylation of cg02268510 and expression of corresponding gene SIPA1L3. The β coefficient and P-value were based on linear regression analysis adjusted for age, sex, smoking status, and clinical stage. Gene expression was log2-transformed before analysis.
In this study, we systematically evaluated all pairwise DNA methylation–smoking cessation interactions on an epigenome-wide scale and further confirmed these interactions in an independent population. To our knowledge, this is the first study with a large sample size to investigate interactions between DNA methylation and smoking behavior on lung cancer survival, and it provides new evidence to account for the missing heritability of complex diseases (Trerotola et al., ). Our results show that the effect of smoking cessation on early-stage LUAD patient survival varies with methylation level of cg02268510SIPA1L3. Smoking cessation only benefits LUAD patients with low methylation, rather than medium or high methylation, of cg02268510SIPA1L3. Further, there is an antagonistic interaction between elevated methylation of cg02268510SIPA1L3 and smoking cessation.
We found that in LUAD patients with low methylation of cg02268510SIPA1L3, current smokers with more accumulative exposure had worse survival than former smokers. However, for a population with medium–high methylation, the prognosis of current smokers was similar to that of former smokers. The effect of smoking cessation is therefore modified by DNA methylation level, indicating opportunities for epi-drug intervention due to the inherent reversibility of epigenetic events (Wright, ).
Up to 50% of lung cancer patients are estimated to keep smoking after diagnosis or to frequently relapse after smoking cessation (Park et al., ; Walker et al., ). Our results indicated that smoking cessation was urgent especially for LUAD patients with low methylation of cg02268510SIPA1L3. On the other hand, reduced methylation of cg02268510SIPA1L3 might strengthen the protective effect of smoking cessation on survival.
Many studies have reported significant associations between smoking cessation and overall survival (Koshiaris et al., ; Nia et al., ), while other studies have reported negative results (Baser et al., ; Parsons et al., ). Based on our interaction analysis, we suspected that epigenetic modifications might account for this inconsistent phenomenon. Because the effect of smoking cessation varies across populations with different methylation levels of cg02268510SIPA1L3, the effect could be neutralized in a population of patients with mixed cg02268510SIPA1L3 methylation levels. Thus, the traditional marginal test for association between smoking cessation and cancer survival inherently loses statistical power to report significant findings due to complex association patterns.
SIPA1L3, the gene in which cg02268510 is located, encodes GTPase-activating proteins (GAPs) specific for the GTP-binding protein Ras-associated protein-1 (RAP1), which is implicated in regulation of cell adhesion, cell polarity, and cytoskeletal organization (Kooistra et al., ). SIPA1L3 is a member of the SPA1 family of RapGAPs, which play a crucial role in spatiotemporal control of Rap1 activation in cells (Mochizuki et al., ). Rap1 plays many roles during cell invasion and metastasis in different cancers (Zhang et al., ). Additionally, overexpression of RAP1 may desensitize NSCLC cells to cisplatin, a first-line drug to treat NSCLC (Besse et al., ). Our results suggest that low methylation at cg02268510SIPA1L3 might promote SIPA1L3 expression, further leading to Rap1 activation and resulting in poor prognosis (Fig. ).
Diagram for pathway of DNA methylation–smoking cessation interaction effect on survival for LUAD patients.
Many of the deleterious effects of smoking are due to induction of inflammatory responses that contribute to lung cancer progression (Crusz and Balkwill, ; Walser et al., ). In vitro experiments in human umbilical vein endothelial cells demonstrate that nicotine stimulates cellular inflammatory responses by activating the NF-κB transcription factor axis by a second messenger pathway (Ueno et al., ). Activation of NF-κB, one of the most investigated transcription factors, controls multiple cellular processes in cancer, including inflammation, transformation, proliferation, angiogenesis, invasion, metastasis, chemoresistance, and radioresistance (Chaturvedi et al., ). Nicotine protects NSCLC cells against chemotherapy-induced apoptosis and serum deprivation-induced apoptosis through NF-κB, and NF-κB activity is also directly stimulated by nicotine (Anto et al., ; Tsurutani et al., ). Therefore, for current smokers, nicotine in tobacco stimulates activation of NF-κB, induces inflammatory responses, and is relevant to poor patient prognosis (Fig. ).
Moreover, Rap1 is an essential modulator of NF-κB-mediated pathways. NF-κB is induced by ectopic expression of Rap1, whereas its activity is inhibited by Rap1 depletion (Teo et al., ). Furthermore, levels of Rap1 are positively regulated by NF-κB, and human breast cancers with NF-κB hyperactivity show elevated levels of cytoplasmic Rap1 (Teo et al., ). Thus, positive feedback mechanisms might exist between Rap1 expression and NF-κB activation (Fig. ). In terms of cg02268510SIPA1L3 and smoking cessation interaction, keeping smoking was associated with poor prognosis only in LUAD patients with low methylation, rather than medium or high methylation, possibly because high activation of both Rap1 and NF-κB may only occur in patients with low methylation.
We also found that methylation level of cg02268510SIPA1L3 increased along with long pack-year of smoking, but decreased with long years of smoking cessation. As presented in Fig. , low methylation and keeping smoking resulted in the worst prognosis, which might be due to the positive feedback in Rap1 and NF-κB. However, methylation of cg02268510SIPA1L3 increased with the cumulative amount of smoking. But, high methylation of cg02268510SIPA1L3 resulted in low SIPA1L3 expression that was hard to active Rap1 and NF-κB, and then might weaken the harmful effect of smoking, which also indicated an antagonistic effect. It is implied that there might be a self-protective mechanism in the human body that prevents the body from receiving excessive damage from exposure. As reported, smoking increases reactive oxygen species (ROS) production and is a significant source of oxidative stress (Athanasios et al., ), but in vivo, there is a variety of antioxidant defense mechanisms existed to counteract the detrimental effects of ROS by regulating the production of free radicals and their metabolites (Deponte, ; He et al., ). It may be an adaptive defense mechanism to counteract the increased ROS production that superoxide dismutase enzyme levels in blood and salivary were increased in smokers (Jenifer et al., ). Moreover, a previous study has found that activation of Rap1 serves to attenuate ROS production (Remans et al., ) and there is a potential interrelationship between Rap1, ROS, and NF-κB activation (Moon et al., ). But further functional studies are warranted to elucidate the mechanism of cg02268510SIPA1L3 and smoking cessation interaction on LUAD survival.
Meanwhile, we observed that some genes trans-regulated by cg02268510SIPA1L3 are involved in DNA damage response and cell growth (GADD45G) (Guo et al., ), immune cell functions (MTURN) (Sun et al., ), tumor cell migration [regulator of G protein signaling 20 (RGS20)] (Yang et al., ), apoptosis (RELL1 and RIPK2) (Chin et al., ; Cusick et al., ), and innate and adaptive immunity (RIPK2) (Jaafar et al., ).
GADD45G is a member of the GADD45 family, which plays an essential role in cellular stress response, survival, senescence, and apoptosis regulation (Liebermann et al., ). GADD45G has been reported to be a tumor suppressor in multiple cancer types and can inhibit cell growth and induce apoptosis (Ying et al., ). Patients with high GADD45G expression had a better prognosis in our study. MTURN is a neural progenitor differentiation regulator homolog. 12-O-tetradecanoylphorbol-13-acetate (TPA) is an effective cancer therapeutic reagent for myelocytic leukemia patients (Han et al., ), and MTURN is TPA-responsive and may promote both leukemic and normal megakaryocyte differentiation (Sun et al., ). Indeed, differentiation therapy by forced differentiation of cancer cells has been successful in curing acute promyeloid leukemia (Chen et al., ). Similarly, LUAD patients with high MTURN expression had favorable survival in our study. RGS20 is suggested to promote cellular characteristics that contribute to metastasis, including enhanced cell aggregation, motility, and invasion. Selective inhibition of RGS20 expression may represent an alternative means to suppress metastasis (Yang et al., ). Its high expression is significantly associated with progression and prognosis of triple-negative breast cancer (Li et al., ). Additionally, our study showed similar results in LUAD patients. Though there is a lack of explicit evidence of relevance between these genes and smoking, what we found may inspire functional studies of these potential genes and further help to complete a picture of the mechanism pathway of cg02268510SIPA1L3 and smoking cessation interaction on LUAD survival.
Our study has some significant strengths. First, this is the first study to investigate the interaction between DNA methylation and smoking cessation on lung cancer survival on an epigenome-wide scale, which provides new evidence to account for the missing heritability of complex diseases (Trerotola et al., ). Second, the two-stage study design we used to exhaustively search for interactions, as well as the sensitivity analysis, is quite conservative in controlling for false positives. Third, our study included a large sample size to analyze DNA methylation–smoking cessation interactions of early-stage NSCLC prognosis, providing an opportunity to identify complex associations with small–medium effect size.
Despite the strengths of our study, we acknowledge some limitations. First, data measured categorical smoking cessation rather than smoking pack-years, which may render less power in the study. Second, smoking cessation was collected at the time of diagnosis and was not reassessed during follow-up. Previous studies have found that ‘former smokers’ might more accurately represent a mixed exposure status, since quitters are more likely to relapse (Hughes et al., ; Walker et al., ). Thus, we likely underestimated the benefits of smoking cessation. Third, the association between cg02268510SIPA1L3 and expression of several genes requires more biological evidence, though methylation is believed to play a crucial role in regulating gene expression (Bird, ) and further influence disease gene function (Schübeler, ), cell differentiation, or reprogramming (Khavari et al., ). Thus, functional experiments are warranted to confirm these associations, so our findings should be biologically interpreted with caution thus far. In addition, our study consisted mainly of a Caucasian population (89.19%), since TCGA data contained only ~ 10% non-Caucasian samples. Our results should therefore be translated with caution for other populations. Lastly, the censored rate of survival time for the TCGA population is relatively high, since early-stage NSCLC patients need longer follow-up time. Thus, the validation phase using TCGA population had low statistical power. However, we still successfully replicated one significant interaction, indicating a quite conservative and robust result (Leung et al., ; Watt et al., ).
ConclusionThis epigenome-wide DNA methylation–smoking cessation interaction analysis of early-stage NSCLC identified one LUAD-specific CpG probe, cg02268510SIPA1L3, which could significantly modify effects of smoking cessation on lung cancer survival. Smoking cessation benefited survival of LUAD patients with low methylation at cg02268510SIPA1L3. These results have implications for not only smoking cessation after diagnosis, but also possible methylation-specific drug targeting.
AcknowledgementsThe authors thank all participants enrolled in this study. This study was supported by the National Key Research and Development Program of China (2016YFE0204900 to FC); the National Natural Science Foundation of China (81530088 and 81473070 to FC, 81602940 to LLiu); the US National Institutes of Health (CA209414, CA092824, and ES000002 to DCC); the Natural Science Foundation of the Jiangsu Higher Education Institutions of China (18KJB310011 to RZ and 14KJA310002 to FC); China Postdoctoral Science Foundation (2018M633767 to RZ); and the Priority Academic Program Development of Jiangsu Higher Education Institutions. YW and RZ were partially supported by the Outstanding Young Teachers Training Program of Nanjing Medical University.
Conflict of interestThe authors have no conflicts to declare.
Author contributionsRZ, LLai, YW, and FC contributed to the study design. RZ, SM, TF, MMB, AK, MP, JS, AH, ME, AS, and LS contributed to data collection. RZ, LLai, and YW performed statistical analysis and interpretation and drafted the manuscript. XD, JH, DY, CC, LLin, YZ, HH, SS, LW, XC, YG, and LLiu revised the manuscript. All authors contributed to critical revision of the manuscript and approved its final version. Financial support and study supervision were provided by RZ, FC, and DCC.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2019. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Smoking cessation prolongs survival and decreases mortality of patients with non-small-cell lung cancer (
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, China; Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA; China International Cooperation Center for Environment and Human Health, Nanjing Medical University, China
2 Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, China
3 Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, China; Department of Epidemiology and Biostatistics, School of Public Health, Southeast University, Nanjing, China
4 Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
5 Department of Preventive Medicine, Medical School of Ningbo University, China
6 Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA; China International Cooperation Center for Environment and Human Health, Nanjing Medical University, China
7 Pulmonary and Critical Care Division, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
8 Bellvitge Biomedical Research Institute, Institucio Catalana de Recerca i Estudis Avançats, University of Barcelona, Barcelona, Spain
9 Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Norway
10 Division of Oncology and Pathology, Department of Clinical Sciences Lund, CREATE Health Strategic Center for Translational Cancer Research, Lund University, Sweden
11 Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital, Norway; Institute of Clinical Medicine, University of Oslo, Norway
12 Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, China; China International Cooperation Center for Environment and Human Health, Nanjing Medical University, China; Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Cancer Center, Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, China
13 Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, MA, USA; China International Cooperation Center for Environment and Human Health, Nanjing Medical University, China; Pulmonary and Critical Care Division, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA