This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
Hepatocellular carcinoma (HCC) is the sixth most commonly diagnosed cancer and the fourth leading cause of cancer-related mortality globally [1, 2] and the second in China [3]. More than 80% of all HCC causes are associated with infection with hepatitis B virus (HBV), hepatitis C virus (HCV), and hepatitis delta virus (HDV) [4]. Approximately 292 million people worldwide are chronically infected with HBV, which causes liver injury that can progress to cirrhosis, resulting in HCC, liver failure, and eventually death [5, 6]. The HDV is known as the satellite of HBV and affects 15–20 million people in the world [7]. Concurrent HBV and HDV infections significantly increase both the incidence and mortality of HCC among patients with chronic hepatitis B (CHB) [8]. HDV is a kind of defective RNA virus which uses HBV envelope protein for successful spread in hepatocytes [9]. Although the risk of HCC is thought to be higher when a HBV-infected patient is superinfected with HDV, the molecular mechanisms of carcinogenesis remain unclear [10]. Chronic hepatitis D (CHD) is more severe than any other type of hepatitis, but its carcinogenesis mechanism remains poorly understood. Additionally, it has been found that the intrahepatic HBV DNA levels in patients with HDV-associated HCC and non-HCC cirrhosis are significantly reduced [11]. This phenomenon of HDV-mediated inhibition of HBV replication suggests that the effects of HDV are mediated through a unique molecular mechanism. While HBV and HCV are both included in International Agency for Research on Cancer (IARC) group 1 (high evidence of carcinogenicity to humans), HDV was assigned several years ago to group 3 (not sufficient evidence of carcinogenicity) [12], due to inadequacy to support the contribution of HDV to HBV-induced HCC. Due to the dependency of HDV on HBV, there are still controversies regarding the increased risk of HCC development in chronically HDV-infected patients [4], and the available data on the particular mechanism by which HDV contributes to HCC are sparse. With the development of genomics and other “-omics” disciplines, substantial omics data from HCC specimens have been accumulated [13]. Therefore, researchers have taken advantage of gene ontology (GO) and signal pathway analysis tools to identify and characterize many differentially expressed genes (DEGs).
In the present study, the GSE55092 and GSE98383 mRNA expression profile datasets were retrieved from Gene Expression Omnibus (GEO) online [14, 15]. And the RNA-sequencing data of 371 HCC patients and their clinical data were from the Cancer Genome Atlas (TCGA). We performed DEG analysis between cancerous specimens of HBV-associated HCC and HDV-associated HCC patients and their respective paracancerous specimens using Linear Models for Microarray Data (LIMMA) [16] and other packages implemented in R/Bioconductor [17]. We aimed to investigate potential HDV carcinogenesis mechanisms by PPI network, Gene Expression Profiling Interactive Analysis (GEPIA), and Weighted Gene Coexpression Network Analysis (WGCNA), particularly to screen and identify key genes and pathways to determine their possible role in HCC pathogenesis, and to help determine their mechanism of inhibition of HBV replication and their effects in diagnosis, treatment and prognosis.
2. Methods and Materials
2.1. Acquisition of Data and Preprocessing
The RNA-sequencing data and clinical data of 371 HCC patients were downloaded from TCGA (http://cancergenome.nih.gov/. http://cancergenome.nih.gov/). The expression of genes was represented by fragments per kilobase of exon per million fragments mapped (FPKM). Microarray data were available at the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO, https://www.ncbi.nlm.nih.gov/geo/) database. The inclusion criteria for selection GEO datasets in this study were as follows: (1) hepatocellular carcinoma containing cancer and paracancer tissue; (2) HBsAg positive at least 6 months with serum HBV DNA positive; (3) anti-HDAg positive with serum HDV RNA positive (applies only to filter HDV-associated HCC dataset); and (4) sample size more than 10 with data unbiased. The exclusion criteria were as follows: (1) the second liver cancer, (2) HCV-related HCC, (3) dataset biased, and (4) no paracancer tissue and carcinoma tissue present at the same patients. We searched the GEO database using “Hepatitis D Virus,” “Hepatocellular Carcinoma,” and “Homo sapiens” as keywords, and there were only 2 results, between which only GSE98383 met our criteria for HDV-associated HCC. Similarly, we searched the keywords “Hepatitis B Virus,” “Hepatocellular Carcinoma,” and “Homo sapiens” and obtained 581 search results. The flow chart of screening HBV-associated HCC could be seen in Figure 1, and only GSE55092 was suited for an in-depth study.
[figure omitted; refer to PDF]
Microarray data GSE55092 [18] and GSE98383 [11] were generated using the GPL570 Affymetrix HG-U133 Plus 2.0 Array platform, and we performed the analysis on the data from whole liver tissue. GSE55092 contains 39 cancer specimens and 81 paracancer specimens from 11 HBV-associated HCC patients (
Table 1
Comparison of baseline characteristics between patients with HBV-associated HCC and HDV-associated HCC.
HBV-associated HCC | HDV-associated HCC | ||
Patients | 11 | 5 | — |
Age (years old) | 0.849 | ||
Male (%) | 10 [90.9] | 5 [100] | 1.000 |
ALT (U/L) | 0.000 | ||
AST (U/L) | 0.001 | ||
GGT (U/L) | 0.917 | ||
PT (INR) | 0.001 | ||
TB (mg/dL) | 0.005 | ||
PLT (103/mL) | 0.000 | ||
Liver pathology | |||
Activity grade | 0.034 | ||
Fibrosis stage | 0.226 | ||
F5/F6 | 9 | 5 | |
Tumor grade | 0.107 | ||
G2 | 7 | 1 | |
G3 | 3 | 4 | |
G4 | 1 | 0 | |
Tumor size | 0.407 | ||
<2 cm | 4 | 0 | |
≥2 and ≤3 cm | 4 | 3 | |
>3 cm | 3 | 2 | |
Serum HDV RNA positive, no. | 0 | 5 | — |
Serum HBV DNA positive, no. | 11 | 5 | — |
ALT: alanine aminotransferase; AST: aspartate aminotransferase; GGT: γ-glutamyl transferase; TB: total bilirubin; PT: prothrombin time; PLT: Platelets.
We downloaded the GSE55092 and GSE98383 datasets, normalized them by the Affy package of the R Bioconductor, and then converted the gene expression profile at the probe level into gene symbol level and removed the duplicated symbols. When numerous probes were mapped to one gene, the average value was defined as the expression level of that gene. According to the description of the uploader, an unsupervised multidimensional scaling (MDS) of all specimens obtained from HBV-associated HCC patients showed a clear separation between two distinct clusters that corresponded to cancer areas and paracancerous areas. A similar separation between two clusters that corresponded to cancerous areas and paracancerous areas was observed for the specimens from HDV-associated HCC patients. Therefore, all the data are available for the identification of DEGs.
2.2. Identification of DEGs
DEG analysis refers to the identification of genes with significantly different expression levels between two groups through multiple analysis modes [19]. We performed differential expression analysis using Bayes
2.3. GO Enrichment and Pathway Analysis
DAVID program (https://david.ncifcrf.gov/) [22] is a bioinformatics resource comprising a biological database and a set of annotation and analytical tool that intuitively integrates functional genomic annotations with graphics. In this study, the DEGs were submitted to DAVID for GO [23] and KEGG [24] enrichment analyses, which included biological process (BP), cellular component (CC), molecular function (MF), and related biological metabolic pathways. A
2.4. Weighted Gene Coexpression Network Construction and Module-Clinical Characteristic Associations
We compared DEGs with the genes of 371 HCC patients downloaded from TCGA website. Expression data of the matched genes from TCGA were applied to find gene modules significantly associated with clinical trait (stage of HCC) by WGCNA [25]. In this analysis mode, the soft thresholding acts as the lowest power based on the criterion of approximate scale-free topology [26], and analogous modules would be merged together due to the similarity. The heatmap of module-clinical characteristic relationship could reveal modules significantly associated with clinical characteristics.
2.5. Construction of PPI Networks and Module Analysis
The visual protein-to-protein interaction (PPI) networks of DEGs were predicted using the web resource Search Tool for the Retrieval of Interacting Genes (STRING) [27] to search the STRING database (https://string-db.org/), which contains over 5,000 organisms as well as their over 24.6 million proteins and over 2 billion interactions. We correlated the target DEGs with the STRING database and set the significant threshold to the highest confidence level (interaction
2.6. Validation of Module Gene
First, we uploaded the potential genes identified by PPI-network analysis to GEPIA [29] (http://gepia.cancer-pku.cn/, an online server containing TCGA/GTEx datasets) to validate the gene expression consistency between the microarray datasets (GSE55092 and GSE98383) and TCGA/GTEx HCC dataset, setting the threshold parameters as follows: |log2FC|
2.7. Univariate and Multivariate Cox Proportional Hazards Model Analysis
We performed univariate and multivariate Cox proportional hazards model analysis in patients from TCGA, to find independent factors associated with the overall survival of HCC.
3. Result
3.1. Comparison of Baseline Characteristics
GSE98383 was the only dataset of HDV-associated HCC that met our criteria, as to the screen of HBV-associated HCC dataset, overall 581 datasets were enrolled and screened for eligibility and 3 datasets met inclusion criteria. Of these 3 datasets, the data processing platform of GSE55092 was GPL570, which was the same with GSE98383, while GSE22058 and GSE94660 were different, so we took GSE55092 to stand for HBV-associated HCC for study. The number of datasets and reasons for exclusion are shown in Figure 1. The HDV-associated HCC patients in GSE98383 had higher levels of alanine aminotransferase (ALT), aspartate aminotransferase (AST), total bilirubin (TB), prothrombin time (PT), platelet counts (PLT), and inflammatory activity grade than the HBV-associated HCC patients in GSE55092, which is consistent with the characteristics of CHD of the most severe hepatitis. On the other hand, there was no significant difference in sex, age, tumor grade, and tumor size between the two groups.
3.2. Identification of DEGs
We identified DEGs from the microarray GSE55092 and GSE98383 datasets using the LIMMA package, setting |log2-FC| to ≥1.0 and adjusted
[figures omitted; refer to PDF]
3.3. GO Enrichment and Pathway Analysis
In order to further screen HDV-associated HCC potential target genes among these DEGs, GO and pathway analysis were performed on HDV-associated HCC DEGs using
Table 2
The top five annotations in GO and KEGG enrichment analysis of the DEGs.
Category | Term | Count | |
GOTERM_BP_FAT | GO:0070887~cellular response to chemical stimulus | 208 | |
GOTERM_BP_FAT | GO:0071310~cellular response to organic substance | 180 | |
GOTERM_BP_FAT | GO:0010033~response to organic substance | 210 | |
GOTERM_BP_FAT | GO:0006952~defense response | 129 | |
GOTERM_BP_FAT | GO:0007155~cell adhesion | 139 | |
GOTERM_CC_FAT | GO:0009986~cell surface | 69 | |
GOTERM_CC_FAT | GO:0098552~side of membrane | 43 | |
GOTERM_CC_FAT | GO:0031988~membrane-bounded vesicle | 225 | |
GOTERM_CC_FAT | GO:0005578~proteinaceous extracellular matrix | 37 | |
GOTERM_CC_FAT | GO:0009897~external side of plasma membrane | 28 | |
GOTERM_MF_FAT | GO:0001948~glycoprotein binding | 16 | |
GOTERM_MF_FAT | GO:0098772~molecular function regulator | 97 | |
GOTERM_MF_FAT | GO:0019955~cytokine binding | 15 | |
GOTERM_MF_FAT | GO:0008201~heparin binding | 20 | |
GOTERM_MF_FAT | GO:0005539~glycosaminoglycan binding | 23 | |
KEGG_PATHWAY | hsa04062: Chemokine signaling pathway | 25 | |
KEGG_PATHWAY | hsa05150: Staphylococcus aureus infection | 12 | |
KEGG_PATHWAY | hsa05202: Transcriptional misregulation in cancer | 22 | |
KEGG_PATHWAY | hsa04510: Focal adhesion | 25 | |
KEGG_PATHWAY | hsa04670: Leukocyte transendothelial migration | 17 |
3.4. Weighted Gene Coexpression Network Construction and Module-Clinical Characteristics
We compared 948 DEGs with the genes of 371 samples downloaded from TCGA datasets, matched a total of 883 genes, and performed WGCNA. As shown in Figures 3(a), 3(b), and 3(c), the soft thresholding power
[figures omitted; refer to PDF]
3.5. Construction of PPI Networks and Gene Module Analysis
We uploaded the DEGs onto the STRING online tool and analyzed them with the Cytoscape software. We then selected 353 nodes and 939 edges with the highest confidence (
[figures omitted; refer to PDF]
Table 3
Functional and pathway enrichment of module 1 genes.
Category | Term | Count | Genes | |
GOTERM_BP_FAT | G protein-coupled receptor signaling pathway | 14 | ADCY7, ADRA2A, CCL21, CCL4, CCR2, CCR7, CXCL16, CXCL5, CXCR4, FPR1, GNG2, P2RY12, P2RY14, PNOC | |
GOTERM_BP_FAT | Cell chemotaxis | 8 | CCL21, CCL4, CCR2, CCR7, CXCL16, CXCL5, CXCR4, FPR1 | |
GOTERM_BP_FAT | Chemokine-mediated signaling pathway | 6 | CCL21, CCL4, CCR2, CCR7, CXCL5, CXCR4 | |
GOTERM_CC_FAT | Plasma membrane | 11 | 0.0205 | ADCY7, ADRA2A, CCR2, CCR7, CXCL16, CXCR4, FPR1, GNG2, P2RY12, P2RY14, PNOC |
GOTERM_CC_FAT | External side of plasma membrane | 3 | 0.0205 | CCR2, CCR7, P2RY12 |
GOTERM_CC_FAT | Side of membrane | 4 | 0.0205 | CCR2, CCR7, GNG2, P2RY12 |
GOTERM_MF_FAT | G protein-coupled receptor binding | 7 | ADRA2A, CCL21, CCL4, CCR2, CXCL16, CXCL5, PNOC | |
GOTERM_MF_FAT | Chemokine receptor binding | 5 | CCL21, CCL4, CCR2, CXCL16, CXCL5 | |
GOTERM_MF_FAT | Chemokine activity | 4 | CCL21, CCL4, CXCL16, CXCL5 | |
KEGG_PATHWAY | Chemokine signaling pathway | 10 | ADCY7, CCL21, CCL4, CCR2, CCR7, CXCL16, CXCL5, CXCR4, GNB4, GNG2 | |
KEGG_PATHWAY | Cytokine-cytokine receptor interaction | 7 | CCL21, CCL4, CCR2, CCR7, CXCL16, CXCL5, CXCR4 | |
KEGG_PATHWAY | Circadian entrainment | 3 | 0.00092 | ADCY7, GNB4, GNG2 |
Table 4
Functional and pathway enrichment of module 2 genes.
Category | Term | Count | Genes | |
GOTERM_BP_FAT | Type I interferon signaling pathway | 12 | BST2, IFI27, IFI6, IFIT1, IRF7, IRF9, ISG15, MX1, MX2, OAS1, OAS2, XAF1 | |
GOTERM_BP_FAT | Defense response to virus | 9 | BST2, IFIT1, IRF7, IRF9, ISG15, MX1, MX2, OAS1, OAS2 | |
GOTERM_BP_FAT | Negative regulation of viral genome replication | 5 | BST2, IFIT1, ISG15, MX1, OAS1 | |
GOTERM_CC_FAT | Cytosol | 10 | 0.004 | BST2, IFIT1, IRF7, IRF9, ISG15, MX1, MX2, OAS1, OAS2, XAF1 |
GOTERM_CC_FAT | Mitochondrion | 6 | 0.0067 | IFI27, IFI6, MX1, MX2, OAS1, XAF1 |
GOTERM_CC_FAT | Cytoplasmic part | 12 | 0.0067 | BST2, IFI27, IFI6, IFIT1, IRF7, IRF9, ISG15, MX1, MX2, OAS1, OAS2, XAF1 |
GOTERM_MF_FAT | 2 | 2 | 0.00028 | OAS1, OAS2 |
GOTERM_MF_FAT | Double-stranded RNA binding | 2 | 0.0229 | OAS1, OAS2 |
KEGG_PATHWAY | Hepatitis C | 5 | IFIT1, IRF7, IRF9, OAS1, OAS2 | |
KEGG_PATHWAY | Measles | 5 | IRF7, IRF9, MX1, OAS1, OAS2 | |
KEGG_PATHWAY | Influenza A | 5 | IRF7, IRF9, MX1, OAS1, OAS2 |
Table 5
Functional and pathway enrichment of module 3 genes.
Category | Term | Count | Genes | |
GOTERM_BP_FAT | Mitotic cell cycle process | 16 | CDC45, CDC6, CDCA5, CDCA8, CENPE, DBF4, ESPL1, KIF18A, MCM10, MCM4, MCM7, ORC6, POLE2, PPP2R5C, SKP2, ZWILCH | |
GOTERM_BP_FAT | Cell cycle | 17 | CDC45, CDC6, CDCA5, CDCA8, CENPE, DBF4, ESPL1, KIF18A, KLHL13, MCM10, MCM4, MCM7, ORC6, POLE2, PPP2R5C, SKP2, ZWILCH | |
GOTERM_BP_FAT | G1/S transition of mitotic cell cycle | 9 | CDC45, CDC6, DBF4, MCM10, MCM4, MCM7, ORC6, POLE2, SKP2 | |
GOTERM_CC_FAT | Chromosomal part | 13 | CDC45, CDCA5, CDCA8, CENPE, CENPH, CENPI, KIF18A, MCM10, MCM7, ORC6, POLE2, PPP2R5C, ZWILCH | |
GOTERM_CC_FAT | Chromosome, centromeric region | 8 | CDCA5, CDCA8, CENPE, CENPH, CENPI, KIF18A, PPP2R5C, ZWILCH | |
GOTERM_CC_FAT | Intracellular nonmembrane-bounded organelle | 18 | CDC45, CDC6, CDCA5, CDCA8, CENPE, CENPH, CENPI, ESPL1, KCTD6, KIF18A, MCM10, MCM7, ORC6, POLE2, PPP2R5C, RNF213, SKP2, ZWILCH | |
GOTERM_MF_FAT | DNA replication origin binding | 3 | 0.00011 | CDC45, MCM10, ORC6 |
GOTERM_MF_FAT | Single-stranded DNA binding | 4 | 0.00037 | CDC45, MCM10, MCM4, MCM7 |
GOTERM_MF_FAT | DNA helicase activity | 3 | 0.00069 | CDC45, MCM4, MCM7 |
KEGG_PATHWAY | Cell cycle | 8 | CDC45, CDC6, DBF4, ESPL1, MCM4, MCM7, ORC6, SKP2 | |
KEGG_PATHWAY | DNA replication | 3 | 0.00022 | MCM4, MCM7, POLE2 |
KEGG_PATHWAY | Ubiquitin-mediated proteolysis | 4 | 0.00024 | KLHL13, SKP2, TCEB1, TRIM37 |
3.6. Validation of Module Genes
We compared the gene expression changes between the three module genes of HDV-associated HCC (a total of 52 genes) and the validation HCC (TCGA/GTEx) datasets in the GEPIA website to verify whether their expression in both datasets is consistent. We noticed that CCL21 and FPR1 in module 1 as well as XAF1 in module 2 were downregulated in tumors compared to normal specimens in the HCC datasets, which is in accordance with the HDV-associated HCC specimens. However, IFI6, IFI27, and ISG15 in module 2, which were downregulated in cancerous specimens compared to paracancerous normal specimens, were conversely expressed in HCC datasets. The genes in module 3, including CDC6, CDC45, CDCA5, CDCA8, CENPH, MCM4, MCM7, and TCEB1, were upregulated in tumor compared to normal specimens in the HCC datasets, which is consistent with the HDV-associated HCC patients. All the box plots comparing gene expression are shown in Figure 5(a). For further verification, 11 genes whose gene expression trends are consistent with HCC datasets were selected and used to conduct overall survival analysis. In Figure 5(b), there were the upregulated genes (CDC6, CDC45, CDCA5, CDCA8, CENPH, MCM4, MCM7, and TCEB1) which are associated with a lower survival rate in the high expression group than in the low expression group.
[figures omitted; refer to PDF]
The reason for excluding CCL21, FPR1, IFI6, IFI27, ISG15, and XAF1 is that their
3.7. Identification of Independent Factors of Overall Survival of HCC
We performed the univariate analysis in 371 patients from TCGA and found that CDCA8, stage, CDC45, CDC6, CDCA5, MCM4, CENPH, MCM7, sex, and age were significantly associated with OS of HCC. The multivariate Cox proportional hazards model showed that CDCA8 and stage of HCC were independent factors of OS of HCC (Table 6).
Table 6
Univariate and Multivariate Cox proportional hazards model analysis of overall survival of HCC.
Variables | Univariate analysis | Multivariate analysis | ||||
HR | 95% CI | HR | 95% CI | |||
CDCA8 | 1.11 | 1.08-1.15 | <0.001 | 1.10 | 1.06-1.14 | <0.001 |
Stage I | <0.001 | 0.019 | ||||
Stage II | 1.43 | 0.88-2.34 | 0.151 | 1.23 | 0.75-2.02 | 0.416 |
Stage IIIA | 2.68 | 1.70-4.21 | <0.001 | 2.11 | 1.31-3.39 | 0.002 |
Stage IIIB | 2.87 | 1.02-8.04 | 0.045 | 1.84 | 0.62-5.44 | 0.273 |
CDC45 | 1.13 | 1.07-1.19 | <0.001 | |||
CDC6 | 1.10 | 1.05-1.15 | <0.001 | |||
CDCA5 | 1.09 | 1.05-1.13 | <0.001 | |||
MCM4 | 1.06 | 1.04-1.09 | <0.001 | |||
CENPH | 1.18 | 1.08-1.28 | <0.001 | |||
MCM7 | 1.01 | 1.00-1.01 | 0.029 | |||
Gender (F vs. M) | 0.82 | 0.57-1.16 | 0.257 | |||
Age (years) | 1.01 | 0.99-1.02 | 0.403 |
F: female; M: male.
4. Discussion
As the virus causing the most severe type of hepatitis, HDV affects 15-20 million people worldwide, but its specific pathogenic mechanism remains unclear. Accordingly, we undertook to find potential genes and pathways involved in the pathogenesis of this disease through text mining to help explain the underlying carcinogenic mechanism of HDV as well as the HBV inhibitory mechanism.
In this study, we compared cancerous and paracancerous specimens of patients suffering from HBV or HDV-associated HCC with the aim of identifying potential genes closely related to HDV-associated HCC. The study identified 373 upregulated DEGs and 582 downregulated DEGs. These DEGs were subjected to GO and KEGG annotation and enrichment analyses. In addition, we constructed PPI networks and sorted out 353 nodes with 939 edges, from which the three most significant modules were selected and 52 central nodes/genes were selected for validation using the GEPIA database. In the module confirmed by the WGCNA that was significantly associated with clinical features including event, OS, stage, grade, and age, only CDC6, CDC45, CDCA5, CDCA8, CENPH, MCM4, and MCM7 were consistent with genes identified by the PPI network, which were found to be significantly correlated with nucleoplasm, cell cycle, DNA replication, and mitotic cell cycle. Univariate and multivariate Cox proportional hazards model analysis showed the stage of HCC and CDCA8 are the independent factors associated with the OS of HCC.
Cell division cycle 6 (CDC6) is thought to be significantly associated with pancreatic cancer and colorectal cancer (CRC) [30]. It has a pivotal role in regulating the process of DNA replication as well as tumorigenesis; its overexpression could interfere with the expression of tumor suppressor genes (INK4/ARF) through the mechanism of epigenetic modification [31]. During the S phase of DNA replication in eukaryotic cells, cell division cycle 45 (CDC45) is an essential component of CMG (CDC45–MCM–GINS) helicase. CDC45 acts as a hubprotein, significantly upregulated in cancerous tissues from CRC and non-small-cell lung cancer (NSCLC) patients, and promotes tumor progression [32]. It is established that the MCM4/6/7 (minichromosome maintenance complex component 4/6/7) hexamer complex acts as a DNA helicase. Additionally, in endometrial cancer and skin cancer studies, it was found that MCM4 mutations may affect the interaction with MCM7, thereby disrupting the stability of the MCM4/6/7 complex [33]. In addition, MCM4 is also a member of significant predictors of poor prognosis in CRC patients [34]. Moreover, MCM7 is also a promising biomarker for early diagnosis of gastric cancer and even a predictor of meningioma recurrence after surgery [35, 36]. Another study found that high expression of MCM7 may be involved in the progression of HCC through the MCM7-cyclin D1 pathway, and MCM7 may serve as a prognostic marker for patients with HCC [37]. The cell division cycle-associated protein 5 (CDCA5) is a member of the CDCA family that comprises CDCA1-8. It plays a crucial role as a regulator of sister-chromatid cohesion and separation during cell division, and its upregulation has been shown to be associated with various cancers, including breast cancer, esophageal squamous cell carcinoma, CRC, and HCC [38, 39]. Also, a study found that the activation of the ERK and AKT pathways may be involved in the regulation of HCC cell proliferation by CDCA45 [39]. CDCA8 is an essential regulator of mitosis, and its overexpression is significantly associated with bladder cancer, cutaneous melanoma, and the progression and prognosis of breast cancer [40, 41]. Centromere protein H (CENPH) is considered to be an essential part of the active centromere complex, and its overexpression is highly related to poor prognosis in renal cell carcinoma, nasopharyngeal carcinoma, CRC, and HCC [42, 43]. Another study found that CENPH may promote the proliferation of HCC through the mitochondrial apoptosis pathway [43].
Most of the abovementioned genes are significantly associated with the cell cycle and DNA replication, and their overexpression may affect the replication of HBV DNA, thereby promoting the unique phenomenon of HDV inhibits HBV replication. All the findings related to these genes may also help us understand the mechanisms of HDV-induced liver injury and HCC. In the future, we will further verify those genes’ function by performing animals, cells, and clinical trials.
5. Conclusions
In summary, 7 potential candidate genes closely related to HDV-associated HCC were identified in this study. Through comparative analysis with previous studies, these genes were found to be involved in many pathways related to tumorigenesis which provided clues to elucidate the mechanism of hepatitis D virus-induced HCC or its unique molecular mechanism in the inhibition of HBV replication. However, additional in-depth molecular biological research on these candidate genes closely related to HDV-associated HCC is necessary to confirm their functions.
Authors’ Contributions
Zhe Yu, Xuemei Ma, and Wei Zhang contributed equally to this work.
[1] F. Bray, J. Ferlay, I. Soerjomataram, R. L. Siegel, L. A. Torre, A. Jemal, "Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries," CA: a Cancer Journal for Clinicians, vol. 68 no. 6, pp. 394-424, DOI: 10.3322/caac.21492, 2018.
[2] A. Villanueva, "Hepatocellular carcinoma," The New England Journal of Medicine, vol. 380 no. 15, pp. 1450-1462, DOI: 10.1056/NEJMra1713263, 2019.
[3] W. Chen, R. Zheng, P. D. Baade, S. Zhang, H. Zeng, F. Bray, A. Jemal, X. Q. Yu, J. He, "Cancer statistics in China, 2015," CA: a Cancer Journal for Clinicians, vol. 66 no. 2, pp. 115-132, DOI: 10.3322/caac.21338, 2016.
[4] R. Romeo, A. Petruzziello, E. I. Pecheur, F. Facchetti, R. Perbellini, E. Galmozzi, N. U. Khan, L. di Capua, R. Sabatino, G. Botti, G. Loquercio, "Hepatitis delta virus and hepatocellular carcinoma: an update," Epidemiology and Infection, vol. 146 no. 13, pp. 1612-1618, DOI: 10.1017/S0950268818001942, 2018.
[5] W. K. Seto, Y. R. Lo, J. M. Pawlotsky, M. F. Yuen, "Chronic hepatitis B virus infection," Lancet, vol. 392 no. 10161, pp. 2313-2324, DOI: 10.1016/S0140-6736(18)31865-8, 2018.
[6] L. S. Y. Tang, E. Covert, E. Wilson, S. Kottilil, "Chronic hepatitis B infection," JAMA, vol. 319 no. 17, pp. 1802-1813, DOI: 10.1001/jama.2018.3795, 2018.
[7] Y. Ni, Z. Zhang, L. Engelskircher, G. Verch, T. Tu, F. A. Lempp, S. Urban, "Generation and characterization of a stable cell line persistently replicating and secreting the human hepatitis delta virus," Scientific Reports, vol. 9 no. 1,DOI: 10.1038/s41598-019-46493-1, 2019.
[8] G. Brancaccio, M. Fasano, A. Grossi, T. A. Santantonio, G. B. Gaeta, "Clinical outcomes in patients with hepatitis D, cirrhosis and persistent hepatitis B virus replication, and receiving long-term tenofovir or entecavir," Alimentary Pharmacology & Therapeutics, vol. 49 no. 8, pp. 1071-1076, DOI: 10.1111/apt.15188, 2019.
[9] M. T. Binh, N. X. Hoan, H. van Tong, D. P. Giang, B. T. Sy, N. L. Toan, L. H. Song, M. H. Bang, H. Wedemeyer, C. G. Meyer, P. G. Kremsner, C. T. Bock, T. P. Velavan, "HDV infection rates in northern Vietnam," Scientific Reports, vol. 8 no. 1,DOI: 10.1038/s41598-018-26446-w, 2018.
[10] C. Sureau, F. Negro, "The hepatitis delta virus: replication and pathogenesis," Journal of Hepatology, vol. 64 no. 1, pp. S102-S116, DOI: 10.1016/j.jhep.2016.02.013, 2016.
[11] G. Diaz, R. E. Engle, A. Tice, M. Melis, S. Montenegro, J. Rodriguez-Canales, J. Hanson, M. R. Emmert-Buck, K. W. Bock, I. N. Moore, F. Zamboni, S. Govindarajan, D. E. Kleiner, P. Farci, "Molecular signature and mechanisms of hepatitis D virus-associated hepatocellular carcinoma," Molecular Cancer Research, vol. 16 no. 9, pp. 1406-1419, DOI: 10.1158/1541-7786.MCR-18-0012, 2018.
[12] Humans, I.W.G.o.t.E.o.C.R.t, I.A.f.R.o. Cancer, Hepatitis viruses, vol. 59, 1994.
[13] H. Lee, B. T. Lau, H. P. Ji, "Targeted sequencing strategies in cancer research," Next Generation Sequencing in Cancer Research, pp. 137-163, 2013.
[14] T. Barrett, D. B. Troup, S. E. Wilhite, P. Ledoux, D. Rudnev, C. Evangelista, I. F. Kim, A. Soboleva, M. Tomashevsky, K. A. Marshall, K. H. Phillippy, P. M. Sherman, R. N. Muertter, R. Edgar, "NCBI GEO: archive for high-throughput functional genomic data," Nucleic Acids Research, vol. 37, pp. D885-D890, DOI: 10.1093/nar/gkn764, 2009.
[15] R. Edgar, M. Domrachev, A. E. Lash, "Gene expression omnibus: NCBI gene expression and hybridization array data repository," Nucleic Acids Research, vol. 30 no. 1, pp. 207-210, DOI: 10.1093/nar/30.1.207, 2002.
[16] G. K. Smyth, "limma: linear models for microarray data," Bioinformatics and computational biology solutions using R and Bioconductor, pp. 397-420, 2015.
[17] R. B. Dessau, C. B. Pipper, "“R”--project for statistical computing," Ugeskrift for Laeger, vol. 170 no. 5, pp. 328-330, 2008.
[18] M. Melis, G. Diaz, D. E. Kleiner, F. Zamboni, J. Kabat, J. Lai, G. Mogavero, A. Tice, R. E. Engle, S. Becker, C. R. Brown, J. C. Hanson, J. Rodriguez-Canales, M. Emmert-Buck, S. Govindarajan, M. Kew, P. Farci, "Viral expression and molecular profiling in liver tissue versus microdissected hepatocytes in hepatitis B virus-associated hepatocellular carcinoma," Journal of Translational Medicine, vol. 12 no. 1,DOI: 10.1186/s12967-014-0230-1, 2014.
[19] E. Korpelainen, J. Tuimala, P. Somervuo, RNA-Seq Data Analysis - a Practical Approach, 2014.
[20] Y. Benjamini, Y. Hochberg, "Controlling the false discovery rate: a practical and powerful approach to multiple testing," Journal of the Royal statistical society: series B (Methodological), vol. 57 no. 1, pp. 289-300, 1995.
[21] J. C. Oliveros, Venny. An interactive tool for comparing lists with Venn's diagrams, 2007. Available from: https://bioinfogp.cnb.csic.es/tools/venny/index.html
[22] G. Dennis, B. T. Sherman, D. A. Hosack, J. Yang, W. Gao, H. C. Lane, R. A. Lempicki, "DAVID: database for annotation, visualization, and integrated discovery," Genome Biology, vol. 4 no. 5,DOI: 10.1186/gb-2003-4-5-p3, 2003.
[23] M. Ashburner, C. A. Ball, J. A. Blake, D. Botstein, H. Butler, J. M. Cherry, A. P. Davis, K. Dolinski, S. S. Dwight, J. T. Eppig, M. A. Harris, D. P. Hill, L. Issel-Tarver, A. Kasarskis, S. Lewis, J. C. Matese, J. E. Richardson, M. Ringwald, G. M. Rubin, G. Sherlock, "Gene Ontology: tool for the unification of biology," Nature Genetics, vol. 25 no. 1, pp. 25-29, DOI: 10.1038/75556, 2000.
[24] H. Ogata, S. Goto, K. Sato, W. Fujibuchi, H. Bono, M. Kanehisa, "KEGG: Kyoto Encyclopedia of Genes and Genomes," Nucleic Acids Research, vol. 27 no. 1, pp. 29-34, DOI: 10.1093/nar/27.1.29, 1999.
[25] P. Langfelder, S. Horvath, "WGCNA: an R package for weighted correlation network analysis," BMC Bioinformatics, vol. 9 no. 1,DOI: 10.1186/1471-2105-9-559, 2008.
[26] B. Zhang, S. Horvath, "A general framework for weighted gene co-expression network analysis," Statistical Applications in Genetics and Molecular Biology, vol. 4 no. 1,DOI: 10.2202/1544-6115.1128, 2005.
[27] A. Franceschini, D. Szklarczyk, S. Frankild, M. Kuhn, M. Simonovic, A. Roth, J. Lin, P. Minguez, P. Bork, C. von Mering, L. J. Jensen, "STRING v9.1: protein-protein interaction networks, with increased coverage and integration," Nucleic Acids Research, vol. 41, pp. D808-D815, DOI: 10.1093/nar/gks1094, 2013.
[28] M. Kohl, S. Wiese, B. Warscheid, "Cytoscape: software for visualization and analysis of biological networks," Methods in Molecular Biology, vol. 696, pp. 291-303, DOI: 10.1007/978-1-60761-987-1_18, 2011.
[29] Z. Tang, C. Li, B. Kang, G. Gao, C. Li, Z. Zhang, "GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses," Nucleic Acids Research, vol. 45 no. W1, pp. W98-W102, DOI: 10.1093/nar/gkx247, 2017.
[30] Y. Hu, L. Wang, Z. Li, Z. Wan, M. Shao, S. Wu, G. Wang, "Potential prognostic and diagnostic values of CDC6, CDC45, ORC6 and SNHG7 in colorectal cancer," Oncotargets and Therapy, vol. Volume 12, pp. 11609-11621, DOI: 10.2147/OTT.S231941, 2019.
[31] L. R. Borlado, J. Mendez, "CDC6: from DNA replication to cell cycle checkpoints and oncogenesis," Carcinogenesis, vol. 29 no. 2, pp. 237-243, DOI: 10.1093/carcin/bgm268, 2008.
[32] J. Huang, Y. Li, Z. Lu, Y. Che, S. Sun, S. Mao, Y. Lei, R. Zang, N. Li, S. Zheng, C. Liu, X. Wang, N. Sun, J. He, "Analysis of functional hub genes identifies CDC45 as an oncogene in non-small cell lung cancer - a short report," Cellular Oncology (Dordrecht), vol. 42 no. 4, pp. 571-578, DOI: 10.1007/s13402-019-00438-y, 2019.
[33] R. Tatsumi, Y. Ishimi, "An MCM4 mutation detected in cancer cells affects MCM4/6/7 complex formation," Journal of Biochemistry, vol. 161 no. 3, pp. 259-268, DOI: 10.1093/jb/mvw065, 2017.
[34] P. Ahluwalia, A. K. Mondal, C. Bloomer, S. Fulzele, K. Jones, S. Ananth, G. K. Gahlay, S. Heneidi, A. M. Rojiani, V. Kota, R. Kolhe, "Identification and clinical validation of a novel 4 gene-signature with prognostic utility in colorectal cancer," International Journal of Molecular Sciences, vol. 20 no. 15,DOI: 10.3390/ijms20153818, 2019.
[35] T. L. Winther, S. H. Torp, "MCM7 expression is a promising predictor of recurrence in patients surgically resected for meningiomas," Journal of Neuro-Oncology, vol. 131 no. 3, pp. 575-583, DOI: 10.1007/s11060-016-2329-0, 2017.
[36] J. Y. Yang, D. Li, Y. Zhang, B. X. Guan, P. Gao, X. C. Zhou, C. J. Zhou, "The expression of MCM7 is a useful biomarker in the early diagnostic of gastric cancer," Pathology Oncology Research, vol. 24 no. 2, pp. 367-372, DOI: 10.1007/s12253-017-0251-1, 2018.
[37] K. Qu, Z. Wang, H. Fan, J. Li, J. Liu, P. Li, Z. Liang, H. An, Y. Jiang, Q. Lin, X. Dong, P. Liu, C. Liu, "MCM7 promotes cancer progression through cyclin D1-dependent signaling and serves as a prognostic marker for patients with hepatocellular carcinoma," Cell Death & Disease, vol. 8 no. 2,DOI: 10.1038/cddis.2016.352, 2017.
[38] N. N. Phan, C. Y. Wang, K. L. Li, C. F. Chen, C. C. Chiao, H. G. Yu, P. L. Huang, Y. C. Lin, "Distinct expression of CDCA3, CDCA5, and CDCA8 leads to shorter relapse free survival in breast cancer patient," Oncotarget, vol. 9 no. 6, pp. 6977-6992, DOI: 10.18632/oncotarget.24059, 2018.
[39] J. Wang, C. Xia, M. Pu, B. Dai, X. Yang, R. Shang, Z. Yang, R. Zhang, K. Tao, K. Dou, "Silencing of CDCA5 inhibits cancer progression and serves as a prognostic biomarker for hepatocellular carcinoma," Oncology Reports, vol. 40 no. 4, pp. 1875-1884, 2018.
[40] C. Ci, B. Tang, D. Lyu, W. Liu, D. Qiang, X. Ji, X. Qiu, L. Chen, W. Ding, "Overexpression of CDCA8 promotes the malignant progression of cutaneous melanoma and leads to poor prognosis," International Journal of Molecular Medicine, vol. 43 no. 1, pp. 404-412, DOI: 10.3892/ijmm.2018.3985, 2019.
[41] Y. Bi, S. Chen, J. Jiang, J. Yao, G. Wang, Q. Zhou, S. Li, "CDCA8 expression and its clinical relevance in patients with bladder cancer," Medicine (Baltimore), vol. 97 no. 34,DOI: 10.1097/MD.0000000000011899, 2018.
[42] W. Wu, F. Wu, Z. Wang, J. di, J. Yang, P. Gao, B. Jiang, X. Su, "CENPH inhibits rapamycin sensitivity by regulating GOLPH3-dependent mTOR signaling pathway in colorectal cancer," Journal of Cancer, vol. 8 no. 12, pp. 2163-2172, DOI: 10.7150/jca.19940, 2017.
[43] G. Lu, H. Hou, X. Lu, X. Ke, X. Wang, D. Zhang, Y. Zhao, J. Zhang, M. Ren, S. He, "CENP-H regulates the cell growth of human hepatocellular carcinoma cells through the mitochondrial apoptotic pathway," Oncology Reports, vol. 37 no. 6, pp. 3484-3492, DOI: 10.3892/or.2017.5602, 2017.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2021 Zhe Yu et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
Several studies have demonstrated that chronic hepatitis delta virus (HDV) infection is associated with a worsening of hepatitis B virus (HBV) infection and increased risk of hepatocellular carcinoma (HCC). However, there is limited data on the role of HDV in the oncogenesis of HCC. This study is aimed at assessing the potential mechanisms of HDV-associated hepatocarcinogenesis, especially to screen and identify key genes and pathways possibly involved in the pathogenesis of HCC. We selected three microarray datasets: GSE55092 contains 39 cancer specimens and 81 paracancer specimens from 11 HBV-associated HCC patients, GSE98383 contains 11 cancer specimens and 24 paracancer specimens from 5 HDV-associated HCC patients, and 371 HCC patients with the RNA-sequencing data combined with their clinical data from the Cancer Genome Atlas (TCGA). Afterwards, 948 differentially expressed genes (DEGs) closely related to HDV-associated HCC were obtained using the R package and filtering with a Venn diagram. We then performed gene ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis to determine the biological processes (BP), cellular component (CC), molecular function (MF), and KEGG signaling pathways most enriched for DEGs. Additionally, we performed Weighted Gene Coexpression Network Analysis (WGCNA) and protein-to-protein interaction (PPI) network construction with 948 DEGs, from which one module was identified by WGCNA and three modules were identified by the PPI network. Subsequently, we validated the expression of 52 hub genes from the PPI network with an independent set of HCC dataset stored in the Gene Expression Profiling Interactive Analysis (GEPIA) database. Finally, seven potential key genes were identified by intersecting with key modules from WGCNA, including 3 reported genes, namely, CDCA5, CENPH, and MCM7, and 4 novel genes, namely, CDC6, CDC45, CDCA8, and MCM4, which are associated with nucleoplasm, cell cycle, DNA replication, and mitotic cell cycle. The CDCA8 and stage of HCC were the independent factors associated with overall survival of HDV-associated HCC. All the related findings of these genes can help gain a better understanding of the role of HDV in the underlying mechanism of HCC carcinogenesis.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details








1 Peking University 302 Clinical Medical School, Beijing 100039, China
2 Department of Liver Disease of Chinese PLA General Hospital, The Fifth Medical Center of Chinese PLA General Hospital, Beijing 100039, China
3 China Military Institute of Chinese Medicine, The Fifth Medical Centre of Chinese PLA General Hospital, Beijing 100039, China
4 Peking University 302 Clinical Medical School, Beijing 100039, China; Department of Liver Disease of Chinese PLA General Hospital, The Fifth Medical Center of Chinese PLA General Hospital, Beijing 100039, China