This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
Lymphoma, as one of the major cancer subtypes involving the lymphatic system, is a severe subgroup of malignancies in human beings [1, 2]. The lymphatic system is the center of the circulation immune system, thereby regulating the immune response against external antigens, germs, virus, and even cancers [3]. As a system throughout the body, the lymphatic system includes multiple levels of organs, including lymph nodes, spleen, thymus, and bone marrow [1, 3, 4]. Considering that the lymphatic system can affect the whole body, the malignant transformation of the lymphatic system, also known as lymphoma, is also a severe kind of malignancy affecting the whole body.
Multiple subtypes of lymphoma, including chronic lymphocytic leukemia [5], cutaneous B-cell lymphoma [6], Hodgkin’s lymphoma [7], and non-Hodgkin’s lymphoma [7, 8], are identified in clinics. Despite the diversity of lymphoma, most patients with lymphoma share similar symptoms, including painless swelling of lymph nodes, persistent fatigue, fever, and itchy skin [7]. Although the detailed pathogenesis of lymphoma remains unclear, external environmental effects, like Epstein–Barr virus [9] and Helicobacter pylori [10] infection, and genetic variations are shown to be associated with the disease. In recent years, with the development of sequencing techniques, genetic variations from multiple functional genes including CASP10 [11], ATM [12], RAD54L [13], BRAF [14], and CARD11 [15] have been shown to be associated with lymphoma. However, revealing the pathogenic mechanisms based on only a group of genes remains challenging. Further functional exploration, like gene ontology (GO) and pathway enrichment analyses, may help explore the biological foundation for the initiation and progression of lymphoma.
In this study, we summarized and compared the functional enrichment patterns of lymphoma-associated and lymphoma-nonassociated genes for the first time. By using Boruta, max-relevance and min-redundancy (mRMR), and incremental feature selection (IFS) methods and decision tree (DT) algorithms, we attempt to identify key functional enrichment terms (GO terms [16] or KEGG pathways [17]) contributing to the identification of lymphoma-associated genes. The identified functional enrichment terms are associated with the pathogenesis of lymphoma. Overall, our study has identified lymphoma-associated GO terms and KEGG pathways for the first time, thereby helping validate previous reports on the key biological effects of identified lymphoma biomarkers and establishing a novel approach to explore disease-associated pathogenesis at the functional level.
2. Materials and Methods
In this study, we investigated the functional enrichment patterns of lymphoma genes by using machine learning methods. The procedures are shown in Figure 1.
[figure(s) omitted; refer to PDF]
2.1. Dataset
In this study, we summarized 1548 lymphoma-associated genes from the DisGeNET database (https://www.disgenet.org/, v7.0) [18]. These genes were termed as positive samples, whereas the rest of the human genes were termed as negative samples. Given that the purpose of this study was to analyze the functional terms of lymphoma-associated genes, positive and negative samples without GO or KEGG pathway information were discarded. A total of 1330 positive and 16338 negative samples remained.
2.2. Feature Construction
Certain informative features should be used to express genes and identify distinctions between positive and negative samples. In this study, GO and KEGG enrichment scores were used as features of each gene.
GO enrichment denotes the association between a gene and a GO term. The
Similarly, the KEGG enrichment score
Certainly, a high enrichment score of a gene with one GO term or KEGG pathway indicated a strong relationship. In this study, 20681 GO and 297 KEGG enrichment scores were obtained for each gene. Thus, these 20978 features might be used to represent gene
2.3. Feature Selection
As shown in Figure 1, we used the Boruta [20], mRMR [21], and incremental feature selection (IFS) [22] algorithms to perform feature selection. The Boruta method eliminated nonrelevant features, the mRMR method sorted the features into a feature list, and the IFS combined specific classifiers to determine the optimal number of features.
2.3.1. Boruta Feature Selection
The presence of a large number of features in the dataset could cause some technical problems. Thus, we applied the Boruta algorithm to assess the importance of features and eventually selected significant features. As a wrapper feature selection method, the Boruta algorithm was designed on the basis of the random forest classification algorithm. The algorithm randomly created shadow features from original features and operated a random forest classifier on the collection of original and shadow features to filter important and unimportant features. In accordance with the outcomes of statistical tests (e.g.,
2.3.2. mRMR Feature Selection
To evaluate the degree of importance for each feature, we used the mRMR algorithm to sort features in terms of their importance. The informative features selected by this method had the maximum relevance to class labels and the minimum redundancy with each other. The method calculated the relationship between features or classified labels by using mutual information (MI). The MI values of variables
2.3.3. IFS
Although the mRMR method ranked the features by importance, which features were essential in the feature list remained a problem. The IFS method was used to determine the optimum features in the feature list. In the first step, IFS had output a set of feature subsets from the list. For example, when the step size was set to 5, the 1st and 2nd feature subsets were composed of the top 5 and top 10 features, respectively, in the list. The training samples represented by features in each subset were next trained with the desired classifier. The classifier was assessed by 10-fold crossvalidation and synthetic minority oversampling technique (SMOTE) to obtain the performance metrics of the classification model, and the best classification model could be determined by performance metrics.
2.4. DT
Different from other algorithms, such as the support vector machine (SVM) [23] and random forest (RF) [24], DT [25] is a white box model that constructs classification or regression models that are easy to interpret. DT creates a tree structure in the IF–THEN format and generates rules that can be understood, thereby further enhancing the knowledge of the model prediction mechanism. This study adopted the DT program implemented by python in Scikit-learn (https://scikit-learn.org/stable/) [26]. Such program implements the CART tree with the Gini index to expand the tree.
2.5. SMOTE
An imbalance problem is present between the sizes of positive and negative samples in the abovementioned constructed dataset, where the positive sample size is much smaller than the negative sample size. To address this issue, we used the SMOTE [27] algorithm in this research. The SMOTE algorithm analyzes and simulates a minority class of samples by using the kNN technique and adds the newly synthesized samples to the dataset to produce a new training set. The SMOTE program that was run in this work was sourced from https://github.com/scikit-learn-contrib/imbalanced-learn, and parameters were set to default.
2.6. Performance Measurements
Evaluation metrics, such as accuracy (ACC), sensitivity (SN) (same as recall), specificity (SP), precision, F1-measure, and MCC [28–31], were used in the 10-fold crossvalidation [32–38] process to assess the performance of prediction models. The formulas of these evaluation metrics are shown as follows:
3. Results
3.1. Results of Boruta and mRMR Methods on the Dataset
The Boruta and mRMR feature selection methods were adopted to analyze the dataset and select key features. A total of 1075 features were retained after processing the original dataset by using the Boruta method. These preserved features are listed in Table S1. These features are composed of 1034 GO terms and 41 KEGG pathways. Further, these features were sorted by the mRMR method to evaluate their importance. Results are also listed in Table S1.
3.2. Results of the IFS Method with DT
A series of feature subsets was generated when the step size was set to 5 from the mRMR feature list and subjected to the IFS method to acquire the best features for classifying lymphoma-related genes and other genes and obtain the best number of features. The classification results using the different number of features are provided in Table S2. IFS curves were plotted by setting the number of features as the
[figure(s) omitted; refer to PDF]
805 features were used in the best DT classifier, which are the top 805 features in Table S1. Among them, 41 features were related to KEGG pathways, whereas the remaining 764 features were about GO terms. It is known that all GO terms can be divided into three groups: biological process (BP), cellular component (CC), and molecular function (MF). The distribution of 764 GO features on three groups is illustrated in Figure 3. It can be observed that BP GO terms were the most, followed by MF and CC GO terms.
[figure(s) omitted; refer to PDF]
3.3. Results of Classification Rules by Using the Optimal DT Classifier
The DT is a white-box model that provides clear decision rules and is beneficial for further analysis. Thus, we used these 805 features to construct a DT by using all samples. From such DT, 799 decision rules were extracted (Table S3). A detailed description of these rules is provided in “Discussion.”
4. Discussion
The functional enrichment annotations of lymphoma-associated genes were used to identify a group of functional enrichment terms, i.e., GO and KEGG pathway terms, and reveal the key biological effects distinguishing lymphoma-associated genes and other genes. On the basis of machine learning models, we identified a group of terms associated with lymphoma. The detailed discussion on these terms are shown as follows.
4.1. Functional Enrichment Terms Associated with Lymphoma-Associated Genes
The first identified functional enrichment term is GO:0042113, describing B-cell activation. Early in 2002, researchers from the University of California, Los Angeles, confirmed that the activation of B cells participates in the initiation and progression of lymphoma, such as in patients with HIV [39]. Further, similar results are validated in South Africa by researchers from the University of the Western Cape in 2018, indicating that B-cell activation is associated with the pathogenesis and progression of lymphoma [40]. Therefore, B-cell activation is an effective biological process associated with lymphoma.
The next identified functional enrichment term is GO:0044424, which describes a cellular component as the obsolete intracellular part and is now named as the intracellular anatomical structure. Although no direct report confirmed that any intracellular structure is specifically associated with the pathogenesis of lymphoma, structural variants associated with programmed cell death-associated proteins are specifically associated with Epstein–Barr virus-associated lymphomas [41]. This finding is consistent with our prediction.
The next identified GO term is the general GO term GO:0010955 that describes the negative regulation of protein processing. Such GO term summarizes any process associated with peptide bond cleavage frequency and protein maturation efficacy. According to recent publications, BAFF has been regarded as an important driver for B-cell non-Hodgkin lymphoma. BAFF and its pathway BAFF/BAFF-R pathway are processed by cleavage from the plasma membrane and transformation into a soluble form [42]. Therefore, functional protein processing, like bond cleavage, may also be essential to trigger lymphoma.
The next two identified GO terms are GO:0032764 (negative regulation of mast cell cytokine production) and GO:0002643 (regulation of tolerance induction). Early in 1982, a long-term in vitro culture of mast cells in mouse models confirmed that mast cell cytokines are associated with the growth and maturation of mast cells and that such cytokines are validated in T-cell lymphoma [43], confirming the correlations between mast cell cytokines and T-cell lymphoma. As for the regulation of tolerance induction, tolerance induction describes a physiological status in which immune cells do not react against antigens or external stimulations. During the pathogenesis of lymphoma, tolerance is quite common in multiple lymphoma subtypes especially for B-cell lymphoma [44–46].
Similarly, although not in the top, we identified KEGG pathways, like hsa04650 (natural killer cell-mediated cytotoxicity) and hsa05202 (transcriptional misregulation in cancer). Early in 1995, T lymphomas are reported to induce optimal natural killer cell-mediated cytotoxicity [47], revealing the correlations between such pathway and lymphoma. Recent reports [48–50] also validated that natural killer cells play an irreplaceable role during lymphoma pathogenesis. As for another KEGG pathway, i.e., transcriptional misregulation in cancer, researchers from the Massachusetts Institute of Technology summarized disease-associated transcriptional regulation and validated that T-cell lymphoma is associated with transcriptional regulations in 2013 [51], thereby validating our prediction.
4.2. Quantitative Rules for Functional Enrichment Terms Associated with Lymphoma-Associated Genes
Apart from the identification of lymphoma-associated biological processes, we established quantitative rules by using enriched functional terms. The detailed analyses on top features from three optimal rules are shown as follows.
The first rule involves 59 features. Here, we selected two features for discussion. The first selected feature is the GO term GO:0042113, describing B-cell activation. As we have discussed, such GO term is associated with lymphoma pathogenesis, thereby validating our prediction. The next feature is GO:0007568 (aging). Aging has been shown to be associated with the initiation and progression of lymphoma [52]. Therefore, predicting that aging is a determinative biological process associated with lymphoma is reasonable.
The next rule involves 44 features. Apart from B-cell activation (GO:0042113) and aging (GO:0007568), the GO term GO:0002705, the positive regulation of leukocyte-mediated immunity as a candidate to be associated with lymphoma, is identified. In 2019, researchers from Shanghai Rui Jin Hospital reported that leukocyte-mediated immune responses are associated with the pathogenesis of lymphoma, thereby validating our prediction. In addition, the negative regulation of mitophagy (GO:1901525) has been reported to be associated with lymphoma and this finding has also been supported by recent publications [53, 54]. Overall, such quantitative rule can help the identification of lymphoma-associated genes.
The third rule also includes multiple lymphoma-associated functional terms (GO terms and KEGG pathways). Except for shared GO terms with the abovementioned two rules, like GO:0042113 and GO:0007568, GO:0048539, as another predicted GO term, describes the bone marrow development and has been shown to be associated with lymphoma. According to recent publications, bone marrow biopsy has been regarded as one of the major methods for clinical diagnosis on lymphoma [55]. The development of bone marrow is tightly associated with the initiation and progression of lymphoma, thereby validating our prediction.
Overall, by using machine learning models, we identified a group of functional enrichment terms associated with lymphoma and established quantitative rules for lymphoma prediction. The prediction results that we presented can help promote the exploration on the fundamental pathological mechanisms for lymphoma and provide us a new tool to analyze the functional characteristics of complex diseases.
5. Conclusion
This study is aimed at identifying key GO terms and KEGG pathways for lymphoma-associated genes. A total of 805 key features and 799 quantitative rules were identified using a machine learning approach, which has been validated by research results in recent years. This study contributes to a deep understanding of the underlying pathological mechanisms of lymphoma and provides us with new tools to analyze the functional characteristics of the disease.
Authors’ Contributions
Qian Sun and Lin Bai contributed equally to this work.
[1] E. McLafferty, C. Hendry, A. Farley, "The lymphatic system," Nursing Standard, vol. 27 no. 15, pp. 37-42, DOI: 10.7748/ns2012.12.27.15.37.c9482, 2012.
[2] A. Carbone, C. Tripodo, C. Carlo-Stella, A. Santoro, A. Gloghini, "The role of inflammation in lymphoma," Advances in Experimental Medicine and Biology, vol. 816, pp. 315-333, DOI: 10.1007/978-3-0348-0837-8_12, 2014.
[3] G. J. Randolph, S. Ivanov, B. H. Zinselmeyer, J. P. Scallan, "The lymphatic system: integral roles in immunity," Annual Review of Immunology, vol. 35 no. 1, pp. 31-52, DOI: 10.1146/annurev-immunol-041015-055354, 2017.
[4] M. A. Swartz, "The physiology of the lymphatic system," Advanced Drug Delivery Reviews, vol. 50 no. 1-2,DOI: 10.1016/S0169-409X(01)00150-8, 2001.
[5] M. Hallek, T. D. Shanafelt, B. Eichhorst, "Chronic lymphocytic leukaemia," Lancet, vol. 391 no. 10129, pp. 1524-1537, DOI: 10.1016/S0140-6736(18)30422-7, 2018.
[6] A. Goyal, R. E. LeBlanc, J. B. Carter, "Cutaneous B-cell lymphoma," Hematology/Oncology Clinics of North America, vol. 33 no. 1, pp. 149-161, DOI: 10.1016/j.hoc.2018.08.006, 2019.
[7] J. O. Armitage, R. D. Gascoyne, M. A. Lunning, F. Cavalli, "Non-Hodgkin lymphoma," Lancet, vol. 390 no. 10091, pp. 298-310, DOI: 10.1016/S0140-6736(16)32407-2, 2017.
[8] C. Argyrou, K. Hatziagapiou, M. Theodorakidou, O. A. Nikola, S. Vlahopoulos, G. I. Lambrou, "The role of adiponectin, LEPTIN, and ghrelin in the progress and prognosis of childhood acute lymphoblastic leukemia," Leukemia & Lymphoma, vol. 60 no. 9, pp. 2158-2169, DOI: 10.1080/10428194.2019.1569230, 2019.
[9] M. Vockerodt, L. F. Yap, C. Shannon‐Lowe, H. Curley, W. Wei, K. Vrzalikova, P. G. Murray, "The Epstein–Barr virus and the pathogenesis of lymphoma," The Journal of Pathology, vol. 235 no. 2, pp. 312-322, DOI: 10.1002/path.4459, 2015.
[10] A. Salar, "Gastric MALT lymphoma and _Helicobacter pylori_," Medicina Clínica (Barcelona), vol. 152 no. 2, pp. 65-71, DOI: 10.1016/j.medcli.2018.09.006, 2019.
[11] M. S. Shin, H. S. Kim, C. S. Kang, W. S. Park, S. Y. Kim, S. N. Lee, J. H. Lee, J. Y. Park, J. J. Jang, C. W. Kim, S. H. Kim, J. Y. Lee, N. J. Yoo, S. H. Lee, "Inactivating mutations of CASP10 gene in non-Hodgkin lymphomas," Blood, vol. 99 no. 11, pp. 4094-4099, DOI: 10.1182/blood.V99.11.4094, 2002.
[12] J. Boultwood, "Ataxia telangiectasia gene mutations in leukaemia and lymphoma," Journal of Clinical Pathology, vol. 54 no. 7, pp. 512-516, DOI: 10.1136/jcp.54.7.512, 2001.
[13] P. E. Leone, M. Mendiola, J. Alonso, C. Paz-y-Miño, A. Pestaña, "Implications of a RAD54L polymorphism (2290C/T) in human meningiomas as a risk factor and/or a genetic marker," BMC Cancer, vol. 3 no. 1,DOI: 10.1186/1471-2407-3-6, 2003.
[14] R. Shi, S. N. Martins Filho, M. Li, A. Fares, J. Weiss, N. A. Pham, O. Ludkovski, V. Raghavan, Q. Li, D. Ravi, M. Cabanero, "BRAF V600E mutation and MET amplification as resistance pathways of the second-generation anaplastic lymphoma kinase (ALK) inhibitor alectinib in lung cancer," Lung Cancer, vol. 146, pp. 78-85, DOI: 10.1016/j.lungcan.2020.05.018, 2020.
[15] G. Lenz, R. E. Davis, V. N. Ngo, L. Lam, T. C. George, G. W. Wright, S. S. Dave, H. Zhao, W. Xu, A. Rosenwald, G. Ott, "OncogenicCARD11mutations in human diffuse large B cell lymphoma," Science, vol. 319 no. 5870, pp. 1676-1679, DOI: 10.1126/science.1153629, 2008.
[16] Gene Ontology Consortium, "Gene ontology consortium: going forward," Nucleic Acids Research, vol. 43 no. D1, pp. D1049-D1056, DOI: 10.1093/nar/gku1179, 2015.
[17] M. Kanehisa, Y. Sato, M. Kawashima, M. Furumichi, M. Tanabe, "KEGG as a reference resource for gene and protein annotation," Nucleic Acids Research, vol. 44 no. D1, pp. D457-D462, DOI: 10.1093/nar/gkv1070, 2016.
[18] J. Piñero, N. Queralt-Rosinach, A. Bravo, J. Deu-Pons, A. Bauer-Mehren, M. Baron, F. Sanz, L. I. Furlong, "DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes," Database: The Journal of Biological Databases and Curation, vol. 2015, 2015.
[19] P. Carmona-Saez, M. Chagoyen, F. Tirado, J. M. Carazo, A. Pascual-Montano, "GENECODIS: a web-based tool for finding significant concurrent annotations in gene lists," Genome Biology, vol. 8 no. 1,DOI: 10.1186/gb-2007-8-1-r3, 2007.
[20] M. B. Kursa, W. R. Rudnicki, "Feature selection with theBorutapackage," Journal of Statistical Software, vol. 36 no. 11,DOI: 10.18637/jss.v036.i11, 2010.
[21] H. Peng, F. Long, C. Ding, "Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27 no. 8, pp. 1226-1238, DOI: 10.1109/TPAMI.2005.159, 2005.
[22] H. Liu, R. Setiono, "Incremental feature selection," Applied Intelligence, vol. 9 no. 3, pp. 217-230, DOI: 10.1023/A:1008363719778, 1998.
[23] C. Cortes, V. Vapnik, "Support-vector networks," Machine Learning, vol. 20 no. 3, pp. 273-297, DOI: 10.1007/BF00994018, 1995.
[24] L. Breiman, "Random forests," Machine Learning, vol. 45 no. 1,DOI: 10.1023/A:1010933404324, 2001.
[25] S. R. Safavian, D. Landgrebe, "A survey of decision tree classifier methodology," Man and Cybernetics., vol. 21 no. 3, pp. 660-674, DOI: 10.1109/21.97458, 1991.
[26] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, "Scikit-learn: machine learning in python," Journal of Machine Learning Research, vol. 12 no. 85, pp. 2825-2830, 2011.
[27] N. V. Chawla, K. W. Bowyer, L. O. Hall, W. P. Kegelmeyer, "SMOTE: synthetic minority over-sampling technique," Journal of Artificial Intelligence Research, vol. 16, pp. 321-357, DOI: 10.1613/jair.953, 2002.
[28] B. Matthews, "Comparison of the predicted and observed secondary structure of T4 phage lysozyme," Structure, vol. 405 no. 2, pp. 442-451, DOI: 10.1016/0005-2795(75)90109-9, 1975.
[29] X. Zhao, L. Chen, J. Lu, "A similarity-based method for prediction of drug side effects with heterogeneous information," Mathematical Biosciences, vol. 306, pp. 136-144, DOI: 10.1016/j.mbs.2018.09.010, 2018.
[30] L. Chen, S. Wang, Y. H. Zhang, J. Li, Z. H. Xing, J. Yang, T. Huang, Y. D. Cai, "Identify key sequence features to improve CRISPR sgRNA efficacy," IEEE Access, vol. 5, pp. 26582-26590, DOI: 10.1109/ACCESS.2017.2775703, 2017.
[31] Y. Yang, L. Chen, "Identification of drug–disease associations by using multiple drug and disease networks," Current Bioinformatics, vol. 17 no. 1, pp. 48-59, DOI: 10.2174/1574893616666210825115406, 2022.
[32] R. Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection, in Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2, Morgan Kaufmann Publishers Inc.: Montreal, 1995.
[33] L. Chen, Z. Li, S. Zhang, Y. H. Zhang, T. Huang, Y. D. Cai, "Predicting RNA 5-methylcytosine sites by using essential sequence features and distributions," BioMed Research International, vol. 2022,DOI: 10.1155/2022/4035462, 2022.
[34] S. Ding, D. Wang, X. Zhou, L. Chen, K. Feng, X. Xu, T. Huang, Z. Li, Y. Cai, "Predicting heart cell types by using transcriptome profiles and a machine learning method," Life, vol. 12 no. 2,DOI: 10.3390/life12020228, 2022.
[35] Z. Li, D. Wang, H. Liao, S. Zhang, W. Guo, L. Chen, L. Lu, T. Huang, Y. D. Cai, "Exploring the genomic patterns in human and mouse cerebellums via single-cell sequencing and machine learning method," Frontiers in Genetics, vol. 13, article 857851,DOI: 10.3389/fgene.2022.857851, 2022.
[36] W. Chen, L. Chen, Q. Dai, "iMPT-FDNPL: identification of membrane protein types with functional domains and a natural language processing approach," Computational and Mathematical Methods in Medicine, vol. 2021,DOI: 10.1155/2021/7681497, 2021.
[37] X. Li, L. Lu, L. Chen, College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China, Department of Radiology, Columbia University Medical Center, New York 10032, USA, "Identification of protein functions in mouse with a label space partition method," Mathematical Biosciences and Engineering, vol. 19 no. 4, pp. 3820-3842, DOI: 10.3934/mbe.2022176, 2022.
[38] S. Tang, L. Chen, "iATC-NFMLP: identifying classes of anatomical therapeutic chemicals based on drug networks, fingerprints and multilayer perceptron," Current Bioinformatics, vol. 17,DOI: 10.2174/1574893617666220318093000, 2022.
[39] O. Martínez-Maza, E. C. Breen, "B-cell activation and lymphoma in patients with HIV," Current Opinion in Oncology, vol. 14 no. 5, pp. 528-532, DOI: 10.1097/00001622-200209000-00009, 2002.
[40] B. T. Flepisi, P. Bouic, G. Sissolak, B. Rosenkranz, "B-cell and T-cell activation in South African HIV-1-positive non-Hodgkin’s lymphoma patients," Southern African Journal of HIV Medicine, vol. 19 no. 1, article 809,DOI: 10.4102/sajhivmed.v19i1.809, 2018.
[41] K. Kataoka, H. Miyoshi, S. Sakata, A. Dobashi, L. Couronné, Y. Kogure, Y. Sato, K. Nishida, Y. Gion, Y. Shiraishi, H. Tanaka, "Frequent structural variations involving programmed death ligands in Epstein-Barr virus-associated lymphomas," Leukemia, vol. 33 no. 7, pp. 1687-1699, DOI: 10.1038/s41375-019-0380-5, 2019.
[42] S. Yang, J. Y. Li, W. Xu, "Role of BAFF/BAFF-R axis in B-cell non-Hodgkin lymphoma," Critical Reviews in Oncology/Hematology, vol. 91 no. 2, pp. 113-122, DOI: 10.1016/j.critrevonc.2014.02.004, 2014.
[43] Y. P. Yung, M. A. Moore, "Long-term In Vitro culture of murine mast cells. III. Discrimination of mast cells growth factor and granulocyte-CSF," Journal of Immunology, vol. 129 no. 3, pp. 1256-1261, 1982.
[44] P. Serafini, S. Mgebroff, K. Noonan, I. Borrello, "Myeloid-derived suppressor cells promote cross-tolerance in B-cell lymphoma by expanding regulatory T cells," Cancer Research, vol. 68 no. 13, pp. 5439-5449, DOI: 10.1158/0008-5472.CAN-07-6621, 2008.
[45] K. Wang, G. Wei, D. Liu, "CD19: a biomarker for B cell development, lymphoma diagnosis and therapy," Experimental Hematology & Oncology, vol. 1 no. 1, article 36,DOI: 10.1186/2162-3619-1-36, 2012.
[46] D. Nemazee, "Mechanisms of central tolerance for B cells," Nature Reviews Immunology, vol. 17 no. 5, pp. 281-294, DOI: 10.1038/nri.2017.19, 2017.
[47] A. B. Geldhof, G. Raes, M. Bakkus, S. Devos, K. Thielemans, P. de Baetselier, "Expression of B7-1 by highly metastatic mouse T lymphomas induces optimal natural killer cell-mediated cytotoxicity," Cancer Research, vol. 55 no. 13, pp. 2730-2733, 1995.
[48] S. E. Street, Y. Hayakawa, Y. Zhan, A. M. Lew, D. MacGregor, A. M. Jamieson, A. Diefenbach, H. Yagita, D. I. Godfrey, M. J. Smyth, "Innate immune surveillance of spontaneous B cell lymphomas by natural killer cells and gammadelta T cells," The Journal of Experimental Medicine, vol. 199 no. 6, pp. 879-884, DOI: 10.1084/jem.20031981, 2004.
[49] H. E. Kohrt, A. Thielens, A. Marabelle, I. Sagiv-Barfi, C. Sola, F. Chanuc, N. Fuseri, C. Bonnafous, D. Czerwinski, A. Rajapaksa, E. Waller, "Anti-KIR antibody enhancement of anti-lymphoma activity of natural killer cells as monotherapy and in combination with anti-CD20 antibodies," Blood, vol. 123 no. 5, pp. 678-686, DOI: 10.1182/blood-2013-08-519199, 2014.
[50] E. Liu, D. Marin, P. Banerjee, H. A. Macapinlac, P. Thompson, R. Basar, L. Nassif Kerbauy, B. Overman, P. Thall, M. Kaplan, V. Nandivada, "Use of CAR-transduced natural killer cells in CD19-positive lymphoid tumors," The New England Journal of Medicine, vol. 382 no. 6, pp. 545-553, DOI: 10.1056/NEJMoa1910607, 2020.
[51] T. I. Lee, R. A. Young, "Transcriptional regulation and its misregulation in disease," Cell, vol. 152 no. 6, pp. 1237-1251, DOI: 10.1016/j.cell.2013.02.014, 2013.
[52] C. Sarkozy, G. Salles, C. Falandry, "The biology of aging and lymphoma: a complex interplay," Current Oncology Reports, vol. 17 no. 7,DOI: 10.1007/s11912-015-0457-x, 2015.
[53] J. Xiong, L. Wang, X. C. Fei, X. F. Jiang, Z. Zheng, Y. Zhao, C. F. Wang, B. Li, S. J. Chen, A. Janin, R. P. Gale, "MYC is a positive regulator of choline metabolism and impedes mitophagy- dependent necroptosis in diffuse large B-cell lymphoma," Blood Cancer Journal, vol. 7 no. 7, article e582,DOI: 10.1038/bcj.2017.61, 2017.
[54] A. Sarkar, V. Gandhi, "Activation of ATM kinase by ROS generated during ionophore-induced mitophagy in human T and B cell malignancies," Molecular and Cellular Biochemistry, vol. 476 no. 1, pp. 417-423, DOI: 10.1007/s11010-020-03917-1, 2021.
[55] S. Lakhwani, D. Cabello-García, A. Allende-Riera, C. Cárdenas-Negro, J. M. Raya, M. T. Hernández-Garcia, "Bone marrow trephine biopsy in Hodgkin’s lymphoma. Comparison with PET-CT scan in 65 patients," Medicina Clínica (Barcelona), vol. 150 no. 3, pp. 104-106, DOI: 10.1016/j.medcli.2017.06.060, 2018.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2022 Qiao Sun et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
Lymphoma is a serious malignant tumor that contains more than 70 different types and seriously endangers the body’s lymphatic system. The lymphatic system is the regulatory center of the immune system and is important in the immune response to foreign antigens and tumors. Studies showed that multiple genetic variants are associated with lymphoma but determining the pathogenic mechanisms remains a challenge. In the present study, we first applied the Gene Ontology (GO) and KEGG pathway enrichment analyses of lymphoma-associated and lymphoma-nonassociated genes. Next, the Boruta and max-relevance and min-redundancy feature selection methods were performed to filter and rank features. Then, features preselected and ranked using the incremental feature selection method were applied for the decision tree model to identify the best GO terms and KEGG pathways and extract classification rules. Results indicate that our predicted features, such as B-cell activation, negative regulation of protein processing, negative regulation of mast cell cytokine production, and natural killer cell-mediated cytotoxicity, are associated with the biological process of lymphoma, consistent with those of recent publications. This study provides a new perspective for future research on the molecular mechanisms of lymphoma.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details



1 Department of Ophthalmology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China; National Clinical Research Center for Eye Diseases, Shanghai 20080, China; Shanghai Key Laboratory of Ocular Fundus Diseases, Shanghai 200080, China; Shanghai Engineering Center for Visual Science and Photomedicine, Shanghai 200080, China; Shanghai Engineering Center for Precise Diagnosis and Treatment of Eye Diseases, Shanghai 20080, China
2 Eye School of Chengdu University of TCM, Chengdu 611137, China; Key Laboratory of Sichuan Province Ophthalmopathy Prevention & Cure and Visual Function Protection, Chengdu 611137, China
3 School of Life Sciences, Shanghai University, Shanghai 200444, China