1. Introduction
1.1. Structure–Property Modeling and Topological Descriptors
Quantitative structure–property relationship (QSPR) studies [1] play a vital role in theoretical chemistry, enabling the estimation of thermodynamic and physicochemical properties of molecular structures, and particularly, organic compounds of these predictive models are built upon advanced mathematical and computational methodologies [2]. The origins can be traced back to Harold Wiener [3], who introduced the concept of the “path number”, representing the sum of pairwise distances, to estimate the boiling points of alkanes. This measure was later formalized as the Wiener index in graph theory. Today, QSPR modeling relies on structure-based molecular descriptors [4], which provide the necessary mathematical frameworks. Among these descriptors, topological indices—also referred to as graph-related molecular invariants—have been widely studied [5]. These invariants transform the molecular structure [6], excluding hydrogen atoms, into numerical values that capture essential chemical characteristics. To accurately predict physicochemical properties such as boiling point and heat of formation [7], graphical invariants utilize regression equations [8] that encode the structural characteristics of a compound and its underlying chemical information. For diverse applicability of graph theory and network analysis in other areas such as communication and mobile networks, we refer to [9,10,11,12].
Graphical invariants can be categorized into various types based on the structural properties they incorporate, including degree-related descriptors [13,14], distance-based indices [15], spectral descriptors derived from graph matrices [16], and counting-related polynomials and invariants [17]. With the continuous introduction of new invariants [18], many of them lack substantial chemical relevance [4]. Addressing this issue requires strict criteria for evaluating newly proposed descriptors. Gutman and Tošović [19] highlighted that the proliferation of graphical invariants without rigorous assessment has resulted in an excessive number of descriptors, many of which are unnecessary. This underscores the need to systematically examine emerging families of graphical descriptors for their effectiveness in structure–property modeling, ensuring that only the most efficient ones are advanced while eliminating those that do not contribute meaningfully.
Recent research in mathematical chemistry has increasingly focused on evaluating graphical invariants to determine their effectiveness in predicting physicochemical and thermodynamic properties. This process involves filtering out less reliable descriptors while highlighting those with superior predictive power. The groundwork for this comparative approach was laid by Gutman and Tošović [19], whose study emphasized the importance of systematically assessing molecular descriptors. Their methodology was later extended by Malik et al. [20], who applied it to benzenoid hydrocarbons (BHs), expanding beyond the initial work on isomeric octanes. Additionally, Hayat et al. [21] explored degree-based invariants for their potential to estimate the total -electronic energy in BHs. This line of research was further developed in the context of distance-based descriptors [22,23], and subsequent investigations have examined the predictive power of eigenvalue-related graphical invariants [24].
This paper studies a novel family of graphical invariants known as connection-based graphical invariants and conduct a comparative testing to computer their potential to predict PCPs of BHs. Following Gutman and Tošović [19], the standard heat of formation and normal boiling point were chosen to represent PCPs. Some new connection-based indices have also been proposed. A computer-dependent computational method is presented to calculate all existing connection-based invariants and contemporary statistical tools such as multiple regression analysis are employed afterwards to rule out insignificant indices while putting forward the ones which deserve further attention in QSPR models. The experimental analysis showcased that the general sum-connectivity connection index outperforms all the connection-based invariants, whereas the well-studied two Zagreb connection indices deliver poor performance. The results in this paper contribute to the avoidance of proliferation of graphical invariants.
1.2. Coumarin-Related Compounds
Coumarins are crystalline, colorless polyphenolic compounds classified under oxygenated heterocyclic substances. These compounds were first discovered by Vogel in 1820, who isolated them from the Fabaceae plant Dipteryx odorata Willd., commonly known as “coumaroun” [25]. Oxygenated heterocyclic compounds are a group that includes furan derivatives, containing four carbon atoms, and pyran derivatives, which have five carbon atoms. Furan derivatives are uncommon in plants, but pyran derivatives are more widespread, forming the backbone of numerous compounds like ketones, including -pyrones and -pyrones. When pyran derivatives fuse with benzene in plants, they yield secondary metabolites known as benzo--pyrones (coumarins) and benzo--pyrones (chromones) [26].
Coumarin (1,2-benzopyrone or 2H-1-benzopyran-2-one) and its derivatives are plant-derived compounds that can be found as glycosides (heterosides) or in their unbound form. To date, nearly 800 naturally occurring coumarin derivatives have been discovered across approximately 600 genera within over 100 plant families [27]. These compounds are commonly found in seeds, roots, and leaves [28], with significant concentrations in plant families like Rutaceae and Apiaceae, which belong to the Dicotyledonae class of the Spermatophyta division. While the majority of natural coumarins are produced by vascular plants, certain types, such as novobiocin, coumermycin, and aflatoxin, are generated by microbial sources [29].
Due to their diverse biological activities, coumarin derivatives have attracted significant attention in recent years. Research has highlighted their potential in various therapeutic areas, including antitumor applications [30], photochemotherapy, anti-HIV activity [31], antibacterial and antifungal properties [32], anti-inflammatory effects [33], and anticoagulant activity through the inhibition of VKOR (vitamin K epoxide reductase) [34]. They also exhibit triglyceride-lowering effects [35] and act as central nervous system stimulants [36]. Hydroxycoumarins, in particular, are noted for their strong antioxidant capabilities, protecting against oxidative stress by neutralizing reactive oxygen species [37]. Furthermore, coumarins with reduced estrogenic activity have been identified, enabling their use in alleviating menopausal symptoms [38]. On the other hand, some coumarin derivatives are employed as flavor enhancers in tobacco products, as noted in [39].
Recently, Timmanaikar et al. [40] employed certain graphical indices in structure–property modeling of coumarin and related compounds. Using several degree-based molecular descriptors, including the Balban index and connective eccentric index (CEI), the study models various physicochemical properties of these compounds, such as boiling point and vapor pressure. Findings reveal that the Balban index and CEI are particularly effective in predicting these properties with high accuracy, highlighting their potential as robust tools in the design of anti-cancer drugs. This work underscores the relevance of topological indices in drug discovery, facilitating efficient and cost-effective molecular analysis. One of the limitations of their work is that their study’s reliance on specific molecular descriptors, like the Balban index and CEI, limits its applicability to broader descriptor classes or additional pharmacological properties.
In this paper, we essentially extended their work to address the aforementioned limitation.
2. Results and Discussion
This section delivers a detailed analysis of MLCs in Section 3.3.
The very first unexpected outcome that we observe is that the two multiplicative Zagreb connection descriptors, i.e., , showcase a considerably poor performance as the MLC for both is <, which in structure–property studies is considered very poor. Thus, the study by Javaid et al. [41], who proposed the two multiplicative Zagreb connection invariants, has no meaningful applicative potential from a structure–property studies perspective. We discourage authors from further investigating these two descriptors. Furthermore, the MLC value for the 2nd Zagreb connection invariant is <, which from the perspective of mathematical investigation is feasible; however, it does not find popularity among researchers.
Notably, the performance of the 1st Zagreb connection invariant with MLC seems reasonable.
Secondly, among the newly proposed connection-based invariants, we observe that the Randić and the general Randić connection descriptors with are less-efficient. However, the general Randić connection descriptor delivers a reasonable performance with . The only other newly proposed connection invariant is the augmented Zagreb connection invariant, i.e., with MLC . Apart from these connection invariants that deliver poor performance, all other newly proposed connection invariants showcase significantly improved efficiency. For instance, the general sum-connectivity connection index with records the strongest MLC of and this connection index is among the newly proposed graphical connection-dependent invariants.
The strong potential of the newly proposed connection invariants, such as , motivates us to look for other potential connection invariants. Our experiment shows that the general sum-connectivity connection index with , the sum-connectivity connection descriptor , the atom-bond-connectivity connection index, the geometric-arithmetic connection invariant, and the arithmetic–geometric connection index are among the top five connection-based graphical invariants for estimating the physicochemical characteristics of BHs. It is noteworthy to observe that all of these five best connection descriptors are the newly proposed connection-based invariants, which ultimately justify considering introducing new connection graphical invariants. Table 1 depicts the list of the five best connection-related graphical invariants.
Next, we conduct a detailed statistical analysis of the five best connection-related graphical invariants. First, we put forward appropriate data-fitting multiple linear regression (MLR) models between the PCPs and and the five best connection-related graphical invariants. Table 2 delivers the most appropriate data-fitting MLRs with 95% confidence intervals for the intercept and the two X-variables. Moreover, Table 2 computes the standard error of estimation and the determination coefficient for the top five connection-related graphical invariants.
We performed the leave-one-out cross validation (LOOCV) method on the data and generated LOOCV root mean squared errors corresponding to the predictions in Table 2. Next, we deliver these cross-validation results.
Table 3 reports for the predictions by the top five connection indices in Table 2. Note that in Table 3 are fairly close to s values in Table 2 which shows the efficiency of our predictive models.
Figure 1 present the plots for scattering data between the PCPs and and the five best connection-related graphical invariants.
Note that the general sum-connectivity connection index () has been identified as the best predictor among the tested topological descriptors. This superior predictive power can be attributed to several key theoretical and structural factors:
-
Balanced Sensitivity to Molecular Connectivity
Unlike degree-based or distance-based indices, incorporates both local and global molecular connectivity by summing the inverse square roots of vertex connections. This ensures a smooth variation across different molecular structures, preventing excessive dependence on extreme values (as seen in multiplicative indices like and ).
-
Mathematical Robustness and Stability
The formula for
allows for flexible tuning via the parameter , enabling it to capture nonlinear structure–property relationships more effectively than rigid indices like the Zagreb connection indices. Empirical results indicate that the case provides optimal correlation, likely due to its ability to moderate the influence of high-degree vertices while maintaining significant contributions from lower-degree vertices.
-
Strong Correlation with van der Waals Interactions
The sum-connectivity framework aligns well with intramolecular interactions, particularly van der Waals forces and dispersion interactions. These weak but cumulative effects influence key physicochemical properties like boiling points and heat of formation, which were the primary test variables in our study.
-
Empirical Evidence from Regression Models
-
achieved the highest multiple correlation coefficient () across all tested indices.
-
Regression analysis demonstrated that quadratic and cubic models using provided the best predictive accuracy, suggesting that its mathematical form aligns well with nonlinear structure–property relationships.
2.1. Structure–Property Modeling of Coumarin-Related Anti-Cancer Drugs
2.1.1. Coumarin-Related Compounds as Potential Anti-Cancer Drugs
Coumarin-related compounds, derived from the benzopyrone family, have garnered significant attention in medicinal chemistry for their diverse biological activities [29], particularly their anti-cancer properties. These compounds exhibit a range of pharmacological actions, including anti-proliferative, pro-apoptotic, and anti-angiogenic effects, which make them promising candidates for cancer therapy [30]. The anti-cancer potential of coumarins is attributed to their ability to modulate various molecular targets, such as inhibiting tyrosine kinases, disrupting cell cycle progression, and inducing oxidative stress in cancer cells. Additionally, coumarins can act as chemosensitizers, enhancing the efficacy of conventional chemotherapy drugs and overcoming drug resistance in certain cancer types.
Coumarins also show potential for selective toxicity, targeting cancer cells while sparing healthy tissues. Their versatility allows for structural modifications, enabling the development of derivatives with enhanced potency and specificity against different cancer types [30]. Examples include esculetin and umbelliferone, which exhibit notable anti-cancer activity through the inhibition of cell signaling pathways and the suppression of metastasis. Furthermore, coumarin derivatives have demonstrated synergistic effects when combined with other anti-cancer agents, highlighting their utility in combination therapy strategies. These compounds represent a promising area of research for developing novel and effective treatments for a wide range of malignancies.
Coumarins exhibit a variety of structural types that contribute to their diverse biological activities [26]. Simple coumarins, the most basic form, consist of a benzopyrone core and are widely found in nature. Furanocoumarins, characterized by a fused furan ring, are known for their photoreactive properties and are often studied for their anti-cancer and anti-inflammatory effects. Pyranocoumarins, with an additional pyran ring, exhibit enhanced lipophilicity and improved bioavailability [28], making them suitable for pharmaceutical applications. Pyrone-substituted coumarins, in which the lactone moiety is modified, demonstrate unique biochemical interactions that broaden their therapeutic potential. These structural variations allow for extensive functional diversity, enabling tailored applications in the design of anti-cancer drugs.
In this paper, we consider 25 contemporary anti-cancer drugs and conduct structure–property modeling of their physicochemical properties. These drugs and their transformed molecular graphs are delivered in Table 4, Table 5 and Table 6. Moreover, Table 4 shows simple coumarin-related compounds, whereas Table 5 (resp. Table 6) records furanocoumarins (resp. pyranocoumarins and pyrone-substituted coumarins) considered in this work. The data of these coumarin compounds were taken from Küpeli et al. [42] who conducted a structure–property study on these compounds. For more on the chemistry of these compounds, we refer to [43,44].
For our structure–property predictive modeling, we consider a diverse range of physicochemical properties, including boiling point (BP) in °C at 760 mmHg, molar volume (MV) in m3/mol, enthalpy of vaporization (E) in kJ/mol, density (D) in g/cm3, surface tension (ST) cm3, vapor pressure (VP) in mmHg at 25 °C, molar refractivity (MR) in A2, index of refraction (IR) in cm3, flash point (FP) in °C, polarizability (P) in dyne/cm, and polar surface area (PSA) cm3. We retrieved the experimental data of these physicochemical properties from the open source
2.1.2. Structure–Property Modeling of Physicochemical Properties
Now, we conduct a detailed correlation and regression analysis of the coumarin-related drugs from Table 4, Table 5 and Table 6 with their physicochemical properties in Table 7. We employ the top five connection-based topological descriptors from Table 1 as our predictors.
With three types of regression models, linear, quadratic, and cubic, we evaluated the relationship between molecular descriptors and their hyper-counterparts with 11 essential physicochemical properties of anti-cancer drugs derived from coumarins. The objective of this study was to determine how well these descriptors could predict properties that are crucial to the effectiveness and stability of anti-cancer drugs. In order to evaluate the quality of each model, two key statistical parameters were used: the correlation coefficient (r), a measure of the strength and direction of linear relationships between variables, and the standard error of estimate (s), a measure of how accurate regression models are. As linear models evolved into cubic models, their complexity and accuracy increased.
The linear regression model provided insight into the relationship between molecular descriptors and drug properties, with molar refractivity (MR) and polarizability (P) showing the highest correlation coefficients. According to Table 8, these properties have a direct and significant linear relationship, suggesting their predictability through linear modeling.
Quadratic regression improves fitting compared to linear models for several properties, such as molar volume (MV) and polarizability (P). According to Table 9, these improvements suggest that the physicochemical properties of these drugs change nonlinearly with changes in molecular descriptors. By including squared terms, the model can capture a wider range of dynamics and variances in data that are not captured by linear models.
As shown in Table 10, the cubic regression model, which incorporates third-degree terms, provides the highest level of precision. It was particularly effective in capturing intricate dependencies on flash point (FP) and molar volume (MV), where higher-order interactions between molecular descriptors are evident. Based on its advanced fit for such properties, the cubic model suggests that some physicochemical traits are influenced by complex interactions that are only captured by higher-order models.
Gradually moving from linear to cubic regressions provides a deeper understanding of drug characteristics. They provide crucial insight into drug efficacy and stability, which is essential for designing optimized anti-cancer drugs. Advances in statistical techniques make it possible to comprehend and predict pharmaceutical behaviors more accurately. This methodology enhances drug development by exploring molecular descriptors and drug properties. It demonstrates the importance of applying advanced statistical tools to pharmaceutical research to improve drug efficacy and patient outcomes by predicting and refining drug properties based on molecular descriptors.
2.1.3. Regression Models for FP, MR, MV, and P
The best cubic regression model is obtained with the geometric-arithmetic connection index:
This model is based on the arithmetic–geometric connection index: Coumarins and their related compounds represent the best overall model for anti-cancer drugs. They are shown in Figure 2.The best cubic regression model is obtained with the sum-connectivity connection index:
The best linear and quadratic regression model is obtained with the sum-connectivity connection index: Coumarins and their related compounds represent the best overall model for anti-cancer drugs, as shown in Figure 3.The best cubic regression model is obtained with the sum-connectivity connection index:
The best linear and quadratic regression model is obtained with the sum-connectivity connection index: Coumarins and their related compounds represent the best overall model for anti-cancer drugs.They are shown in Figure 4.
The best cubic, linear, and quadratic regression model is obtained with the sum-connectivity connection index, as follows:
Coumarins and their related compounds represent the best overall model for anti-cancer drugs.The regression models are shown in Figure 5.
Although applying these results in drug discovery pipelines demands a separate study, we recall a seminal work by Estrada et al. [45], which addresses this gap of structure–property modeling by graphical descriptors and drug discovery research. The reader is referred to this work for an illustration of the role of graphical descriptors in drug discovery research.
3. Materials and Methods
3.1. Mathematical Preliminaries
A graph is a pair in which is the vertex set and . The valency/degree of a vertex is defined as . The distance/geodesic between two vertices has the definition , where is the length (number of edges traversing) by the path (chain of vertices connecting x to z). Based on the geodesic, we define the connection of vertex as , i.e., the number of vertices at distance two from x.
A graphical invariant is said to be connection-based if it is structured on the vertices’ connection. The next subsection surveys all the existing connection invariants. Some new connection-based indices have also been put forward.
Connection-Based Graphical Invariants
Based on the connection of vertices, the first two connection-based graphical descriptors were proposed by Ali and Trinajstić [46]. They defined the first Zagreb connection index as follows:
(1)
They also introduced a degree-connection-based descriptor called the modified first Zagreb connection index. It is defined as Ali et al. [47] studied of certain T-sum graphs. Ali and Trinajstić [46] also studied the applicability of these descriptors in cheminformatics. Immediately after its conception, Ducoffe et al. [48] derived extremal graphs corresponding to . Tang et al. [49] in 2019 put forward the second Zagreb connection index as follows:(2)
They proved some results corresponding to for certain derived graphs, such as semi-total point/line graph etc. Cao et al. [50] studied molecular graphs with respect to some operation for Zagreb connection indices. Javaid et al. [41] delivered the multiplicative version of two Zagreb connection invariants. They are defined as follows:(3)
(4)
Moreover, these multiplicative Zagreb connection invariants were further investigated for wheel-related graphs.Note that Diudea et al. ([8], Chapter 4) presented a rationale to be followed while proposing a graphical descriptor. It includes a list of the following desirable attributes for constructing a graphical descriptor:
1. . Direct structural interpretation;
2. . Good correlation with at least one property;
3. . Good discrimination of isomers;
4. . Locally defined;
5. . Generalizable to higher analogs;
6. . Linearly independent;
7. . Simplicity;
8. . Not trivially related to other indices;
9. . Efficiency of construction;
10.. Based on familiar structural concepts;
11.. Show a correct size dependence;
12.. Gradual change with gradual change in structures.
We observed that most of the existing connection-based graphical descriptors failed to comply with these aforementioned attributes. For instance, the modified first Zagreb connection index and the two multiplicative Zagreb connection indices fail to comply with attributes such as numbers 4, 5, and 6. Building upon this limitation, we introduce novel connection graphical indices efficiently complying with the theoretical foundation delivered by Diudea et al. ([8], Chapter 4).
Note that connection-based descriptors are introduced based on their degree-based counterparts; see [46]. Following this, we further enhanced this study by proposing connection descriptors based on other degree-based graphical invariants. Note that the references cited here are regarding the corresponding degree-based graphical invariants.
The following expression delivers the Randić [51] connection index of .
(5)
For , the general Randić [52] connection index of has the following mathematical formula:(6)
Note that .Next, we put forward the sum-connectivity [53] connection index
(7)
and the general sum-connectivity [54] connection index(8)
where . Note that .The atom-bond connectivity [55] connection index possesses the defining structure:
(9)
Next, we introduce the augmented Zagreb [56] connection index as follows:(10)
The geometric–arithmetic [57] and the arithmetic–geometric [58] connection index has the mathematical expression
(11)
and(12)
respectively.The reduced Randić [59] connection and the reduced reciprocal Randić [60] connection index are defined as
(13)
and(14)
respectively.Finally, the Sombor [61] connection index is defined as
(15)
In order to explain these connection-based indices, we consider the example of a chemical graph H in Figure 6.
The graph H in Figure 6 has order 8. The connection of the vertex , for instance, is calculated as
That is, there are three vertices which are at distance 2 from . Thus, we have . In a similar fashion, by calculating the connections of all the vertices of H, let us calculate the connection index as follows: For graph H in Figure 6, its AZI index can be calculated as follows:Figure 7 explains the workflow of employing graphical descriptors in structure–property modeling.
Next, we deliver computational details applied in this study.
3.2. Computational Methods
This section is dedicated to presenting a computer-dependent computing technique to calculate connection-based graphical indices presented in Section 3.1.
The method makes use of three software packages simultaneously. This includes a computational chemistry software called HyperChem (version 8.0) [62], a mathematical platform to conduct matrix analysis, i.e., MatLab (version R2024b) [63], and TopoCluj (version 1.1) [64], a molecular topology platform. HyperChem is a comprehensive molecular modeling and computational chemistry software widely used in academic and industrial research. Its functionalities span various aspects of molecular modeling, including molecular mechanics, quantum chemistry, molecular dynamics, and visualization. For instance, regarding molecular modeling (resp. quantum chemistry), it performs model building, 3D visualization, etc. (resp. Density Functional Theory (DFT), ab initio methods, etc.). On the other hand, for molecular mechanics (resp. spectroscopy), its ability to conduct energy minimization (resp. UV-Vis and IR spectra, NMR spectroscopy, etc.) possesses significant efficiency. TopoCluj is a specialized software designed for calculating topological descriptors from topological matrices and polynomials. These descriptors are essential in the study of molecular characteristics as well as structures, particularly in the field of computational chemistry and molecular graph theory. MATLAB (short for “Matrix Laboratory”) is an interactive user-friendly environment and a high-level programming language delivered by MathWorks, primarily used for algorithmic development, data analysis, numerical visualization, and numerical computing.
Next, we provide a rationale for selecting the three platforms, i.e., HyperChem, MATLAB, and TopoCluj:
-
HyperChem: HyperChem is a widely used computational chemistry software known for its robust molecular modeling and visualization tools. Unlike other molecular modeling software (such as Gaussian (version 16) or Spartan (version 9)), HyperChem offers the following:
-
User-friendly interface for building and optimizing molecular structures.
-
Real-time visualization of molecular properties and transformations.
-
Efficient quantum mechanics and molecular mechanics calculations, which are particularly useful for cheminformatics applications.
-
MATLAB: MATLAB was chosen for its advanced numerical computing and matrix operations, which are essential for processing topological descriptors. Compared to alternatives like Python 3.13.2 (NumPy, SciPy) or R 4.4.2, MATLAB R2024b offers:
-
Highly optimized built-in matrix operations, crucial for computing large-scale graph-based descriptors.
-
Statistical and regression modeling capabilities, enabling precise correlation analysis.
-
Seamless integration with other scientific tools, ensuring flexibility in extending the analysis.
-
TopoCluj: TopoCluj is a specialized molecular topology software designed for calculating topological indices from molecular graphs. It was preferred over general-purpose tools like ChemOffice or Open Babel due to the following:
-
Dedicated algorithms for computing topological descriptors, reducing computational complexity.
-
Efficient processing of molecular graphs, making it highly suited for cheminformatics applications.
-
Compatibility with standard cheminformatics workflows, ensuring consistency in descriptor computations.
Here, we deliver our proposed 3-step computational method to compute connection-related descriptors for a given molecular graph :
Step 1. Use the HyperChem drawing module to construct a 3D molecular graph of . It delivers a file with the .hin extension.
Step 2. Feed the .hin file to TopoCluj to compute the distance matrix of and generate the .m file corresponding to the .hin file.
Step 3. Compute all connection-related descriptors (from Section 3.1) by inputting .m to MatLab and employing our code written in MatLab.
Our step-by-step computational method is depicted in Figure 8.
We have made our MatLab code public by employing GitHub platform. Click the GitHub link in order to access all the data.
3.3. Data Analysis
This section delivers all the data, their usage, and implications in structure–property modeling.
The first step is to select test properties as representatives of physicochemical properties. Following the seminal work of Gutman and Tošović [19], the normal boiling point and standard heat-of-formation were selected as test physicochemical characteristics. The selection of is justified, as it represents van der Waals/intramolecular-type reciprocations. Moreover, the justification for opting is that the standard heat-of-formation delivers representation for thermochemical characteristics. Note that of a substance is the temperature at which its vapor pressure equals atmospheric pressure (1 atmosphere or 101.3 kPa) at sea level. Each substance has a unique normal boiling point depending on its molecular properties. Moreover, is the change in enthalpy when one mole of a compound is formed from its constituent elements in their standard states under standard conditions (usually 298 K and 1 atm pressure).
Next, as representatives of benzenoid hydrocarbons (BHs), we select 22 lower BHs as our test molecules. This has previously been performed by Hayat and Khan [65] and Hayat et al. [22]. Note that selecting lower BHs for this kind of testing is motivated by Lučić et al. [66], who considered 30 lower BHs for determining the predictive ability of two degree-based graphical descriptors for the total -electronic energy of BHs. We have considered the lower 22 derivatives because of the limitation of public availability of the experimental data of and . Hayat and Khan selected lower 22 BHs in their comparative analysis to test the quality of eigenvalue-related degree descriptors. In addition, Hayat et al. [22] opted for 22 lower BHs to investigate the prediction ability of distance-related graph-theoretic invariants for physicochemical characteristics of BHs. The close correlation values of both and for the initial 22 members of BHs strengthen the justification for their selection in this study. Moreover, we find the number 22 sufficient in order to validate our statistical inferences. The experimental data of and for some BHs were retrieved from NIST’s standard data repository [67], while we consulted Allison and Burgess [7], Dias [68], and Nikolić et al. [69] for the remaining BHs.
Figure 9 showcased the graphical structures of the 22 initial members of BHs selected as test molecules. Since these are just the graphical representations of the 3D molecular structure of BHs, the aromaticity is omitted. The next step is to compute numerical values of the connection-related graphical invariants in Section 3.1. In order to do that, the computational method explained in Section 3.2 has been employed. Moreover, Table 11 delivers experimental values of both and for lower 22 BHs. The implementation of the proposed method in Section 3.2 for the first/second Zagreb and multiplicative Zagreb connection invariants delivers the data in Columns 4–7. Although we omit the data for the other connection invariants, it is noteworthy to say that those indices can be computed similarly.
The final step of this section is to employ the multiple linear correlation (MLC) between a given connection index and the two chosen PCPs, i.e., and . We compute , i.e., the multiple linear correlation between , , and a connection invariant by using the data in Table 11. The data analysis toolpack of MS Excel is utilized for this computation. Table 12 delivers the data of MLC values. Note that the general sum-connectivity and Randić connection indices has a generic parameter . Thus, for a meaningful analysis, we select as test values of .
In the next section, we deliver a detailed analysis of the data in Table 12 and mention the top five best connection-based graphical invariants for predicting PCPs of BHs, meanwhile mentioning the ones that do not deserve further attention from the researchers.
4. Conclusions
4.1. Contributions
-
Introduced novel connection-based graphical invariants for predicting physicochemical properties (PCPs).
-
Demonstrated superior performance of the general sum-connectivity index over traditional indices.
-
Applied models to benzenoid hydrocarbons (BHs) and coumarin-related anti-cancer drugs, advancing QSPR modeling methods.
4.2. Study Implications
-
Enhanced predictive accuracy for boiling points, heat-of-formation, and key PCPs.
-
Offered scalable tools for drug development and molecular screening.
-
Provided a foundation for cheminformatics applications in material science and pharmaceuticals.
4.3. Limitations
-
Descriptor Selection Bias: The focus on connection-based descriptors may underrepresent quantum chemical and steric effects, requiring hybrid models for broader applicability.
-
Dataset Constraints: This study is limited to benzenoid hydrocarbons and coumarins, restricting generalizability to other chemical classes.
-
Modeling Assumptions: Standard environmental conditions and smooth regression models may not fully capture real-world molecular behavior.
-
Potential Overfitting: The small dataset may lead to overfitting in statistical models, necessitating regularization techniques.
4.4. Future Study
-
Expand datasets to include diverse molecular structures.
-
Explore additional properties like toxicity, solubility, and pharmacokinetics.
-
Integrate machine learning for enhanced predictive performance.
-
Investigate real-world applications in drug discovery and advanced material design.
Conceptualization, S.H. and S.W.; methodology, S.H. and S.W.; software, S.W.; validation, S.H.; formal analysis, S.W.; investigation, S.H.; resources, S.W.; data curation, S.H. and S.W.; writing—original draft preparation, S.H.; writing—review and editing, S.W.; visualization, S.W.; supervision, S.W. and S.H.; project administration, S.W. and S.W.; funding acquisition, S.W. All authors have read and agreed to the published version of the manuscript.
Not applicable.
Not applicable.
Datasets generated or analyzed during the current study are publicly available at
The authors are grateful to all the reviewers for their remarks and constructive criticism which significantly improved the submitted version of the paper.
The authors declare no conflicts of interest.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Figure 1. Plot for scattering data between the PCPs [Forumla omitted. See PDF.] and [Forumla omitted. See PDF.] and the five best connection-related graphical invariants.
Figure 6. A graph [Forumla omitted. See PDF.] with the vertex set [Forumla omitted. See PDF.] and edge set [Forumla omitted. See PDF.].
Figure 9. The 2D graphs of the 22 initial members of BHs selected as test molecules.
The top five connection-related graphical invariants.
Position | Connection-Based Index | MLC Value |
---|---|---|
1 | | |
2 | | |
3 | | |
4 | | |
5 | |
The most appropriate data-fitting MLR models for the top 5 connection-related graphical invariants and the chosen PCPs
Connection Index | MLR Model | Statistics |
---|---|---|
| | |
| | |
| | |
| | |
| | |
The leave-one-out cross validation (LOOCV) root mean squared errors for the top five connection indices.
Descriptor | |
---|---|
| 0.2800 |
| 0.84530 |
| 1.5162 |
| 2.5931 |
| 2.6490 |
Chemical structures and their corresponding molecular graphs for certain simple coumarins.
Cancer Drug | Molecular Graph | Cancer Drug | Molecular Graph |
---|---|---|---|
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
Some furanocoumarins and their transformed molecular graphs.
Cancer Drug | Molecular Graph | Cancer Drug | Molecular Graph |
---|---|---|---|
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
Chemical structures of some pyranocoumarins and pyrone-substituted coumarins.
Cancer Drug | Molecular Graph | Cancer Drug | Molecular Graph |
---|---|---|---|
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
[Image omitted. Please see PDF.] | [Image omitted. Please see PDF.] |
The physicochemical properties of the selected coumarin-related anti-cancer drugs.
Drugs | BP | MV | E | D | ST | VP | MR | IR | FP | P | PSA |
---|---|---|---|---|---|---|---|---|---|---|---|
Alternariol | 384.6 | 186.7 | 63.3 | 1.2 | 42.7 | 0.9 | 62.5 | 1.584 | 161.9 | 24.8 | 36 |
Angelicin | 362.6 | 134.0 | 60.9 | 1.4 | 55.5 | 0.8 | 49.9 | 1.667 | 173.1 | 19.8 | 39 |
Bergapten | 412.4 | 158.0 | 66.5 | 1.4 | 52.0 | 1.0 | 56.6 | 1.635 | 203.2 | 22.4 | 49 |
Coumestrol | 406.0 | 167.4 | 68.3 | 1.6 | 79.9 | 1.0 | 69.4 | 1.768 | 199.3 | 27.5 | 80 |
Daphnetin | 430.4 | 114.0 | 71.2 | 1.6 | 75.6 | 1.1 | 43.5 | 1.689 | 184.5 | 17.3 | 67 |
Daphnin | 670.0 | 202.6 | 103.4 | 1.7 | 92.6 | 2.2 | 77.4 | 1.689 | 252.4 | 30.7 | 146 |
Dicumarol | 620.7 | 213.8 | 96.7 | 1.6 | 80.7 | 1.9 | 85.4 | 1.731 | 231.9 | 33.9 | 93 |
Esculetin | 469.7 | 114.0 | 76.0 | 1.6 | 75.6 | 1.2 | 43.5 | 1.689 | 201.5 | 17.3 | 67 |
Esculin | 697.7 | 202.6 | 107.3 | 1.7 | 92.6 | 2.3 | 77.4 | 1.689 | 262.8 | 30.7 | 146 |
Gravelliferone | 454.3 | 267.1 | 74.1 | 1.1 | 43.3 | 1.2 | 87.5 | 1.569 | 184.9 | 34.7 | 47 |
Herniarin | 335.3 | 141.1 | 57.8 | 1.2 | 44.4 | 0.7 | 46.4 | 1.572 | 138.6 | 18.4 | 36 |
Imperatorin | 448.3 | 217.5 | 70.7 | 1.2 | 46.7 | 1.1 | 75.0 | 1.606 | 224.9 | 29.7 | 49 |
Isobergapten | 412.4 | 158.0 | 66.5 | 1.4 | 52.0 | 1.0 | 56.6 | 1.635 | 203.2 | 22.4 | 49 |
Isopimpinellin | 448.7 | 182.0 | 70.7 | 1.4 | 49.6 | 1.1 | 63.3 | 1.612 | 225.1 | 25.1 | 58 |
Limettin | 388.1 | 165.1 | 63.7 | 1.2 | 43.0 | 0.9 | 53.1 | 1.557 | 176.3 | 21.1 | 45 |
Novobiocin | 876.2 | 431.0 | 133.4 | 1.4 | 70.6 | 0.0 | 155.3 | 1.640 | 483.7 | 61.6 | 196 |
Pimpinellin | 441.0 | 182.0 | 69.8 | 1.4 | 49.6 | 1.1 | 63.3 | 1.612 | 220.5 | 25.1 | 58 |
Psoralen | 362.6 | 134.0 | 60.9 | 1.4 | 55.5 | 0.8 | 49.9 | 1.667 | 173.1 | 19.8 | 39 |
Seselin | 403.0 | 186.7 | 65.4 | 1.2 | 42.7 | 0.9 | 62.5 | 1.584 | 170.5 | 24.8 | 36 |
Skimmin | 632.0 | 204.2 | 98.2 | 1.6 | 81.4 | 1.9 | 75.5 | 1.661 | 239.3 | 29.9 | 126 |
Umbelliferon | 382.1 | 115.5 | 65.5 | 1.4 | 59.5 | 0.9 | 41.6 | 1.640 | 181.2 | 16.5 | 47 |
Visnadin | 477.7 | 307.2 | 74.2 | 1.3 | 49.5 | 1.2 | 99.3 | 1.560 | 206.9 | 39.4 | 88 |
Warfarin | 515.2 | 235.8 | 82.9 | 1.3 | 58.7 | 1.4 | 84.4 | 1.635 | 188.8 | 33.5 | 64 |
Xanthotoxin | 414.8 | 158.0 | 66.8 | 1.4 | 52.0 | 1.0 | 56.6 | 1.635 | 204.7 | 22.4 | 49 |
Xanthyletin | 340.0 | 133.9 | 58.4 | 1.3 | 46.4 | 0.7 | 45.4 | 1.593 | 159.5 | 18.0 | 29 |
The
Descriptor | D | BP | VP | E | FP | IR | MR | PSA | P | ST | MV |
---|---|---|---|---|---|---|---|---|---|---|---|
| 0.004 | 0.648 | 0.000 | 0.638 | 0.707 | 0.003 | 0.967 | 0.594 | 0.967 | 0.079 | 0.909 |
| 0.016 | 0.670 | 0.000 | 0.662 | 0.712 | 0.014 | 0.950 | 0.631 | 0.950 | 0.112 | 0.870 |
| 0.024 | 0.678 | 0.001 | 0.671 | 0.711 | 0.022 | 0.937 | 0.646 | 0.937 | 0.129 | 0.847 |
| 0.033 | 0.674 | 0.002 | 0.666 | 0.703 | 0.032 | 0.919 | 0.652 | 0.919 | 0.142 | 0.821 |
| 0.026 | 0.668 | 0.001 | 0.660 | 0.709 | 0.024 | 0.931 | 0.640 | 0.931 | 0.127 | 0.840 |
The
Descriptor | D | BP | VP | E | FP | IR | MR | PSA | P | ST | MV |
---|---|---|---|---|---|---|---|---|---|---|---|
| 0.008 | 0.649 | 0.508 | 0.641 | 0.826 | 0.005 | 0.967 | 0.599 | 0.967 | 0.086 | 0.909 |
| 0.024 | 0.672 | 0.513 | 0.667 | 0.840 | 0.020 | 0.951 | 0.637 | 0.951 | 0.123 | 0.872 |
| 0.035 | 0.681 | 0.514 | 0.677 | 0.844 | 0.030 | 0.939 | 0.652 | 0.939 | 0.142 | 0.850 |
| 0.043 | 0.678 | 0.508 | 0.675 | 0.843 | 0.040 | 0.924 | 0.660 | 0.924 | 0.154 | 0.827 |
| 0.036 | 0.673 | 0.503 | 0.668 | 0.843 | 0.031 | 0.935 | 0.648 | 0.935 | 0.137 | 0.844 |
The
Descriptor | D | BP | VP | E | FP | IR | MR | PSA | P | ST | MV |
---|---|---|---|---|---|---|---|---|---|---|---|
| 0.050 | 0.649 | 0.537 | 0.642 | 0.849 | 0.035 | 0.968 | 0.604 | 0.967 | 0.151 | 0.910 |
| 0.134 | 0.675 | 0.572 | 0.672 | 0.854 | 0.108 | 0.952 | 0.661 | 0.952 | 0.274 | 0.875 |
| 0.172 | 0.686 | 0.588 | 0.686 | 0.857 | 0.143 | 0.941 | 0.686 | 0.941 | 0.329 | 0.856 |
| 0.209 | 0.682 | 0.589 | 0.682 | 0.858 | 0.184 | 0.928 | 0.696 | 0.928 | 0.368 | 0.840 |
| 0.175 | 0.677 | 0.579 | 0.675 | 0.857 | 0.151 | 0.936 | 0.681 | 0.936 | 0.328 | 0.851 |
Experimental values of both
Molecule | | | | | | |
---|---|---|---|---|---|---|
Benzene | 80.1 | 75.2 | 24 | 24 | 4096 | 4096 |
Naphthalene | 218 | 141 | 64 | 96 | 192,080,000 | 6,879,707,136 |
Phenanthrene | 338 | 202.7 | 106 | 184 | 7.46807 | 8.70713 |
Anthracene | 340 | 222.6 | 104 | 176 | 6.29408 | 7.2139 |
Chrysene | 431 | 271.1 | 148 | 273 | 2.86774 | 1.102 |
Benzo[a]anthracene | 425 | 277.1 | 146 | 265 | 2.4089 | 9.13009 |
Triphenylene | 429 | 275.1 | 150 | 288 | 2.62144 | 8.30377 |
Tetracene | 440 | 310.5 | 144 | 256 | 2.06244 | 7.56432 |
Benzo[a]pyrene | 496 | 296 | 184 | 376 | 5.35488 | 5.14147 |
Benzo[e]pyrene | 493 | 289.9 | 182 | 363 | 5.15237 | 5.39122 |
Perylene | 497 | 319.2 | 184 | 374 | 5.62449 | 5.14147 |
Anthanthrene | 547 | 323 | 104 | 176 | 6.29408 | 7.2139 |
Benzo[ghi]perylene | 542 | 301.2 | 218 | 466 | 1.04142 | 3.18346 |
Dibenzo[a,c]anthracene | 535 | 348 | 190 | 370 | 8.3236 | 8.70713 |
Dibenzo[a,h]anthracene | 535 | 335 | 188 | 354 | 9.21947 | 1.15553 |
Dibenzo[a,j]anthracene | 531 | 336.3 | 188 | 354 | 9.21947 | 1.15553 |
Picene | 519 | 336.9 | 190 | 362 | 1.10121 | 1.39471 |
Coronene | 590 | 296.7 | 252 | 558 | 1.92829 | 1.97112 |
Dibenzo(a,h)pyrene | 596 | 375.6 | 224 | 456 | 1.70556 | 5.39122 |
Dibenzo(a,i)pyrene | 594 | 366 | 224 | 456 | 1.70556 | 5.39122 |
Dibenzo(a,l)pyrene | 595 | 393.3 | 226 | 473 | 1.62857 | 4.54885 |
Pyrene | 393 | 221.3 | 140 | 270 | 1.5565 | 5.39122 |
MLCs of connection-related graphical invariants with
Connection-Based Index | |
---|---|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
|
References
1. Katritzky, A.R.; Petrukhin, R.; Tatham, D.; Basak, S.; Benfenati, E.; Karelson, M.; Maran, M. Interpretation of quantitative structure- property and- activity relationships. J. Chem. Inf. Comput. Sci.; 2001; 41, pp. 679-685. [DOI: https://dx.doi.org/10.1021/ci000134w]
2. Basak, S.C.; Mills, D. Quantitative structure- property relationships (QSPRS) for the estimation of vapor pressure: A hierarchical approach using mathematical structural descriptors. J. Chem. Inf. Comput. Sci.; 2001; 41, pp. 692-701. [DOI: https://dx.doi.org/10.1021/ci000165r]
3. Wiener, H. Structural determination of the paraffin boiling points. J. Am. Chem. Soc.; 1947; 69, pp. 17-20. [DOI: https://dx.doi.org/10.1021/ja01193a005] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/20291038]
4. Gutman, I.; Furtula, B. Novel Molecular Structure Descriptors—Theory and Applications; University of Kragujevac: Kragujevac, Serbia, 2010; Volume 1–2.
5. Balaban, A.T.; Motoc, I.; Bonchev, D.; Mekenyan, O. Topological indices for structure-activity corrections. Top. Curr. Chem.; 1983; 114, pp. 21-55.
6. Gutman, I.; Polansky, O.E. Mathematical Concepts in Organic Chemistry; Springer: New York, NY, USA, 1986.
7. Allison, T.C.; Burgess Jr, D.R. First-principles prediction of enthalpies of formation for polycyclic aromatic hydrocarbons and derivatives. J. Phys. Chem. A; 2015; 119, pp. 11329-11365. [DOI: https://dx.doi.org/10.1021/acs.jpca.5b07908]
8. Diudea, M.V.; Gutman, I.; Lorentz, J. Molecular Topology; Nova, Huntington: Butler, PA, USA, 2001.
9. Cai, D.; Fan, P.; Zou, Q.; Xu, Y.; Ding, Z.; Liu, Z. Active device detection and performance analysis of massive non-orthogonal transmissions in cellular Internet of Things. Sci. China Inf. Sci.; 2022; 65, 182301. [DOI: https://dx.doi.org/10.1007/s11432-021-3328-y]
10. Guo, Y.; Zhao, R.; Lai, S.; Fan, L.; Lei, X.; Karagiannidis, G.K. Distributed machine learning for multiuser mobile edge computing systems. IEEE J. Sel. Top. Signal Process.; 2022; 16, pp. 460-473. [DOI: https://dx.doi.org/10.1109/JSTSP.2022.3140660]
11. Tan, C.; Cai, D.; Fang, F.; Ding, Z.; Fan, P. Federated unfolding learning for CSI feedback in distributed edge networks. IEEE Trans. Commun.; 2025; 73, pp. 410-424. [DOI: https://dx.doi.org/10.1109/TCOMM.2024.3429170]
12. Zheng, S.; Shen, C.; Chen, X. Design and analysis of uplink and downlink communications for federated learning. IEEE J. Sel. Areas Commun.; 2021; 39, pp. 2150-2167. [DOI: https://dx.doi.org/10.1109/JSAC.2020.3041388]
13. Gutman, I. Degree-based topological indices. Croat. Chem. Acta; 2013; 86, pp. 351-361. [DOI: https://dx.doi.org/10.5562/cca2294]
14. Hayat, S.; Imran, M. On topological properties of nanocones CNCk[n]. Stud. UBB Chem.; 2014; 59, pp. 113-128.
15. Xu, K.; Liu, M.; Das, K.C.; Gutman, I.; Furtula, B. A survey on graphs extremal with respect to distance-based topological indices. MATCH Commun. Math. Comput. Chem.; 2014; 71, pp. 461-508.
16. Consonni, V.; Todeschini, R. New spectral indices for molecular description. MATCH Commun. Math. Comput. Chem.; 2008; 60, pp. 3-14.
17. Hosoya, H. On some counting polynomials in chemistry. Discret. Appl. Math.; 1988; 19, pp. 239-257. [DOI: https://dx.doi.org/10.1016/0166-218X(88)90017-0]
18. Todeschini, R.; Consonni, V. Molecular Descriptors for Chemoinformatics; Wiley-VCH: Weinheim, Germany, 2009; Volume 1–2.
19. Gutman, I.; Tošović, J. Testing the quality of molecular structure descriptors. Vertex-degree-based topological indices. J. Serb. Chem. Soc.; 2013; 78, pp. 805-810. [DOI: https://dx.doi.org/10.2298/JSC121002134G]
20. Malik, M.Y.H.; Binyamin, M.A.; Hayat, S. Correlation ability of degree-based topological indices for physicochemical properties of polycyclic aromatic hydrocarbons with applications. Polycycl. Aromat. Compd.; 2022; 42, pp. 6267-6281. [DOI: https://dx.doi.org/10.1080/10406638.2021.1977349]
21. Hayat, S.; Khan, S.; Khan, A.; Liu, J.-B. Valency-based molecular descriptors for measuring the π-electronic energy of lower polycyclic aromatic hydrocarbons. Polycycl. Aromat. Compd.; 2022; 42, pp. 1113-1129. [DOI: https://dx.doi.org/10.1080/10406638.2020.1768414]
22. Hayat, S.; Khan, S.; Imran, M.; Liu, J.-B. Quality testing of distance-based molecular descriptors for benzenoid hydrocarbons. J. Mol. Struct.; 2020; 1222, pp. 128927-128935. [DOI: https://dx.doi.org/10.1016/j.molstruc.2020.128927]
23. Hayat, S.; Khan, S.; Khan, A.; Imran, M. Distance-based topological descriptors for measuring the π-electronic energy of benzenoid hydrocarbons with applications to carbon nanotubes. Math. Methods Appl. Sci.; 2020; early view [DOI: https://dx.doi.org/10.1002/mma.6668]
24. Hayat, S.; Khan, S.; Khan, A.; Imran, M. A computer-based method to determine predictive potential of distance-spectral descriptors for measuring the π-electronic energy of benzenoid hydrocarbons with applications. IEEE Access; 2021; 9, pp. 19238-19253. [DOI: https://dx.doi.org/10.1109/ACCESS.2021.3053270]
25. Bruneton, J. Immunotoxicity of Epicutaneously Applied Anti-Coagulant Rodenticide Warfarin; Intercept Ltd.: Hampshire, UK, 1999; pp. 245-263.
26. Lacy, A.; O’kennedy, R. Studies on coumarins and coumarin-related compounds to determine their therapeutic role in the treatment of cancer. Curr. Pharm. Des.; 2004; 10, pp. 3797-3811. [DOI: https://dx.doi.org/10.2174/1381612043382693] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/15579072]
27. Murray, R.D.H.; Mendez, J.; Brown, S.A. The Natural Coumarins Occurrence, Chemistry and Biochemistry; John Wiley and Sons Ltd.: New York, NY, USA, Chichester, UK, 1982.
28. Lake, B. Synthesis & pharmacological investigation of 4-hydroxy coumarin derivatives & shown as anti-coagulant. Food Chem. Tox.; 1999; 3, pp. 412-423.
29. Cooke, D.; Fitzpatrick, B.; O’Kennedy, R.; McCormack, T.; Egan, D. Coumarin: Biochemical Profile and Recent Developments; John Wiley & Sons: New York, NY, USA, 1997; Volume 3, pp. 311-322.
30. Fylaktakidou, K.C.; Hadipavlou-Litina, D.J.; Litinas, K.E.; Nicolaides, D.N. Natural and synthetic coumarin derivatives with anti-inflammatory/antioxidant activities. Curr. Pharm. Des.; 2004; 10, pp. 3813-3833. [DOI: https://dx.doi.org/10.2174/1381612043382710] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/15579073]
31. Harvey, R.G.; Cortex, C.; Ananthanarayan, T.P.; Schmolka, S. A new coumarin synthesis and its utilization for the synthesis of polycyclic coumarin compounds with anticarcinogenic properties. J. Org. Chem.; 1988; 53, pp. 3936-3943. [DOI: https://dx.doi.org/10.1021/jo00252a011]
32. Al-Haiza, M.A.; Mostafa, M.S.; El-Kady, M.Y. Synthesis and biological evaluation of some new coumarin derivatives. Molecules; 2003; 8, pp. 275-286.
33. Tosun, A.; Kupeli, E.; Yesilada, E. Anti-inflammatory and antinociceptive activity of coumarins from seseli gummiferum subsp. corymbosum (Apiaceae). Z. Nat. C; 2009; 64, pp. 56-62. [DOI: https://dx.doi.org/10.1515/znc-2009-1-210] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/19323267]
34. Hoult, J.R.; Paya, M. Pharmacological and biochemical actions of simple coumarins: Natural products with therapeutic potential. Gen. Pharmacol.; 1996; 27, pp. 713-722. [DOI: https://dx.doi.org/10.1016/0306-3623(95)02112-4]
35. Madhavan, G.R.; Balraju, V.; Malleshasm, B.; Chakrabarti, R.; Lohray, V.B. Novel coumarin derivatives of heterocyclic compounds as lipid-lowering agents. Bioorg. Med. Chem. Lett.; 2003; 13, pp. 2547-2551. [DOI: https://dx.doi.org/10.1016/S0960-894X(03)00490-6] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/12852963]
36. Moffet, R.S. Central nervous system depressants. VII. Pyridyl coumarins. J. Med. Chem.; 1964; 7, pp. 446-449. [DOI: https://dx.doi.org/10.1021/jm00334a010] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/14221122]
37. Paya, M.; Halliwell, B.; Hoult, J.R. Interactions of a series of coumarins with reactive oxygen species, Scavenging of superoxide, hypochlorous acid and hydroxyl radicals. Biochem. Pharmacol.; 1992; 44, pp. 205-214. [DOI: https://dx.doi.org/10.1016/0006-2952(92)90002-Z] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/1322662]
38. Murray, R.D. The naturally occurring coumarins. Fortschr. Chem. Org. Naturst.; 2002; 83, 1.
39. Fentem, J.H.; Fry, J.R. Species differences in the metabolism and hepatotoxicity of coumarin. Comp. Biochem. Physiol. C; 1993; 104, pp. 1-8. [DOI: https://dx.doi.org/10.1016/0742-8413(93)90102-Q]
40. Timmanaikar, S.T.; Hayat, S.; Hosamani, S.M.; Banu, S. Structure-property modeling of coumarins and coumarin-related compounds in pharmacotherapy of cancer by employing graphical topological indices. Eur. Phys. J. E; 2024; 47, 31. [DOI: https://dx.doi.org/10.1140/epje/s10189-024-00427-6] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/38735010]
41. Javaid, M.; Ali, U.; Siddiqui, K. Novel connection based Zagreb indices of several wheel-related graphs. Comput. J. Combinat. Math.; 2021; 1, pp. 1-28.
42. Küpeli, A.E.; Genç, Y.; Karpuz, B.; Sobarzo-Sánchez, E.; Capasso, R. Coumarins and coumarin-related compounds in pharmacotherapy of cancer. Cancers; 2020; 12, 1959. [DOI: https://dx.doi.org/10.3390/cancers12071959] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32707666]
43. Basanagouda, M.; Jambagi, V.B.; Barigidad, N.N.; Laxmeshwar, S.S.; Devaru, V. Synthesis, structure–activity relationship of iodinated-4-aryloxymethyl-coumarins as potential anti-cancer and anti-mycobacterial agents. Eur. J. Med. Chem.; 2014; 74, pp. 225-233. [DOI: https://dx.doi.org/10.1016/j.ejmech.2013.12.061]
44. Makandar, S.B.N.; Basanagouda, M.; Kulkarni, M.V.; Pranesha,; Rasal, V.P. Synthesis, antimicrobial and DNA cleavage studies of some 4-aryloxymethylcoumarins obtained by reaction of 4-bromomethylcoumarins with bidentate nucleophiles. Med. Chem. Res.; 2012; 21, pp. 2603-2614. [DOI: https://dx.doi.org/10.1007/s00044-011-9785-z]
45. Estrada, E.; Uriarte, E. Recent advances on the role of topological indices in drug discovery research. Curr. Med. Chem.; 2001; 8, pp. 1573-1588. [DOI: https://dx.doi.org/10.2174/0929867013371923] [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/11562286]
46. Ali, A.; Trinajstić, N. A novel/old modification of the first Zagreb index. Mol. Inform.; 2018; 37, 1800008. [DOI: https://dx.doi.org/10.1002/minf.201800008]
47. Ali, U.; Javaid, M.; Kashif, A. Modified Zagreb connection indices of the T-sum graphs. Main Group Met. Chem.; 2020; 43, pp. 43-55. [DOI: https://dx.doi.org/10.1515/mgmc-2020-0005]
48. Ducoffe, G.; Marinescu-Ghemeci, R.; Obreja, C.; Popa, A.; Tache, R.M. Extremal graphs with respect to the modified first Zagreb connection index. Proceedings of the 2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC); Timisoara, Romania, 20–23 September 2018; pp. 141-148.
49. Tang, J.H.; Ali, U.; Javaid, M.; Shabbir, K. Zagreb connection indices of subdivision and semi-total point operations on graphs. J. Chem.; 2019; 2019, 9846913. [DOI: https://dx.doi.org/10.1155/2019/9846913]
50. Cao, J.; Ali, U.; Javaid, M.; Huang, C. Zagreb connection indices of molecular graphs based on operations. Complexity; 2020; 2020, 7385682. [DOI: https://dx.doi.org/10.1155/2020/7385682]
51. Randić, M. On characterization of molecular branching. J. Am. Chem. Soc.; 1975; 97, pp. 6609-6615. [DOI: https://dx.doi.org/10.1021/ja00856a001]
52. Amić, D.; Bešlo, D.; Lucić, B.; Nikolić, S.; Trinajstić, N. The vertex-connectivity index revisited. J. Chem. Inf. Comput. Sci.; 1998; 38, pp. 819-822. [DOI: https://dx.doi.org/10.1021/ci980039b]
53. Zhou, B.; Trinajstić, N. On a novel connectivity index. J. Math. Chem.; 2009; 46, pp. 1252-1270. [DOI: https://dx.doi.org/10.1007/s10910-008-9515-z]
54. Zhou, B.; Trinajstić, N. On general sum-connectivity index. J. Math. Chem.; 2010; 47, pp. 210-218. [DOI: https://dx.doi.org/10.1007/s10910-009-9542-4]
55. Estrada, E.; Torres, L.; Rodríguez, L.; Gutman, I. An atom-bond connectivity index: Modelling the enthalpy of formation of alkanes. Indian J. Chem.; 1998; 37A, pp. 849-855.
56. Furtula, B.; Graovac, A.; Vukičević, D. Augmented Zagreb index. J. Math. Chem.; 2010; 48, pp. 370-380. [DOI: https://dx.doi.org/10.1007/s10910-010-9677-3]
57. Vukičević, D.; Furtula, B. Topological index based on the ratios of geometrical and arithmetical means of end-vertex degrees of edges. J. Math. Chem.; 2009; 46, pp. 1369-1376. [DOI: https://dx.doi.org/10.1007/s10910-009-9520-x]
58. Shegehalli, V.S.; Kanabur, R. Arithmetic-geometric indices of path graph. J. Math. Comput. Sci.; 2015; 16, pp. 19-24.
59. Favaron, O.; Mahéo, M.; Saclé, J.F. Some eigenvalue properties of graphs (conjectures of Grafitti-II). Discret. Math.; 1993; 111, pp. 197-220. [DOI: https://dx.doi.org/10.1016/0012-365X(93)90156-N]
60. Manso, F.C.; Júnior, H.S.; Bruns, R.E.; Rubira, A.F.; Muniz, E.C. Development of a new topological index for the prediction of normal boiling point temperatures of hydrocarbons: The Fi index. J. Mol. Liq.; 2012; 165, pp. 125-132. [DOI: https://dx.doi.org/10.1016/j.molliq.2011.10.019]
61. Gutman, I. Geometric approach to degree-based topological indices: Sombor indices. MATCH Commun. Math. Comput. Chem.; 2021; 86, pp. 11-16.
62. HyperChem Package Release 7.5 for Windows; Hypercube Inc.: Gainesville, FL, USA, 2002.
63. MATLAB 8.0 and Statistics Toolbox 8.1; The MathWorks, Inc.: Natick, MA, USA, 2023.
64. Diudea, M.V.; Ursu, O.; Nagy, C.L. Topocluj; Babes-Bolyai University: Cluj, Romania, 2002.
65. Hayat, S.; Khan, S. Quality testing of spectrum-based valency descriptors for polycyclic aromatic hydrocarbons with applications. J. Mol. Struc.; 2021; 1228, 129789. [DOI: https://dx.doi.org/10.1016/j.molstruc.2020.129789]
66. Lučić, B.; Trinajstić, N.; Zhou, B. Comparison between the sum-connectivity index and product- connectivity index for benzenoid hydrocarbons. Chem. Phys. Lett.; 2009; 475, pp. 146-148. [DOI: https://dx.doi.org/10.1016/j.cplett.2009.05.022]
67. NIST Standard Reference Database. Available online: http://webbook.nist.gov/chemistry/ (accessed on 15 February 2025).
68. Dias, J.R. Handbook of Polycyclic Hydrocarbons. Part A: Benzenoid Hydrocarbons; Elsevier: Amsterdam, The Netherlands, 1987.
69. Nikolić, S.; Trinajstić, N.; Baučić, I. Comparison between the vertex- and edge–connectivity indices for benzenoid hydrocarbons. J. Chem. Inf. Comput. Sci.; 1998; 38, pp. 42-46. [DOI: https://dx.doi.org/10.1021/ci970031m]
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Cheminformatics bridges chemistry, computer science, and information technology to predict chemical behaviors using quantitative structure–property relationships (QSPRs). This study advances QSPR modeling by introducing novel connection-based graphical invariants, specifically designed to enhance the predictive accuracy for physicochemical properties (PCPs) of benzenoid hydrocarbons (BHs). Employing cutting-edge computational methods, we evaluate these invariants against established descriptors in modeling the normal boiling point and standard heat of formation. The findings reveal superior predictive performance by newly proposed invariants, such as the sum-connectivity connection index, outperforming traditional indices like the Zagreb connection indices. Furthermore, we extend these methods to model the physicochemical properties of coumarin-related anti-cancer drugs, demonstrating their potential in drug development. The statistical analysis suggests that the most appropriate structure–property models are nonlinear. This work not only proposes robust tools for PCP estimation but also advocates for rigorous testing of descriptors to ensure relevance in cheminformatics.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details


1 Faculty of Science, Universiti Brunei Darussalam, Jln Tungku Link, Gadong BE1410, Brunei
2 Department of Mathematics, Science Faculty, King Abdulaziz University, Jeddah 21589, Saudi Arabia;