1. Introduction
Naturally occurring nanoscale particles (nanoparticles, NPs) have existed throughout Earth’s and subsequently human history. They can be produced from biological, anthropogenic and geological phenomena, such as erosion, volcanic eruptions and forest fires, charcoal burning, industrial operations and more [1]. In recent years, the increased production of engineered NPs, with tuneable physical, chemical and biological properties, has led to them being widely used, among others, in the automotive, electronics, optics, food technology, cosmetics and healthcare industries [1,2]. Metal oxide (MexOy) engineered NPs are among the highest produced per volume and are used widely in technological applications like gas sensors, photovoltaics, adsorbents, catalysis and fuel cells [3,4], due to their unique superparamagnetic, piezoelectric, optical, etc. properties [5,6,7,8,9]. MexOy NPs are also being used in various consumer products, such as food, cosmetics (sunscreen) and electronic and medical devices [10].
Due to their small size and high biochemical activity, and despite their useful and beneficial properties, there are concerns that NPs are able to cross biological barriers and access a wide number of organs and tissues in the human body, including (to a limited extent) the blood–brain barrier [11], which can lead to toxic side-effects [11,12,13,14,15,16,17,18,19]. As a result, the hazard and risk assessment of NPs is key to ensure their safe use for humans and the environment. So far, in vivo studies have been the main source of information regarding NP effects on the physiology of biological organisms [20,21,22]. Furthermore, the requirements for a reduction in animal testing as per the 3Rs (Replace, Reduce, Refine) principles [23] and European Commission (EC) legislation [24] have led to a push for development of alternative testing strategies (ATS). These, so far, have mostly translated to in vitro models, followed by in silico modelling [25,26]. According to a study by Törnqvist et al. (2014), data from 36 relevant projects demonstrated major (i.e., 53%) reductions in the use of animals in pharmaceutical development [27]. Taking into account the cost and working hours associated with in vivo and in vitro experiments and that traditional “wet-lab” toxicology cannot keep up with diversity and increasing abundance of engineered NPs, computational modelling has the potential to act as a high-throughput alternative [28,29] and is becoming increasingly accepted in regulatory testing as model validation and documentation improves.
Several studies have been published assessing chemical and NP toxicity using computational methods [30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45], the majority of which are based on machine learning techniques and the development of Quantitative Nanostructure-Activity relationships (QNARs) [30,31,36,37,38,41,42,44,45]. The goal of these studies was to use computational methods to understand the mode of action (MOA) of the NPs, and the NP characteristics driving toxicity. Furthermore, it was possible to elucidate the relevant Adverse Outcome Pathways (AOPs), as part of integrated approaches to testing and assessment (IATA). Previous findings have demonstrated that MexOy NPs toxicity can, e.g., originate from dissolution and metal ion “shedding” from the NP shell area [40]. In some cases, the experimental datasets used were complemented with computational descriptors, which were based on atomistic approaches [40,41,45], Simplified Molecular-Input Line-Entry System (SMILES) codes [42,43,44,46,47] and calculation of descriptors using image analysis [48,49]. However, promising, computational studies present certain limitations when it comes to the calculation of atomistic descriptors. Taking into account that NPs with a diameter of 5 nm contain approximately 20,000 atoms, this means that a 30 nm NP will contain over 106 atoms [50]. Therefore, with the current computational power available, calculation of full-particle descriptors for nanoscale particles using ab initio or semiempirical methods is impossible in the near future. Thus, workarounds including approximations and simplified NP models are used [40,41,42,43,51]. Early approaches employed the design of molecular descriptors using SMILES structures and took into account the NPs’ chemical composition and could also include information about the experimental conditions, such as the dispersion medium [42,43,51]. Later studies were based on the quantum-chemical calculation of small atomic clusters [41,52], from which quantum-chemical descriptors such as the HOMO-LUMO gap (also called band gap) and enthalpy of formation, which is currently not possible for full-sized NPs, can be calculated. These descriptors can be directly used to model the toxic properties of NPs without taking into account the size dependency of the descriptors [41], or can be extrapolated to find descriptor values for specific NP sizes [52]. More recent studies [40,53,54] employed molecular mechanics to calculate full descriptors and energetics (such as potential energies and coordination numbers) to model MexOy NPs. Further approaches, implemented NPs surface modification characteristics (e.g., coating polymers and proteins absorption) [55,56,57] to describe more complex systems like competitive protein absorption and NPs exposure to biological systems, thus creating more complete QNAR models.
In this work, we present a meta-analysis of a dataset by Zhang et al. (2012) [45] retrieved from the S2NANO (
2. Materials and Methods
2.1. Toxicological Data from Metal Oxide NPs
The dataset used for descriptor calculation and model development was retrieved from the S2NANO (
The cytotoxicity experiments carried out by Zhang et. al. (2012) [45] demonstrated the possibility to use Ec levels to delineate the toxicological potential of MexOy NPs at the cellular and whole animal level. In vitro toxicological analyses were carried out both in single- and multiparameter toxicity assays. Single-parameter ATP and LDH assays in human bronchial epithelial (BEAS-2B) and murine myeloid (RAW 264.7) cell lines were included as they are commonly used to assess engineered NPs cytotoxicity without reference to a specific MOA [61,62]. Results from these single-parameter toxicity analyses were compared with results from an in-house multiparameter high-throughput screening assay containing 5 parameters measuring oxidative stress [45]. There was strong correlation between the multiparameter and single-parameter responses confirming that the same 7 NPs are potentially more hazardous in general toxicity assays, as well as during comparative analysis of their oxidative stress effects in the multiparameter assays. These findings were further confirmed in an in vivo mouse model, where generation of acute neutrophilic inflammation and cytokine responses in the lungs of C57 BL/6 mice was measured.
In the current study, we used a subset of the data for modelling from the aforementioned single-parameter ATP and LDH assays carried out in BEAS-2B and RAW 264.7 cell lines. The type of assay (ATP or LDH) was included as an extra parameter, since significant correlation was identified in various instances [60,63,64,65] between the type of assay and cytotoxicity results. In total, 15 descriptors (independent variables) originating from Zhang et al. (2012) were included in the analysis: 6 physicochemical (chemical formula, core size, specific surface area, total surface area, hydrodynamic size and ζ-potential), 6 molecular (EV, EC, Eg, χcation, χoxide and EΔH) and 3 assay-related (assay type, cell species and NP exposure dose) descriptors. The biological endpoint (dependent variable) was % cell viability 24 h postexposure.
2.2. Dataset Enrichment with Computational Descriptors
The dataset resulting from the above analysis was further enriched with 62 computational descriptors for each NP, which are directly related with its stability. We have previously reported a methodology to calculate a set of full particle nanodescriptors, based on the atomic structure of NPs, which requires as input the crystal structure of the respective bulk materials [40,53]. The NP structures were derived from the most thermodynamically stable crystal structures of the respective bulk metal oxides. The unit cells of the metal oxides were replicated in all three dimensions using in-house developed Python scripts and the molecular dynamics software LAMMPS. Full details of the simulation approach are provided in the Modelling Data (MODA) reporting template provided in the electronic Supplementary Information (ESI S2). The resulting spherical NPs were generated by deleting all atoms outside of the set radius of the produced NPs, while performing energy minimisation (using the Lennard-Jones parameters [53]) and maintaining the electroneutrality of the final NPs. Subsequently, the NPs were subjected to energy minimisation using the Polak–Ribiere version of the conjugate gradient (CG) algorithm [66].
Potential energies of atoms were calculated based on the Buckingham [67] and Coulomb potentials using the force field presented in Ref. [68]. Coulombic interactions were calculated using the Wolf summation [68], which is much more computationally affordable than the standard Ewald summation [69]. Cut-off radii for the Wolf summation were derived by matching energies of infinite crystals with small clusters of unit cells (2 × 2 × 2 unit cells). The calculations were performed under periodic boundary conditions in all three cartesian directions employing the LAMMPS software [70]. The length of the simulation box in each direction was much larger than the NP diameter and the cut-off values of the Buckingham potentials and Wolf summation. Thus, it was ensured that there are no interactions of the NP atoms with their periodic images. The derivation of nanodescriptors was based on the core and shell models for spherical NPs. The shell region refers to the atoms located within a depth of 1 nm from the surface of the NP, with the rest forming the NP’s core. This model allows the construction of a number of nanodescriptors that quantify the special features of the surface atoms based on different parameters such as potential energies and coordination numbers.
In total, 62 descriptors were derived from chemical composition (9 descriptors), potential energy (9 descriptors), topology (9 descriptors), lattice energy (5 descriptors), size (3 descriptors) and force vectors (27 descriptors). The atomistic descriptors calculations were performed for the entire atom, the MexOy core and the MexOy shell and can be divided into 3 categories: (1) related to all metal and oxygen atoms contained in the MexOy, (2) related to all metal atoms contained into the MexOy and (3) all oxygen atoms contained into the MexOy. In all three cases, the computed descriptors included the number of atoms present, the MexOy size, volume and surface area, the average potential energies of the atoms, the lattice energies, force-related descriptors applied on the atoms and the average coordination parameters of the MexOy atoms.
Together with the 15 descriptors from the original dataset, the final dataset used for modelling included a total of 77 descriptors (independent variables), which were correlated with cell viability (dependent variable). The list with all descriptors can be found in Table S1 of the supplementary information file.
2.3. NanoPharos Database and Data Management
The complete dataset was cleaned, structured and uploaded into the NanoPharos database (
The NanoPharos database is designed to enable further development and use for additional relevant purposes. This is achieved by adding appropriate data structures and more interfaces in a modular fashion. The design of this relational database is performed in a standard fashion to allow its expansion and incorporation of additional datasets of varying composition. NanoPharos database offers the possibility for submitted datasets to be enriched automatically with relevant bibliographic, molecular (e.g., crystal structure, electronegativity) and computational descriptors.
The full enriched dataset used for model development can be accessed through:
2.4. Model Development, Validation, Read Across and Domain of Applicability
The Isalos Analytics Platform, powered with the Enalos+ nodes [71], was used for the development and validation of the produced cytotoxicity QNAR model [72,73,74]. To decrease the risk of low-variance data distracting the modelling algorithm, all double-compatible descriptors, which had low variance and did not significantly contribute to the discrimination power of the model, were removed using a low-variance filter [75]. The removal criterion was set to 0.2, meaning that a descriptor was excluded from the analysis if it contained 20% or more values equal to those of another descriptor. All remaining descriptors were normalised using Z-score normalisation to follow a Gaussian distribution with a mean value of 0.0 and a standard deviation of 1.0 [76]. The dataset was then randomly portioned into training and test sets using a ratio of 70%:30%, respectively. The descriptors with the highest significant contribution were identified using the Correlation-based Feature Selection (CfsSubset) algorithm combined with the BestFirst evaluator (see Section S1 in the Experimental Supplementary Information for a short introduction of the two algorithms) [77,78,79].
The Enalos implementation of the k-nearest neighbours (EnaloskNN, Enalos Chem/Nanoinformatics tools) methodology was applied to the dataset to produce the cytotoxicity (% cell viability) predictive model. kNN is an instance-based (lazy) method that predicts the dependent variable based on the distance of the k (k = 1, 2, 3, …) nearest neighbours, in the features space Rn, where n is the total number of descriptors used for the prediction. In our case, the EnaloskNN was used in regression mode and cytotoxicity prediction was based on the Euclidian distances (similarity measure) of the target variable from its k closest neighbours [78]. In the case of nominal descriptors, the EnaloskNN node sets the Euclidian distance to 0 if individual values are the same and 1 otherwise [80].
The kNN algorithm can be used according to ECHA’s read across framework [81] for NPs as long as the following criteria are fulfilled:
Gathering of the required descriptors (physicochemical, molecular and atomistic) for each NP.
Construction of a data matrix including properties and endpoints.
Development of an initial grouping hypothesis that correlates an endpoint, to different behaviour and reactivity properties. Assignment of the samples to groups.
Assessment of the applicability of the approach using computational techniques and data gap filling. If no regular pattern emerged, an alternative grouping hypothesis must be proposed.
If the grouping hypothesis is robust, but adequate data are not available, additional testing should be considered.
Justification of the method.
The Euclidean distance can, thus, be used as a metric to identify the dependent’s variable neighbours and predict NP cytotoxicity. By identifying groups of neighbours, it is also possible to divide the entire dataspace into subgroups as per ECHA’s read across framework requirements. The EnaloskNN node was built to provide the specific neighbours and Euclidian distances along with the respective predictive results.
The produced model was validated and documented following the OECD’s principles for the validation of predictive models for regulatory purposes [82]. Internal and external validation took place using the goodness-of-fit, robustness and predictivity metrics [83,84]. Statistical evaluation of the model’s performance took place using Trophsa’s tests, i.e., the coefficient of determination between experimental values and model predictions (R2), validation through an external test set, leave-many-out cross validation procedure and Quality of Fit and Predictive Ability of a continuous predictive Model [85]. To perform the evaluation, the Enalos Model Acceptability Criteria were used, where the following equations to calculate Tropsha’s tests were implemented:
(1)
(2)
(3)
where ntest is the number of NPs in the test set, is the average cytotoxicity for the training set; , , i = 1, 2, …, ntest are the experimentally measured and the predicted cytotoxicity values for the validation set, respectively and is the average predicted cytotoxicity over all of the predictions , i = 1, 2, …, ntest.Furthermore, according to Tropsha et al. [85], a QSAR model is considered predictive if all of the below conditions are satisfied:
(4)
(5)
(6)
(7)
To ensure that the produced model was not a result of chance correlation and confirm its statistical significance and robustness, we performed Y-randomisation [86]. Ten different datasets were produced following random shuffling of the cytotoxicity predictions and using all original descriptors. The calculations were then repeated several times, and the model acceptability criteria, as described above, were recalculated. For the model to be valid, the recalculated criteria were expected to reduce when compared to the original model.
Finally, to ensure the applicability of the produced model to external datasets and to launch the webservice, the reliability limits, i.e., the domain of applicability (APD), for future predictions was identified. Any predictions made outside the defined limits will be flagged as unreliable [72]. Using the Euclidian distances of all NPs in the training set, the APD can be calculated using:
APD = <d> + Zσ,(8)
where <d> and σ are the average and standard deviation of all Euclidian distances in the training set, respectively. Z is an empirical cut-off value, which was set to 0.5 [83].To fully demonstrate that the produced cytotoxicity model meets the OECD criteria, as listed above, we have included in S3 of the ESI a completed QSAR Model Reporting Format (QMRF) template.
3. Results and Discussion
The goal of this study is to test whether NP cytotoxicity can be predicted using a combination of physicochemical, molecular and whole NP computational descriptors. The dataset used, by Zhang et al. (2012) [45], contained 24 different MexOy NPs (17 commercial and 7 synthesised in house) along with their physicochemical and structural characterisation (core size, specific surface area, total surface area, hydrodynamic size, ζ-potential, point of zero ζ-potential, metal dissolution and crystal structure) and enriched with a number of molecular descriptors (EV, EC, Eg, χcation, χoxide and EΔH). The dataset was enriched further with 62 computational descriptors for each NP, which were derived from chemical composition (9 descriptors), potential energy (9 descriptors), topology (9 descriptors), lattice energy (5 descriptors), size (3 descriptors) and force vectors (27 descriptors). The final dataset included a total of 77 descriptors (independent variables), which were studied in terms of their correlation to % cell viability (dependent variable) of BEAS-2B and RAW 264.7 cell lines exposed to the NPs for 24 h. One of the descriptors used during model development was the assay type (ATP or LDH) used to measure cell viability. The reason for including the assay type was its statistical significance on the produced results, as demonstrated in previous meta-analysis studies [60,63,64,65]. As a result, extra care needs to be taken when dataset combination takes place regarding their interoperability to flag sources of potential variability. This also emphasises the need for sufficient metadata implementation [87] with published datasets to increase their FAIRness score and thus reusability [88].
The produced predictive model was developed following a random division of the dataset into training and test sets (70%:30% respectively). Out of the 77 descriptors used (for a full list see Table S1 in the Supplementary Information) in the dataset, 9 double-compatible (see Table S1 in ESI for list) were removed for having low variance, which risked distracting the used kNN algorithm. Following Z-score normalisation of the training set, the retrieved normalisation parameters were used to normalise the test set. The CfsSubset algorithm combined with the BestFirst evaluator [77,78] were then used to identify the descriptors contributing the most to dataset variability (which correlates with model predictivity). From the 61 remaining descriptors 7 were identified as the most significant. These were: NP core size, NP hydrodynamic size, type of assay, NP exposure dose, conduction band energy (Ec), average coordination number of metal atoms in the surface region of the NP (Avg. C.N. Me atoms surface) and average length of surface normal component of force vector of atoms in the surface region of the NP (v⊥ Me atoms surface).
As can be derived from the above, a good balance exists between the physicochemical, assay-related and molecular/atomic descriptors, i.e., 2:2:3, respectively, as drivers of the NP cytotoxicity. More specifically, the physicochemical descriptors are both size related and combine the size of the pristine NP core (as measured with electron microscopy) with the NP’s behaviour within a specific medium (hydrodynamic diameter). NP core size was linked with cytotoxicity in the past [89,90,91,92,93,94] with decreasing size related to higher toxicity. This was usually in conjunction with the assay parameter of exposure dose, since on a constant mass basis there will be much higher numbers of smaller particles relative to larger ones [95]. Studies on BEAS-2B cells [89,90], which included LDH activity, demonstrated that SiO2 NPs with a nominal size of 10 nm had higher ability to induce the pro-inflammatory cytokines CXCL8 and IL-6 compared to NPs with a nominal size of 50 nm [89]. Similarly, Li et al. (2016) demonstrated that NPs had higher cytotoxicity and autophagy dysfunction in human bronchial epithelial BEAS-2B cells when compared to an equivalent mass of microscale particles [90], with both end-points varying in a size- and dose-dependent manner. RAW 264.7 cells [91,93] also demonstrated a size- and dose-dependent relation with cytotoxicity, with smaller particles related to higher toxicity. Makama et al. (2018) found that the Ag NP size-dependent toxicity was evident for the production or reactive oxygen species (ROS) [91], while Loan et al. (2018) demonstrated higher toxicity of Au NPs (5 vs. 30 nm nominal sizes) [93].
The fact that exposure dose also plays a significant role in MexOy NP cytotoxicity is expected, since dose–response relationships are at the heart of toxicity evaluation, although in the case of NPs, the relationships are not always linear. High NP concentrations can lead to particle agglomeration which changes the uptake potential and impacts processes such as dissolution. There has also been significant debate as to the most relevant dose metric for NPs, with particle number/cell or particle number/mm2 proposed as alternatives to mass [95,96]. Au NPs inactivated the DNA repair system, generating dose-dependent DNA ladder bands on agarose gel electrophoresis [93]. Similar results regarding a range of NPs, including Ag, Al, carbon black, carbon-coated Ag and Au NPs, were also observed by Nishanth et al. (2011) [94].
Similarly, previous studies found correlations between cytotoxicity and the NPs’ hydrodynamic size [97,98,99,100,101]. In most cases, the hydrodynamic size of the NPs is larger than the core size. The hydrodynamic size (RH) is calculated using the Stokes–Einstein radius equation (Equation (9)) [102]:
(9)
where kB is the Boltzmann constant, T is the temperature, η is the liquid’s viscosity and D is an ion’s diffusion coefficient, which is proportional to the ion’s mobility, μ (Equation (10)):(10)
where q is the ion charge and μ is correlated (Equation (11)) with ze, which is the ionic charge in integer multiples of electron charges and the medium’s frictional coefficient f (Equation (12)):(11)
(12)
where FDrag is the drag force applied to a perfect sphere travelling through a viscous liquid of frictional coefficient f, s is the sphere’s drift speed and α is its radius.Equations (8)–(11) demonstrate the effect of the exposure medium characteristics on the NPs properties and behaviour. In complex media, the ionic charge and diffusion coefficient will attract free radicals and biomolecules leading to the formation of protein corona [97,103,104]. In this way, the NP’s hydrodynamic size and, subsequently, the way in which the NP will interact with biological organisms will change [100]. Dassler et al. showed that decreasing Fe2O3 NPs hydrodynamic size significantly increased the blood half-life time and biodistribution of these NPs, leading to an alteration of their toxic effects on different organs [98]. This is probably due to the increased potential for NP degradation inside the organism, leading to the release of ions and increased intracellular ROS levels, as shown by Abakumov et al. [99]. Furthermore, increased hydrodynamic size may correspond with NP agglomeration, which can reduce uptake and result in decreased NP toxicity [105].
From the theoretical descriptors, Ec has been correlated with NP toxicity due to its connection to the energy band gap (EG). Zhang et al. (2012) demonstrated that the overlap of EG with the cellular redox potential was strongly correlated to the ability of NPs to induce oxidative stress and acute pulmonary inflammation in mice [45]. Ec represents the lowest unoccupied molecular orbital (LUMO) that participates in electron transfers from and to the MexOy surface, while the valence band (EV) is usually occupied. Thus, if the cellular redox potential is higher than the conduction band edge of the MexOy NP, direct electron transfer from the aqueous electron donor to the conduction band can take place. Alternatively, electrons injected from an aqueous donor could be transferred to the NP and from there to a series of ambient electron acceptors inside the cell until a steady state is reached [45].
The coordination number of metal atoms in the shell region (Avg. C.N. Me atoms surface) defines the number of all atoms lying within radius R of a spherical particle [40]:
(13)
where RM and RO are the ionic radii of the metal and oxygen ions, respectively. The coordination number of metal atoms is, thus, related to the distance between the metal atoms on the surface of the NP and is directly related to the metal atoms’ stability and dissociation potential. The coordination number corresponds to the chemical bonding (ionic or covalent) in the NP, which is directly correlated to the potential mode of toxicity associated with NP dissolution and ion release. Smaller values of this descriptor indicate that metal atoms are prone to dissociate from the NP surface, releasing ions into the surrounding environment and thus having the potential to cause toxicological effects [40,53]. Similarly, the force vector surface normal component of atoms (metals and oxygens) in the shell region (v⊥ Me atoms surface) describes the strength of bonds between the surface atoms and the NP core, as shown in Figure 1, and can help distinguish between the regular (bulk) NP atoms and those demonstrating the high surface activity properties of NPs [53]. Hence, these properties describe the potential energy (stability and activity) of the atoms on the surface of NPs, with smaller force vector values indicating a more thermodynamically stable surface [53]. As seen in Figure 1, the average length V of the surface normal component of the force vector of an atom in the shell region and at distance d from the centre of the NP is calculated using the atomic coordinates relative to the centre-of-mass of the NP (x, y and z) and the respective components of the force vector (fx, fy and fz) using Equation (14):(14)
The workflow developed through the Isalos Analytics Platform provided the opportunity to test different predictive algorithms (e.g., J48, random forest and kNN) and allowed us to pick kNN as the best performing, with k = 2 (i.e., three neighbours). In this case, the coefficient of determination () of the experimental values versus the model prediction on the test set was R2 = 0.91. Model validation took place following the OECD’s guidelines and successfully passed Tropsha’s tests [85] (Table 1) demonstrating the robustness and predictivity of the model. As mentioned in the experimental section, the calculation of the test parameters took place using the regression results of the experimentally measured and the predicted values of BEAS-2B and RAW 264.7 cell viability and vice versa. In good agreement, Y-randomisation demonstrated the model’s robustness and validity. Based on the calculated APD value of 2.645, all predictions in our case were classified as reliable (normalised test set range: 0–0.591). These results can act as a guide for potential limitations of the model and provide future users with an indication of their predictions’ reliability.
The acquired results provided a good picture of the neighbouring space relative to the Euclidian distance between the NP descriptors (specific examples in Figure 2), which were provided by the EnaloskNN node. Results demonstrated grouping patterns among NPs with the same core material, indicating the significance of the theoretical descriptors, which are element (and thus NP) specific, and the usage of only three neighbours (k = 2) to perform the predictions.
Taking into account the significance of NP toxicity for hazard and risk assessment and for the safe by design of NPs, the proposed model has been made available through the NanoSolveIT Cloud Platform [106]. The corresponding webservice can be found at:
The produced model is complemented with a REST API to make it available and easy to use programmatically, i.e., to implement into a workflow, e.g., in KNIME nodes. The REST API is used to communicate with the Isalos analytics platform and request the data submission and exchange necessary to run the model. As the model requires a large amount of data to be transferred, the API has been implemented using the POST Request Method. This method includes in the body of the request the user submitted data to be used to make the prediction. These can be either typed into the webservice’s GUI, as shown in Figure 3a, or uploaded using a .csv file (see the webservice tutorial for more information, which is available at
To use the MexOy NP cytotoxicity API, the user is required to form a tuple of data (i.e., a collection of data, which is ordered and immutable) in JSON format. Hence, the tuple should contain “User row ID,” “Core size (nm),” “Hydro size (nm),” “Ec (eV),” “Assay,” “Exp. dose (ug/mL),” “Avg. C.N. Me atoms surface” and “v⊥ Me atoms surface,” which is the same order of input as in the GUI (Figure 3a). Assuming that the first NP shown in Figure 3a (CuO) is submitted, the request will translate, in correspondence to the GUI presented in Figure 3a, to:
{
“id”: “CuO”,
“coreSize”: 25,
“hydroSize”: 45,
“ecEv”: -5.17,
“assay”: “ATP”,
“exposureDose”: 3.2,
“des306”: 3.579,
“des606”: -0.243,
}
To use the web API, the user can make a request (submit the desired information) using a data transfer software like Client URL (cURL):
curl -d ‘[{“id”:”CuO”,”coreSize”:25,”hydroSize”:45,”ecEv”: -5.17,”assay”: “ATP”,”exposureDose”:3.2,”des306”:3.579,”des606”:-0.243]’ -H ‘Content-Type: application/json’
Based on the submitted information, the model will use existing settings to normalise the data to meet the model’s needs and compute the closest neighbours, corresponding Euclidian distances and model reliability (APD). The normalised response along with the results, in this case, will be:
[
{“id”:”CuO”,
“assay”:”ATP”,
“apdPrediction”:”reliable”,
“nn1ID”:”ZnO”,
“nn2ID”:”ZnO”,
“nn3ID”:”ZnO”,
“nn4ID”:”ZnO”,
“coreSize”:-0.11951770136760874,
“hydroSize”:-2.642180718449538,
“ecEv”:-1.6267973637452349,
“exposureDose”:-0.6247745075500192,
“des306”:-1.2112713652065334,
“des606”:0.44321784950783655,
“nn1Distance”:2.222865548947517,
“nn2Distance”:2.2232594323735833,
“nn3Distance”:2.227550196215906,
“nn4Distance”:2.2491214514422158,
“knnprediction”:1.6576248278753694}
]
Similarly, the user can use the REST API to submit more than one request (tuples). To do so, the user needs to separate each tuple with commas. Assuming the user would like to submit the first two lines in Figure 3a, the JSON request would be:
curl -d ‘[{“id”:”CuO”,”coreSize”:25,”hydroSize”:45,”ecEv”: -5.17,”assay”: “ATP”,”exposureDose”:3.2,”des306”:3.579,”des606”:-0.243},{“id”:”CeO2”,”coreSize”:64,”hydroSize”:120,”ecEv”: -3.8,”assay”: “LDH”,”exposureDose”:0.8,”des306”:7.011,”des606”:-0.1108}]’ -H ‘Content-Type: application/json’
and the corresponding results returned would be:
[
{“id”:”CuO”,”assay”:”ATP”,”apdPrediction”:”reliable”,”nn1ID”:”ZnO”,”nn2ID”:”ZnO”,”nn3ID”:”ZnO”,”nn4ID”:”ZnO”,”coreSize”:-0.11951770136760874,”hydroSize”:-2.642180718449538,”ecEv”:-1.6267973637452349,”exposureDose”:-0.6247745075500192,”des306”:-1.2112713652065334,”des606”:0.44321784950783655,”nn1Distance”:2.222865548947517,”nn2Distance”:2.2232594323735833,”nn3Distance”:2.227550196215906,”nn4Distance”:2.2491214514422158,”knnprediction”:1.6576248278753694},
{“id”:”CeO2”,”assay”:”LDH”,”apdPrediction”:”reliable”,”nn1ID”:”CoO”,”nn2ID”:”CoO”,”nn3ID”:”CoO”,”nn4ID”:”CoO”,”coreSize”:1.9605633643433438,”hydroSize”:-1.505175370072615,”ecEv”:-0.3275969616463464,”exposureDose”:-0.6626005904429065,”des306”:1.6555169751247294,”des606”:0.6598914572620428,”nn1Distance”:1.9192283066219589,”nn2Distance”:1.9208125182623592,”nn3Distance”:1.927695680036072,”nn4Distance”:2.0696148079478345,”knnprediction”:1.9327703185014482}
]
4. Conclusions
In this study, we presented a robust, validated and easily applicable model for the prediction of the cytotoxicity of MexOy NPs. The model was developed using a dataset containing 15 physicochemical and structural descriptors, enriched with 62 atomic computational descriptors using the Isalos Analytics Platform and the Enalos+ nodes. Out of the 77 total descriptors used as input, 7 were deemed statistically significant. These are two experimental parameters (core and hydrodynamic size of NPs), two assay-related parameters (assay type (LDH or ATP) and exposure dose) and three computational descriptors: the energy of the conduction band (EC), the coordination number of surface metal atoms (Avg. C.N. Me atoms surface) and the force vector surface normal component of metal atoms (v⊥ Me atoms surface). EC can be found from libraries of physicochemical descriptors (e.g.,
Supplementary Materials
The following are available online at
Author Contributions
A.G.P., A.A., I.L., E.V.-J. and G.M. developed and implemented the predictive model. J.J., E.V., A.A., L.S., J.B., P.B. and K.T. performed the calculations of the atomistic descriptors. M.K.H. and T.H.Y. performed the S2NANO data curation. M.K.H. and T.H.Y. provided the initial dataset. A.G.P., A.A., G.M., I.L. and A.T. performed the NanoPharos data curation and dataset enrichment. A.T. and A.A. performed the webservice and API development. All authors contributed to drafting and reviewing the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding
This work received funding from the European Union’s Horizon 2020 research and innovation programme via NanoSolveIT Project under grant agreement number 814572. A.G.P., A.A., E.V.J. and I.L. acknowledge support from the POST-DOC/0718/0070 project, co-funded by the European Regional Development Fund and the Republic of Cyprus through the Research and Innovation Foundation. T.H.Y. and M.K.H. also acknowledge support by the Korean Ministry of Science & ICT (MSIT) and the National Research Foundation (NRF) through the International Cooperative R&D Program (No. 2019K1A3A1A78112959), as a part of the European Union’s Horizon 2020 NanoSolveIT Project.
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript or in the decision to publish the results.
Figures and Table
Figure 1. Normal component of the force vector of a single atom (green) in the shell region of a metal oxide (MexOy) nanoparticle (NP) (grey spheres: metal atoms and red spheres: oxygen atoms). The average length V of the surface normal component of force vector of all atoms in the shell region of the NP is calculated for all atoms using x, y and z coordinates of atom and fx, fy and fz components of the force vector using the formula V = (x*fx + y*fy + z*fz)/d, where x, y and z are atom coordinates relative to the NP centre-of-mass, fx, fy and fz are the x, y and z components of atoms force vector and d is the distance of the atom from the centre of the NP.
Figure 2. Indicative results for MexOy (CoO, ZnO, In2O3, NiO, HfO2 and Sb2O3) NPs from the normalised k-nearest neighbours (kNN) space of the produced cytotoxicity predictive model. The NPs are placed based on their Euclidian distances. Red and green spheres correspond to NPs from the training and test sets, respectively.
Figure 3. (a) Cell viability webservice for the prediction of MexOy NP cytotoxicity. The user inputs the required parameters and the calculation is performed automatically. (b) The results provided from the webservice include the prediction, the distances of the closest neighbours and whether the prediction falls within the model’s domain of applicability (APD) (reliable prediction) or not (unreliable prediction).
Model validation criteria and acquired results.
Criterion | Result | Assessment |
---|---|---|
R2 > 0.6 | 0.91 | Pass |
Rcvext > 0.5 | 0.904 | Pass |
0.022 | Pass | |
0.002 | Pass | |
0.018 | Pass | |
0.85 < k < 1.15 | 0.994 | Pass |
0.85 < k’ < 1.15 | 1.005 | Pass |
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2020 by the authors.
Abstract
A literature curated dataset containing 24 distinct metal oxide (MexOy) nanoparticles (NPs), including 15 physicochemical, structural and assay-related descriptors, was enriched with 62 atomistic computational descriptors and exploited to produce a robust and validated in silico model for prediction of NP cytotoxicity. The model can be used to predict the cytotoxicity (cell viability) of MexOy NPs based on the colorimetric lactate dehydrogenase (LDH) assay and the luminometric adenosine triphosphate (ATP) assay, both of which quantify irreversible cell membrane damage. Out of the 77 total descriptors used, 7 were identified as being significant for induction of cytotoxicity by MexOy NPs. These were NP core size, hydrodynamic size, assay type, exposure dose, the energy of the MexOy conduction band (EC), the coordination number of the metal atoms on the NP surface (Avg. C.N. Me atoms surface) and the average force vector surface normal component of all metal atoms (v⊥ Me atoms surface). The significance and effect of these descriptors is discussed to demonstrate their direct correlation with cytotoxicity. The produced model has been made publicly available by the Horizon 2020 (H2020) NanoSolveIT project and will be added to the project’s Integrated Approach to Testing and Assessment (IATA).
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details











1 NovaMechanics Ltd., Nicosia 1065, Cyprus;
2 Institute of Chemistry, University of Tartu, 50411 Tartu, Estonia;
3 NovaMechanics Ltd., Nicosia 1065, Cyprus;
4 Department of Chemistry, College of Natural Sciences, Hanyang University, Seoul 04763, Korea;
5 Department of Chemistry, College of Natural Sciences, Hanyang University, Seoul 04763, Korea;
6 School of Geography, Earth and Environmental Sciences, University of Birmingham, Birmingham B15 2TT, UK;
7 Division of Physical Sciences and Applications, Hellenic Military Academy, 16672 Vari, Greece;