1. Introduction
The history of investigations devoted to the analysis of networks between chemical composition, molecular, and crystal structures and numerous properties of compounds dates back to 1960s. The development of the X-ray diffraction technique and computational routines allowed to collect information about crystal structures of plenty of inorganic, organic, organometallic, and macromolecular compounds. Early findings in the field of composition–structure–properties networks of these solids gave us knowledge about the typical molecular geometry [1,2], steric, and electronic effects of functional groups [3,4], principles of molecular packing [5], role and energetic of numerous intermolecular interactions [6,7]. Longstanding efforts of the crystallographic community to present crystallographic data in a machine-processed format, and to collect these data in crystallographic databases combined with recent progress in software development promotes further insights into the synthesis of novel solids with desired physicochemical properties (including optical, magnetic, electrical, mechanical, and others).
The literature contains numerous review articles devoted to recent advances in crystal engineering [8,9], design of functional organic (see, for example, Refs. [10,11,12,13]) and inorganic [14,15] materials, and the development of software for analysis of molecular crystals [16,17,18]. However, in my opinion knowledge-based analysis and corresponding software are still insufficiently used by chemists, who often analyze relations between functional properties and intermolecular interactions on the level of bond geometry and Figures of crystal packing. On the other hand, analysis of applications of the software by the end users of software can help software developers to find and overcome limitations of their algorithms, and to propose lines for further development. Thus, in the present paper, to demonstrate the effectiveness of knowledge-based analysis of structure–property relations in crystals, some of the relations, possible application of data–knowledge studies, and predictions to analyze them, and a brief description of procedures will be described. Among a huge number of papers published in these fields, the manuscript cites only (i) recent works devoted to the analysis of correlations between intermolecular interactions and properties of a small molecule (for example its inclination to form polymorphs, solvates, and co-crystals), or a corresponding solid (from a well-known requirement for non-linear optical materials to crystallize in acentric space groups, to recent studies devoted to the effect of solvent presence on mechanical properties), and (ii) the corresponding software developed to investigate these correlations and to design novel solid forms with desired physicochemical properties. As the description of structure–property networks describes mainly the papers published in the last 10 years in the field of organic, organometallic, and coordination crystals, then the software under discussion will be limited with those developed by the Cambridge Crystallographic Data Centre (CCDC) for material chemistry and crystallography. Various examples of applications of the CCDC software to functional materials including their combination with other software, and restrictions found, will be given.
First, the Cambridge Structural Database (CSD) and components of the CSD-Enterprise will be described in Section 2. Then, some properties related to the appearance of a given supramolecular associate, and the tools to search for an associate in the CSD will be reported in Section 3. The properties of solids dependent on the crystal morphology, the Bravais, Friedel, Donnay, and Harker (BFDH) tool for crystal morphology prediction and its’ application to affect crystal morphology, polymorphism, and solvatomorphism will be reported in Section 4. Knowledge-based predictions of H-bonded polymorphism, co-crystal formation, mapping of likely intermolecular interactions, and conformer generation for the synthesis of novel functional materials will be discussed in Section 5, and the analysis of local connectivity and whole architectures of solvent molecules in Section 6. Finally, Section 7 contains information about Python API algorithms compatible with the CSD-Enterprise and about some examples of the successful combination of the CSD-Enterprise tools with external software applicable for investigations in the field of structure–property networks.
2. The Cambridge Structural Database (CSD) and its’ Libraries and Modules
The Cambridge Structural Database [19,20] is one of several databases containing crystal structures of various compounds whose structures have been determined using crystallographic techniques. Particularly, the CSD contains crystal structures of organic, and organoelement compounds, and metal–organic complexes. These data are collected from publications all other the world combined with structural determinations with no accompanying manuscript. Single-crystal data are included to the CSD even if no coordinates are available, while powder studies are included only if cell parameters, atomic coordinates, and refinement details were reported. The original value of the CSD was to simplify access to individual structures, to help crystallographers to avoid redetermination of previously reported structures, and to allow easy sharing of work within the chemical and crystallographic community. In 2019, the number of the CSD individual entries overpassed 1,000,000! This value can be compared with ca. 200,000 of inorganic crystal structures from the Inorganic Crystal Structures Database (ICSD) [21], or 150,000 entries from the Protein Data Bank (PDB) [22].
The second benefit of the database comes from data–knowledge analysis derived from the entire collection. Since the late 1960s, analysis of crystallographic data allowed us to demonstrate a non-uniform distribution of crystal systems and space groups, to estimate ionic and van-der-Waals radii of various elements, or to propose close packing of molecules within a crystal. Analysis of intermolecular distances to reveal structure-forming interactions has been known for tens of years; the role of hydrogen bonds, π...π stacking, and halogen...halogen interactions can not be overestimated [6,7]. Nowadays, information about the crystal structures of discrete organic and organometallic compounds allows us to reformulate principles of their packing [23,24], and to investigate intermolecular interactions more thoroughly. Not only information about geometry and frequency of occurrence of an unusual interaction, synthon, or tecton is extracted from the CSD to support a conclusion about its’ role in crystal packing and electronic effects that cause its’ appearance [25,26,27,28,29]; but it is also recommended to additionally compare interaction occurrence relative to what would be expected at random [30,31,32,33]. Data about molecular geometry—from bond lengths and angles to torsions and ring geometry—form the basis of our understanding of the energetics of molecular conformation. Analysis of these data allows us to understand not only the most stable conformations, but also to rationalize steric and electronic effects of substitutents, or co-formers on the geometry, or to link it with a reaction pathway [34,35,36].
Taking the value of the conformational preferences exhibited by molecular fragments, and intermolecular interactions into account, two dynamic knowledge-based libraries were derived from the CSD. These are: Mogul [37], which stores intramolecular information, and IsoStar [38], which collects information about intermolecular interactions. Mogul contains information about millions of bond lengths, angles, and torsions, each relating to a specific chemical environment. Data from this library can be organized into distributions providing a click-of-a-button access to structures used to construct the distribution. Besides, one can check the geometry of a novel structure (this is of importance for structure refinement from powder X-Ray diffraction data, or for understanding the environmental effects of combinative factors and steric exclusion) or to check the geometry of a protein–ligand docking pose. Data for Isostar were collected not only from the CSD, but also from protein–ligand complexes in the PDB having a resolution better than 2 Å, and ab initio data of the key interactions. Isostar provides the searchable distribution of contact functional groups around a central one (such distributions can be obtained for either the CSD or the PDB collections), and data about geometry (and sometimes energy) of various interactions.
Thus, the CSD and associated libraries serve also as sources of supramolecular knowledge for applications software that addresses specific problems in structural chemistry, rational drug design and crystal engineering. In accord with these tasks, various modules comprising the CSD-Enterprise suite are under constant development (Figure 1). The CSD-System contains the CSD, Mogul, and Isostar databases; ConQuest and webCSD provide off-line and on-line searches within the CSD; Mercury and Hermes are the CCDC’s 3D vizualizers of, respectively, small and macromolecular crystal structures. Currently, both Hermes and Mercury are linked with some CSD-Materials and the CSD-Discovery tools of the CSD-Enterprise. The CSD-Materials module serves for use in crystal engineering and materials design, as well as for the structure solution from powder XRD data with DASH package. Various tools and programs for drag analysis and discovery are combined within the CSD-Discovery module. Note, that the Full Interaction Map (FIM) tool and Conformer Generator tool are intensively used for both material chemistry and drug design. At last, every scientist from this community is welcome to install previously published or to write (and distribute) his/her own CSD Python API scripts.
The CSD-Materials Module
The main CSD module applicable for investigation and comparison of solid forms and intermolecular interactions, which govern packing of these solids, is the CSD-Materials module. Its’ components in the CSD-2019 version allow us to perform:
Analysis of H-bond motifs (searching of motifs and statistics of their occurrences; assessing the risk of polymorphism via H-bond propensities; prediction of co-crystal formation).
Analysis of crystal packing features (searching on selected motifs, analysis of packing similarity and building of a packing similarity tree diagram).
Calculation of theoretical crystal morphology, gas phase MOPAC (Molecular Orbital PACkage) calculation and ‘UNI’ (UNIversal) force field intermolecular energy calculations.
Mapping of interaction preferences around an isolated molecule.
Analysis of solvate and hydrate crystal structures (searching and classification of H-bond motifs, calculation of the volume occupied in a unit cell with solvent molecules, mapping of interaction preferences around them).
Generation of conformers based on geometrical statistics from the CSD.
Solution of crystal structures from powder diffraction data using DASH.
Although this software was developed and attested mainly for pharmaceutical crystal forms, numerous examples of structure–property networks found for other organic and organometallic compounds with valuable optical, electrochemical, mechanical, and other properties are given below. These studies demonstrate that the possible range of applications of the CSD-materials module can be much expanded, although its application to coordination polymers is less diverse.
3. Search and Analysis of Supramolecular Associates
3.1. Search on Hydrogen-Bond Motifs
The utility of big-data analysis of intermolecular interactions extracted from crystallographic databases is based on the fact that the frequency distributions of functional group contacts in the CSD are directly related to the corresponding interaction energies in solution [39]. Analysis of crystal packing in pharmaceuticals, their homologues, polymorphs, co-crystals, and solvates gives one information about preferable synthons that could appear in a binding pocket occupied with this drug (although some weak interactions can be over-, or under-estimated). Data knowledge about typical water associates in crystals of small molecules is important for biochemistry as similar hydrate architectures were found in crystals of inorganic, organic, and macromolecular compounds [40,41].
Among all contacts, H-bonds are the strongest and the most directional interactions that play the dominant role in the crystallization and stability of organic solids, thus their analysis is a central theme of crystal engineering [42,43,44]. For example, stable H-bonding motifs are used widely to fix olefins in photoreactive positions for [2+2] cycloaddition [45,46]. Analysis of H-bonding is important for understanding proton conductivity [47] and high-temperature ferroelectricity caused by proton tautomerism in polar space groups [48,49,50,51]. Reversal of an electric field in the latter solids causes a switch of O-H...O or N-H...N bonds to O...H-O or N...H-N ones accompanied with ketone/enol [48] or neutral/zwitterion [49] isomerisation or imidazole tautomerization [50,51] and the polarity reversal.
Comparison of various H-bonded motifs found between similar functional groups allows one not only to estimate which of them is more stable and abundant to occur but also to shed light on stereoelectronic effects stabilizing various associates [29,52,53]. For example, analysis of 23 phenylglicine amide benzaldimines revealed only five types of H-bonded motifs; the choice of the particular motif depends on the number of H-bonding donors and acceptors, the ease with which the motif is formed, and the possibility of the motif to accommodate additional substituents [54]. The stability of polymorphs of a multifunctional molecule is known to be determined by the energy of an H-bonded motif [55]—that is of importance for drug and food industry, production of high-energy compounds, and dyes and pigments. For these materials the presence of uncommon weak H-bonds in an observed solid form can be indicative of the potential for alternative packing to form another polymorph with stronger interactions [56]. Instead, for high-energy compounds strong H-bonded interactions are disadvantageous as these can prevent molecules from dense packing [57,58,59]. The appearance of particular H-bonded motifs (spiral chains, bilayers, and others) is characteristic for spontaneous resolution of racemic and nonracemic mixtures of some chiral compounds [60,61,62] at crystallizations. Other applications of the analysis of H-bonded associates include salt formation via evaluating synthon competition [63,64], assessing putative structures from crystal structure prediction [65], and analysis of H-bonding in drugs and vitamins [66].
Thus, the search for all representatives of a given H-bonded motif, or for all H-bonded motifs found between some functional groups within the CSD, as well as their classification and statistics of occurrence, become of interest. All these possibilities are realized within the CSD Motif Search (Figure 2). A user can select a pre-defined motif from a special library or generate a new motif. To generate a motif, one should sketch a functional group, select the atoms of the functional group which define the contact(s) of the motif, define interatomic distance, and select the type of motif (a ring, an infinite, or a discrete chain) and the number of contacts within the motif. The search can be carried out in the current version of the CSD, in individual refcodes and families of refcodes or in files of structures. The results can be viewed by motifs with the number of hits found and the frequency of occurrence or by structures with the motifs found for each hit. As more than one motif can be found in a crystal structure, their combination can give not only ring and chain motifs but also H-bonded layers and frameworks. Unfortunately, classification of all matches in accord with all non-equivalent H-bonding associates is unavailable in the current version. Besides, the notation used to describe these motifs [67] do not correspond to that recommended to describe underlying nets of crystalline networks and clusters [68,69,70,71]. Analysis of H-bonded motifs is part of the H-bond propensity tool, the Co-Crystal Formation tool, and the Hydrate Analyzer described in Section 5.1, Section 5.2 and Section 6, respectively.
3.2. StudyingCrystal Packing Features
Besides robust and directed intermolecular interactions, such packing features as π...π stacking, dipole...dipole or halogen, pnicogen, and carbon bonding as well as molecular size-shape regularities can become investigation objects. Some properties of solids are associated with molecular packing instead of intermolecular interactions. These are such properties as luminescent properties of co-crystals of anticrowns with aromatic molecules and hydrocarbons [72,73,74], electroconductivity of layered donor-acceptor complexes [75,76], optical properties of π-conjugated molecules [77,78], dense packing of high-energetic compounds associated with detonation, and stability properties [59,79]. In this context, it becomes of interest how polytopic molecules or molecules with a lack of limited H-bonding functionalities pack, and which factors govern their packing. A comparison of polymorphs, solvates, and co-crystals of such compounds gives a key for better understanding of these factors. For example, an analysis of 88 crystal structures containing trimeric perfluoro-ortho-phenylen mercury (TPPM) revealed only four motifs of their packing with Lewis bases, governed mainly by Hg...C and π...π interactions [80]. It was shown that the type of supramolecular motif strongly correlates with the nature of a co-former. Another example of the role of weak interactions is the co-crystallization of diphenyl dichalcogenides Ph2E2 (E = Se, Te) with diiodotetrafluorobenzene that is incorporated during crystallization between stable tectonic chain Ph2E2 architectures, which were also found in the crystals of pure Ph2E2 [81]. The co-crystallization of this molecule can be considered as a replacement of E–E and Te–π(Ph) chalcogen bonds with I–E and I–π(Ph) halogen bonds via insertion of stacks of halogen bond donors between tectons of halogen acceptors. Analysis of intermolecular interactions by means of the energy framework diagrams revealed that the energies of corresponding chalcogen and halogen bonds were lower than those between the stacks of pure Ph2E2 and diiodotetrafluorobenzene, and illustrates an approach to binary crystals formed by matching tectons (that can be found with the Crystal Packing Features tool).
Analysis of mutual disposition of molecules, in this case, can be carried out with the Crystal Packing Features tool of the CSD-Materials module. In contrast with the CSD Motif Search, a “packing feature” can be generated only from a displayed structure by selecting the atoms and bonds to define the feature. A search query is constructed without sketching any functional groups and atoms based on selected atoms, bonds, and intermolecular distances. Then the CSD is searched to identify crystal structures that contain a similar mutual disposition of atoms. The hits are automatically overlaid with the original geometry, and root-mean-square distance (RMSD) value is reported as a measure of packing similarity. For example, the binary stacks of TPPM and aromatic molecules can be found in a series of co-crystals with luminescent properties [72]. This motif, one of four typical for TPPM co-crystals [80], can be constructed from a TPPM molecule and two aromatic C6 rings situated above and below the TPPM meanplane (Figure 3a). The Crystal Packing Feature search allows us to extract from the CSD other examples of such potentially luminescent co-crystals, and to compare the distances between the meanplanes of planar molecules as a measure of charge transfer from electron-rich aromatic molecules to anticrowns.
A strategy to co-crystal formation based on packing similarity between various co-formers was suggested in Ref. [82]. In this paper, the authors suggested replacing the anion in the crystal of lamaviudine saccharinate with another anion having a similar disposition of acceptor groups (the nature of atoms within acceptors was allowed to vary). Crystal structure and intermolecular interactions of lamaviudine maleate taken as a hit of the CSD search, indeed, were very similar to that of saccharinate. This tool can also be applied to intramolecular interactions or mutial disposition of more than two molecules. For example, the analysis of semicarbazone conformations distinguished that the syn–anti–syn–syn conformation predominates the syn–anti–anti–anti one [83].
3.3. Crystal Packing Similarity Tool
While the Crystal Packing Features described above allows one to select from the CSD all supramolecular associates with a user defined mutual disposition of molecules, the Crystal Packing Similarity tool can be used to compare a large set of solids containing one co-former and to measure similarity between them, as well as to obtain similarly overlaid molecular packing automatically. This tool was used for comparison of the experimental crystal structures with those predicted during the sixth blind test of organic crystal structure prediction methods [84]. Such investigations are also of great importance for the analysis of isostructural compounds [85] and some properties that occur only within a given group of supramolecular associates. For example, tetraphenylborates of N-salicylideneanilines form isostructural series of photochromic solids due to the packing of cations within a cavity formed by phenyl groups of anions, while more dense packing in other salts gives non-photochromic solids [86]. The effect of substituents on isostructurality of compounds was widely analyzed for both rigid [87,88,89,90,91,92] and flexible molecules [93,94,95,96,97,98]. Not only long-accepted isostructurality of Cl- and Me- or -Cl, -Br, and -I substituted molecules were revealed, but also the equivalence of ethylene and azo-bridges [92], or that of azide and iodide substituents [85] were demonstrated. Besides the Crystal Packing Similarity tool, XPac [99] and Crycom [100] are also among the software appropriate to carry out such a comparison.
The Crystal Packing Similarity tool is applicable to compare multiple structures containing the same compound, e.g., its polymorphs, hydrates, solvates, co-crystals, and salts. Within this method molecular clusters (typically containing a central molecule and its’ 14 closest neighbors) are built for each structure, and the clusters are compared with some geometric tolerance to define whether packing is similar or not. Small differences in packings (variation of halogen atoms, absence of some hydrogen atoms, presence of few independent molecules and small co-formers) can be ignored. The results of the comparison are grouped in accord with the number of neighbors forming molecular clusters (from 1 to 2 for different structures to all 14 for pseudo-isostructural compounds), and the groups can also be compared with each other to reveal synthons and associates common for different groups. Such grouping reflects the fact that similar motifs found in polymorphs and solvates typically keep similar intermolecular connectivity and energy of pair interactions [80,101]. The result of such analysis is the dendrogram visualizing crystal packing similarity between different groups of crystal structures. For example, analysis of 50 crystal structures, containing carbamazepin revealed three main motifs (“translation stacks”, “inversion cup”, and “co-former pairing”, Figure 4), and none of these appears via H-bonding [102]. Instead, these motifs seem to represent the most efficient methods for packing of carbamazepin molecules while leaving the carboxamide group free to form hydrogen bonds. Note, that these results coincide with those obtained for carbamazepin-containing solids using the XPac method [103].One of the carbamazepin pseudo-isostructural groups is represented by metastable polymorph II and its solvates and hydrates situated inside the channels formed by the hydrophobic aromatic surfaces of this molecule, which were found to act as stabilizers of this solid [104]. These solvent molecules within the channels can even be replaced with hydrophobic polymer guest molecules [105].
Comparison of an orthorhombic polymorph of trimeric perfluoro-ortho-phenylen mercury with its’ three monoclinic polymorphs gives up to six similarly packed molecules. Figure 3b visualizes that in crystals of {MOXMAN02} and {MOXMAN03} perfluoro-ortho-phenylen mercury form similar stacks, but these stacks are packed in different ways. The interplay of packing motifs and hydrogen bonds were demonstrated on the example of 37 enantiopure and racemic salts of methylethedrine, where the molecule forms six “isostructural” groups, two of which were characterized by alternating motifs of H-bonding [106]. Analysis of co-crystals of meloxicam with carboxylic acids reveal the acids, which are able to break H-bonded meloxicam dimers, and, therefore, to compare the effect of crystal packing on dissolution behavior [107]. The comparison of crystal structures of hydrates of furosemide: Nicotinamide co-crystal suggests the mechanism of dehydration [108]. Galcera, et al. demonstrated by using this tool that the isostructurality in co-crystals of lamotrigin with flexible dicarboxilic acids appears at the presence of solvents able to mimic the difference in conformation and volume of co-formers [109]. The analysis of 42 tyrammonium salts demonstrated that isostructural cation packing can occur even with structurally different anions, with different hydration states and with different hydrogen bonding; realization of different packing was associated with various conformations of the ethylammonium group of the cation [110]. Martins, et al. demonstrated that similarity between supramolecular clusters occurs only for molecules able to exhibit similarity in molecular shape-size parameters (thus called isostructural molecules), contact area, and energy of intermolecular interactions [111]. Fenamate co-crystals with 4,4’-bipyridine keep the packing arrangement of the initial solid connected by the 4,4’-bipyridine [112]. The Crystal Packing Similarity tool was also used to investigate 16 trospium chloride containing structures. This molecule realizes similar conformation in the majority of solvates including a sesquihydrate, while its conformation in dihydrate significantly varies from that in other solvates. All solvates but water belong to the same group as two trospium chloride polymorphs [113]. Dehydration and desolvation of lenalidomide follow “isostructurality” in crystal packing: All solvates at heating convert to its’ thermodynamically stable anhydrous form, whereas all hydrates upon dehydration convert to a metastable anhydrous polymorph able to transform upon further heating to the stable polymorph [114]. Analysis of H-bonding and crystal packing similarity for the crystal structures of vitamin D analogs showed that various conformations of its A-ring are predetermined by H-bonding of hydroxyl group both in crystals and in the binding pocket of the vitamin D receptor and that the exocyclic methylene group also influences H-bonding pattern in solids [66].
To sum up “isostructurality” found by means of the CSD Crystal Packing Similarity tool ignores the requirement for identical symmetry of compounds, and unit cell parameters. Instead, the families of structures with matching molecular clusters when superimposed can be found to reveal crystal packing relationships, motif stability, and hydrogen-bond competition for a variety of solid forms of a molecule (and its’ homologues or structural analogs).
3.4. CSD-Crossminer
The CSD-Crossminer is the novel software of the Cambridge Crystallographic Data Centre first appeared as part of the CSD-Enterprise in 2018. Its utility is based on ideas previously demonstrated on the example of carboxylate/tetrazolate [115], hydrate/peroxosovate [116], and maleate/saccharinate [82] behavior in solids and potential of 3D macrocyclic analogues of small-molecules to serve as drugs [117]. Particularly, molecules which form similar intermolecular interactions and have comparable size and shape tend to form similar supramolecular associates, and hence are able to form isotypical solvates, or bind with similar functional groups within a binding pocket of a macromolecule. A search defined within this software does not contain any chemical formulas or functional groups but uses such chemical feature as “H-bond donor”, “H-bond acceptor”, “hydrophobe”, and others. Starting from a real molecule, mutual disposition of these chemical features can be defined, as well as the disposition of complementary functional features, which may come from other molecules, co-formers, solvates, or proteins and tolerance. The search is carried out within the CSD and the PDB databases and gives a number of hits potentially able to replace the reference molecule in solids or binding pockets. This software was developed as a pharmacophore query tool able to identify common protein binding sites in macromolecules, to determine structural motifs that are able to interact with similar binding sites, to estimate which ligand modifications are tolerated in a binding pocket, and others. More details about this tool are given in a white paper at
4. Solid Form Calculations
Calculations included within the CSD-Materials module of Mercury allows one to simulate crystal morphology using the BFDH (Bravais, Friedel, Donnay, and Harker) method, to perform semi-empirical MOPAC calculations and to evaluate force-field intermolecular energy calculations using the ‘UNI’ intermolecular potentials. The MOPAC software is a semi-empirical quantum chemistry program based on NDDO (neglect of diatomic differential overlap) approximation, which allows a user to perform some calculations (geometry optimization, bond order calculations, molecular electrostatic potential visualization) for isolated molecules. The 2007 or 2009 version can be obtained free for academic users at
4.1. BFDH Morphology Prediction
Some of the macroscopic properties of solids depend on supramolecular architectures of symmetry elements formed by molecules and ions. Thus, if a supramolecular architecture is anisotropic, then, properties measured become anisotropic too; and their understanding and prediction require knowledge of disposition of functional groups, supramolecular synthons, or symmetry elements as compared with crystal faces. Having experimental data about Miller indices of crystal faces, one can examine corresponding planes using the free Packing/Slicing tool of Mercury. Particularly, indexing of the crystal phases becomes of practical interest for nonlinear optics [120], organic photonics [121], piezoelectrics [122,123,124], and organic electronics [125,126,127,128]. Knowledge of functional groups or supramolecular synthons forming crystal faces allowed in some cases to rationalize mechanical effects in dynamic crystals [129,130,131,132] such as self-healing, jumping, bending, twisting at heat, humidity, force or light, and to describe pressure-induced phase transitions (see Refs. [133,134,135] for analysis of such transitions in amino acids). Particle size and shape also affect material properties, such as tabletability [136], thus, their control is of special interest for the pharmaceutical industry.
Taking into account that the solvent can affect crystal morphology [137,138], it becomes of interest to rationalize solvent selection and to predict its’ effect on crystal size and shape. Analysis of functional groups forming each surface gives a clue to solvent choice to prevent crystal growth in some directions, and vice versa, to rationalize the dissolution of a single crystal over various crystal faces [139]. For example, long-chain alkylboron capped tris-pyrazoloximates and clathrochelates readily form thin plate crystals with their main faces formed by hydrophobic alkyl groups (Figure 5) [140,141]. Their X-ray quality single crystals are obtained from polar solvents able to bind with small crystal faces formed by polar groups via hydrogen or halogen interactions, while crystallization from hydrocarbons typically results in twinned and turbostratic conglomerates of crystals. Similarly, ibuprofen [142] single crystals are faceted with (1 0 0), (0 0 2), and (0 1 1) faces, formed by hydrophobic, van der Waals and H-bonded interactions, respectively. As follows from the strength of intermolecular interactions in a crystal, and the strength of interactions between a solute and a solid [143], a polar protic solvent should bind to (0 0 2) faces to prevent the formation of needle morphology. This strategy based on the type of functional groups forming crystal faces was successfully used to optimize crystal morphology of lovastatin [144], 1-hydroxypyrene [145], isoniazid [146], tolbutamide [147] and N-benzyl-2-methyl-4-nitroaniline [148]. For phenacetin single crystals, the crash-cooling experiments resulted in the crystallization of needle-like crystals, while slow growth yielded similar single crystals with a hexagonal-like BFDH-predicted morphology affected by the solvent used [149]. Application of additive molecules able to bind with selected crystal faces also allows the of the controlling crystal morphology [150,151,152,153]. Yunqi Liu and coworkers suggested an interesting modification of such technique to affect the morphology and morphology-dependent breathing effect of metal–organic frameworks via the addition of a small amount of slightly modified ligands to the reaction mixture [154,155]. The hydrolysis of selected ligand decreased the growth of an anisotropic coordination polymer in some directions; the size of a crystal face containing pores was found to affect gas adsorption.
Having access to CSD-Materials, one can predict the morphology using the Bravais, Friedel, Donnay, and Harker (BFDH) crystal morphology tool (Figure 5). This method implies the idea that the crystals are preferable to grow along the direction with strong intermolecular interactions. Not only simulated but also experimental morphology can be depicted and analyzed to evaluate functional groups which form a crystal face and to estimate preferable interactions which occur between this face, a solvent, or an additive. Crystal morphology prediction can be used to estimate the direction of a preferred orientation for an experimental powder XRD patterns. Photomicrographs were taken during isothermal dehydration combined with calculated BFDH morphology to help distinguish concomitant polymorphs [156], to understand the faces, and directions of solvent loss [157], while atomic force microscopy supported with BFDH calculations was used to follow phase changes in situ [158]. Analysis of intermolecular interactions over molecular surfaces allows the rationalization of concomitant polymorphism of substances where single crystals of two crystal forms share faces of their crystals [159]. The case of intergrowth polymorphism [160,161] referred to the existence of two solid forms within one single crystal, which is a special case of such concomitant polymorphism. On the contrary, absence of similar surfaces on dominant phases of 1:1 and 3:2 co-crystals of p-toulenesulfonamide and triphenylphosphine oxide allowed Croker, et al. to conclude that solvent mediated phase transition in this system occurs through dissolving of one phase and re-crystallization of the other [162]. Analysis of polymorphs, solvates, and hydrates of trospium chloride containing structures demonstrated that these compounds can be divided into three main structural groups, and the predicted and experimental crystal morphology of various forms within one structural group was found to be very similar [113]. The BFDH morphology tool was applied to rationalize silicon oil induced spontaneous phase transition in ethynyl-substituted benzamides [163] and unusual mechanical response from a crystal undergoing topochemical dimerization [164].
More complex applications of BFDH morphology tool include synthesis of desired solid forms or prediction of desired properties for previously published compounds. It was shown that self-assembled monolayers of rigid biphenyl thiols [165,166,167] or silioxane-based monolayers [168,169] can be used as templates to exclude concomitant polymorphysm (Figure 6a) or to obtain a metastable polymorph. First, analysis of crystal faces of the desired solid form was carried out to reveal functional groups on one of the main faces; then, the monolayer containing functional groups are able to form stable synthons with the above functional groups, which was used during crystallization (Figure 6b). Not only predesigned monolayers but also polymers [170,171] and additives (Figure 6c) [172,173] can be used as templates for crystallization. For example, β-1,4-saccharides act as templates to produce the metastable form III of paracetamol at crystallization from melt readily forming H-bonds with the (010) surface of form III [172], while the presence of sulfamides in solution promotes crystallization of γ-pyrazinamide from aqueous solutions [173]. Pyrazineamide usually nucleates from solutions in its’ α form that contains molecular H-bonded dimers; amide of pyrazineamides probably forms heterodimers with sulfamide groups of templates on one of the crystal surfaces and then forms N-H...N connected chains with neighboring molecules. These chains are found only in γ-polymorph. Patel, Nguyen & Chadwick [171] not only used polymers to promote heterogeneous nucleation but also suggested to use H-bonding propensities tool to rank polymer surfaces towards heterogeneous nucleation of benzocaine and 1,1′-bi-2-naphthol.
Applications of some anisotropic (needle, plate) crystals are related with anisotropy of chemical bonds within such solids. Anisotropic crystals with predominated faces formed by hydrophobic groups can be used as superhydrophobic porous materials [174]. Analysis of mechanical stimulus applied along three axes of unit cells to some layered energetic materials allows us to conclude that these explosives can convert kinetic energy into layer sliding to prevent the formation of hot spots [175]. This should also be the rationale for why high-performance insensitive energetic materials can be used as desensitizers versus mechanical stimuli. Vice versa, a corrugated molecular structure of celecoxib without any slip-planes and numerous weak interactions in orthogonal directions were suggested to be the reasons for exceptionally high elasticity of its needle singe crystals [176]. Flat naphthalene diimine derivatives were found to have comparable crystal packing governed by π...π stacking, similar crystal morphology, but exhibited various mechanical flexibility attributed to close packing or interlock of their terminal alkyl chains [177]. Investigation of sulfa drug crystals [178], co-crystals of vanillin isomers [179], and amino acids [180] demonstrated that H-bonded layered structures with orthogonal distribution of strong and weak interactions attain the feasibility of cleaving a crystal along a given crystallographic plane parallel with these robust synthons.Particularly, single crystals with molecular surfaces formed by hydrophobic interactions can be applied as clean surfaces for molecular beam epitaxy. The presence of slip planes is also thought to be associated with low elastic recovery upon compression, and greater plasticity [181,182]. Crystal morphology prediction was applied to discover new gelators by focusing on scaffolds with predicted high aspect ratio crystals [183].
Thus, analysis of crystal morphology and functional groups on main crystal faces cover many areas of crystal design and are important for many areas of industry. The tools implemented to the Mercury package can be helpful in these areas, but their application seems to be limited with anisotropic crystals or solids faceted with different functional groups. In other cases, this approach can be improved by DFT calculations of the interaction effect between the growing faces and the solvent molecules, or molecular electrostatic potential on crystal faces.
5. Knowlede-Based Prediction of Supramolecular Associates and Solid Forms
5.1. Polymorph Assessment
Understanding and prediction of polymorphism—the ability of a solid material to exist in multiple crystal forms known as polymorphs—is vital for the industry as polymorphs of drugs, explosives, and pigments exhibit different properties [55]. It can be associated with various molecular conformations of flexible molecules, competing intermolecular interactions of polytopic molecules, various packing of molecules or supramolecular synthons, et cetera. While some paths to control polymorphism are described in Section 4.1, here, the approach to estimate the likelihoods of H-bonded polymorphism is described. It is of special interest for the analysis of compounds containing few donors (D) and acceptors (A) of H-bonding, since competition of donors or acceptors to take part in a hydrogen bond, and restrictions to form all highly likely bonds simultaneously may cause the appearance of polymorphs [184]. Taking into account that various polymorphs exhibit different properties including solubility, bioavailiability, tabletability, et cetera, the pharmaceutical industry has prompted the development of computational tools that are able to predict the likelihoods of H-bonded polymorphs. Thus, a methodology has been developed [185] to estimate the likelihood of H-bond formation for each pair of donors and acceptors in a molecule taking the environment of the D and A groups into account.
The H-bond propensities tool uses only a 2D molecular formula (Figure 7) and involves four stages: Data sampling, model fitting, model validation, and target assessment. For each functional group, it is either automatically assigned from the 2D formula, taken from the functional group library, or sketched manually by using a fitting data set, which is generated by loading in an existing set of structural data, or from the CSD. The H-bonds within the fitting data set are identified to collate statistics and descriptors of H-bonding formation. The logistic analysis is performed to generate a statistical model to determine the likelihood of H-bond formation; the model should be analyzed to include for the final model only significant variables and be good enough to proceed (area under ROC curve (receiver operating curve) above 0.8 indicates good discrimination). The results are ranked by propensity to allow inspection of the most and the least likely D-A pairs. Based on this knowledge-based approach, a structure that adopts the highest propensity H-bonds displays a low likelihood of appearance of a more stable polymorph, while for solid forms with lower propensities of H-bonds, a high risk of polymorphism caused by H-bonding is expected. For example, the most likely interaction for a “molecule” depicted in Figure 7 includes a donor atom of Group 3 and an acceptor atom of Group 1. However, in solid {Refcode} the second likely hydrogen bond between groups 1 and 2 is observed, thus, indicating possibility of appearance of another polymorph with the most likely hydrogen bond.
The tool has proved to be highly valuable in estimating the relative stability of known crystal forms [186]. Particularly, the kinetically more favored Form I of ritonavir displays statistically unlikely hydroxy-thiazoyl and ureido-ureido interactions but exhibits more favorable conformation of the carbamate moiety, while Form II realizes stronger hydrogen bonds as estimated in Ref. [186]. Two X-rayed polymorphs of 4-aminobenzoic acid realize different H-bonding, and the more stable form realizes more likely sets of H-bonding [187]. The approach was also successfully used to rationalize polymorphism for N2-(indol-3-acetyl)-L-aspargin [186], lamotrigine [188], and crizotinib [189,190] (see Figure 8 for chemical formulas of these compounds). Not only strong donors and acceptors of H-bonding can be used for analysis; a competition of hydroxide O-H and ethynyl C−C≡C−H groups to form hydrogen bonds [191] and the role of CHCl3 and CH2Cl2 molecules in crystal packing [192] were demonstrated using this tool.
Known forms of paracetamol both exhibit H-bonding between OH groups, that is a less likely donor to form a bond than an amide fragment. It was assumed that there was a conflict between a good donor group to form a bond disfavoring a poor acceptor to form bonding, which could be overcome by co-crystal formation with molecules containing the same D and A groups [187]. Indeed, co-crystals of paracetamol with diamino- or bis(4-pyridyl)-containing co-formers contain molecules of paracetamol connected by more likely amide/hydroxo or amide/amide pairs of interactions [193]. A good correlation between H-bonding charge analysis and H-bond propensities was also demonstrated for crizotinib [189], and heterocycle-1-carbohydrazoneamides [194].
Note, that the high probability for exhibiting polymorphism does not necessarily allow one to obtain novel polymorphs and that different polymorphs can realize the same H-bonds [195]. For example, probenecid was predicted to realize various H-bonded polymorphs, and differential scanning calorimetry indeed revealed three polymorphs, but XRD experiments showed that all of them realized the same hydrogen bonding [196]. For bufexamac, the propensity tool suggests the possible existence of three crystal forms, while Nauha and Bernstein found two polymorphs with similar H-bonds [197]. Two experimentally observed forms of meglumine realize slightly different H-bonding, which, however, are absent on putative structure landscape as these contain highly unlikely bifurcate acceptor OH groups [197]. For axitinib [189], five anhydrous polymorphs realize the most likely H-bonding, thus, the approach can not distinguish the most stable among them.
To sum up, the propensity tool neither predicts polymorphism, nor guarantees that all sets of theoretically possible H-bonding combinations can be obtained or guidelines can be given on how to obtain any of these polymorphs. It does not describe inclinations of a molecule to form concomitant polymorphs, conformational polymorphs, or packing polymorphs, or solid forms with Z’ >1. Instead, it indicates the possibility of solid forms to organize various H-bonded architectures and provides some guidelines on the amount of affords that one can spend in the experimental search for various crystal forms. Abramov also notes such a limitation of the approach as incapability to distinguish more stable polymorph among crystal forms with similar H-bonded networks, which however can be overcome with additional charge-density analysis of known polymorphs [189]. At the same time, its application is not limited by monomolecular systems. Note, that any desired functional groups, additional co-formers, and solvent molecules can be included in the statistical model. Thus, although this method is not able to predict the ratio of components in a co-crystal, it can be applied for co-crystal design.
5.2. Co-Crystals Design
Co-crystals are solids that consist of two and more components (co-formers) that form a unique crystalline structure having unique properties. Thus, co-crystallization is an approach to optimize the physical properties of solid materials. For example, pharmaceutical co-crystals can be obtained to modulate dissolution rate and physical stability of drugs [198,199]. Schultheiss and Neuman reported [200] the effect of coformers on melting points, stability towards humidity, solubility, bioavailiability, and some other properties of pharmaceutical co-crystals. Karki, et al. [201] demonstrated that the tabletability of pharmaceuticals can be tuned up on the example of paracetamol co-crystals with various co-formers. The dependence of mechanical properties of acid...amid based co-crystals was studied by Saha and Desiraju [202]. The potential of co-crystallization for tuning the properties of the energetic materials [175,203,204], optical materials [205,206], and for the food industry [207] have also been demonstrated in recent papers. In contrast with synthetic routes focused on the synthesis of covalent derivatives containing functional groups that affect the desired property, co-crystallization became of interest as a ‘greener’ process, frequently free of toxic ingredients and by-products. Pharmaceutical compounds in this context become of particular interest as these molecules contain functional groups that can be involved in molecular recognition of biomolecules [208]. The main task of crystal chemistry in this field is to estimate which co-formers, if any, will form a co-crystal with a given molecule prior to screening and investigation of properties of the solid obtained.
One group of methods available are quantum chemical calculations from simple energy minimization to full structure prediction [209]. Another group of methods is based on data knowledge about the likely homo and heterosynthons. Since the works of Etter [210], it is known that (i) all good donors and acceptors tend to take part in hydrogen bonding, (ii) the strongest donors tend to interact with the strongest acceptors. In this context, it becomes essential for a researcher to estimate which donors and acceptors are “good” and “the strongest”. The H-bonding propensities tool allows ranking of various donors and acceptors without any quantum-chemical calculations (Figure 9).
H-bonding propensities tool for uncharged and charged molecules of pyrimethamine and dicarboxylic acids, as well as halogenated aromatic compounds (Figure 10a), was successful in predicting formation and non-formation of adducts [211]. A similar approach was later applied to pyrimethamine with some other drug molecules also taking solvent molecules into account [212]. Note, that proton transfer accompanied with transformation of a co-crystal to a salt was rationalized in terms of their ΔpKa values [213,214], where ΔpKa = pKa[protonated base] − pKa[acid] >2 or 3 was found for salts, negative ΔpKa was characteristic of co-crystals, while an intermediate situation may result in both depending on stable supramolecular synthons [215]. Hydrogen-bond propensities gave similar trends as calculated bond energies for possible synthons in thiazole amides (Figure 10b), and co-crystallization of six thiazole amides with 20 different carboxylic acids demonstrated effectiveness of complimentary energy-based and data–knowledge predictions for the prediction of likely synthons [216]. The knowledge-based approach to co-crystal design was successfully applied to select pyrazinoids and pyridinoids as prospective co-formers for a diuretic drug hydrochlorothiazide [217] to tune its solubility. The amide-pseudoamide synthon was shown to be more stable than the formation of two dimers of the theophylline molecule, that allowed to explain the formation of theophylline: Amide co-crystals [218]. At the same time, theophylline co-crystals with fluorobenzoic acids demonstrate that it is still difficult to predict co-crystallization of small rigid molecules using this method [219]. Similarly, preferable co-crystallization of only one of three tautomers of 2-amino-6-methyl-1,4-dihydropyrimidin-4-one with carboxylic acids was explained using this approach [220]. Potential of co-crystal design for molecules with limited H-bonding functionalities was demonstrated on the example of propyphenazone [221]: NH2, OH, and CO2H functionalities were found to be the most likely groups to interact with O=C group, and eight novel co-crystals were obtained using co-formers containing OH or/and CO2H groups. Dicarboxylic acids [222] or anions [223,224] can be applied to co-crystallize two molecules, which do not form co-crystals.
Note, that the analysis of possible heterosynthons is not limited to co-crystals. It can be utilized to investigate inclusion compounds [225] and mixtures of polymers and solutes [171]. Besides, the ranking of heterosynthons on their relative strength can not only help in estimating the most likely binary co-crystals. This method has great potential in the synthesis of multicomponent co-crystals based on polytopic co-formers. Thus, H-bonded heterosynthons are able to interact with the third co-former through halogen bonds [226,227,228]. π-Stacking in conjunction with hydrogen bonding was used for synthesis of ternary co-crystals [229,230]. Combination of H-bonded synthons and stacking interactions between donor and acceptor planar molecules allowed Desiraju and co-workers to synthesize quaternary and even quintinary co-crystals [231,232]. Partial substitution of some co-formers in quaternary systems with their shape-size analogs even allowed for obtaining six component solids [233].
Although some examples of co-crystals design based on H-bonding propensities tool are given above, successful prediction of inclination of some molecules to form co-crystals obviously needs analysis of some other molecular descriptors. First, the possibility of a solvent to take part in H-bonding should be taken into account [211,212,234]. A systematic study of co-crystallization of paracetamol with H-bond acceptors [235] and donors [236] demonstrated that paracetamol molecules in these crystals are linked via either OH⋯O=C or NH⋯O=C interactions, depending on the presence or absence of substituent groups on the molecule of the second co-crystal former. A machine learning algorithm trained out on a set of paracetamol co-crystal experiments using more than 190 molecular descriptors for each co-former allowed to predict 9 of the 13 experimentally obtained co-crystals within the top 11 suggestions [237]. Unfortunately, this method requires a large amount of experimental work to train the model, and the model obtained should be applied only to molecules with similar molecular descriptors as those for an attested molecule. Fabian has proposed a methodology that involves using size and shape complementarity as the primary driver for co-crystal formation [238]. Analysis of 131 molecular descriptors for 710 co-crystal partners suggested that co-crystals were more likely to form between molecules of a similar size and shape and with a similar polarity of co-crystal formers, thus both these descriptors were recently included to the Co-Crystal Design tool.
Effectiveness of such modification can be demonstrated on the example of 1,2,4-thiadiazole derivative co-crystallization with gallic and vanillic acids [239]. While H-bonding propensities tool gives almost equal probability of occurrence of homo- and heterosynthons, molecular complementarity tool indicates that co-crystals in these systems should form; and these co-crystals were experimentally obtained. Karki, et al. [240] demonstrated that synthon analysis combined with Fabian’s methodology was effective in prediction of possible co-formers for artemisinin. A series of co-crystals of sulfamethoxazole [241] and leflunomide [242] were synthesized using this approach. Prediction of possible H-bonded motifs between tyraminium cations and violurate anions also successfully predicted many of the bimolecular synthons experimentally observed in tyraminium violurate polymorphs and hydrates, but also demonstrated that none of the trimolecular synthons were predicted [243]. Note, that tri- and tetra-molecular synthons are not something unusual in co-crystals (see, for example, tetramolecular associates found in thiazole-amides, Figure 10), thus any predictions should also include such polymolecular associates into account.
To sum up, H-bonding Propensities and Co-Crystal Design tools allow one to simplify co-crystal screening. CSD analysis helps to select complementary functional groups to form heterosynthons more likely to form than homosynthons. Presence of false positives in experimental screening then may be accounted for other factors. For example, although hydrogen bonds are thought to be stronger than halogen bonding, one can undergo a competitive co-crystallization between H-bonded and halogen bonded synthons based on the polarity of a solvent used [244]. It was shown that in polar solvents 1,2-bis(pyrid-4-yl)ethane forms co-crystals via halogen bonds, and in nonpolar - via H-bonds. Of more concern with regards to co-former screening is the possibility of false negatives. If the results of this type of analysis are used to narrow down the number of screening experiments performed, any possible co-crystals that are incorrectly marked as unlikely to form would be missed. Besides quantum calculations of molecular dimers or logistic models trained on large datasets of co-crystals, development of shape-size molecular complementarity tools for the prediction of possible co-formers seem to be very promising. Note, that the CSD-Crossminer package developed for sophisticated search of analogues of drugs is now available (Section 3.4). It utilizes the idea that molecules with similar disposition of functional groups and hydrophobic fragments may act similarly with proteins and replace each other. The idea of substitution of one of the co-crystal formers with its size-shape analog can be realized using this program.
5.3. Full Interaction Maps
The Full Interaction Maps (FIM) tool [245] implemented within the Mercury package visualizes the likelihood of a synthon appearance and corresponding geometry variation between functional groups of a molecule under consideration, and a probe functional group. For each functional group of a molecule, 3D scatterplots of CSD contact searches with a chosen probe, and the functional groups are generated and converted into scaled density maps. These density maps are then combined for the whole molecule, taking the environmental effects of combinative factors and steric exclusion to account. The 2019 version of the CSD-Enterprise contains (i) RNH3, uncharged and charged NH nitrogen atoms as probe functional groups of donors of hydrogen bonds, (ii) various oxygen atoms (of a carbonyl and alcohol group or a water molecules) as a probe of hydrogen bond acceptors, (iii) methyl and aromatic carbon atoms as a probe of stacking interactions. The CSD-Enterprise version of 2018 allowed the use of C-I and C-Br interactions as a probe of (iv) halogen interactions; and the next year these interactions were extended with C-F and C-Cl probes, to evaluate differences between various halogen atoms. For each functional group, the FIM distribution is similar with IsoStar scatterplots of corresponding functional groups, but the FIM for a whole molecule reflects the fact that some of functional groups are better donors and acceptors of, for example, H-bonding, than the others. On these maps, this difference is expressed in the color (and the coordinates of the most likely positions of the donor and acceptor sites can be additionally calculated and depicted), while corresponding quantitative values can be estimated using the H-bonding Propensities tool (Section 5.1). Besides, these maps are very sensitive to the steric exclusion from other molecular species and would differ for a molecule in optimized and experimental geometries, and those obtained with CSD-Conformer Generator.
The FIMs probed with H-bond donors (blue) and acceptors (red) and supramolecular synthons of the Pbcn polymorph of chalcone {BZYACO01}and 5-(4-bromophenyl)-1-phenylpent-2-en-4-yn-1-one {YILNAK} are depicted in Figure 11. For both molecules’ blue regions near the ketone oxygen atom reflect its inclination to form bifurcate bonding due to the presence of two lone pairs on the oxygen atoms. The most expected positions of acceptors of hydrogen bonding for the chalcone are situated on the opposite sites of the molecule, and correlate well with observed supramolecular synthons in its polymorphs—a dimer depicted in Figure 11, a {BZYACO01}and a head-to-tail chain {BZYACO04} similar with that of 5-(4-bromophenyl)-1-phenylpent-2-en-4-yn-1-one. Elongation of the π-conjugated chain with a triple bond makes the appearance of such chains more abundant than dimer occurrence (Figure 11b), that was experimentally confirmed for a series of 1,5-diarylpent-2-en-4-yn-1-ones [246,247].
This tool has been applied mainly for the analysis of strong hydrogen bonds [54,170,190,194,248,249,250,251] in accord with the first test functional groups suggested within this tool, and the first paper published to describe it. Mutual disposition of hot spots can be used to find functional groups of the same molecule [245], co-formers [243,249,250], active sites in a binding pocket of a macromolecule [252], or surface inhibitors [170], which match a given pattern of interaction preferences. Analysis of polymorphs of some drugs demonstrated that disposition of H-bond donors and acceptors close to the hotspots are indicative for the more stable polymorph even if a molecule realizes less likely conformation [190]. However, in the case of polymorphism induced by weak intermolecular interaction reorganization [253] or high-pressure [254], it has lower predictive ability. This can easily be understood if we keep in mind that data for the FIMs are plotted based on structural information derived at atmospheric pressure.
At the same time, the test carbonyl oxygen and uncharged NH groups can be successfully applied to reveal positions of acceptors and donor of C-H...O [170,255,256,257] and C-H...N [258] bonding. It was demonstrated in Refs. [255,257] that the FIMs can be applied to estimate likely C-H...O bonded motifs for chalcones, polyenones, pentenynones and cyclic ketones with vinylacetylene fragments. At the same time, it was demonstrated that C=O and C-Br groups compete with each other for the most acidic hydrogen atoms, thus that FIM predictions of synthons based on C-H...Br bonding became less reliable at the presence of C-H...O=C bonding [257]. At the absence of carbonyl groups the FIMs for the N-salicilidenanylines probed with C-I and C-F groups successfully predict all of the C-I...N and many of the C-F...H-C interactions in co-crystals of N-salicilidenanylines with perhalogenated co-formers [259]. Mugheirbi and Tajber analyzed the FIM hotspots around the itraconazole molecule to understand the molecular environment in the mesophase [260]. The itraconazole lacks any donors of strong H-donors but has a number of competing acceptors of H-bonding; thus, analysis of the FIMs and FTIR spectra allowed them to reveal the most ordered of mesophases, and to propose that the greatest mobility of this molecule is associated with movement of the triazoline ring. Analysis of FIMs in two polymorphs of the dinuclear Co(II)-Shiff base complex revealed the most acidic H(C) atoms (similar in both polymorphs), which take part either in C-H...O bonding in the triclinic polymorph, or in the C-H...π bonding in the monoclinic polymorph, or even to not take part in any prominent intermolecular bonding [256].
Note, that unsatisfied strong acceptors observed in a crystal structure solved from powder XRD data may be indicative of missed water/solvent [261,262]. In this case coordinates of the most expected position of a water molecule can be used in the refinement. Worth noting, that IsoStar and FIMs deal only with intermolecular interactions but a similar idea could be used for investigation of metal-ligand bonding in polytopic ligands. This could be helpful for understanding the factors that govern linkage isomerism, and to estimate the most likely coordination mode of the most widespread ligands.
5.4. CSD-Conformer Generator
Representation of a molecule in the three-dimensional space finds numerous applications in structure solution from powder diffraction data in real space, crystal structure prediction including the formation of co-crystals, protein–ligand docking, and others. Thus, the CSD-Conformer Generator tool included in the CSD-Enterprise as an approach to a fast generation of plausible molecular conformations by using geometric distributions derived from the CSD. First, for the input 3D molecular model with all hydrogen atoms present, all bond lengths and angles are minimized based on corresponding average values. Then, the molecule is partitioned into rotamers, and rotamer libraries and ring template libraries are used to generate a conformer tree with unforbidden and preferred rotatable bond geometries and ring geometries. A final set of conformers is clustered according to conformer similarity. Each conformer is locally optimized in torsion space. It was demonstrated that this tool reproduced well conformations of a number of molecules observed in the CSD and the Protein Data Bank [263,264]. Some discrepancies between the predicted and experimental geometries occurred for the ligand: Macromolecule complexes in unusual conformational space, rare rotamer examples, uncertain bond types, and some other cases, nevertheless, theoretical configuration space represents well experimental data [265,266] or configurations obtained by the Molecular Operating Environment’s Low Mode Molecular Dynamics module [267]. It was demonstrated that the CSD-Conformer Generator combined with ab initio [268] or DFTB3-D3 [269] calculations can be used for crystal structure prediction of some flexible pharmaceuticals. Such combination reproduced well the molecular geometry and crystal parameters, although did not provide sufficiently accurate energy ranking.
6. Hydrate/Solvate Analyser
As it was described above, a solvent can affect crystal morphology and sometimes determines the polymorph, but it also can be built into the crystal structure to form a hydrate or solvate. Since water is nature’s solvent, non-toxic and widespread, the scientific community is interested in understanding water assembles in liquids and solids. In particular, medium-sized and large water clusters are important for biology, since they act as surrounding and solvating solutes for biologically active molecules, fill discrete voids and channels in molecular and supramolecular assemblies including their reactive sites and interpenetrate into the interfacial region of hydrophobic surfaces [270,271,272,273]. Besides, water associates are involved in dynamic processes such as proton transport, material or molecular folding, de/resolvation, and others [274,275], and even affect mechanical properties of single crystals [276]. The behavior of liquid and solid water including ice, clathrates, and ice-like systems are determined by the disorder of the hydrogen atoms, and the binding energy of different configurations as well as some other properties can also differ [277,278,279]. At last, but not least different solvents may stabilize different forms of a molecule (the neutral or zwitterionic forms) [280,281]. In other words, both (i) connectivity of a solvent molecule in a crystal, inclination to form a given associate, and (ii) dimensionality and unit cell volume that goes to solvents are of interest for biology, crystallography, and material chemistry. All corresponding algorithms are realized within the Hydrate and Solvate Analyser tools of the CSD-Material module. The Hydrate Analyser provides information about the water H-bond geometry and motifs (one of any of 10 most common motifs described by Ref. [282]) detected, information on the volume occupied with water molecules, and display the water space and water interaction maps. The Solvate Analyser provides similar information about simple and mixed solvates, co-formers and ions.
6.1. Analysis of the Local Connectivity of a Solvent
Analysis of solvate connectivity and the most abundant motifs (if any) formed by solvent molecules is similar to that described above for the investigation of other synthons and supramolecular associates. The practical meaning of results obtained for material chemistry, biochemistry, and the pharmaceutical industry are based on the fact that similar hydrate architectures were found in crystals of both inorganic, organic, coordination, and even macromolecular compounds [40,41]. H-bonding between a complex and an outer sphere solvent can lead to additional quenching in luminescent materials that are unfavorable for the task-specific design of optical materials [283,284]. Comparison of clathrate, turbolato-clathrate, and non-clathrate hydrates revealed that water associates in the latter can be regarded as fragments of clathrate hydrates [285]. Analysis of hydrate architectures in crystal structures of known kosmotropic and chaotropic agents revealed that the kosmotropes tend to take part in H-bonding with hydrates, while chaotropes in crystals tend to from clathrate hydrate-like structures [286]. Kosmotropic agents are those able, by the ordering of the water structure in solution, to enhance intermolecular interactions within protein molecules, thus preventing denaturation, while chaotropes act oppositely. Hence, the property of a molecule to crystallize with water giving full enclathration was directly associated with the ability to act as chaotrope.
Most frequently water serves as a donor of two hydrogen bonds and an acceptor of one hydrogen bond, although three other motifs are also relatively common (Figure 12a [282,287,288]). The dual nature of this molecule in respect to H-bonding allowed proposing imbalance in the number of donor and acceptor groups of a polytopic molecule to be the reason for form hydrate appearance [289]. Although, it is in accordance with Etter’s rule, which states that “all good proton donors and acceptors are used in hydrogen bonding” [210], Infantes, Fábián, and Motherwelldemonstrated that the imbalance of donor and acceptor groups, in fact, does not affect inclination of a molecule to form hydrates [288]. Instead, it more readily interacts with unsatisfied acceptors of H-bonding, especially R2PO2-, Cl-, C-NH3+ groups, acts as a bridge between unsatisfied donors and acceptors, and occupies the free volume, especially for chiral molecules [290]. The strong imbalance between the number of donors and acceptor groups between tetrasulfonate-functionalized rigid anions and planar polyamino-containing cations indeed afforded their crystallization as hydrates, where water molecules act both as bridges between cations and anions, and clusters incorporated within cavities and channels of H-bonded networks [251]. Nevertheless, typically it is still hardly possible to predict if a compound will form a hydrate or not. Zaworotko and coworkers revealed that not only molecules with unsatisfied donors but also acceptors of H-bonding also readily form hydrates, but the reason of CSD statistics, in this case, might underestimate inclination of a molecule to form a hydrate, probably, because the most effective pathways to hydrate formation were slurrying in water and exposure to humidity [291]. The role of MeOH [292], DMSO [293,294], and CHCl3 and CH2Cl2[192] solvates in crystals was also investigated.
Comparison of motifs formed by water molecules in organic hydrates and by H2O2 in organic peroxosolvates gave a clue to the understanding of their isomorphysm [116,295]. It was found that H2O2 always forms two H-bonds as a proton donor, and up to four bonds as an acceptor of H-bonding (Figure 12b). Thus, only three of the peroxosolvate H-bonding patterns are similar to those observed in crystals of hydrates. As isomorphous substitution may occur only in crystals with similar H-bonding motifs, peroxosolvates with DD, DDA, and DDAA motifs connectivity require high H2O2 concentrations for synthesis, are sensitive to humidity, and as a result are of limited practical meaning. Instead, peroxosolvates involved in at least five hydrogen bonds and strong H-bonding with acceptors are not inclined to take part in isomorphous substitution with hydrates.
Not only water connectivity, but also the main motifs formed by water molecules, or supramolecular architectures including water molecules as well as suggested nomenclature were described [67]. Combined together, these data have practical meaning for prediction of coordinates of missing water molecules for solving and refinement of crystal structures from powder X-Ray diffraction data [261], as well as for molecular docking with GOLD (Genetic Optimization for Ligand Docking), part of the CSD-Discovery module.
6.2. Analysis of Solvent Associates
Thus, besides analysis and classification of H-bonded motifs, visualization of associates, and calculation of the unit cell volume occupied with water/solvent molecules (both coordinated and not) is available via Hydrate/Solvate Analyser tool. On Figure 13, a DMSO and water space in the structure of bosutinib DMSO solvate trihydrate {ABECES} are shown in red and blue, respectively. It is clearly seen that both solvents form isolated clusters, while in the structure of tetrabutylammonium fluoride clathrate hydrate {CIPRAV} H-bonded water molecules form a 3D architecture with tetrabutylammomium cations situated in pores of this framework (Figure 13b). Visualization is similar to void analysis, but there is no need to remove any molecules to carry out calculations. Moreover, the Solvate analyser tool can be used to identify not only solvents, but also co-formers and ions, disordered fragments, or even hydrophobic/hydrophilic fragmentset cetera, as well as to visualize their packing and check H-bonding involving the identified components.
Analysis of the voids within previously published copper-containing coordination polymers allowed Inokuma et al. to find a crystalline sponge with desired pore size and affinity to guest molecules and demonstrated that a huge number of previously published coordination polymers can be used to capture guest molecules [296]. Anisotropic motifs formed by solvent molecules affect mechanical properties of compounds, and dehydration mechanism. For example, presence of infinite H-bonded water chains is typically associated with the reversible dehydration process and similarity between crystal structures of the hydrated and anhydrous forms of a solid, like caffeine [297], carbamazepin [104], aspartame [298], and Shiff-base Ni(II) complex [299]. Such channels can be not only dehydrated but also substituted with other solvents [300,301]. The solid dehydration with prominent atomic movement traced with single-crystal or powder X-Ray diffraction can shed light on a more complex mechanism of water loss [302,303,304,305,306]. Various mobility of the water clusters incorporated within the pores and H-bonding waters that structure these pores was demonstrated in Refs. [251,307]. Liu et al. demonstrated that anhydrous uric acid and its’ dihydrate exhibited various mechanical properties of their single crystals despite the similarity of layered motifs formed by the acid [276]. 2D water associates situated between acid layers perpendicular with single crystal main faces break at indentation with dehydration at crystal surface and make this solid softer than the anhydrous form.
7. Additional Software Compatible with the CSD-Materials Analysis of Structure–Property Networks
7.1. CSD Python API
The CSD Python API is now part of the CSD-Enterprise automatically installed as part of the CSD installation. This allows one to run supplied or user-written Python scripts for the loaded structure or a set of structures. CCDC Python-Built-In scripts include at the moment:
Analysis (generation of conformers and calculation of their RMSD (conformes_similarity.py) or of all torsion angles (generate_conformers.py) for the loaded structure).
External (load_in_conquest.py– loading the current structure into the ConQuest sketch window).
Reports (generating a report in *.html format containing crystallographic details, molecular geometry, and intermolecular hydrogen and halogen bonding (crystal_structure_report.py), or molecular geometry, including Mogul geometry analysis (molecular_geometry_report.py), or crystal packing descriptors (quick_packing_check.py), or basic chemical, crystallographic and publication information about the loaded structure (structure-simple_report.py)).
Searches within the CSD for entries relevant to the specified chemical name or synonym (chemical_name_search.py) or having similar molecular geometry (molecular_similarity_search.py).
One can additionally download from the CSD-Python API Forum [308] the following scripts:
To generate molecular formula and weight (welcome-and-weight.py), Crystal14 input files (crystal_inputs.py) or Packing Similarity dendrogram (Packing_Similarity_Dendrogram.py);
To send a Mol2 or CIF files to an external application (send_mol2_to_notepad+.py);
To generate diagrams for all molecules in a structure file (diagram_to_file.py);
To find covalently bonded clusters within a structure (dimensionality.py);
To perform void calculation in a crystal (void_calc.py);
To merge GOLD docking results (gold_merge.py);
To filter molecular conformers with unusual torsion angles (conformer_filter_density.py);
To extract from the CSD unique rings of a specified size (ring_type_count.py);
To compare protein bound ligand with CSD-Generated conformers (find_binding_conformation.py).
Some other examples of Python scripts applied for investigation of crystal structures include removing solvent molecules from a MOF subset extracted from the CSD to perform analysis of geometry and properties of nondisordered porous compounds [309]; authors realized the possibility to remove by request both uncoordinated and coordinated solvent molecules from coordination polymers. Dolinar et al. wrote a Python script to perform the CSD search of chiral monomolecular compounds [310]. Bryant, Maloney, and Sykes investigated a number of polymorphs and co-crystals with previously reported tabletabilities to analyze how different factors govern tabletability of organic crystals, and to suggest a programmatic method to calculate corresponding crystal descriptors [311]. Their investigation of the most likely slip planes, their interpenetration on connection by weaker interactions, presence of other slip planes, and automatic assessment of H-bond dimensionalities revealed that the degree of separation, followed by the presence of H-bonds between the layers, and finally d-spacing between potential slip planesare the most important descriptors affecting the tabletability. The corresponding CSD Python API protocol that can be applied for the prediction of mechanical properties of organic crystals was also published [311]. Miklitz and Jelfs reported a Python script for structural analysis of porous materials (organic and coordination cages) that estimates the cavity diameter, number of windows, and their diameter [312]. It uses Cartesian coordinates and atom types for input, thus, can be in principle applied also to selected fragments of framework materials. Besides codes to extract and analyze data stored within the CSD, for text mining of selected properties within a large set of papers was also published [313]. The algorithm uses manuscript html files as input for extraction of surface areas and pore volumes of metal–organic frameworks with at least 73% accuracy and can be used for investigation of structure–property networks.
Thus, the Python API gives a convenient path to combine and modify the CSD requests with CSD-Materials, CSD-Discovery tools or external software for ones’ purposes. The next Section gives some examples of the combination of the CSD-Materials tools with external crystallographic software.
7.2. Combination of Various Tools and Algorythms
Some combinations of the CSD-Materials tools with external software were mentioned in previous Sections, for example, the combination of the CSD Conformer Generator with periodic calculations to predict crystal structures, or that of the CSD Co-Crystal Former and the H-bond propensities with DFT calculations to better understand competing for intermolecular interactions. Let us mention also some other examples, where the CSD-Materials tools were used in unusual way or as the tool to solve only part of a scientific problem.
Crystal packing of organic single-component compounds with Z’ = 1 were compared to evaluate the effect of chemical transformation on crystal packing and isostructurality using a combination of the CSD Python API scripts, external software, and the CSD Crystal Packing Similarity tool [85]. For this purpose, 15,5543 organic compounds were extracted from the CSD based on their composition and data quality. Then, a freely available algorythm published in 2010 [314] was used to cleave up to three acyclic single bonds between functional groups as demonstrated in Figure 14. Comparison of the Crystal Packing Similarity between molecular clusters formed by the “Value” molecular parts, allows us to reveal the most frequent terminal transformations of the “Key” groups and to evaluate their effect of the packing and isostructurality. For example, substitution of the methyl group to the chloro-, azide- or iodo- groups gives isostructural pairs in nearly 30% of cases.
Recently possibility of prediction of H-bonded motifs using a topological approach was demonstrated for some families of organic molecules with a large number of X-rayed representatives [315,316]. This method schematically represented in Figure 15 utilizes the idea that (i) only a limited number of architectures can be formed from a particular building block with a given connectivity, (ii) overall topology of a network depends on connectivity of building blocks, and (iii) propensities of various networks appearance are not equal. While relations between connectivity of an organic molecule and resulting network and corresponding propensities of their appearance have already been published [315,317], this approach does not allow evaluating the most abundant molecular connectivity of insufficiently studied families of organic compounds. Recently, maps of electrostatic potential, the Full Interaction Maps, and the H-bonding Propensities were compared as the tools to overcome this problem for heterocycle-1-carbohydrazoneamides [194]. Knowledge-based approaches demonstrated competition between various nitrogen atoms to act as acceptors of H-bonding with the donor -NH2 group and allowed evaluating local molecular connectivity. Experimental crystal structures realize one of highly abundant theoretically predicted topologies of an H-bonded network.
Tilbury, et al. suggested to combine the H-bond propensity based prediction with COSMO-RS theory [318,319] to predict drug substance hydrate formation [320]. Hydrate formation probability is estimated based on propensities calculated for each donor/acceptor groups in drug...drug or drug...water pairs. The most favorable hydrate formation has the maximal difference between these values. Although within this model relative strength of different functional groups to interact with water is displayed, this model does not take steric effects and intramolecular interactions into account. Additional quantum mechanics approach to eliminate donor and acceptor groups that are unable to form intermolecular interactions and to calculate the most favorable molecular and hydrate conformations allowed to improve predictive fidelity of calculations.
Chandra, et al. [225] applied CSD to enhance the solubility of telmisartan (TEL), a low soluble antihypertensive drug. First, the possibility of TEL to form likely intermolecular interactions with sulfobutylether beta-cyclodextrin was confirmed using H-bonding propensities tool. Then, the best docking pose of an inclusion complex of TEL in cyclodextrine was found with GOLD. The predicted binding mode was in line with the experimental spectra for the inclusion complex, which demonstrated enhanced solubility and dissolution rate of TEL.
Correlation between solubility and various parameters describing dimethyl sulfoxide (DMSO) role in crystals structures was carried out by Spitery, et al. [294]. They analyzed various parameters of hydrogen bonding of DMSO molecules as obtained from crystal structures reported in the CSD with solubility estimated using Chem3D [321], and found a negative correlation between the number of interactions the solvent is involved in and solubility of a solid.
8. Conclusions
The Cambridge Crystallographic Data Center holds a unique position of a curator of the Cambridge Crystallographic Database that now contains more than 10,000,00 crystal structures and a software developer for the analysis of collected data. This software allows the visualization of complex statistical data in a click-of-a-button manner (Full Interaction Maps, Conformer Generator, Mogul analysis, Solvate Analyzer, BFDH predictions), and investigation of local connectivity of functional groups and small molecules (H-Bond Propensities, Co-Crystal Former, Motif Search) that became possible due to the network between chemical diagrams and automatically calculating interatomic and intermolecular distances. Despite the complexity of the utilized algorithms, big data analysis, and their sophisticated processing, the software remains visible, user-friendly, and available for each CSD user. Thus, its’ visibility and flexibility combined with exhaustive free documentation (see user guides, how-to-video, tutorials, and other materials at [322]) provide the basis for successful application in crystal engineering, material chemistry, and chemoinformatics.The diversity of CSD-supported studies described in this review indicate the great potential of the knowledge-based software for future development in these and associated fields, thus, the development of links with other molecular and crystallographic databases and repositories of properties would be advantageous.
The combination of the CSD tools with external software similarly with UNI and MOPAC calculations incorporated within the CSD-Enterprise seems to be very prospective for future development. DFT calculations, especially periodic ones, could provide further insight into the field of coordination polymers with numerous applications in gas storage and separation, catalysis, and spintronics. Implementation of graph theory to polytopic molecules and ligands could expand data about the local (molecular) connectivity to knowledge about possible and most abundant coordination and H-bonded architectures. The future of this field still possesses many challenges, but there is no denying that the value of the Cambridge Structural Database and associated software for the knowledge-based analysis in the filed of composition–structure–properties studies can not be overestimated.
Funding
This work was supported by the Ministry of Science and Higher Education of the Russian Federation.
Acknowledgments
The author is grateful to A. A. Korlyukov for fruitful discussion of the text.
Conflicts of Interest
The author declares no conflict of interest.
Figures
Figure 1. Modules comprising the Cambridge Structural Database (CSD)-Enterprise (ver. 2019).
Figure 3. Examples of the (a) Packing Feature and (b) Packing Similarity search hits. A sandwich of a TPPM with two aromatic C6 rings was constructed as the Packing Feature from a reference (red) {MOXMIV}a molecule and was also found in (blue) {QATTAH}. The Packing Similarity comparison of an orthorhombic polymorph of TPPM (blue) {MOXMAN02} with its’ three monoclinic polymorphs gives the best similarity with (red) {MOXMAN03}. a Here and below a six-letter CSD-Refcode of a compound is given in Figure braces.
Figure 4. Carbamazepin molecules packed in (a) “translation stacks” in {CBMZPN11}, (b) “inversion cups” in {CBMZPN01} and (c) “co-former pairs” in {UNEZAO}.
Figure 5. Bravais, Friedel, Donnay, and Harker (BFDH) predicted the morphology (blue) and hydrophobic groups forming the main crystal faces of a long-chain alkyl (a) tris-pyrazoloximate {1497845}, and (b) hexahalogenoclathrochelate {YUFYUV04}.
Figure 6. Schematic representation of (a) concomitant polymorphism, (b) crystallization on self-assembled monolayers, (c) crystallization on additives.
Figure 7. Flowchart for the methodology to predict the likelihood of hydrogen bond formation.
Figure 8. Schematic representation of compounds analyzed using H-bonding propensities tool.
Figure 9. Flowchart for co-crystal design strategies based on synthon competition.
Figure 10. Examples of homo/heterosynthons analyzed for (a) pyrimethamine [211,212], and (b) thiazole-amides [216].
Figure 11. Interaction maps and supramolecular synthons in the crystal structures of (a) chalcone {BZYACO01}, (b) (E)- 5-(4-bromophenyl)-1-phenylpent-2-en-4-yn-1-one {YILNAK}.
Figure 12. H-bonding patterns in (a) organic hydrates [288] and (b) organic peroxosolvates [116] (X denotes any donor or acceptor atom).
Figure 13. The structure of (a) bosutinib dimethyl sulfoxide (DMSO) solvate trihydrate {ABECES} and (b) tetrabutylammonium fluoride clathrate hydrate {CIPRAV} with solvate analyser display of DMSO space shown in red and water space in light blue.
Figure 14. (a) Flowchart of the algorythm of bond cleavage reported at Refs. [85,314] and (b) some fragments formed from single, double, and triple cuts of lamotrigine.
Figure 15. Flowchart of the algorithm to estimate H-bonding architectures of organic molecules.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Intermolecular interactions of organic, inorganic, and organometallic compounds are the key to many composition–structure and structure–property networks. In this review, some of these relations and the tools developed by the Cambridge Crystallographic Data Center (CCDC) to analyze them and design solid forms with desired properties are described. The potential of studies supported by the Cambridge Structural Database (CSD)-Materials tools for investigation of dynamic processes in crystals, for analysis of biologically active, high energy, optical, (electro)conductive, and other functional crystalline materials, and for the prediction of novel solid forms (polymorphs, co-crystals, solvates) are discussed. Besides, some unusual applications, the potential for further development and limitations of the CCDC software are reported.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer