Content area
Every year, researchers create hundreds of thousands of new materials, each with unique structures and properties. For example, over 5000 new metal-organic frameworks (MOFs) were reported in the past year alone. While these materials are often synthesized for specific applications, they may have potential uses in entirely different domains. However, linking these new materials to their best applications remains a significant challenge. In this study, we demonstrate a multimodal approach that uses the information available as soon as a MOF is synthesized, specifically its powder X-ray diffraction pattern (PXRD) and the chemicals used in its synthesis, to predict its potential properties and uses. By self-supervised pretraining of this model on crystal structures accessible from MOF databases, our model achieves accurate predictions for various properties, across pore structure, chemistry-reliant, and quantum-chemical properties, even when small data is available. We further assess the robustness of this method in the presence of experimental measurement imperfections. Utilizing this approach, we create a synthesis-to-application map for MOFs, offering insights into optimal material classes for diverse applications. Finally, by augmenting this model with a recommendation system, we identify promising MOFs for applications that are different from the originally reported applications. We provide this tool as an open source code and a web app to accelerate the matching of new materials with their potential industrial applications.
Many MOFs are synthesized every day around the world but it is not easy to identify their best application right away. Here, authors present a multimodal machine learning model that connects these materials to their potential applications using only information available upon synthesis.
Introduction
By selecting metals and organic linkers, we can potentially synthesize millions of possible metal-organic frameworks (MOFs)1. Given this high chemical tunability, MOFs can be tailor-made for various applications. MOFs have been explored extensively over the past two decades for applications ranging from catalysis2 and gas separation3 to sensing and electronics4, and over 15,000 porous5,6 and 120,0007 nonporous MOFs have been synthesized. With the integration of automation and artificial intelligence (AI) in chemical synthesis8, 9–10, we can expect the discovery rate of new MOFs to grow even further.
Despite this significant progress, connecting newly synthesized materials to their best possible applications remains a challenge11. Determining the properties of a material requires extensive characterization and testing, often demanding expertise, resources, and infrastructure that may not be available to the researchers that synthesized the material. This hinders the realization of the full potential of new materials. There are several examples of materials that were found to be most effective for applications other than initially intended applications. Al-PMOF, initially synthesized for its photocatalytic properties12, was only found years later to be highly effective at separating CO2 from wet flue gases13. Similarly, SBMOF-1, originally created for CO2 capture14, turned out to be exceptional at separating Xenon from Krypton. These remarkable rediscoveries were enabled by high-throughput computational screening and machine learning studies years after the initial study were published11,13,15, 16–17. However, such methods require precise crystal structure information, often in a computation-ready format, which is complex to obtain and generally not available immediately after a new MOF is synthesized5,18,19. Therefore, developing methods that use only the data available upon synthesis can greatly accelerate materials matching to potential applications.
In this work, we present a multimodal model that utilizes data readily available at the point of MOF synthesis, specifically the powder X-ray diffraction (PXRD) pattern, represented as a spectrum, and the chemical precursors (metal and linker), encoded as text strings. To enhance the model’s performance (particularly on small datasets), we leverage existing MOF structures from databases, represented as crystal graphs, to pretrain the model using a self-supervised learning framework. This pretraining enables the model to achieve high accuracy across a range of chemical, geometric, and quantum-chemical property predictions on low data regimes. We further evaluate the model’s robustness under conditions of experimental structures or PXRD deviations from perfect crystals. Using this approach, we create a map, linking MOF synthesis to potential applications, offering a framework to recommend the best applications for the newly synthesized materials with minimal computational cost and complexity. Lastly, through a reverse time-travel study, we demonstrate that our method can identify materials suited for applications that are entirely different from those originally intended by their creators.
Results
Multimodal model development
To develop a predictive model for MOF properties, it is essential to encode both the material’s chemistry and pore geometry. While this information can be extracted from the crystal structure of the MOF, crystal structure is typically not available at the synthesis stage. The synthesis of MOFs involves selecting precursors, namely the metal and organic linker, and finding the optimal synthesis conditions for the material to form. The primary characterization technique used to confirm successful MOF synthesis is a powder X-ray diffraction (PXRD) pattern. The PXRD pattern provides an abstract representation of the arrangement of atoms in the crystal, containing information about the material’s global geometric structure. Previous studies have demonstrated the effectiveness of PXRD in capturing geometric information for machine learning predictions related to crystal system classification, lattice parameters, and atomic positions20, 21–22. Hence, in this work, we introduce a multimodal machine learning model that utilizes only the precursors and PXRD data as inputs–information readily available immediately following MOF synthesis (Fig. 1). The precursors provide valuable insights into the material’s chemistry, which is supplemented by the PXRD, providing information about the global structure.
[See PDF for image]
Fig. 1
Workflow of the self-supervised multimodal model.
The model takes a precursor string and powder x-ray diffraction (PXRD) spectrum as inputs, embedding them via a transformer and convolutional neural network (CNN), respectively. These embeddings are concatenated and passed through a regression head for fine-tuning. Prior to this, the model is pretrained using unlabeled crystal structures via a crystal graph convolutional neural network (CGCNN)25. The CGCNN and our model generate embeddings (ZA, ZB), which are used to construct a cross-correlation matrix. The Barlow-Twin loss (LossBT) minimizes the differences between the cross-correlation matrix and the identity matrix23,52,53. to align representations. The pretrained model is then fine-tuned on labeled data for property prediction and metal-organic framework (MOF) application recommendation. The crystal structure is visualized using iRASPA55.
The chemical precursors of the MOF can be represented as a string. In this study, we utilize the Simplified Molecular Input Line Entry System (SMILES) to represent the organic linker, appending it with the type of metal used. We employ a transformer to embed this string, as previous research has shown the effectiveness of transformer architectures in property prediction, specifically using a string representation of the MOF23,24. Additionally, we utilize a convolutional neural network (CNN) to embed the PXRD pattern, which can be represented as a one-dimensional spectrum (see details in the Methods section).
While the precursors provide information on the chemistry of the metal and organic linker, they do not capture the metal-organic coordination environment, which can be necessary for predicting quantum chemical and chemistry-reliant properties. To inform the model about these coordination environments, we implement a self-supervised pretraining approach that leverages existing MOF crystal structures without the need for any labeled data. Our self-supervised pretraining combines a crystal graph convolutional neural network (CGCNN)25 representation with our model. CGCNNs provide an expressive embedding of local chemical environments and have been successful in predicting quantum-mechanical properties of MOFs18,19. By pretraining our model with CGCNN, the embedding space learns meaningful representations related to properties such as electronegativity and covalent radius in local environments.
Finally, we utilize the pretrained model, which takes only PXRD and precursors, and fine-tune it with labeled data to predict desired properties. With the materials properties predicted, we use selection criteria for recommending them for various applications. Details of the method development can be found in SI and the method section.
Model performance evaluation
We first demonstrate that our multimodal model effectively predicts a diverse range of MOF properties, laying the foundation of connecting MOF synthesis to applications. For our evaluation, we categorize the properties into three groups: geometry-reliant, chemistry-reliant, and quantum-chemical properties. Geometry-reliant properties include pore geometric features such as accessible surface area (ASA) and properties including gas uptake at high pressures (HP) and methane deliverable capacity (CH4 DC). Chemistry-reliant properties encompass gas uptake of CO2 at low pressure (LP), while quantum-chemical properties focus on band gap. Data for these properties were extracted from prior studies and multiple databases, including QMOF, CoRE-2019, and the hMOF database (see SI for database details)5,18,19,26, 27, 28–29.
Our multimodal model exhibits strong performance across the three property categories. Figure 2 quantifies the model’s effectiveness in ranking materials using the Spearman rank correlation coefficient (SRCC) and accurate prediction of their properties using mean absolute error (MAE). Notably, its accuracy is comparable to machine learning models that utilize crystal structure data, as reported in prior studies23,25,26,30,31. Specifically, we benchmark our model against three state-of-the-art machine learning models: 1) Descriptor-based ML, which uses geometric and chemical descriptors (e.g., surface area, pore volume, and revised autocorrelation (RACs)26; see SI for details), 2) Crystal Graph Convolutional Neural Network (CGCNN)25, which represents materials using a crystal graph, and 3) MOFormer23, a model that takes a text-based representation of MOFs (MOFid24) as input. While MOFormer uses a string representation, this encoding requires crystal structure information (e.g., MOF topology and metal cluster details), making it dependent on structural processing. Remarkably, our model performs comparably to crystal structure-based models across all property types. Specifically for geometric properties, such as accessible surface area (ASA), our model outperforms both CGCNN and MOFormer. The descriptor-based ML model achieves perfect scores for surface area and density, as these are directly provided as inputs, and therefore we left them out of Fig. 2. For geometry-reliant properties, such as high-pressure adsorption of CH4 and Xe, our model achieves accuracy comparable to descriptor-based approaches. Finally, for chemistry-reliant and quantum-chemical properties, our model performs on par with crystal structure-based models (Fig. 2).
[See PDF for image]
Fig. 2
Model performance across various property prediction tasks.
a Regression results for our model (powder x-ray diffraction (PXRD) & precursors) in comparison to a descriptor-based machine learning (ML) model26, a transformer-based model accepting MOFids (MOFormer)23,24 and a crystal graph convolutional neural network (CGCNN, accepting 3D structures of crystals)25 across various geometric (CH4 deliverable capacity - DC, density, Xe uptake at HP - high pressure, CH4 uptake at HP, accessible surface area - ASA), chemistry-reliant (CO2 uptake at low pressure - LP) and quantum chemical (band gap) properties. b Ablation study showcasing the impact of multimodality across various properties. The results are reported in the mean Spearman’s Rank Correlation Coefficient (SRCC), with the error bars showing standard deviation across three random splits. The results shown here are on structures from CoRE-20195,26 (CO2 uptake at LP, CH4 uptake at HP, ASA, density, CH4 DC), QMOF18,19 (band gap) and hMOF26,29 (Xe uptake at HP) datasets. Source data are provided as a Source Data file.
In scenarios where we deal with small labeled datasets, a self-supervised pretraining workflow can improve the model performance, where the pretraining leverages large amounts of unlabeled data to learn generalizable representations that can be fine-tuned on the limited labeled samples. In our study, we pretrain our multimodal model taking PXRDs and precursors against crystal structures available from large MOF databases. We show in the SI that this approach improves the performance of the model on small datasets, such as ARABG, and make the method applicable for the cases when large amount of data is not available. However, it should be noted that the model performance has very negligible improvement when the pretrained model is fine-tuned on large MOF databases such as CoRE-MOF, BW20K, QMOF and hMOF.
It is insightful to compare the performance of our multimodal model with a model that relies solely on the chemical precursor and another model that relies solely on the PXRD. The results of this ablation study are shown on the right panel of Fig. 2. The results show that a model only accepting chemical precursors is inadequate for capturing the global structure of the MOF, scoring particularly low in geometric-reliant and pure geometric properties. On the other hand, a model only accepting PXRDs capture the global structure of the MOF well, but fails to capture the local environment, scoring low in chemistry-reliant and quantum chemical properties such as CO2 uptake at LP and band gap. This confirms that the PXRD is critical and provides sufficient information about the geometry of a MOF, whereas the precursor aids in capturing information about the local environment. Achieving high accuracy in all three categories using only the information available at the time of MOF synthesis underscores the importance of multimodality in property prediction tasks. No single modality can fully encapsulate the complexities of chemistry and geometry in MOFs; thus, integrating both PXRD and precursor strings facilitates accurate prediction of material properties.
To further assess the model’s generalizability, we assess its performance across various crystal systems. Analyzing the diversity within our dataset shows that model performance is not uniform across all crystal systems and depends on the database used for training and the specific crystal system of the MOF. Similar observations regarding the impact of crystal system diversity on model generalizability have been reported previously26. We found that training on more complex crystal systems, such as triclinic, leads to better generalization to less symmetric cases. Conversely, training on highly symmetric systems, such as cubic, results in poorer generalization. This suggests that crystal system diversity should be a consideration in database development and model evaluation, as well as the training of machine learning models. Further details regarding this analysis can be found in the Supplementary Information.
Given the high accuracy of our model in property prediction, we can extend its capabilities to build a recommendation system that connects MOF synthesis to applications. We note that while descriptor-based models offer slightly better performance in property prediction, they require preprocessed crystal structures in a computation-ready format (e.g., solvent removal and handling of partial occupancy), which can be challenging. In contrast, our approach leverages PXRD and precursor information, which are readily available upon synthesis, allowing us to provide accurate recommendations for both chemistry-reliant and geometry-reliant properties and applications without the need for extensive structural preprocessing.
Robustness assessment
A strong indicator of the model’s practicality is its evaluation against noisy or experimental data. The high performance of our multimodal model demonstrates that it is, in principle, feasible to use PXRD and precursors to accurately predict various properties of MOFs. Our model was trained using precursors and PXRD data derived from perfect, computation-ready crystal structures. While the precursors are devoid of inaccuracies due to the known composition prior to synthesis, experimentally measured PXRD patterns can be noisy and may differ from those simulated from ideal crystals. In the absence of large experimental PXRD databases, we evaluate the robustness of our method in two scenarios: first, using PXRDs computed from experimental crystal structures extracted from the CSD, and second, assessing the model’s performance on some experimentally measured PXRDs. Given that PXRD primarily influences geometric-reliant properties, we expect minimal impact on predictions of chemistry-reliant properties. Therefore, we focus our evaluation on a geometry-reliant property: predicting methane uptake at high pressure for methane storage applications26.
Experimental crystal structures for MOFs often lack hydrogen atoms and may contain solvent molecules, partially occupied sites, or disordered atoms. To prepare these structures for computational chemistry calculations and property evaluations, certain adaptations are necessary. For example, to create a computation-ready format for gas adsorption calculations in the CoRE-2019 dataset, all solvents were removed. In the QMOF database for quantum calculations, hydrogen atoms were added, and unbounded solvents were eliminated. Consequently, the PXRD patterns computed from these modified structures differ from those available in the CSD. Figure 3 presents a parity plot illustrating differences in model predictions when using PXRDs computed from the CoRE-2019 and CSD, highlighting specific MOFs with three scenarios: missing hydrogen atoms and the presence of bounded/unbounded solvents. We observe that missing hydrogen atoms do not influence the PXRDs, and the model is very robust to these changes. This is due to the scattering power of a PXRD being proportional to the number of electrons in an atom, and hydrogen (Z = 1) atoms play a minimal role in scattering and the structural factor (see Methodology for more details). Furthermore, while the model generally shows robustness regarding bounded solvents, discrepancies can arise between predictions from computation-ready and CSD data. For instance, the MOF LELROL significantly deviates from parity32. However, we observe an overall robust predictions as the rankings of the MOFs in terms of CH4 uptake is well captured (SRCC of 0.73) and a relative error below 13%. This demonstrates the model’s robust performance in the presence of variations in crystal structures.
[See PDF for image]
Fig. 3
Assessment of robustness of model on the crystal structure imperfections.
The parity plot in (a) shows a comparison between the model output when fed a powder x-ray diffraction (PXRD) pattern computed from computation-ready crystal structure (CoRE-2019)5 and the experimental crystal structure from the Cambridge Structural Database (CSD)7. The parity plot showcases a set of selected examples of metal-organic frameworks (MOFs) that either have hydrogen atoms (blue), MOFs with bounded solvents present (green) and MOFs with unbounded solvents (yellow), with the rest of the MOFs grayed out as they are not highlighted examples. On the right panels, we showcase examples of MOFs within these three classes, with the left column being CoRE-2019 and right column being CSD structure. b Showcases a MOF (CSD reference code: WAHMEY)56 (metal types: Fe, Au, linker: C6H4N2) with missing hydrogen atoms, c showcases a MOF (CSD reference code: NALYEG)57 (metal type: U, linker: C9H6O6) with unbounded solvent in the form of C3H7NO, d showcases a MOF (CSD reference code: VEFLUP)58 (metal type: Co, linker: C12H8O4) with bounded solvent in the form of C3H7NO. Crystal structures visualized using VESTA59 and the atom color coding is: N (blue), O (red), H (white), C (brown), Fe (amber), Au (light amber), U (gray), Co (dark blue). Source data are provided as a Source Data file.
Up until now, the model’s performance has been evaluated using simulated PXRDs. In practice, when assessing the crystallinity of MOF powders, experimental PXRDs may contain noise from instrumentation, temperature variations, and low-quality samples with poor crystallinity. To evaluate our model’s robustness against these imperfections, we compare the recommendations for methane storage when using simulated versus experimental PXRDs. This evaluation was conducted on five materials, for which both experimental and simulated PXRDs are shown in Fig. 4. In general, we observe consistent recommendations between the simulated and experimental PXRDs, except for one case. In this inconsistent case, the difference between the simulated and experimental PXRDs is pronounced; the experimental PXRD exhibits noticeable noise and multiple significant peaks compared to the simulated PXRD. Additional comparisons can be found in the Supplementary Information, alongside model assessment on different noise levels. Overall, our evaluation indicates that the model remains robust to a great extent in the presence of noise and imperfections in PXRDs. This enables experimentalists to use their measured PXRDs to predict their properties.
[See PDF for image]
Fig. 4
Comparison of recommendations made by model with simulated and experimental powder x-ray diffraction (PXRD) patterns.
This assessment was done on methane storage recommendations due to the dependence on pore volume and density. The blue and red PXRDs correspond to simulated and experimental PXRDs respectively. The table showcases results (with metal (M) and linker (L)) for: CAU-28 (M: Zr, L: C6H4O5), Yb-UiO-66 (M: Yb, L: C8H6O4), Zn-(Ade)(TBAPy) (M: Zn, L: C5H5N5, C44H26O8), Zn2-(TBAPy) (M: Zn, L: C44H26O8) with coordinated water molecules and with a distorted structural framework60, 61–62. Crystal structures visualized using VESTA59 and the atom color coding is: N (blue), O (red), H (white), C (brown), Zr (green), Yb (light blue), Zn (gray).
An application recommendation map
The high performance of our multimodal model in predicting various properties of MOFs enables a direct link between their synthesis and potential applications. We built a recommendation system that classifies MOFs as promising candidates for specific applications based on predicted material properties. The applications explored in this study include gas storage (for gases such as methane, xenon, and hydrogen) and carbon capture, including direct air capture (DAC). The metrics for these applications are derived from industrial cases reported in the literature (see SI for details)33, 34, 35, 36–37. Our results show accurate distinction of promising MOFs for these applications (see SI for details). To minimize false negatives—instances where promising candidates are overlooked, we configure our model to be more optimistic in its recommendations.
For large-scale application predictions, such as assessing all materials in the CoRE-2019, hMOF and QMOF databases, it is insightful to see how the model links synthesis information to applications. Figure 5 shows the projection of the model’s embedding space for materials in CoRE-2019, with promising MOFs color-coded for various applications. The visual trends highlight the model’s ability to identify application-relevant properties from the latent space, effectively clustering MOFs based on their suitability for specific applications. Notably, MOFs flagged as promising for gas storage are distinctly clustered on the left side of the map. Further analysis of the map, color-coded by material density and pore diameter, indicates that these materials are indeed more porous and have lower densities–attributes favorable for gas storage applications, as supported by previous studies26,38. Additionally, the map provides insights into the desired metal electronegativity when designing materials for carbon capture or DAC. These insights confirm the relation between the embedding space of the multimodal model to chemical and structural attributes of MOFs. The model’s ability to infer these structural attributes solely from PXRD and precursor data is a significant strength. In the previous section, we evaluated the model’s performance against experimental imperfections and found that model confidence correlates with agreement to experimentally determined structures. This capability enables effective materials-application matching for newly discovered MOFs.
[See PDF for image]
Fig. 5
Mapping synthesis to applications.
The projection of the latent space of the multimodal model is obtained using t-SNE (t-distributed Stochastic Neighbor Embedding). The different colors indicate the model’s recommended applications for various metal-organic frameworks (MOFs) in the combined CoRE-2019, QMOF and hMOF databases5,18,19,26,29. The panels on the right display the same embedding space color coded with specific physiochemical properties: crystal density (top right), pore diameter (middle right) and Pauling electronegativity (bottom right). Source data are provided as a Source Data file.
The map shown in Fig. 5 provides insights into the designability of MOFs for various applications. Overlaps between regions for specific applications suggest the potential for versatile MOFs that can serve multiple purposes, while shedding light on the properties most relevant for designing these MOFs. For instance, many MOFs flagged as promising for methane storage are also recommended for xenon and hydrogen storage, indicating relationships between materials and applications. Moreover, the map illustrates that for a particular application, there can be different families of MOFs that perform optimally. For instance, there are a few different clusters showing promising performance for carbon capture, balancing pore structure with specific chemical interactions.
Discussion
Our study offers a pathway for matching newly discovered materials to their potential applications. This was achieved by developing a multimodal model that requires only data available upon MOF synthesis — namely PXRD and chemical precursors, enabling accurate predictions of a wide range of geometry-reliant, chemistry-reliant, and quantum-chemical properties for MOFs. This remarkable accuracy on such a wide range of properties underscores the importance of leveraging multimodality in materials property prediction, as different modalities enhance the model’s ability to predict a broader range of properties, which a single modality cannot achieve.
A practical motivation for our method was to identify applications for newly discovered MOFs, which may differ significantly from the intended application at the time of design. To demonstrate this capability, we conducted a time-travel experiment, training our model on CoRE-2019 entries deposited in the CSD database before 2017, with a test set comprising entries deposited after 2017. This approach assesses whether our model can recommend applications for MOFs synthesized in the future. Our recommendation system identified 18 promising candidates for carbon capture, with an impressive accuracy of 16 out of 18 based on molecular simulation data. Remarkably, 15 of these materials were originally designed for applications other than CO2 capture. Figure 6 presents some of these MOFs along with their intended synthesis applications based on the corresponding articles. Please refer to the Supplementary Information for the results on the rest of the MOFs.
[See PDF for image]
Fig. 6
A “time-travel” experiment connecting MOFs and their potential applications.
Metal-organic frameworks (MOFs) identified to be promising for CO2 capture (from their predicted computational CO2 uptake at 0.15 bar) and the intended applications reported in their original articles. The MOFs, reported from top-to-bottom in their Cambridge Structural Database (CSD) reference codes, are EXUHUC (metal type: Mn, linker: C11H8O4N2)63, GARLUJ(metal type: Eu, linker: C15H13NO4)64, LAQZOV (metal type: Ce, linker: C21H14O6)65, NAMDUD (metal type: Mn, linker: C27H18O6)66 and NAWKII (metal type: Mg, linker: C12H8O4)67.
Given that our model can recommend applications based solely on a PXRD pattern and precursors, experimentalists can utilize our web app to receive recommendations immediately after conducting crystallinity assessments to obtain PXRD data39. By integrating advanced key performance indicators from chemical process modeling and life-cycle assessments40 as well as robotic platforms41, this approach will accelerate materials discovery for sustainable applications.
Methods
Generating input to the model
From leveraging the crystal structures from each database (CoRE-2019, BW20K, ARABG, QMOF, hMOF)5,18,19,28,29, pymatgen’s XRD module42 was used to calculate the simulated PXRD pattern for every MOF specified, ranging from 0 to 90 degrees. It should be noted that for pymatgen, the Debye-Waller factor (for temperature) and the multiplicity factor are not accounted for. Furthermore, CuKa radiation was used, with no refinement on the structures done42,43. Please refer to the SI for more details regarding the generation of simulated PXRD patterns.
When a PXRD is collected, there are usually a varying number of data points. When machine learning is done, the inputs need to all be of the same length. In order to construct a 1D array of intensities, interpolation can be done to fill in the missing data points. It should be noted that experimental PXRDs characteristically have broad, smooth peaks due to various reasons such as instrumentation. In an attempt to model this behavior, a Gaussian transformation was done on the 1D array. The total desired size of the 1D PXRD vector is 9000, with the desired peak width being controlled by σ = 0.1. For details regarding the configurations, parameters and algorithm of the preprocessing and transformation of the PXRD pattern, please refer to the SI.
The metal nodes and the SMILES of the organic linker were previously given for MOFs across various databases5,18,19,28,29. The chemical precursors were constructed in the format of: [Metal Type].[Organic Linker]. This was inputted and tokenized by the transformer channel of our model.
Construction of model
Taking inspiration from MOFormer, an encoder and tokenizer was built on top of the transformer23. The precursor is inputted into a SMILES tokenizer (as the only portion inputted from the precursor are the inorganic and organic building blocks), with a [CLS] and [SEP] tag added at the beginning and the end of the precursor string respectively. As the precursor strings are of varying lengths, there is padding added to each string with [PAD] such that they are at a fixed length of 512 characters. Positional encoding is done on the tokens to give information on the absolute and relative positions of each token, where the dimension of embedding is 512. The transformer is built on a self-attention mechanism44. The encoder layers consist of multihead attention layers, where each encoder has the attention layer, followed by the multilayer perceptron (MLP) layer. In each attention layer, the input sequence is broken down into three vectors: the query (Q), key (K) and value (V), with the corresponding learnable weights being WQ, WK, WV respectively. The attention output is defined as:
1
Based off this, the model is able to focus on different parts of the inputted precursor. The total number of tokens in the vocabulary is 4021, with the number of parallel heads being 8 and the number of encoder layers being 6. A dropout rate of 0.1 was used. An embedding is returned of shape (N, 512, 512), where N is the batch size (32 is chosen for fine-tuning). Chitturi et al. published work in which they built a 1D-CNN that accepts PXRD patterns as the input for the prediction of the classification of crystal symmetries45. Using this as inspiration, a 1D-CNN was constructed with ten convolutional blocks, with max-pooling layers and ReLU activation functions following each convolutional block. After the 10th convolutional block, the embedding of the CNN is returned of shape (N, 660). Taking the two embeddings from the transformer and CNN, tensor concatenation was performed to return a concatenated embedding size of (N, 1172). This concatenated tensor is fed into a projector, in which linear and softplus activation functions are used to project a tensor of shape (N, 512). This embedding is used in the self-supervised pretraining against the crystal graph convolutional neural network (CGCNN). When performing regression, the projected embedding is fed into a regression head consisting of linear and ReLU Dense layers to return M labels, with M being any number of desired properties the user wants to return. For the property prediction results, M was set to 1 (one label was predicted at a time). PyTorch 2.3 has been used in the construction of the model46,47.
For model evaluation, data visualization and analysis, we used Pandas48, Matplotlib49, NumPy50 and Scikit-learn51.
Self-supervised learning
As DFT and experimental labels for MOFs are difficult to obtain, it is difficult to construct large databases for machine learning applications. As a result, there is an incentive to construct models which perform well in lower data regimes. Furthermore, while the PXRD gives adequate information on the global environment of the MOF, capturing the metal/organic chemistry of MOFs by using a PXRD/precursors is difficult due to the model not having an intuitive understanding of the chemistry behind the tokens of a text representation of a MOF. Self-supervised pretraining aims to solve this problem, as a pretrained model’s weights are initialized such that it converges to a solution quickly, rather than using randomized weights in which large amounts of data is needed. To avoid the limitation of the model not understanding the local environment of a MOF from the inputs, a self-supervised learning (SSL) pipeline has been constructed where representation learning is done between a crystal graph convolutional neural network (CGCNN) and our model23,52. The CGCNN accepts a 3D structure of a MOF and returns an embedding containing information on the structure itself, chemical composition and local environment. While the geometric properties, which are reliant on structural information, are already easy to predict due to the PXRD input, chemistry-reliant properties are harder due to the inputs not giving the model an explicit understanding of the metal/organic chemistry of the MOF. This gives us an incentive to perform self-supervised learning against the CGCNN embeddings. The embeddings are taken out of the projector for both the CGCNN and our model, each with an embedding size of 512. From these two embeddings, it is possible to construct a cross-correlation matrix of shape (512, 512). This can be defined as:
2
Where b is the batch sample, i, j are the indices of the vector dimension, zA, zB are the embeddings for our model and CGCNN respectively. For representation learning to happen, the cross-correlation matrix should approach an identity matrix. As a result, the loss function (Barlow-Twin loss) used to quantify this can be defined as:3
Where λ is the regularization term which describes the trade-off between the first and second parts of the loss function53. A batch size of 64 was used, with the regularization term being 0.0051. Furthermore, the learning rates of CGCNN and our model is 0.00005 for 100 epoch.Supplementary information availability
Information about the MOF databases used, model architecture/hyperparameters, relevant property distributions, CoRE-2019 filtration process, statistics of model performance, recommendation system details (thresholds, masked learning), the “time-travel” model discussed in the MOF Application Discovery section, evaluation on CoRE-2019’s diversity, model robustness on experimental PXRDs and its robustness on covalent-organic frameworks (COFs) can be found in the supplementary information section. Data efficiency information can also be found in the supplementary information.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Acknowledgements
The authors gratefully acknowledge financial support from Natural Sciences and Engineering Research Council of Canada, the University of Toronto’s Acceleration Consortium through the Canada First Research Excellence Fund under Grant number CFREF-2022-00042, and National Research Council of Canada under the Materials for Clean Fuels Challenge Program grant number MCF-146. We would like to thank Hudson Bicalho and Dr. Micaela Richezzi from Dr. Ashlee Howarth’s group (Concordia University) for providing us experimental PXRDs for materials in Fig. 4.
Author contributions
Conceptualization: S.T.K. & S.M.M.; Data curation: S.T.K.; Formal analysis: S.T.K.; Funding acquisition: S.M.M.; Investigation: S.T.K. & S.M.M.; Methodology: S.T.K. & S.M.M.; Project administration: S.M.M.; Software: S.T.K.; Supervision: S.M.M.; Validation: S.T.K.; Visualization: S.T.K.; Writing - review & editing: S.T.K. & S.M.M.
Peer review
Peer review information
Nature Communications thanks Jennifer Chayes, Theo Pierre Jaffrelot Inizan and the other anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Data availability
The powder x-ray diffraction, precursor, and the labels used from the databases showcased in this study have been deposited in the Zenodo repository with this link: https://zenodo.org/records/14908210. Source data for all figures and tables are provided as a Source Data file.
Code availability
All code related to this study was developed in Python and is publicly available under the MIT License, an open-source license approved by the Open Source Initiative. The full source code (including the featurization, prediction and trained models) are available from https://github.com/AI4ChemS/XRayPro54.
Competing interests
The authors declare no competing interests.
Supplementary information
The online version contains supplementary material available at https://doi.org/10.1038/s41467-025-60796-0.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
1. Zhou, H.-C., Long, J. R. & Yaghi, O. M. Introduction to metal–organic frameworks. Chemical reviews112, 673–674 (2012).
2. Pascanu, V; González Miera, G; Inge, AK; Martín-Matute, B. Metal–organic frameworks as catalysts for organic synthesis: a critical perspective. J. Am. Chem. Soc.; 2019; 141, pp. 7223-7234.1:CAS:528:DC%2BC1MXntleht7g%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30974060][DOI: https://dx.doi.org/10.1021/jacs.9b00733]
3. Zhao, X; Wang, Y; Li, D-S; Bu, X; Feng, P. Metal–organic frameworks for separation. Adv. Mater.; 2018; 30, 1705189. [DOI: https://dx.doi.org/10.1002/adma.201705189]
4. Zhang, L-T; Zhou, Y; Han, S-T. The role of metal–organic frameworks in electronic sensors. Angew. Chem.; 2021; 133, pp. 15320-15340. [DOI: https://dx.doi.org/10.1002/ange.202006402]
5. Chung, YG et al. Advances, updates, and analytics for the computation-ready, experimental metal–organic framework database: CoRE MOF 2019. J. Chem. Eng. Data; 2019; 64, pp. 5985-5998.1:CAS:528:DC%2BC1MXitVOmtrrL [DOI: https://dx.doi.org/10.1021/acs.jced.9b00835]
6. Chung, YG et al. Computation-ready, experimental metal–organic frameworks: a tool to enable high-throughput screening of nanoporous crystals. Chem. Mater.; 2014; 26, pp. 6185-6192.1:CAS:528:DC%2BC2cXhs1Oiur7E [DOI: https://dx.doi.org/10.1021/cm502594j]
7. Moghadam, PZ et al. Development of a Cambridge Structural Database subset: a collection of metal–organic frameworks for past, present, and future. Chem. Mater.; 2017; 29, pp. 2618-2625.1:CAS:528:DC%2BC2sXkt12rurc%3D [DOI: https://dx.doi.org/10.1021/acs.chemmater.7b00441]
8. Moosavi, SM et al. Capturing chemical intuition in synthesis of metal-organic frameworks. Nat. Commun.; 2019; 10, 40161781:CAS:528:DC%2BC1MXntlKltbc%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30710082][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6358622][DOI: https://dx.doi.org/10.1038/s41467-019-08483-9] 539.
9. Tom, G et al. Self-driving laboratories for chemistry and materials science. Chem. Rev.; 2024; 124, pp. 9633-9732.1:CAS:528:DC%2BB2cXhslelsLjO [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/39137296][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11363023][DOI: https://dx.doi.org/10.1021/acs.chemrev.4c00055]
10. Back, S et al. Accelerated chemical science with AI. Digital Discov.; 2024; 3, pp. 23-33.1:CAS:528:DC%2BB3sXis1ensL7I [DOI: https://dx.doi.org/10.1039/D3DD00213F]
11. Ongari, D; Talirz, L; Smit, B. Too many materials and too many applications: an experimental problem waiting for a computational solution. ACS Cent. Sci.; 2020; 6, pp. 1890-1900.1:CAS:528:DC%2BB3cXhvFOgtL3N [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33274268][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7706098][DOI: https://dx.doi.org/10.1021/acscentsci.0c00988]
12. Fateeva, A et al. A water-stable porphyrin-based metal–organic framework active for visible-light photocatalysis. Angew. Chem. Ger. Ed.; 2012; 124, 7558. [DOI: https://dx.doi.org/10.1002/ange.201202471]
13. Boyd, PG et al. Data-driven design of metal–organic frameworks for wet flue gas CO2 capture. Nature; 2019; 576, pp. 253-256.1:CAS:528:DC%2BC1MXitlOmu7zN [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/31827290][DOI: https://dx.doi.org/10.1038/s41586-019-1798-7]
14. Qian, J-J et al. Switching Xe/Kr adsorption selectivity in modified SBMOF-1: a theoretical study. RSC Adv.; 2020; 10, pp. 17195-17204.1:CAS:528:DC%2BB3cXovVGisbg%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/35521447][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9053438][DOI: https://dx.doi.org/10.1039/D0RA02212H]
15. Moosavi, SM; Jablonka, KM; Smit, B. The role of machine learning in the understanding and design of materials. J. Am. Chem. Soc.; 2020; 142, pp. 20273-20287.1:CAS:528:DC%2BB3cXit1yntr3O [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33170678][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7716341][DOI: https://dx.doi.org/10.1021/jacs.0c09105]
16. Banerjee, D et al. Metal–organic framework with optimally selective xenon adsorption and separation. Nat. Commun.; 2016; 7, pp. 1-7. [DOI: https://dx.doi.org/10.1038/ncomms11831]
17. Moghadam, PZ et al. Computer-aided discovery of a metal–organic framework with superior oxygen uptake. Nat. Commun.; 2018; 9, [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29643387][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5895810][DOI: https://dx.doi.org/10.1038/s41467-018-03892-8] 1378.
18. Rosen, AS et al. Machine learning the quantum-chemical properties of metal–organic frameworks for accelerated materials discovery. Matter; 2021; 4, pp. 1578-1597.1:CAS:528:DC%2BB3MXhsl2qurrI [DOI: https://dx.doi.org/10.1016/j.matt.2021.02.015]
19. Rosen, AS et al. High-throughput predictions of metal–organic framework electronic properties: theoretical challenges, graph neural networks, and data exploration. npj Computational Mater.; 2022; 8, pp. 1-10. [DOI: https://dx.doi.org/10.1038/s41524-022-00796-6]
20. Oviedo, F et al. Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks. npj Computational Mater.; 2019; 5, [DOI: https://dx.doi.org/10.1038/s41524-019-0196-x] 60.
21. Utimula, K et al. Machine-learning clustering technique applied to powder X-ray diffraction patterns to distinguish compositions of ThMn12-type alloys. Adv. Theory Simul.; 2020; 3, 2000039.1:CAS:528:DC%2BB3cXhtl2jtLvE [DOI: https://dx.doi.org/10.1002/adts.202000039]
22. Suzuki, Y et al. Symmetry prediction and knowledge discovery from X-ray diffraction patterns using an interpretable machine learning approach. Sci. Rep.; 2020; 10, 1:CAS:528:DC%2BB3cXis1WltLjP [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/33311555][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7732852][DOI: https://dx.doi.org/10.1038/s41598-020-77474-4] 21790.
23. Cao, Z; Magar, R; Wang, Y; Barati Farimani, A. Moformer: self-supervised transformer model for metal–organic framework property prediction. J. Am. Chem. Soc.; 2023; 145, pp. 2958-2967.1:CAS:528:DC%2BB3sXhvFWltbc%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/36706365][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041520][DOI: https://dx.doi.org/10.1021/jacs.2c11420]
24. Bucior, BJ et al. Identification schemes for metal–organic frameworks to enable rapid search and cheminformatics analysis. Cryst. Growth Des.; 2019; 19, pp. 6682-6697.1:CAS:528:DC%2BC1MXhslOqurrP [DOI: https://dx.doi.org/10.1021/acs.cgd.9b01050]
25. Xie, T; Grossman, JC. Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties. Phys. Rev. Lett.; 2018; 120, 145301.1:CAS:528:DC%2BC1MXltFSnu7c%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/29694125][DOI: https://dx.doi.org/10.1103/PhysRevLett.120.145301]
26. Moosavi, SM et al. Understanding the diversity of the metal-organic framework ecosystem. Nat. Commun.; 2020; 11, 1:CAS:528:DC%2BB3cXhs1WrsbfE [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32792486][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7426948][DOI: https://dx.doi.org/10.1038/s41467-020-17755-8] 4068.
27. Bobbitt, NS et al. MOFX-DB: an online database of computational adsorption data for nanoporous materials. J. Chem. Eng. Data; 2023; 68, pp. 483-498.1:CAS:528:DC%2BB3sXisVCltA%3D%3D [DOI: https://dx.doi.org/10.1021/acs.jced.2c00583]
28. Boyd, PG; Woo, TK. A generalized method for constructing hypothetical nanoporous materials of any net topology from graph theory. CrystEngComm; 2016; 18, pp. 3777-3792.1:CAS:528:DC%2BC28XksVylsrY%3D [DOI: https://dx.doi.org/10.1039/C6CE00407E]
29. Wilmer, CE et al. Large-scale screening of hypothetical metal–organic frameworks. Nat. Chem.; 2012; 4, pp. 83-89.1:CAS:528:DC%2BC3MXhsVagtL3K [DOI: https://dx.doi.org/10.1038/nchem.1192]
30. Kang, Y; Park, H; Smit, B; Kim, J. A multi-modal pre-training transformer for universal transfer learning in metal–organic frameworks. Nat. Mach. Intell.; 2023; 5, pp. 309-318. [DOI: https://dx.doi.org/10.1038/s42256-023-00628-2]
31. Burner, J et al. High-performing deep learning regression models for predicting low-pressure CO2 adsorption properties of metal–organic frameworks. J. Phys. Chem. C.; 2020; 124, pp. 27996-28005.1:CAS:528:DC%2BB3cXisFWqsb3M [DOI: https://dx.doi.org/10.1021/acs.jpcc.0c06334]
32. Boldog, I et al. A rare alb-4, 8-Cmce metal–coordination network based on tetrazolate and phosphonate functionalized 1, 3, 5, 7-tetraphenyladamantane. CrystEngComm; 2013; 15, pp. 1235-1243.1:CAS:528:DC%2BC3sXntlShtA%3D%3D [DOI: https://dx.doi.org/10.1039/c2ce26819a]
33. Mahajan, S; Lahtinen, M. Recent progress in metal-organic frameworks (MOFs) for CO2 capture at different pressures. J. Environ. Chem. Eng.; 2022; 10, 108930.1:CAS:528:DC%2BB38XivFSjtbfM [DOI: https://dx.doi.org/10.1016/j.jece.2022.108930]
34. Ursueguía, D; Díaz, E; Ordóñez, S. Metal-Organic Frameworks (MOFs) as methane adsorbents: from storage to diluted coal mining streams concentration. Sci. Total Environ.; 2021; 790, 148211. [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34111784][DOI: https://dx.doi.org/10.1016/j.scitotenv.2021.148211]
35. Zhou, W. Methane storage in porous metal- organic frameworks: current records and future perspectives. Chem. Rec.; 2010; 10, pp. 200-204.1:CAS:528:DC%2BC3cXnsFyqs7c%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/20509145][DOI: https://dx.doi.org/10.1002/tcr.201000004]
36. Ahmed, A et al. Exceptional hydrogen storage achieved by screening nearly half a million metal-organic frameworks. Nat. Commun.; 2019; 10, [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/30952862][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6450936][DOI: https://dx.doi.org/10.1038/s41467-019-09365-w] 1568.
37. Zhang, X., Jablonka, K. M. & Smit, B. Deep learning-based recommendation system for metal-organic frameworks (MOFs). Digital Discov.3, 1410–1420 (2024).
38. Li, B; Wen, H-M; Zhou, W; Chen, B. Porous metal–organic frameworks for gas storage and separation: what, how, and why?. J. Phys. Chem. Lett.; 2014; 5, pp. 3468-3479.1:CAS:528:DC%2BC2cXhs1WhsL7F [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/26278595][DOI: https://dx.doi.org/10.1021/jz501586e]
39. Khan, S. T. & Moosavi, S. M. XRayPro: Connecting metal-organic framework synthesis to applications with multimodal model machine learning. https://xraypro.streamlit.app/ (2024).
40. Charalambous, C. et al. A holistic platform for accelerating sorbent-based carbon capture. Nature632, 89–94 (2024)
41. Lunt, AM et al. Modular, multi-robot integration of laboratories: an autonomous workflow for solid-state chemistry. Chem. Sci.; 2024; 15, pp. 2456-2463.1:CAS:528:DC%2BB2cXhtlCks7g%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/38362408][DOI: https://dx.doi.org/10.1039/D3SC06206F]
42. Ong, SP et al. Python materials genomics (pymatgen): a robust, open-source python library for materials analysis. Computational Mater. Sci.; 2013; 68, pp. 314-319.1:CAS:528:DC%2BC3sXhsVGjt7g%3D [DOI: https://dx.doi.org/10.1016/j.commatsci.2012.10.028]
43. Graef, MD; McHenry, ME. Structure of materials: an introduction to crystallography, diffraction, and symmetry; 2007; Cambridge University Press:
44. Vaswani, A. Attention is all you need. In Advances in neural information processing systems (NIPS, 2017).
45. Chitturi, SR et al. Automated prediction of lattice parameters from X-ray powder diffraction patterns. J. Appl. Crystallogr.; 2021; 54, pp. 1799-1810.1:CAS:528:DC%2BB3MXis1yhsrbM [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/34963768][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662964][DOI: https://dx.doi.org/10.1107/S1600576721010840]
46. Paszke, A. et al. Automatic differentiation in PyTorch. NIPS 2017 Workshop on Autodiff (2017).
47. Paszke, A. et al. PyTorch: an imperative style, high-performance deep learning library. CoRR, https://arxiv.org/abs/1912.01703 (2019).
48. pandas development team, T. pandas-dev/pandas: Pandas, https://doi.org/10.5281/zenodo.3509134 (2020)
49. Hunter, JD. Matplotlib: a 2D graphics environment. Comput. Sci. Eng.; 2007; 9, pp. 90-95. [DOI: https://dx.doi.org/10.1109/MCSE.2007.55]
50. Harris, CR et al. Array programming with NumPy. Nature; 2020; 585, pp. 357-362.1:CAS:528:DC%2BB3cXitlWmsbbN [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/32939066][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7759461][DOI: https://dx.doi.org/10.1038/s41586-020-2649-2]
51. Pedregosa, F. et al. Scikit-learn: Machine Learning in Python. CoRR, https://arxiv.org/abs/1201.0490 (2012)
52. Magar, R; Wang, Y; Barati Farimani, A. Crystal twins: self-supervised learning for crystalline material property prediction. npj Computational Mater.; 2022; 8, [DOI: https://dx.doi.org/10.1038/s41524-022-00921-5] 231.
53. Zbontar, J., Jing, L., Misra, I., LeCun, Y. & Deny, S. Barlow twins: self-supervised learning via redundancy reduction. Int. Conf. Mach. Learn.139, 12310–12320 (2021).
54. Khan, S. T. & Moosavi, S. M. Connecting metal-organic framework synthesis to applications with multimodal machine learning. XRayPro, https://doi.org/10.5281/zenodo.15330841 (2025).
55. Dubbeldam, D; Calero, S; Vlugt, TJ. iRASPA: GPU-accelerated visualization software for materials scientists. Mol. Simul.; 2018; 44, pp. 653-676.1:CAS:528:DC%2BC1cXmsValtrg%3D [DOI: https://dx.doi.org/10.1080/08927022.2018.1426855]
56. Galet, A., Muñoz, M. C., Martínez, V. & Real, J. A. Supramolecular isomerism in spin crossover networks with aurophilic interactions. Chem. Commun.20, 2268–2269 (2004)
57. Volkringer, C et al. Metal–organic-framework-type 1D-channel open network of a tetravalent uranium trimesate. Inorg. Chem.; 2011; 50, pp. 11865-11867.1:CAS:528:DC%2BC3MXhtlyhtLvI [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/22034984][DOI: https://dx.doi.org/10.1021/ic2021963]
58. Xu, H; Xu, Z. A microporous coordination polymer of 2, 6-Naphthalenedicarboxylate and cobalt (II) showing reversible structural and functional transformation. Microporous Mesoporous Mater.; 2012; 157, pp. 33-36.1:CAS:528:DC%2BC38Xnt1yrt70%3D [DOI: https://dx.doi.org/10.1016/j.micromeso.2011.10.036]
59. Momma, K; Izumi, F. VESTA: a three-dimensional visualization system for electronic and structural analysis. Appl. Crystallogr.; 2008; 41, pp. 653-658.1:CAS:528:DC%2BD1cXlvFKgu78%3D [DOI: https://dx.doi.org/10.1107/S0021889808012016]
60. Dreischarf, AC; Lammert, M; Stock, N; Reinsch, H. Green synthesis of Zr-CAU-28: structure and properties of the first Zr-MOF based on 2, 5-furandicarboxylic acid. Inorg. Chem.; 2017; 56, pp. 2270-2277.1:CAS:528:DC%2BC2sXitVyqtrk%3D [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/28165722][DOI: https://dx.doi.org/10.1021/acs.inorgchem.6b02969]
61. Donnarumma, PR et al. Synthetic approaches for accessing rare-earth analogues of UiO-66. Chem. Commun.; 2021; 57, pp. 6121-6124.1:CAS:528:DC%2BB3MXhtFaqt73O [DOI: https://dx.doi.org/10.1039/D1CC01552D]
62. Pougin, MJ et al. Adsorption in pyrene-based metal–organic frameworks: the role of pore structure and topology. ACS Appl. Mater. Interfaces; 2024; 16, pp. 36586-36598.1:CAS:528:DC%2BB2cXhsVymtrjI [PubMed: https://www.ncbi.nlm.nih.gov/pubmed/38978297][PubMedCentral: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11261566][DOI: https://dx.doi.org/10.1021/acsami.4c05527]
63. Wang, C; Jiao, Y; Li, G. One three-dimensional manganese (II)-organic framework bearing hydroxylphenyl imidazole dicarboxylate ligand. Inorg. Nano Met. Chem.; 2017; 47, pp. 298-301.1:CAS:528:DC%2BC2sXmvFWjur4%3D [DOI: https://dx.doi.org/10.1080/15533174.2016.1186043]
64. Wang, X et al. Highly efficient white-light emission and UV–visible/NIR luminescence sensing of lanthanide metal–organic frameworks. Cryst. Growth Des.; 2017; 17, pp. 2178-2185.1:CAS:528:DC%2BC2sXjsFCkurw%3D [DOI: https://dx.doi.org/10.1021/acs.cgd.7b00112]
65. Ma, A et al. Two lanthanide-based metal–organic frameworks for highly efficient adsorption and removal of fluoride ions from water. CrystEngComm; 2017; 19, pp. 2172-2177.1:CAS:528:DC%2BC2sXks1Kntro%3D [DOI: https://dx.doi.org/10.1039/C7CE00291B]
66. Meng, Q-H; Liu, J-L; Long, X; Zhang, S; Quan, Y. A porous the-type metal-organic framework based on [Mn4Cl] 7+ clusters for selective gas sorption. Inorg. Chem. Commun.; 2017; 79, pp. 46-49.1:CAS:528:DC%2BC2sXkslyisbg%3D [DOI: https://dx.doi.org/10.1016/j.inoche.2017.03.021]
67. Gangu, KK; Maddila, S; Mukkamala, SB; Jonnalagadda, SB. Synthesis, structure, and properties of new Mg (II)-metal–organic framework and its prowess as catalyst in the production of 4 H-Pyrans. Ind. Eng. Chem. Res.; 2017; 56, pp. 2917-2924.1:CAS:528:DC%2BC2sXjs1Kns7Y%3D [DOI: https://dx.doi.org/10.1021/acs.iecr.6b04795]
Copyright Nature Publishing Group 2025