megnet

Name	megnet JSON
Version	1.3.2 JSON
	download
home_page
Summary	MatErials Graph Networks for machine learning of molecules and crystals.
upload_time	2022-11-16 21:24:32
maintainer
docs_url	None
author	Chi Chen
requires_python
license	BSD
keywords	materials science machine learning deep graph networks neural
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            [![License](https://img.shields.io/github/license/materialsvirtuallab/megnet)]()
[![Build Status](https://travis-ci.org/materialsvirtuallab/megnet.svg?branch=master)](https://travis-ci.org/materialsvirtuallab/megnet)
[![Coverage Status](https://coveralls.io/repos/github/materialsvirtuallab/megnet/badge.svg?branch=master)](https://coveralls.io/github/materialsvirtuallab/megnet?branch=master&service=github)
[![Downloads](https://pepy.tech/badge/megnet)](https://pepy.tech/project/megnet)
[![Linting](https://github.com/materialsvirtuallab/megnet/workflows/Linting/badge.svg)](https://github.com/materialsvirtuallab/megnet/workflows/Linting/badge.svg)
[![Testing](https://github.com/materialsvirtuallab/megnet/workflows/Testing%20-%20main/badge.svg)](https://github.com/materialsvirtuallab/megnet/workflows/Testing%20-%20main/badge.svg)


[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/materialsvirtuallab/megnet/master)

# Table of Contents
* [Introduction](#introduction)
* [MEGNet Framework](#megnet-framework)
* [Installation](#installation)
* [Usage](#usage)
* [Datasets](#datasets)
* [Implementation details](#implementation-details)
* [Computing requirements](#computing-requirements)
* [Known limitations](#limitations)
* [Contributors](#contributors)
* [References](#references)

<a name="introduction"></a>
# Introduction

This repository represents the efforts of the [Materials Virtual Lab](http://www.materialsvirtuallab.org)
in developing graph networks for machine learning in materials science. It is a
work in progress and the models we have developed thus far are only based on
our best efforts. We welcome efforts by anyone to build and test models using
our code and data, all of which are publicly available. Any comments or
suggestions are also welcome (please post on the Github Issues page.)

A web app using our pre-trained MEGNet models for property prediction in
crystals is available at [http://megnet.crystals.ai](http://megnet.crystals.ai). For tutorials, please visit `notebooks` in this repo. We have also established an online simulation tool and a tutorial lecture at nanoHUB ([https://nanohub.org/resources/megnet](https://nanohub.org/resources/megnet)).

Note: A [DGL implementation of MEGNet](https://github.com/materialsvirtuallab/m3gnet-dgl) is now available. For users
trying to build their own MEGNet models, it is highly recommended you check this version out, which may be easier to
work with and extend in future.

<a name="megnet-framework"></a>
# MEGNet framework

The MatErials Graph Network (MEGNet) is an implementation of DeepMind's graph
networks[1] for universal machine learning in materials science. We have
demonstrated its success in achieving very low prediction errors in a broad
array of properties in both molecules and crystals (see
["Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals"](https://doi.org/10.1021/acs.chemmater.9b01294)[2]). New releases have included our recent work on multi-fidelity materials property modeling (See ["Learning properties of ordered and disordered materials from multi-fidelity data"](https://www.nature.com/articles/s43588-020-00002-x)[3]).

Briefly, Figure 1 shows the sequential update steps of the graph network,
whereby bonds, atoms, and global state attributes are updated using information
from each other, generating an output graph.

![GraphModel diagram](resources/model_diagram_small.jpg)
<div align='center'><strong>Figure 1. The graph network update function.</strong></div>

Figure 2 shows the overall schematic of the MEGNet. Each graph network module
is preceded by two multi-layer perceptrons (known as Dense layers in Keras
terminology), constituting a MEGNet block. Multiple MEGNet blocks can be
stacked, allowing for information flow across greater spatial distances. The
number of blocks required depend on the range of interactions necessary to
predict a target property. In the final step, a `set2set` is used to map the
output to a scalar/vector property.

![GraphModel architecture](resources/model_arch_small.jpg)
<div align='center'><strong>Figure 2. Schematic of MatErials Graph Network.</strong></div>

<a name="installation"></a>
# Installation

Megnet can be installed via pip for the latest stable version:

```bash
pip install megnet
```

For the latest dev version, please clone this repo and install using:

```bash
python setup.py develop
```

<a name="usage"></a>
# Usage

Our current implementation supports a variety of use cases for users with
different requirements and experience with deep learning. Please also visit
the [notebooks directory](notebooks) for Jupyter notebooks with more detailed code examples.

## Using pre-built models

In our work, we have already built MEGNet models for the QM9 data set and
Materials Project dataset. These models are provided as serialized HDF5+JSON
files. Users who are purely interested in using these models for prediction
can quickly load and use them via the convenient `MEGNetModel.from_file`
method. These models are available in the `mvl_models` folder of this repo.
The following models are available:

* QM9 molecule data:
    - HOMO: Highest occupied molecular orbital energy
    - LUMO: Lowest unoccupied molecular orbital energy
    - Gap: energy gap
    - ZPVE: zero point vibrational energy
    - µ: dipole moment
    - α: isotropic polarizability
    - \<R2\>: electronic spatial extent
    - U0: internal energy at 0 K
    - U: internal energy at 298 K
    - H: enthalpy at 298 K
    - G: Gibbs free energy at 298 K
    - Cv: heat capacity at 298 K
    - ω1: highest vibrational frequency.
* Materials Project data:
    - Formation energy from the elements
    - Band gap
    - Log 10 of Bulk Modulus (K)
    - Log 10 of Shear Modulus (G)

The MAEs on the various models are given below:

### Performance of QM9 MEGNet-Simple models

| Property | Units      | MAE   |
|----------|------------|-------|
| HOMO     | eV         | 0.043 |
| LUMO     | eV         | 0.044 |
| Gap      | eV         | 0.066 |
| ZPVE     | meV        | 1.43  |
| µ        | Debye      | 0.05  |
| α        | Bohr^3     | 0.081 |
| \<R2\>   | Bohr^2     | 0.302 |
| U0       | eV         | 0.012 |
| U        | eV         | 0.013 |
| H        | eV         | 0.012 |
| G        | eV         | 0.012 |
| Cv       | cal/(molK) | 0.029 |
| ω1       | cm^-1      | 1.18  |

### Performance of MP-2018.6.1

| Property | Units      | MAE   |
|----------|------------|-------|
| Ef       | eV/atom    | 0.028 |
| Eg       | eV         | 0.33  |
| K_VRH    | log10(GPa) | 0.050 |
| G_VRH    | log10(GPa) | 0.079 |

### Performance of MP-2019.4.1

| Property | Units      | MAE   |
|----------|------------|-------|
| Ef       | eV/atom    | 0.026 |
| Efermi   | eV         | 0.288 |

New models will be added as they are developed in the [mvl_models](mvl_models)
folder. Each folder contains a summary of model details and benchmarks. For
the initial models and bencharmks comparison to previous models,
please refer to ["Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals"](https://doi.org/10.1021/acs.chemmater.9b01294)[2].

Below is an example of crystal model usage:

```python
from megnet.utils.models import load_model
from pymatgen.core import Structure, Lattice

# load a model in megnet.utils.models.AVAILABLE_MODELS
model = load_model("logK_MP_2018")

# We can construct a structure using pymatgen
structure = Structure(Lattice.cubic(3.167),
            ['Mo', 'Mo'], [[0, 0, 0], [0.5, 0.5, 0.5]])


# Use the model to predict bulk modulus K. Note that the model is trained on
# log10 K. So a conversion is necessary.
predicted_K = 10 ** model.predict_structure(structure).ravel()[0]
print(f'The predicted K for {structure.composition.reduced_formula} is {predicted_K:.0f} GPa.')
```
A full example is in [notebooks/crystal_example.ipynb](notebooks/crystal_example.ipynb).

For molecular models, we have an example in
[notebooks/qm9_pretrained.ipynb](notebooks/qm9_pretrained.ipynb).
We support prediction directly from a pymatgen molecule object. With a few more
lines of code, the model can predict from `SMILES` representation of molecules,
as shown in the example. It is also straightforward to load a `xyz` molecule
file with pymatgen and predict the properties using the models. However, the
users are generally not advised to use the `qm9` molecule models for other
molecules outside the `qm9` datasets, since the training data coverage is
limited.

Below is an example of predicting the "HOMO" of a smiles representation

```python
from megnet.utils.molecule import get_pmg_mol_from_smiles
from megnet.models import MEGNetModel

# same model API for molecule and crystals, you can also use the load_model method
# as in previous example
model = MEGNetModel.from_file('mvl_models/qm9-2018.6.1/HOMO.hdf5')
# Need to convert SMILES into pymatgen Molecule
mol = get_pmg_mol_from_smiles("C")
model.predict_structure(mol)
```

## Training a new MEGNetModel from structures

For users who wish to build a new model from a set of crystal structures with
corresponding properties, there is a convenient `MEGNetModel` class for setting
up and training the model. By default, the number of MEGNet blocks is 3 and the
atomic number Z is used as the only node feature (with embedding).

```python
from megnet.models import MEGNetModel
from megnet.data.crystal import CrystalGraph
import numpy as np

nfeat_bond = 10
r_cutoff = 5
gaussian_centers = np.linspace(0, r_cutoff + 1, nfeat_bond)
gaussian_width = 0.5
graph_converter = CrystalGraph(cutoff=r_cutoff)
model = MEGNetModel(graph_converter=graph_converter, centers=gaussian_centers, width=gaussian_width)

# Model training
# Here, `structures` is a list of pymatgen Structure objects.
# `targets` is a corresponding list of properties.
model.train(structures, targets, epochs=10)

# Predict the property of a new structure
pred_target = model.predict_structure(new_structure)
```
Note that for realistic models, the `nfeat_bond` can be set to 100 and `epochs` can be 1000.
In some cases, some structures within the training pool may not be valid (containing isolated atoms),
then one needs to use `train_from_graphs` method by training only on the valid graphs.

Following the previous example,
```python
model = MEGNetModel(graph_converter=graph_converter, centers=gaussian_centers, width=gaussian_width)
graphs_valid = []
targets_valid = []
structures_invalid = []
for s, p in zip(structures, targets):
    try:
        graph = model.graph_converter.convert(s)
        graphs_valid.append(graph)
        targets_valid.append(p)
    except:
        structures_invalid.append(s)

# train the model using valid graphs and targets
model.train_from_graphs(graphs_valid, targets_valid)
```
For model details and benchmarks, please refer to ["Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals"](https://doi.org/10.1021/acs.chemmater.9b01294)[2]


### Training multi-fidelity graph networks

Please see the folder `multifidelity` for specific examples.

### Pre-trained elemental embeddings

A key finding of our work is that element embeddings from trained formation
energy models encode useful chemical information that can be transferred
learned to develop models with smaller datasets (e.g. elastic constants,
band gaps), with better converegence and lower errors. These embeddings are
also potentially useful in developing other ML models and applications. These
embeddings have been made available via the following code:

```python
from megnet.data.crystal import get_elemental_embeddings

el_embeddings = get_elemental_embeddings()
```

An example of transfer learning using the elemental embedding from formation
energy to other models, please check [notebooks/transfer_learning.ipynb](notebooks/transfer_learning.ipynb).

## Customized Graph Network Models

For users who are familiar with deep learning and Keras and wish to build
customized graph network based models, the following example outlines how a
custom model can be constructed from `MEGNetLayer`, which is essentially our
implementation of the graph network using neural networks:

```python
from tensorflow.keras.layers import Input, Dense
from tensorflow.keras.models import Model
from megnet.layers import MEGNetLayer, Set2Set

n_atom_feature= 20
n_bond_feature = 10
n_global_feature = 2

# Define model inputs
int32 = 'int32'
x1 = Input(shape=(None, n_atom_feature)) # atom feature placeholder
x2 = Input(shape=(None, n_bond_feature)) # bond feature placeholder
x3 = Input(shape=(None, n_global_feature)) # global feature placeholder
x4 = Input(shape=(None,), dtype=int32) # bond index1 placeholder
x5 = Input(shape=(None,), dtype=int32) # bond index2 placeholder
x6 = Input(shape=(None,), dtype=int32) # atom_ind placeholder
x7 = Input(shape=(None,), dtype=int32) # bond_ind placeholder
xs = [x1, x2, x3, x4, x5, x6, x7]

# Pass the inputs to the MEGNetLayer layer
# Here the list are the hidden units + the output unit,
# you can have others like [n1] or [n1, n2, n3 ...] if you want.
out = MEGNetLayer([32, 16], [32, 16], [32, 16], pool_method='mean', activation='relu')(xs)

# the output is a tuple of new graphs V, E and u
# Since u is a per-structure quantity,
# we can directly use it to predict per-structure property
out = Dense(1)(out[2])

# Set up the model and compile it!
model = Model(inputs=xs, outputs=out)
model.compile(loss='mse', optimizer='adam')
```

With less than 20 lines of code, you have built a graph network model that is
ready for materials property prediction!

<a name="implementation-details"></a>
# Implementation details

Graph networks[1] are a superclass of graph-based neural networks. There are a
few innovations compared to conventional graph-based neural neworks.

* Global state attributes are added to the node/edge graph representation.
  These features work as a portal for structure-independent features such as
  temperature, pressure etc and also are an information exchange placeholder
  that facilitates information passing across longer spatial domains.
* The update function involves the message interchange among all three levels
  of information, i.e., the node, bond and state information. It is therefore a
  highly general model.

The `MEGNet` model implements two major components: (a) the `graph network`
layer and (b) the `set2set` layer.[4] The layers are based on
[keras](https://keras.io/) API and is thus compatible with other keras modules.

Different crystals/molecules have different number of atoms. Therefore it is
impossible to use data batches without padding the structures to make them
uniform in atom number. `MEGNet` takes a different approach. Instead of making
structure batches, we assemble many structures into one giant structure, and
this structure has a vector output with each entry being the target value for
the corresponding structure. Therefore, the batch number is always 1.

Assuming a structure has N atoms and M bonds, a structure graph is represented
as **V** (nodes/vertices, representing atoms), **E** (edges, representing bonds)
and **u** (global state vector). **V** is a N\*Nv matrix. **E** comprises of a
M\*Nm matrix for the bond attributes and index pairs (rk, sk) for atoms
connected by each bond. **u** is a vector with length Nu. We vectorize rk and
sk to form `index1` and `index2`, both are vectors with length M. In summary,
the graph is a data structure with **V** (N\*Nv), **E** (M\*Nm), **u**
(Nu, ), `index1` (M, ) and `index2` (M, ).

We then assemble several structures together. For **V**, we directly append the
atomic attributes from all structures, forming a matrix (1\*N'\*Nv), where
N' > N. To indicate the belongingness of each atom attribute vector, we use a
`atom_ind` vector. For example if `N'=5` and the first 3 atoms belongs to the
first structure and the remainings the second structure, our `atom_ind` vector
would be `[0, 0, 0, 1, 1]`. For the bond attribute, we perform the same
appending method, and use `bond_ind` vector to indicate the bond belongingness.
For `index1` and `index2`, we need to shift the integer values. For example,
if `index1` and `index2` are `[0, 0, 1, 1]` and `[1, 1, 0, 0]` for structure 1
and are `[0, 0, 1, 1]` and `[1, 1, 0, 0]` for structure two. The assembled
indices are `[0, 0, 1, 1, 2, 2, 3, 3]` and `[1, 1, 0, 0, 3, 3, 2, 2]`. Finally,
**u** expands a new dimension to take into account of the number of structures,
and becomes a 1\*Ng\*Nu tensor, where Ng is the number of structures. `1` is
added as the first dimension of all inputs because we fixed the batch size to
be 1 (1 giant graph) to comply the keras inputs requirements.

In summary the inputs for the model is **V** (1\*N'\*Nv), **E** (1\*M'\*Nm),
**u** (1\*Ng\*Nu), `index1` (1\*M'), `index2` (1\*M'), `atom_ind` (1\*N'), and
`bond_ind` (1\*M'). For Z-only atomic features, **V** is a (1\*N') vector.

<a name="datasets"></a>
# Data sets

To aid others in reproducing (and improving on) our results, we have provided
our MP-crystals-2018.6.1 crystal data set via [figshare](https://figshare.com/articles/Graphs_of_materials_project/7451351)[5].
The MP-crystals-2018.6.1 data set comprises the DFT-computed energies and
band gaps of 69,640 crystals from the [Materials Project](http://www.materialsproject.org)
obtained via the [Python Materials Genomics (pymatgen)](http://pymatgen.org)
interface to the Materials Application Programming Interface (API)[6] on
June 1, 2018. The crystal graphs were constructed using a radius cut-oﬀ of 4
angstroms. Using this cut-oﬀ, 69,239 crystals do not form isolated atoms and
are used in the models. A subset of 5,830 structures have elasticity data that
do not have calculation warnings and will be used for elasticity models.

The molecule data set used in this work is the QM9 data set 30 processed by
Faber et al.[7] It contains the B3LYP/6-31G(2df,p)-level DFT calculation
results on 130,462 small organic molecules containing up to 9 heavy atoms.

<a name="computing-requirements"></a>
# Computing requirements

Training: It should be noted that training MEGNet models, like other deep
learning models, is fairly computationally intensive with large datasets. In
our work, we use dedicated GPU resources to train MEGNet models with 100,000
crystals/molecules. It is recommended that you do the same.

Prediction: Once trained, prediction using MEGNet models are fairly cheap.
For example, the http://megnet.crystals.ai web app runs on a single hobby dyno
on Heroku and provides the prediction for any crystal within seconds.

<a name="limitations"></a>
# Known limitations

- `isolated atoms` error. This error occurs when using the given cutoff in the model (4A for
2018 models and 5A for 2019 models), the crystal structure contains isolated atoms, i.e.,
no neighboring atoms are within the distance of `cutoff`. Most of the time, we can just
discard the structure, since we found that those structures tend to have a high energy above
hull (less stable). If you think this error is an essential issue for a particular problem,
please feel free to email us and we will consider releasing a new model with increased cutoff.

<a name="contributors"></a>
# Contributors
1. Chi Chen from the Materials Virtual Lab is the lead developer of MEGNet.
2. Shyue Ping Ong and other members of the Materials Virtual Lab contributes to general improvements
   of MEGNet and its applications.
3. Logan Ward has made extensive contributions, especially to the development of molecular graph
   portions of MEGNet.

<a name="references"></a>
# References

1. Battaglia, P. W.; Hamrick, J. B.; Bapst, V.; Sanchez-Gonzalez, A.;
   Zambaldi, V.; Malinowski, M.; Tacchetti, A.; Raposo, D.; Santoro, A.;
   Faulkner, R.; et al. Relational inductive biases, deep learning, and graph
   networks. 2018, 1–38. [arXiv:1806.01261](https://arxiv.org/abs/1806.01261)
2. Chen, C.; Ye, W.; Zuo, Y.; Zheng, C.; Ong, S. P. Graph Networks as a
   Universal Machine Learning Framework for Molecules and Crystals. Chemistry
   of Materials 2019, 31(9), 3564-3572.
   [doi:10.1021/acs.chemmater.9b01294](https://doi.org/10.1021/acs.chemmater.9b01294)
3. Chen, C.; Zuo, Y.; Ye, W.; Li, X.G.; Ong, S. P. Learning properties of ordered and
   disordered materials from multi-fidelity data. Nature Computational Science 2021,
   1, 46–53 [doi:10.1038/s43588-020-00002-x](https://www.nature.com/articles/s43588-020-00002-x).
4. Vinyals, O.; Bengio, S.; Kudlur, M. Order Matters: Sequence to sequence for
   sets. 2015, arXiv preprint. [arXiv:1511.06391](https://arxiv.org/abs/1511.06391)
5. https://figshare.com/articles/Graphs_of_materials_project/7451351
6. Ong, S. P.; Cholia, S.; Jain, A.; Brafman, M.; Gunter, D.; Ceder, G.;
   Persson, K. A. The Materials Application Programming Interface (API): A
   simple, flexible and efficient API for materials data based on
   REpresentational State Transfer (REST) principles. Comput. Mater. Sci. 2015,
   97, 209–215 DOI: [10.1016/j.commatsci.2014.10.037](http://dx.doi.org/10.1016/j.commatsci.2014.10.037).
7. Faber, F. A.; Hutchison, L.; Huang, B.; Gilmer, J.; Schoenholz, S. S.;
   Dahl, G. E.; Vinyals, O.; Kearnes, S.; Riley, P. F.; von Lilienfeld, O. A.
   Prediction errors of molecular machine learning models lower than hybrid DFT
   error. Journal of Chemical Theory and Computation 2017, 13, 5255–5264.
   DOI: [10.1021/acs.jctc.7b00577](http://dx.doi.org/10.1021/acs.jctc.7b00577)

Raw data

            {
    "_id": null,
    "home_page": "",
    "name": "megnet",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "materials,science,machine,learning,deep,graph,networks,neural",
    "author": "Chi Chen",
    "author_email": "chc273@eng.ucsd.edu",
    "download_url": "https://files.pythonhosted.org/packages/98/53/8f37ddd65c3d44534811c8cffa2726b0280d8aff1da8abb3eb3ba976b727/megnet-1.3.2.tar.gz",
    "platform": null,
    "description": "[![License](https://img.shields.io/github/license/materialsvirtuallab/megnet)]()\n[![Build Status](https://travis-ci.org/materialsvirtuallab/megnet.svg?branch=master)](https://travis-ci.org/materialsvirtuallab/megnet)\n[![Coverage Status](https://coveralls.io/repos/github/materialsvirtuallab/megnet/badge.svg?branch=master)](https://coveralls.io/github/materialsvirtuallab/megnet?branch=master&service=github)\n[![Downloads](https://pepy.tech/badge/megnet)](https://pepy.tech/project/megnet)\n[![Linting](https://github.com/materialsvirtuallab/megnet/workflows/Linting/badge.svg)](https://github.com/materialsvirtuallab/megnet/workflows/Linting/badge.svg)\n[![Testing](https://github.com/materialsvirtuallab/megnet/workflows/Testing%20-%20main/badge.svg)](https://github.com/materialsvirtuallab/megnet/workflows/Testing%20-%20main/badge.svg)\n\n\n[![Binder](https://mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/materialsvirtuallab/megnet/master)\n\n# Table of Contents\n* [Introduction](#introduction)\n* [MEGNet Framework](#megnet-framework)\n* [Installation](#installation)\n* [Usage](#usage)\n* [Datasets](#datasets)\n* [Implementation details](#implementation-details)\n* [Computing requirements](#computing-requirements)\n* [Known limitations](#limitations)\n* [Contributors](#contributors)\n* [References](#references)\n\n<a name=\"introduction\"></a>\n# Introduction\n\nThis repository represents the efforts of the [Materials Virtual Lab](http://www.materialsvirtuallab.org)\nin developing graph networks for machine learning in materials science. It is a\nwork in progress and the models we have developed thus far are only based on\nour best efforts. We welcome efforts by anyone to build and test models using\nour code and data, all of which are publicly available. Any comments or\nsuggestions are also welcome (please post on the Github Issues page.)\n\nA web app using our pre-trained MEGNet models for property prediction in\ncrystals is available at [http://megnet.crystals.ai](http://megnet.crystals.ai). For tutorials, please visit `notebooks` in this repo. We have also established an online simulation tool and a tutorial lecture at nanoHUB ([https://nanohub.org/resources/megnet](https://nanohub.org/resources/megnet)).\n\nNote: A [DGL implementation of MEGNet](https://github.com/materialsvirtuallab/m3gnet-dgl) is now available. For users\ntrying to build their own MEGNet models, it is highly recommended you check this version out, which may be easier to\nwork with and extend in future.\n\n<a name=\"megnet-framework\"></a>\n# MEGNet framework\n\nThe MatErials Graph Network (MEGNet) is an implementation of DeepMind's graph\nnetworks[1] for universal machine learning in materials science. We have\ndemonstrated its success in achieving very low prediction errors in a broad\narray of properties in both molecules and crystals (see\n[\"Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals\"](https://doi.org/10.1021/acs.chemmater.9b01294)[2]). New releases have included our recent work on multi-fidelity materials property modeling (See [\"Learning properties of ordered and disordered materials from multi-fidelity data\"](https://www.nature.com/articles/s43588-020-00002-x)[3]).\n\nBriefly, Figure 1 shows the sequential update steps of the graph network,\nwhereby bonds, atoms, and global state attributes are updated using information\nfrom each other, generating an output graph.\n\n![GraphModel diagram](resources/model_diagram_small.jpg)\n<div align='center'><strong>Figure 1. The graph network update function.</strong></div>\n\nFigure 2 shows the overall schematic of the MEGNet. Each graph network module\nis preceded by two multi-layer perceptrons (known as Dense layers in Keras\nterminology), constituting a MEGNet block. Multiple MEGNet blocks can be\nstacked, allowing for information flow across greater spatial distances. The\nnumber of blocks required depend on the range of interactions necessary to\npredict a target property. In the final step, a `set2set` is used to map the\noutput to a scalar/vector property.\n\n![GraphModel architecture](resources/model_arch_small.jpg)\n<div align='center'><strong>Figure 2. Schematic of MatErials Graph Network.</strong></div>\n\n<a name=\"installation\"></a>\n# Installation\n\nMegnet can be installed via pip for the latest stable version:\n\n```bash\npip install megnet\n```\n\nFor the latest dev version, please clone this repo and install using:\n\n```bash\npython setup.py develop\n```\n\n<a name=\"usage\"></a>\n# Usage\n\nOur current implementation supports a variety of use cases for users with\ndifferent requirements and experience with deep learning. Please also visit\nthe [notebooks directory](notebooks) for Jupyter notebooks with more detailed code examples.\n\n## Using pre-built models\n\nIn our work, we have already built MEGNet models for the QM9 data set and\nMaterials Project dataset. These models are provided as serialized HDF5+JSON\nfiles. Users who are purely interested in using these models for prediction\ncan quickly load and use them via the convenient `MEGNetModel.from_file`\nmethod. These models are available in the `mvl_models` folder of this repo.\nThe following models are available:\n\n* QM9 molecule data:\n    - HOMO: Highest occupied molecular orbital energy\n    - LUMO: Lowest unoccupied molecular orbital energy\n    - Gap: energy gap\n    - ZPVE: zero point vibrational energy\n    - \u00b5: dipole moment\n    - \u03b1: isotropic polarizability\n    - \\<R2\\>: electronic spatial extent\n    - U0: internal energy at 0 K\n    - U: internal energy at 298 K\n    - H: enthalpy at 298 K\n    - G: Gibbs free energy at 298 K\n    - Cv: heat capacity at 298 K\n    - \u03c91: highest vibrational frequency.\n* Materials Project data:\n    - Formation energy from the elements\n    - Band gap\n    - Log 10 of Bulk Modulus (K)\n    - Log 10 of Shear Modulus (G)\n\nThe MAEs on the various models are given below:\n\n### Performance of QM9 MEGNet-Simple models\n\n| Property | Units      | MAE   |\n|----------|------------|-------|\n| HOMO     | eV         | 0.043 |\n| LUMO     | eV         | 0.044 |\n| Gap      | eV         | 0.066 |\n| ZPVE     | meV        | 1.43  |\n| \u00b5        | Debye      | 0.05  |\n| \u03b1        | Bohr^3     | 0.081 |\n| \\<R2\\>   | Bohr^2     | 0.302 |\n| U0       | eV         | 0.012 |\n| U        | eV         | 0.013 |\n| H        | eV         | 0.012 |\n| G        | eV         | 0.012 |\n| Cv       | cal/(molK) | 0.029 |\n| \u03c91       | cm^-1      | 1.18  |\n\n### Performance of MP-2018.6.1\n\n| Property | Units      | MAE   |\n|----------|------------|-------|\n| Ef       | eV/atom    | 0.028 |\n| Eg       | eV         | 0.33  |\n| K_VRH    | log10(GPa) | 0.050 |\n| G_VRH    | log10(GPa) | 0.079 |\n\n### Performance of MP-2019.4.1\n\n| Property | Units      | MAE   |\n|----------|------------|-------|\n| Ef       | eV/atom    | 0.026 |\n| Efermi   | eV         | 0.288 |\n\nNew models will be added as they are developed in the [mvl_models](mvl_models)\nfolder. Each folder contains a summary of model details and benchmarks. For\nthe initial models and bencharmks comparison to previous models,\nplease refer to [\"Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals\"](https://doi.org/10.1021/acs.chemmater.9b01294)[2].\n\nBelow is an example of crystal model usage:\n\n```python\nfrom megnet.utils.models import load_model\nfrom pymatgen.core import Structure, Lattice\n\n# load a model in megnet.utils.models.AVAILABLE_MODELS\nmodel = load_model(\"logK_MP_2018\")\n\n# We can construct a structure using pymatgen\nstructure = Structure(Lattice.cubic(3.167),\n            ['Mo', 'Mo'], [[0, 0, 0], [0.5, 0.5, 0.5]])\n\n\n# Use the model to predict bulk modulus K. Note that the model is trained on\n# log10 K. So a conversion is necessary.\npredicted_K = 10 ** model.predict_structure(structure).ravel()[0]\nprint(f'The predicted K for {structure.composition.reduced_formula} is {predicted_K:.0f} GPa.')\n```\nA full example is in [notebooks/crystal_example.ipynb](notebooks/crystal_example.ipynb).\n\nFor molecular models, we have an example in\n[notebooks/qm9_pretrained.ipynb](notebooks/qm9_pretrained.ipynb).\nWe support prediction directly from a pymatgen molecule object. With a few more\nlines of code, the model can predict from `SMILES` representation of molecules,\nas shown in the example. It is also straightforward to load a `xyz` molecule\nfile with pymatgen and predict the properties using the models. However, the\nusers are generally not advised to use the `qm9` molecule models for other\nmolecules outside the `qm9` datasets, since the training data coverage is\nlimited.\n\nBelow is an example of predicting the \"HOMO\" of a smiles representation\n\n```python\nfrom megnet.utils.molecule import get_pmg_mol_from_smiles\nfrom megnet.models import MEGNetModel\n\n# same model API for molecule and crystals, you can also use the load_model method\n# as in previous example\nmodel = MEGNetModel.from_file('mvl_models/qm9-2018.6.1/HOMO.hdf5')\n# Need to convert SMILES into pymatgen Molecule\nmol = get_pmg_mol_from_smiles(\"C\")\nmodel.predict_structure(mol)\n```\n\n## Training a new MEGNetModel from structures\n\nFor users who wish to build a new model from a set of crystal structures with\ncorresponding properties, there is a convenient `MEGNetModel` class for setting\nup and training the model. By default, the number of MEGNet blocks is 3 and the\natomic number Z is used as the only node feature (with embedding).\n\n```python\nfrom megnet.models import MEGNetModel\nfrom megnet.data.crystal import CrystalGraph\nimport numpy as np\n\nnfeat_bond = 10\nr_cutoff = 5\ngaussian_centers = np.linspace(0, r_cutoff + 1, nfeat_bond)\ngaussian_width = 0.5\ngraph_converter = CrystalGraph(cutoff=r_cutoff)\nmodel = MEGNetModel(graph_converter=graph_converter, centers=gaussian_centers, width=gaussian_width)\n\n# Model training\n# Here, `structures` is a list of pymatgen Structure objects.\n# `targets` is a corresponding list of properties.\nmodel.train(structures, targets, epochs=10)\n\n# Predict the property of a new structure\npred_target = model.predict_structure(new_structure)\n```\nNote that for realistic models, the `nfeat_bond` can be set to 100 and `epochs` can be 1000.\nIn some cases, some structures within the training pool may not be valid (containing isolated atoms),\nthen one needs to use `train_from_graphs` method by training only on the valid graphs.\n\nFollowing the previous example,\n```python\nmodel = MEGNetModel(graph_converter=graph_converter, centers=gaussian_centers, width=gaussian_width)\ngraphs_valid = []\ntargets_valid = []\nstructures_invalid = []\nfor s, p in zip(structures, targets):\n    try:\n        graph = model.graph_converter.convert(s)\n        graphs_valid.append(graph)\n        targets_valid.append(p)\n    except:\n        structures_invalid.append(s)\n\n# train the model using valid graphs and targets\nmodel.train_from_graphs(graphs_valid, targets_valid)\n```\nFor model details and benchmarks, please refer to [\"Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals\"](https://doi.org/10.1021/acs.chemmater.9b01294)[2]\n\n\n### Training multi-fidelity graph networks\n\nPlease see the folder `multifidelity` for specific examples.\n\n### Pre-trained elemental embeddings\n\nA key finding of our work is that element embeddings from trained formation\nenergy models encode useful chemical information that can be transferred\nlearned to develop models with smaller datasets (e.g. elastic constants,\nband gaps), with better converegence and lower errors. These embeddings are\nalso potentially useful in developing other ML models and applications. These\nembeddings have been made available via the following code:\n\n```python\nfrom megnet.data.crystal import get_elemental_embeddings\n\nel_embeddings = get_elemental_embeddings()\n```\n\nAn example of transfer learning using the elemental embedding from formation\nenergy to other models, please check [notebooks/transfer_learning.ipynb](notebooks/transfer_learning.ipynb).\n\n## Customized Graph Network Models\n\nFor users who are familiar with deep learning and Keras and wish to build\ncustomized graph network based models, the following example outlines how a\ncustom model can be constructed from `MEGNetLayer`, which is essentially our\nimplementation of the graph network using neural networks:\n\n```python\nfrom tensorflow.keras.layers import Input, Dense\nfrom tensorflow.keras.models import Model\nfrom megnet.layers import MEGNetLayer, Set2Set\n\nn_atom_feature= 20\nn_bond_feature = 10\nn_global_feature = 2\n\n# Define model inputs\nint32 = 'int32'\nx1 = Input(shape=(None, n_atom_feature)) # atom feature placeholder\nx2 = Input(shape=(None, n_bond_feature)) # bond feature placeholder\nx3 = Input(shape=(None, n_global_feature)) # global feature placeholder\nx4 = Input(shape=(None,), dtype=int32) # bond index1 placeholder\nx5 = Input(shape=(None,), dtype=int32) # bond index2 placeholder\nx6 = Input(shape=(None,), dtype=int32) # atom_ind placeholder\nx7 = Input(shape=(None,), dtype=int32) # bond_ind placeholder\nxs = [x1, x2, x3, x4, x5, x6, x7]\n\n# Pass the inputs to the MEGNetLayer layer\n# Here the list are the hidden units + the output unit,\n# you can have others like [n1] or [n1, n2, n3 ...] if you want.\nout = MEGNetLayer([32, 16], [32, 16], [32, 16], pool_method='mean', activation='relu')(xs)\n\n# the output is a tuple of new graphs V, E and u\n# Since u is a per-structure quantity,\n# we can directly use it to predict per-structure property\nout = Dense(1)(out[2])\n\n# Set up the model and compile it!\nmodel = Model(inputs=xs, outputs=out)\nmodel.compile(loss='mse', optimizer='adam')\n```\n\nWith less than 20 lines of code, you have built a graph network model that is\nready for materials property prediction!\n\n<a name=\"implementation-details\"></a>\n# Implementation details\n\nGraph networks[1] are a superclass of graph-based neural networks. There are a\nfew innovations compared to conventional graph-based neural neworks.\n\n* Global state attributes are added to the node/edge graph representation.\n  These features work as a portal for structure-independent features such as\n  temperature, pressure etc and also are an information exchange placeholder\n  that facilitates information passing across longer spatial domains.\n* The update function involves the message interchange among all three levels\n  of information, i.e., the node, bond and state information. It is therefore a\n  highly general model.\n\nThe `MEGNet` model implements two major components: (a) the `graph network`\nlayer and (b) the `set2set` layer.[4] The layers are based on\n[keras](https://keras.io/) API and is thus compatible with other keras modules.\n\nDifferent crystals/molecules have different number of atoms. Therefore it is\nimpossible to use data batches without padding the structures to make them\nuniform in atom number. `MEGNet` takes a different approach. Instead of making\nstructure batches, we assemble many structures into one giant structure, and\nthis structure has a vector output with each entry being the target value for\nthe corresponding structure. Therefore, the batch number is always 1.\n\nAssuming a structure has N atoms and M bonds, a structure graph is represented\nas **V** (nodes/vertices, representing atoms), **E** (edges, representing bonds)\nand **u** (global state vector). **V** is a N\\*Nv matrix. **E** comprises of a\nM\\*Nm matrix for the bond attributes and index pairs (rk, sk) for atoms\nconnected by each bond. **u** is a vector with length Nu. We vectorize rk and\nsk to form `index1` and `index2`, both are vectors with length M. In summary,\nthe graph is a data structure with **V** (N\\*Nv), **E** (M\\*Nm), **u**\n(Nu, ), `index1` (M, ) and `index2` (M, ).\n\nWe then assemble several structures together. For **V**, we directly append the\natomic attributes from all structures, forming a matrix (1\\*N'\\*Nv), where\nN' > N. To indicate the belongingness of each atom attribute vector, we use a\n`atom_ind` vector. For example if `N'=5` and the first 3 atoms belongs to the\nfirst structure and the remainings the second structure, our `atom_ind` vector\nwould be `[0, 0, 0, 1, 1]`. For the bond attribute, we perform the same\nappending method, and use `bond_ind` vector to indicate the bond belongingness.\nFor `index1` and `index2`, we need to shift the integer values. For example,\nif `index1` and `index2` are `[0, 0, 1, 1]` and `[1, 1, 0, 0]` for structure 1\nand are `[0, 0, 1, 1]` and `[1, 1, 0, 0]` for structure two. The assembled\nindices are `[0, 0, 1, 1, 2, 2, 3, 3]` and `[1, 1, 0, 0, 3, 3, 2, 2]`. Finally,\n**u** expands a new dimension to take into account of the number of structures,\nand becomes a 1\\*Ng\\*Nu tensor, where Ng is the number of structures. `1` is\nadded as the first dimension of all inputs because we fixed the batch size to\nbe 1 (1 giant graph) to comply the keras inputs requirements.\n\nIn summary the inputs for the model is **V** (1\\*N'\\*Nv), **E** (1\\*M'\\*Nm),\n**u** (1\\*Ng\\*Nu), `index1` (1\\*M'), `index2` (1\\*M'), `atom_ind` (1\\*N'), and\n`bond_ind` (1\\*M'). For Z-only atomic features, **V** is a (1\\*N') vector.\n\n<a name=\"datasets\"></a>\n# Data sets\n\nTo aid others in reproducing (and improving on) our results, we have provided\nour MP-crystals-2018.6.1 crystal data set via [figshare](https://figshare.com/articles/Graphs_of_materials_project/7451351)[5].\nThe MP-crystals-2018.6.1 data set comprises the DFT-computed energies and\nband gaps of 69,640 crystals from the [Materials Project](http://www.materialsproject.org)\nobtained via the [Python Materials Genomics (pymatgen)](http://pymatgen.org)\ninterface to the Materials Application Programming Interface (API)[6] on\nJune 1, 2018. The crystal graphs were constructed using a radius cut-o\ufb00 of 4\nangstroms. Using this cut-o\ufb00, 69,239 crystals do not form isolated atoms and\nare used in the models. A subset of 5,830 structures have elasticity data that\ndo not have calculation warnings and will be used for elasticity models.\n\nThe molecule data set used in this work is the QM9 data set 30 processed by\nFaber et al.[7] It contains the B3LYP/6-31G(2df,p)-level DFT calculation\nresults on 130,462 small organic molecules containing up to 9 heavy atoms.\n\n<a name=\"computing-requirements\"></a>\n# Computing requirements\n\nTraining: It should be noted that training MEGNet models, like other deep\nlearning models, is fairly computationally intensive with large datasets. In\nour work, we use dedicated GPU resources to train MEGNet models with 100,000\ncrystals/molecules. It is recommended that you do the same.\n\nPrediction: Once trained, prediction using MEGNet models are fairly cheap.\nFor example, the http://megnet.crystals.ai web app runs on a single hobby dyno\non Heroku and provides the prediction for any crystal within seconds.\n\n<a name=\"limitations\"></a>\n# Known limitations\n\n- `isolated atoms` error. This error occurs when using the given cutoff in the model (4A for\n2018 models and 5A for 2019 models), the crystal structure contains isolated atoms, i.e.,\nno neighboring atoms are within the distance of `cutoff`. Most of the time, we can just\ndiscard the structure, since we found that those structures tend to have a high energy above\nhull (less stable). If you think this error is an essential issue for a particular problem,\nplease feel free to email us and we will consider releasing a new model with increased cutoff.\n\n<a name=\"contributors\"></a>\n# Contributors\n1. Chi Chen from the Materials Virtual Lab is the lead developer of MEGNet.\n2. Shyue Ping Ong and other members of the Materials Virtual Lab contributes to general improvements\n   of MEGNet and its applications.\n3. Logan Ward has made extensive contributions, especially to the development of molecular graph\n   portions of MEGNet.\n\n<a name=\"references\"></a>\n# References\n\n1. Battaglia, P. W.; Hamrick, J. B.; Bapst, V.; Sanchez-Gonzalez, A.;\n   Zambaldi, V.; Malinowski, M.; Tacchetti, A.; Raposo, D.; Santoro, A.;\n   Faulkner, R.; et al. Relational inductive biases, deep learning, and graph\n   networks. 2018, 1\u201338. [arXiv:1806.01261](https://arxiv.org/abs/1806.01261)\n2. Chen, C.; Ye, W.; Zuo, Y.; Zheng, C.; Ong, S. P. Graph Networks as a\n   Universal Machine Learning Framework for Molecules and Crystals. Chemistry\n   of Materials 2019, 31(9), 3564-3572.\n   [doi:10.1021/acs.chemmater.9b01294](https://doi.org/10.1021/acs.chemmater.9b01294)\n3. Chen, C.; Zuo, Y.; Ye, W.; Li, X.G.; Ong, S. P. Learning properties of ordered and\n   disordered materials from multi-fidelity data. Nature Computational Science 2021,\n   1, 46\u201353 [doi:10.1038/s43588-020-00002-x](https://www.nature.com/articles/s43588-020-00002-x).\n4. Vinyals, O.; Bengio, S.; Kudlur, M. Order Matters: Sequence to sequence for\n   sets. 2015, arXiv preprint. [arXiv:1511.06391](https://arxiv.org/abs/1511.06391)\n5. https://figshare.com/articles/Graphs_of_materials_project/7451351\n6. Ong, S. P.; Cholia, S.; Jain, A.; Brafman, M.; Gunter, D.; Ceder, G.;\n   Persson, K. A. The Materials Application Programming Interface (API): A\n   simple, flexible and efficient API for materials data based on\n   REpresentational State Transfer (REST) principles. Comput. Mater. Sci. 2015,\n   97, 209\u2013215 DOI: [10.1016/j.commatsci.2014.10.037](http://dx.doi.org/10.1016/j.commatsci.2014.10.037).\n7. Faber, F. A.; Hutchison, L.; Huang, B.; Gilmer, J.; Schoenholz, S. S.;\n   Dahl, G. E.; Vinyals, O.; Kearnes, S.; Riley, P. F.; von Lilienfeld, O. A.\n   Prediction errors of molecular machine learning models lower than hybrid DFT\n   error. Journal of Chemical Theory and Computation 2017, 13, 5255\u20135264.\n   DOI: [10.1021/acs.jctc.7b00577](http://dx.doi.org/10.1021/acs.jctc.7b00577)\n",
    "bugtrack_url": null,
    "license": "BSD",
    "summary": "MatErials Graph Networks for machine learning of molecules and crystals.",
    "version": "1.3.2",
    "project_urls": {
        "Download": "https://github.com/materialsvirtuallab/megnet"
    },
    "split_keywords": [
        "materials",
        "science",
        "machine",
        "learning",
        "deep",
        "graph",
        "networks",
        "neural"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "420e5dd57e48e164abaa43d293b031a00bf887083cdd93d8ccf1be6123f500e8",
                "md5": "1d6fdc213c56b7639e113f01e9cd1c64",
                "sha256": "ac9e3eb6ac9c800a28e215536de7b2de70f6f194bb5ffd237e8542521b792341"
            },
            "downloads": -1,
            "filename": "megnet-1.3.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "1d6fdc213c56b7639e113f01e9cd1c64",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 113230,
            "upload_time": "2022-11-16T21:23:57",
            "upload_time_iso_8601": "2022-11-16T21:23:57.152657Z",
            "url": "https://files.pythonhosted.org/packages/42/0e/5dd57e48e164abaa43d293b031a00bf887083cdd93d8ccf1be6123f500e8/megnet-1.3.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "98538f37ddd65c3d44534811c8cffa2726b0280d8aff1da8abb3eb3ba976b727",
                "md5": "c8a296823e54adce68e9917adbb1ecb5",
                "sha256": "0c2de597860fa3caeefc3591f01fe48e8cc70adbda812f5b72a47c3eb6361a99"
            },
            "downloads": -1,
            "filename": "megnet-1.3.2.tar.gz",
            "has_sig": false,
            "md5_digest": "c8a296823e54adce68e9917adbb1ecb5",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 30264908,
            "upload_time": "2022-11-16T21:24:32",
            "upload_time_iso_8601": "2022-11-16T21:24:32.902665Z",
            "url": "https://files.pythonhosted.org/packages/98/53/8f37ddd65c3d44534811c8cffa2726b0280d8aff1da8abb3eb3ba976b727/megnet-1.3.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2022-11-16 21:24:32",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "materialsvirtuallab",
    "github_project": "megnet",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "megnet"
}

Chi Chen