-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Qm7x dataset #216
Qm7x dataset #216
Conversation
@allaffa Comparing energies is complicated because QM methods have many settings which modify their energy function:
And there are 2 different ways to reference energies between different QM methods (to make comparison of the "energy" across methods valid). Both subtract off the energy of all isolated atoms computed with the same QM method. One (simple) way treats isolated atoms as single, uncharged atoms in gas phase. The other (physical) way is to treat isolated atoms as part of their equilibrium material (e.g. N2 gas for nitrogen, solid graphite for carbon, liquid for bromine, etc). Among the QM7-x properties, the most useful energies for comparison are:
The first one is the total energy output by DFT with the PBE0 functional. The second one is the PBE0 energy after subtracting the energies of the isolated atoms using the first (simple) method. Ideally we should fit that second one. OpenCatalyst provides "raw DFT" energies output from VASP with DFT+U as well as (since OC22) reference energies using a non-standard version of the second (physical) method, - see also Table 17 on p. 42 of the OC22 paper. However, they don't seem to have included similar references for the slab atoms. The VASP forum notes that the "raw DFT" energies are usually close to atomization energies, but this isn't guaranteed. Calculations on single atoms are difficult with VASP. So we need to adjust the referenced energies of OC or QM7x to compare. Despite being non-physical, I favor the simple atomization energies because their reference QM calculations are more exact (single atom vs. liquid, for example). |
@pzhanggit @jychoi-hpc |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks good to me.
* qm7example rebased with updated capabiltiies from the main * qm7x example updated * qm7x example updated * coordinates added in data.x * black formatting of train.py * option to normalize energy by number of atoms added * qm7x example updated with correct variable name * energy_per_atoms boolean variable added * write only ADIOS format as default * qm7 dataset fixed * black formatting fixed --------- Co-authored-by: Massimiliano Lupo Pasini <[email protected]> Co-authored-by: Massimiliano Lupo Pasini <[email protected]> Co-authored-by: Massimiliano Lupo Pasini <[email protected]>
@frobnitzem
among all the energies that can be retrieved from the QM7x dataset, may you help me identify the one that corresponds to the energy returned by OC2020 and OC2022?