One-step conversion from reservoir Earth models to Exodus II format

em2ex is a python program that converts a reservoir model to an Exodus II file that can then be used in a simulation tool (such as MOOSE) or viewed in a visualisation tool (such as Paraview).

Johansen formation converted to Exodus format from Eclipse dataset

Currently, em2ex supports two reservoir modelling formats:

Eclipse (ASCII files)
Leapfrog Geothermal (CSV files)

Setup

em2ex is a pure python program that does not depend on any external libraries (but does require a few common python packages), so can run on any system with a working python installation.

Clone repository

em2ex can be installed by cloning this repository from GitHub using

git clone git@github.com:cpgr/em2ex.git

or

git clone https://github.com/cpgr/em2ex.git

This will add a folder em2ex containing the code.

Required python packages

The following python packages are required to run em2ex

numpy
pandas
netCDF4

The first two are typically already installed, but if not, can be installed using pip. The netCDF4 package can be installed using pip as well:

pip install netcdf4

Two additional python package, pytest and pyYAML are required to run the test script. Again, these can be installed using pip, e.g.

pip install pytest

Optional Exodus API

em2ex can optionally use the Exodus II API instead of the simplified pyexodus API included in the code, which is available through the SEACAS package.

For MOOSE users, this package is installed as part of the default environment. To use the Exodus python API, the path to the python API in the SEACAS package (/opt/moose/seacas/lib) should be added to the PYTHONPATH environment variable

export PYTHONPATH=$PYTHONPATH:/opt/moose/seacas/lib

For non-MOOSE users, SEACAS can be installed manually and the location of exodus.py added to PYTHONPATH.

Usage

To convert a reservoir model to an Exodus II file, run

./em2ex.py filename

which produces an Exodus II file filenanem.e with the cell-centred reservoir properties saved as elemental variables, and nodal properties saved as nodal variables.

For example, the test/eclipse directory contains several ASCII Eclipse reservoir model (.grdecl file extension). These can be converted to an Exodus II file using

./em2ex.py simple_cube.grdecl

Similarly, the test/leapfrog directory contains a set of example Leapfrog reservoir model files that can be converted to Exodus II files using

./em2ex.py test

for example.

Configuration files

Once a workflow uses more than a handful of options, putting them in a config file is easier than re-typing the same command line. em2ex accepts a YAML config via --config:

./em2ex.py --config my_workflow.yaml model.grdecl

The config is a YAML mapping. Each key is the CLI flag name with the leading -- stripped and hyphens converted to underscores — exactly what you'd type after -- in the shell, normalised for YAML. So --refine-xy becomes refine_xy (or, if you prefer, refine-xy — both work). Values follow the option's natural type: a single string or number for single-value flags, a YAML list for multi-value flags, a boolean for switches:

# Geometry transforms
extract_i: [10, 50]
extract_j: [10, 50]
refine_xy: [2, 2]

# Property handling
extra_keywords:
  - PVTNUM
  - EQLNUM

# Output controls
output: model.e             # corresponds to --output
force: true                 # corresponds to --force
fault_sidesets: true
convert_to_m: true

# Mesh quality
strict_jacobians: true

The filename can be specified in the config too (filename: path/to/model.grdecl), in which case the positional argument can be omitted on the CLI.

Precedence: command-line flags always win over config values, and config values win over the parser's own defaults. So a config that defaults force: false can still be overridden by passing -f at the prompt.

Unknown keys are rejected with a list of valid keys, so typos like refine_xz (when you meant refine_xy) surface immediately rather than being silently ignored.

Commandline options

A number of optional commandline options are available, and can be seen by passing the --help flag:

$ ./em2ex.py --help

usage: em2ex.py [-h] [--config FILE] [-o OUTPUT_FILE]
                [--filetype {eclipse,leapfrog}] [--no-nodesets]
                [--no-sidesets] [-f] [-u] [--flip]
                [--translate TRANSLATE TRANSLATE] [--mapaxes] [--pinch]
                [--pinch-tol PINCH_TOL] [--refine-xy RX RY]
                [--extract-i I_LO I_HI] [--extract-j J_LO J_HI]
                [--extract-k K_LO K_HI] [--extra-keywords KEY [KEY ...]]
                [--fault-sidesets] [--convert-to-m]
                [--no-check-jacobians] [--strict-jacobians]
                [filename]

Converts earth model to Exodus II format

positional arguments:
  filename

options:
  -h, --help            show this help message and exit
  --config FILE         YAML config file specifying default values for any of
                        this script's options. Values from the config are
                        overridden by command-line flags. Use the option's
                        `dest` name as the key (e.g. refine_xy, extract_i,
                        extra_keywords).
  -o OUTPUT_FILE, --output OUTPUT_FILE
                        File name for output
  --filetype {eclipse,leapfrog}
                        Explicitly state the filetype for unknown extensions
  --no-nodesets         Disable addition of nodesets
  --no-sidesets         Disable addition of sidesets
  -f, --force           Overwrite filename.e if it exists
  -u, --use-official-api
                        Use exodus.py to write files
  --flip                Flip the sign of the Z coordinates
  --translate TRANSLATE TRANSLATE
                        Translate the (x, y) coordinates by this amount
  --mapaxes             Use the MAPAXES coordinates for an Eclipse file
  --pinch               Remove pinched elements (coincident corners within
                        --pinch-tol). Off by default; faithful conversion is
                        produced without this flag.
  --pinch-tol PINCH_TOL
                        Tolerance for coincident corners when removing pinched
                        elements (default: 1e-3)
  --refine-xy RX RY     Refine the grid laterally by integer factors RX in x
                        and RY in y (vertical resolution unchanged). Each
                        child cell inherits its parent's element properties.
  --extract-i I_LO I_HI
                        Extract cells I_LO..I_HI along the x-axis (1-based
                        inclusive, Eclipse-style). Cells are taken in file
                        order, before any coordinate-system normalisation;
                        runs before --refine-xy if both are given.
  --extract-j J_LO J_HI
                        Extract cells J_LO..J_HI along the y-axis (1-based
                        inclusive).
  --extract-k K_LO K_HI
                        Extract cells K_LO..K_HI along the z-axis (1-based
                        inclusive).
  --extra-keywords KEY [KEY ...]
                        Additional per-cell property keywords to read from the
                        grdecl file (e.g. PVTNUM EQLNUM FIPNUM). Each must be
                        a per-cell scalar of length NX*NY*NZ. Normalised to
                        uppercase. The reader recognises ACTNUM, SATNUM, PORO,
                        PERMX, PERMY, PERMZ, NTG, HEATCR and THCONR by
                        default.
  --fault-sidesets      Emit paired sidesets named "fault_primary" and
                        "fault_secondary" containing the faces on either side
                        of every fault (any internal face where adjacent cells
                        do not share their corner nodes).
  --convert-to-m        Convert grid coordinates to metres on output, using
                        the input file's GRIDUNIT keyword as the source unit.
                        Supported values are METRES (no-op), FEET and CM.
                        Files without GRIDUNIT are assumed to be in metres.
  --no-check-jacobians  Skip the per-element Jacobian sanity check. By default
                        em2ex computes the Jacobian at all 8 corners of every
                        HEX8 element and warns if any are non-positive
                        (degenerate or inverted), since those elements would
                        be rejected by most FEM solvers.
  --strict-jacobians    Treat any non-positive element Jacobian as a fatal
                        error and exit non-zero. By default such elements only
                        produce a warning. Useful for CI / scripted workflows.
  --remove-distorted    Remove elements with non-positive Jacobians (degenerate
                        or inverted) from the output mesh, reporting a count of
                        those removed. By default such elements are kept and
                        only a warning is printed.

Lateral refinement (Eclipse only)

The --refine-xy RX RY option refines an Eclipse grid in the (x, y) plane by integer factors RX and RY, leaving vertical resolution unchanged. This is useful for grdecl models whose cells are long and wide but thin: each parent cell is split into RX * RY children (so --refine-xy 2 2 turns one cell into four, not eight). Pillars are linearly interpolated to create new (x, y) coordinates, per-cell top and bottom faces are bilinearly interpolated within each parent (which preserves faults), and each child cell inherits all of its parent's element properties (PORO, PERMX, SATNUM, ACTNUM, etc.).

./em2ex.py --refine-xy 2 2 simple_cube.grdecl

RX and RY must be strictly positive integers; anything else is rejected up front with an informative error.

Extracting a subset (Eclipse only)

The --extract-i, --extract-j and --extract-k options pull a rectangular subset of cells out of a grdecl model along the x-, y- and z-axes respectively. Each takes two 1-based inclusive cell indices (Eclipse-style, matching the BOX keyword), and each is independently optional — any axis you don't restrict is kept in full. For example, to keep only cells i=10..30, j=5..40 across every layer:

./em2ex.py --extract-i 10 30 --extract-j 5 40 large.grdecl

Or to take only the top 20 layers:

./em2ex.py --extract-k 1 20 large.grdecl

Indices refer to cell positions as they appear in the file — same numbering you see in the SPECGRID keyword and in the order properties like PORO are listed. The subset is taken before any further coordinate processing.

Composition with --refine-xy. If both are given, extract runs first and refinement applies to the subset (so --extract-i 10 30 --refine-xy 2 2 extracts 21 cells along the x-axis and then refines to 42; it does not refine the full grid and then extract from it). This is almost always what you want — refining the whole grid just to throw most of it away would be wasteful.

Composition with --flip and with left-hand coordinate files. Extract operates on the file's i/j/k indexing before em2ex flips the z values (--flip) or normalises a left-handed coordinate system to right-handed (which happens automatically when the file's x or y coordinates decrease). The practical consequence:

--flip doesn't affect which cells are extracted — only their z sign in the output. The k range you give is the same range you'd give without --flip.
For a left-handed coordinate file, the extracted region is still the cells at file indices i=I_LO..I_HI (etc.), exactly as they're listed in PORO. In the output mesh, those cells then get re-numbered through the auto-flip to the canonical right-handed system, so their i (and/or j) indices in the produced Exodus mesh may run in the opposite direction from the file's. The geometry and properties are preserved; only the index sense changes.

If any range is out of bounds for the file's SPECGRID size, or if LO > HI, the conversion is rejected up front with the actual dimensions cited.

Per-cell properties (Eclipse only)

em2ex recognises the following per-cell scalar property keywords out of the box and emits each as an elemental variable on the resulting Exodus mesh:

ACTNUM, SATNUM
PORO, PERMX, PERMY, PERMZ
NTG (net-to-gross)
HEATCR (volumetric heat capacity), THCONR (rock thermal conductivity)

The Eclipse keyword catalogue is much larger than this. If the model uses keywords the reader doesn't know about (e.g. PVTNUM, EQLNUM, FIPNUM, custom in-house names), pass them with --extra-keywords:

./em2ex.py --extra-keywords PVTNUM EQLNUM FIPNUM model.grdecl

A few practical notes:

Keywords are normalised to uppercase, so --extra-keywords ntg fipnum and --extra-keywords NTG FIPNUM are equivalent.
Each named keyword must be a per-cell scalar block of NX*NY*NZ entries terminated by /. The existing array-size check applies to extras the same as to defaults — a mismatched block size is rejected with the offending keyword named.
If you ask for a keyword that doesn't appear in the file (or any of its INCLUDEd files), the conversion fails up front with the offending keyword named — typos in --extra-keywords are not silently ignored.
Because --extra-keywords takes a variable number of values (nargs='+'), put the input filename before it, or separate them with --. Either of these works:
```
./em2ex.py model.grdecl --extra-keywords PVTNUM EQLNUM
./em2ex.py --extra-keywords PVTNUM EQLNUM -- model.grdecl
```

Fault sidesets (Eclipse only)

When the input grdecl describes a faulted reservoir (cells on either side of a fault have different z values at the shared pillar), the resulting Exodus mesh is topologically disconnected across the fault — cells on each side own their own nodes, with no element neighbourship bridging the gap. The --fault-sidesets flag emits two paired sidesets so a downstream solver has named boundaries to attach cross-fault physics to:

./em2ex.py --fault-sidesets faulted.grdecl

fault_primary — the face on the "left/back/lower" side of every fault face (i.e. the lower-(i, j, k)-index cell's face that abuts the discontinuity).
fault_secondary — the matching face on the "right/front/upper" side (the higher-index cell's face).

The two sidesets always have the same length and the entries are ordered consistently — fault_primary[n] and fault_secondary[n] describe the same physical fault face, viewed from each side.

Detection is purely topological: any internal face where the two adjacent cells disagree on their four shared face-corner node IDs becomes a fault face. This catches i-, j- and k-direction discontinuities equally; pinched cells (removed by --pinch) don't appear since they're already inactive by the time fault detection runs.

Without --fault-sidesets, the output is unchanged and only the six standard boundary sidesets are written.

Coordinate units (Eclipse only)

The Eclipse GRIDUNIT keyword declares the length unit of the grid. em2ex recognises three values:

`GRIDUNIT` value	Length unit	Factor to metres
`METRES` (or absent — the Eclipse default)	metres	1.0
`FEET`	US survey feet	0.3048
`CM`	centimetres	0.01

By default, em2ex preserves the input file's units — the numbers in COORD and ZCORN are passed through to the Exodus mesh unchanged. The Exodus format itself has no concept of length units, so it's the modeller's responsibility to remember (or document downstream) what units the mesh is in.

To convert to metres on output, pass --convert-to-m:

./em2ex.py --convert-to-m model.grdecl

This multiplies every coordinate (COORD x/y/z and ZCORN z) by the appropriate factor and prints a one-line confirmation. Per-cell property values are never converted — --convert-to-m only affects geometry.

A few practical notes:

Non-metres files trigger an info note at the start of the run telling you what unit the file is in and reminding you about --convert-to-m. The conversion is opt-in — em2ex never silently rescales your data.
Files without GRIDUNIT are treated as metres (Eclipse's documented default). No info note, no conversion needed.
Unrecognised GRIDUNIT values print an info note saying conversion is not available; the numbers pass through. Asking for --convert-to-m on an unrecognised unit is rejected with a clear error.
Property units are entirely the modeller's responsibility. The GRIDUNIT keyword only describes the unit of the grid's coordinates. Per-cell properties like PERMX, HEATCR, THCONR, etc. carry their own unit conventions (Eclipse's METRIC, FIELD, LAB, PVT-M unit systems each define their own choices for pressure, flow rate, permeability, density, thermal conductivity, etc.). em2ex does not track those conventions and applies no conversion to property values, even when --convert-to-m is rescaling the geometry. If your input file is in FIELD units (psi, bbl/day, mD, BTU-based thermal quantities, etc.) and you convert the geometry to metres, the property values stay in FIELD units; the resulting mesh is internally inconsistent and will need property conversion downstream before it's physically meaningful.

Element Jacobian check

After conversion, em2ex evaluates the Jacobian at all 8 corners of every HEX8 element and prints a one-line summary:

Element Jacobian check: 1000000 / 1000000 elements OK

A non-positive Jacobian (zero = degenerate, negative = inverted) almost always means one of two things:

The input file uses a z increases upward convention rather than Eclipse's default z increases downward. The cells come out "upside down" and need --flip to invert the z values.
There are a few genuinely distorted cells — for example near faults that are so skewed the element is inverted, or near-pinched cells that slipped through the tolerance filter.

When all elements are non-positive the output is expanded with a hint:

Element Jacobian check: 27 negative, 0 zero, 0 OK (out of 27)
  All elements have non-positive Jacobians. Possible causes:
    - z-up coordinate convention: try --flip to invert the z-axis
    - Orientation-reversing coordinate system (e.g. MAPAXES handedness)
  Use --remove-distorted to remove these elements and proceed anyway.

When only some elements are bad, the report includes element IDs and centroid locations:

Element Jacobian check: 3 negative, 0 zero, 24 OK (out of 27)
  Examples of negative-Jacobian elements (showing up to 5 of 3):
    element 7: centroid (0.5, 0.5, 0.5), min Jacobian = -1.000e-03

By default the output file is still written regardless of warnings. Three flags adjust this:

--remove-distorted removes elements with non-positive Jacobians before writing, printing a count of those dropped. Useful when a small number of distorted cells near faults would otherwise cause solver failures.
--strict-jacobians upgrades any non-positive Jacobian to a fatal error (exit code 1). Useful in CI / scripted workflows where a bad mesh should stop the pipeline.
--no-check-jacobians skips the check entirely (a small time saving on very large grids, but disables a useful safety net).

Relationship to --pinch. --remove-distorted and --pinch are complementary rather than interchangeable. --pinch detects cells where any two corners are within --pinch-tol of each other — it catches near-coincident corners (e.g. a 0.5 m thick cell in a grid measured in metres) even when the Jacobian is still technically positive. --remove-distorted catches cells whose Jacobian has already reached zero or gone negative, which only happens once corners are exactly coincident or the cell has become inverted. In practice, running both is the safest option for grids with thin reservoir layers near faults: --pinch handles the near-zero cells that --remove-distorted would miss, and --remove-distorted catches any remaining inverted cells.

em2ex attempts to guess the reservoir model format from the file extension (see supported formats below). If the reservoir model has a non-standard file extension, the user can force em2ex to read the correct format using the --filetype commandline option.

For example, if the reservoir model is named model.dat but is actually an Eclipse ASCII file, then em2ex can still be used in the following manner

./em2ex.py --filetype eclipse model.dat

to produce an Exodus II model test.e.

If the SEACAS package is installed, then the python API from that package can be used instead of the provided pyexodus API using

./em2ex.py --use-official-api test.grdecl

Supported formats

em2ex currently supports:

File format	File extension
Eclipse ASCII	`.grdecl`
Leapfrog Geothermal	-

Note for Leapfrog Geothermal users

To prepare for usage, several steps must be taken in leapfrog.

First, the user must export a "block model" -- as a CSV with full header data. Leapfrog gives three options for export of block models,

CSV Block Model - this option includes the model definition info on the top of the CSV file. This option is required for use of this tool.
CSV Block Model + Text File - this option gives the same info as above, but in two files/
CSV Points - a raw dump of the point data

The CSV Block Model file must contain all of the elemental (material property) data--anything that is cell entered. You will need the rename to file to filename_cell.csv

Second, the user will need to create a second block model in Leapfrog that is n+1 bigger and with the base point being nx/2, ny/2, and nz/2 offset--this will make the second mesh centers align with the corners of the first mesh...giving the locations of the nodes. In Leapfrog, you can interpolate the field estimated pressure and temperature onto this block model. This second block model must be exported exactly the same as the first one. You will need to rename the file to filename_node.csv

Test suite

em2ex includes a python script run_tests.py which uses the pytest framework to run the included tests.

Note: The test suite generates and Exodus file from each reservoir model, and compares it with an existing Exodus file (the gold file). To compare these files, the test harness uses the exodiff utility (part of the SEACAS package) to compare Exodus files. If this package is already installed (for example, as part of MOOSE or to utilise the Exodus API), then the test suite can be run using

./run_tests.py

Alternatively, to avoid installing the entire SEACAS package just to run the test suite, the python pyexodiff package can be installed, and used in the test suite using

python -m pytest -v --exodiff=pyexodiff.py run_tests.py

New tests can be added anywhere within the test directory. The test harness recurses through this directory and all subdirectories looking for all instances of a tests file. This YAML file contains the details of each test in that directory.

The tests file syntax is basic YAML, and looks like:

simple_cube:
  filename: simple_cube.grdecl
  type: exodiff
  gold: simple_cube.e

In this example, the test harness will run

em2ex.py -f simple_cube.grdecl

and then compare the resulting Exodus II file with the file gold\simple_cube.e

exodiff simple_cube.e gold\simple_cube.e

The test harness can also test for expected error messages. For example, the follwing block in a tests file

missing_specgrid:
  filename: missing_specgrid.grdecl
  type: exception
  expected_error: No SPECGRID data found

will run

em2ex.py -f missing_specgrid.grdecl

and then check that the error message contains the string No SPECGRID data found.

Each tests files can contain multiple individual tests. When pytest runs the test suite, the top-level label for each individual test in the tests file (for example, the labels simple_cube and missing_specgrid in the above examples) will be printed to the commandline, along with the status of each test run.

The test suite is run automatically on all pull requests to ensure that em2ex continues to work as expected. To reduce the time for automated testing, these tests are run using the provided pyexodus API, as well as pyexodiff to compare the results.

Contributors

em2ex has been developed by

Chris Green, CSIRO (cpgr)
Rob Podgorney, INL (rpodgorney)
Michael Volkov, John Monash Science School (mickydroid)

New contributions are welcome, using the pull request feature of GitHub.

Feature requests/ bug reports

Any feature requests or bug reports should be made using the issues feature of GitHub. Pull requests are always welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 155 Commits
.github/workflows		.github/workflows
exodus_model		exodus_model
images		images
pyexodiff @ 23c2a48		pyexodiff @ 23c2a48
pyexodus		pyexodus
readers		readers
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
conftest.py		conftest.py
em2ex.py		em2ex.py
environment.yml		environment.yml
run_tests.py		run_tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One-step conversion from reservoir Earth models to Exodus II format

Setup

Clone repository

Required python packages

Optional Exodus API

Usage

Configuration files

Commandline options

Lateral refinement (Eclipse only)

Extracting a subset (Eclipse only)

Per-cell properties (Eclipse only)

Fault sidesets (Eclipse only)

Coordinate units (Eclipse only)

Element Jacobian check

Supported formats

Note for Leapfrog Geothermal users

Test suite

Contributors

Feature requests/ bug reports

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

One-step conversion from reservoir Earth models to Exodus II format

Setup

Clone repository

Required python packages

Optional Exodus API

Usage

Configuration files

Commandline options

Lateral refinement (Eclipse only)

Extracting a subset (Eclipse only)

Per-cell properties (Eclipse only)

Fault sidesets (Eclipse only)

Coordinate units (Eclipse only)

Element Jacobian check

Supported formats

Note for Leapfrog Geothermal users

Test suite

Contributors

Feature requests/ bug reports

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages