`info` method - expose information about the model #1178

martinmodrak · 2023-08-10T08:24:46Z

Summary:

Add a new info method that lets tools (or users) get info about the model to help e.g. deciding if the model needs recompilation.

Description:

Currently, a lot of the metadata about a compiled model can only be retrieved by running inference (Stan version, stanc flags) or not at all (cpp compile options, compiler, ...). This limits the ability of wrappers like cmdstanr to make good decisions on when to recompile a model as a change in compiler options or Stan version will not be recognized by checking modification times.

Beyond choosing to recompile or not, other tools may benefit from this metadata in my own work, the SBC package tries to do caching of its results and needs to check if the model it is given is substantially equivalent to a model the cached results were compiled with. It could rely on modification time, but this could result in some unnecessary recomputations (e.g. when the model is modified, found to be problematic and then the change is reverted). For those more extended usages by tools, it might make sense to also include the Stan code and the contents of user-provided extra .hpp in the information stored.

Since all of the information is known at a compile time, a simple implementation would have a .json (or other format) created at compile time and embedded as a resource. The output format could be configurable, but since the primary consumers are likely to be tools, it should probably default to JSON or similar.

Additional information:

If there is an agreement on implementing this, I'd be happy to write a PR.

Current Version:

v2.32.2

The text was updated successfully, but these errors were encountered:

WardBrian · 2023-08-10T12:32:20Z

We already have something like this (it’s even called info, #1010), but it’s definitely not the most informative.

martinmodrak · 2023-08-10T12:43:01Z

Oh, I didn't see it in help or manual, so I thought it doesn't exist :-D . So then the proposal would be to add stanc options, cpp options and possible also compiler info and model and header code (and update the documentation)...

WardBrian · 2023-08-10T15:35:24Z

Model code might not always be desired (I'm imagining a situation where someone wants to provide a proprietary package without the source), but we could have it as an option, or we could do something like include a hash of the source or something.

martinmodrak · 2023-08-22T14:06:41Z

So, to be a bit more specific, I propose to:

Cmdstan would create a .json file containing metadata about the model. I believe this should be possible completely within make targets, but in the worst case scenario, it might need a small compiled utility.
The metadata would by default contain all options passed to stanc and the c compiler, info about compiler (CXX_YYY variables), hash of the source code (or potentially hash of normalized source code) and hash of the user header. There would be an option to also add the full source code. (I am not sure I could get hashing to be reliable across platforms within make, so that might require the special utility program)
The .json file will be included as a binary resource in the executable (there appears to be a bewildering array of options to actually do this, so I'll need to investigate a bit what would be the most reliable approach)
The info method would load the metadata from the .json resource, and combine it with the currently stored medata
The info method would get an argument for output format - it could be the key:value format currently used or json for something more readable by tools (once again, especially for including the whole source code)
The info method would be documented in cmdstan User's guide.

Would that make sense? Are there other people I should consult this with? Should I make a post on Discourse?

I find a file resources to be potentially more flexible (especially for adding the whole source code), but I am open to be convinced that just passing everything by macros and -D is better.

martinmodrak mentioned this issue Aug 10, 2023

Store and expose stanc_options stan-dev/cmdstanr#814

Open

martinmodrak mentioned this issue Aug 18, 2023

Recompile only on changes to output of stanc3 auto-formatter (feature request & design discussion) stan-dev/cmdstanr#423

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`info` method - expose information about the model #1178

`info` method - expose information about the model #1178

martinmodrak commented Aug 10, 2023

WardBrian commented Aug 10, 2023

martinmodrak commented Aug 10, 2023

WardBrian commented Aug 10, 2023

martinmodrak commented Aug 22, 2023

info method - expose information about the model #1178

info method - expose information about the model #1178

Comments

martinmodrak commented Aug 10, 2023

Summary:

Description:

Additional information:

Current Version:

WardBrian commented Aug 10, 2023

martinmodrak commented Aug 10, 2023

WardBrian commented Aug 10, 2023

martinmodrak commented Aug 22, 2023

`info` method - expose information about the model #1178

`info` method - expose information about the model #1178