All persistent data lives under a single data root. By default that is data/ in the current working directory. You can override it with the MASS_DATA_DIR environment variable or the global CLI option -d, --data-dir <path>.
| Path | Description |
|---|---|
personas/<id>/<id>.json |
One JSON file per persona (metadata, details, and optional persona narrative in details.persona). |
personas/<id>/<id>_memories.json |
Persona memories (extracted from conversations). Optional; created when memory extraction runs. |
personas/<id>/<id>_state.json |
Persona runtime state. Optional; used for simulation and stateful chat. |
cohorts/<id>.json |
One JSON file per cohort. The CLI discovers cohorts by listing this directory. |
workspaces/<id>.json |
One JSON file per workspace (conversation state). |
reports/<id>.json |
One JSON file per report. |
report_responses/ |
Report response data (e.g. per workspace). |
tasks/ |
Background task state. |
executions/ |
Execution state. |
At the data root you may also see:
index.json– Persona index (optional; used for name deduplication and lookups in some flows).index_extended.json– Extended persona index (optional).connections.json– Persona connections (optional).
These are not required for basic CLI use (cohort create, persona create/list/delete, chat, report generate/list/show/delete). The app will create and use them as needed for certain operations.
You may see files such as stats.json, stats_extended.json, stats.australia.json, stats_coherence.australia.json, and similar. These are not used by the Mass CLI; they are aggregate summaries (e.g. demographic counts, validation averages) produced by external or one-off tooling for human review.
- You do not need to keep all of them. Keep only what you use for analysis.
- The same information can be shown in markdown for easier reading in the repo. See Data stats example for a markdown format; if your stats are generated by a script, consider outputting
.mdas well as or instead of.json.
- Persona and cohort data:
personas/,cohorts/. - Conversation and reports:
workspaces/,reports/,report_responses/. - Internal state:
tasks/,executions/.
See Creating cohorts and personas for the structure of persona files and optional sidecars.