Multivariate Structural Statespace Components #529

jessegrabowski · 2025-06-25T14:23:55Z

This PR lifts the requirement that models built with the structural sub-module of PyMC be univariate. It's a chonky PR, so I split it into commits. Most of the files changes are changed by the first commit, which is just reorganization of files. It is safe to ignore that one.

Here are the steps I followed:

The structural module was getting pretty unweildly, so I broke it into a bunch of sub-files. This makes the code easier to find and extend. This is handled in the Reorganize structural model modlue commit
We need tools that can merge different components with potentially different (or overlapping) observed time series. This is handled by the Allow combination of component with different numbers of observed states PR. I am confident this code can be improved.
Each component needs to have new logic implemented to handle the case where there are multiple observed series. Users can optionally pass a list of names to each component as observed_state_names. Every time you add two components together, all the relevant matrices are padded and expanded, and the total observed states are created as a union between the components.

For now, we assume all states in a component follow the same parameterization. It's now also valid to add together the same component twice with different states to work around this (e.g. AutoRegressive(order=1, observed_state_names=['data_1']) + Autoregressive(order=5, observed_state_names=['data_2'])) would be a valid model with 2 observed states, but each has it's own autoregressive dynamics.

When you pass a batch of observed_state_names, e.g. LevelTrend(order=2, observed_state_names=['data_1', 'data_2']), the parameters will all be given a batch dimension, but will otherwise be the same as the base case.

More docs coming, but I tried obsessively document what in there so far.

The logic for extending the components is pretty straight-forward -- mostly copying + block_diag or concat, but there are some corner cases that need attention.

This PR should be seen as a companion to #450. Instead of vectorizing across the computation of a model, we're concatenating models. There will be cases where this is superior -- for example when you want to explicitly model latent interactions between components. But in other cases, this approach will be worse. I am interested in having both.

…ates

AlexAndorra · 2025-06-26T20:42:30Z

AutoRegressive(order=1, observed_state_names=['data_1']) + Autoregressive(order=5, observed_state_names=['data_2'])) would be a valid model with 2 observed states, but each has it's own autoregressive dynamics.

This is cool! I will review ASAP.

Note that #450 is currently blocked by what I think is a pytensor bug

pymc_extras/statespace/models/utilities.py

AlexAndorra

This is 🔥 @jessegrabowski 🤯
I just left a suggestion for what I think was a typo in the docstring. I'll merge once this is resolved, and then test all of this for our PyData tutorial -- probably this weekend.

Just a quick question: IIUC, now users can also have batched RegressionComponents, correct?

AlexAndorra

This is 🔥 @jessegrabowski 🤯
I just left a suggestion for what I think was a typo in the docstring.

Still missing this feature are:

Cycle (currently worked on by @AlexAndorra)
Seasonal
Regression (currently worked on by @Dekermanjian)

We also need to:

Make sure that there are tests that combined LevelTrend + AR + error for two observed variables with no interaction model matches two separate models for each, given the same parameters.
Make sure that pytensor ops are used everywhere for building the SS matrices (no numpy/scipy)

AlexAndorra · 2025-07-02T22:12:51Z

I think I'm done for a first review from you on the Cycle component @jessegrabowski 🍾

2. Adjusted the regression component to allow multivariate regression component specification 3. Added a notebook for quick evaluation of the adjustments and additions made

2. replaced scipy block diag with pytensor block diag 3. Added forecast to test model in multivariate ssm notebook

Added multivariate regression-component

review-notebook-app · 2025-07-05T14:51:46Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

jessegrabowski

@AlexAndorra I left comments for you

Since it's my own PR I can't request changes. It's better in future if you fork the PR branch and open a new PR into this PR, then we can do the usual review workflow on your PR and merge it into this PR when we're ready

pymc_extras/statespace/models/structural/components/cycle.py

tests/statespace/models/structural/components/test_cycle.py

jessegrabowski · 2025-07-06T04:44:41Z

@AlexAndorra @Dekermanjian I want the names of parameters in the components to be really consistent and unsurprising. So please vote on:

For the sigma parameters: name_sigma vs sigma_name
For the initial state parameters :name_initial vs initial_name vs name
For assorted greek things, like rho in Cycle: name_greek vs greek_name vs descriptive_name vs name_descriptive
For shock state names: name vs name_shock vs name_innovation

Concrete examples for (3):
a. business_cycle_rho
b. rho_business_cycle
c. dampening_business_cycle
d. business_cycle_dampening

For (4), I'm talking about the internal state names that will end up as labels for the R and Q matrices, nothing else.

Also for default names, since all the are going to depend on the names in the multivariate case, should we:

Make all the default names simpler. For example LevelTrend -> level_trend, or Cycle[s={cycle}, dampen={dampen}, innovations={innovations}] -> cycle
Keep the complex names for univariate case, but use a simple default name when its multivariate
Do away with default names, and force the user to always pass a name
As 3, but only in the multivariate case

jessegrabowski · 2025-07-10T15:18:11Z

This is really close!

I think basically we need to go through and make sure all the names in all the components follow the agreed format. Let's leave {name}_shock for now, but I will open another PR to change that one since it breaks the pattern.

We just need a test for hidden state decomposition with multiple observed, and I think we're pretty much there.

jessegrabowski · 2025-07-11T11:32:15Z

I added the test for multivariate decomposition. Basically it works, but it leaves a lot to be desired. I will open a separate issue to improve it.

I think this is more or less done. @AlexAndorra and @Dekermanjian , if you could provide a final review then we can merge it.

Also @Dekermanjian what do you want to do with your notebook that got merged in? We can either clean it up a bit to be a simple example, or we can remove it for now and you can do something more elaborate with missingness in the future.

Dekermanjian · 2025-07-11T13:04:54Z

I think this is more or less done. @AlexAndorra and @Dekermanjian , if you could provide a final review then we can merge it.

That is great! I will go through and review it today as soon as I am off from work.

Also @Dekermanjian what do you want to do with your notebook that got merged in? We can either clean it up a bit to be a simple example, or we can remove it for now and you can do something more elaborate with missingness in the future.

I am not sure. I think that if the simple example doesn’t add anything on top of what Alex is working on then it would be best to remove it and do something more elaborate as you say with missingness. I am interested in figuring out how that will all work in a Bayesian State Space framework.

Dekermanjian

I think it is looking great! The main issues I found in my review were related to the doctstring and certain components that still haven't adopted the schema of something_{self.name}. These should be pretty quick to fix. We are almost there!

pymc_extras/statespace/models/structural/components/autoregressive.py

pymc_extras/statespace/models/structural/components/cycle.py

pymc_extras/statespace/models/structural/components/level_trend.py

pymc_extras/statespace/models/structural/components/measurement_error.py

tests/statespace/models/structural/components/test_cycle.py

tests/statespace/models/structural/components/test_seasonality.py

tests/statespace/models/structural/test_core.py

…hema and updated tests in accordance to naming changes

naming schema adherence

AlexAndorra

I believe... this is it guys 🍾

Dekermanjian · 2025-07-14T21:04:36Z

This is looking fantastic! 👏

codecov-commenter · 2025-07-16T00:09:48Z

Welcome to Codecov 🎉

Once you merge this PR into your default branch, you're all set! Codecov will compare coverage reports and display results in all future pull requests.

Thanks for integrating Codecov - We've got you covered ☂️

jessegrabowski · 2025-07-16T15:53:07Z

I'm happy enough with how this looks. There are several immediate follow-ups, including:

Check that VAR, SARIMA, and ETS notebooks were not broken by this
Cut down on the number of tests we're running (especially for forecasting)
Enforce the new naming convention in VAR/SARIMA/ETS

I'm sure I'm forgetting things. This also 100% has bugs remaining, but we'll address them as they arise. Great work @AlexAndorra and @Dekermanjian :D

jessegrabowski added 5 commits June 25, 2025 22:17

Reorganize structural model module

a70b733

Allow combination of components with different numbers of observed st…

b970a6c

…ates

Allow multiple observed in LevelTrend component

7cae487

Allow multiple observed states in measurement error component

bba8431

Allow multiple observed in AutoRegressive component

0a84576

jessegrabowski assigned zaxtax, ricardoV94 and AlexAndorra Jun 25, 2025

jessegrabowski added enhancements New feature or request major statespace labels Jun 25, 2025

AlexAndorra reviewed Jun 27, 2025

View reviewed changes

pymc_extras/statespace/models/utilities.py Show resolved Hide resolved

AlexAndorra approved these changes Jun 27, 2025

View reviewed changes

AlexAndorra self-requested a review June 28, 2025 22:11

AlexAndorra requested changes Jun 28, 2025

View reviewed changes

AlexAndorra and others added 4 commits July 1, 2025 09:37

Fix typo in docstrings

480f4fb

Allow multiple observed in Cycle component

a898eb6

Fix Cycle docstring examples

62d0750

Use pytensor block_diag for Cycle

152e962

Dekermanjian and others added 4 commits July 5, 2025 08:23

1. updated level_trend component coord/param labels

7e9bb07

2. Adjusted the regression component to allow multivariate regression component specification 3. Added a notebook for quick evaluation of the adjustments and additions made

1. removed incorrectly comitted file test_structural.py

c0a4a47

2. replaced scipy block diag with pytensor block diag 3. Added forecast to test model in multivariate ssm notebook

removed incorrectly committed file structural.py

1f3dc3a

Merge pull request #3 from Dekermanjian/multivariate-structural

530f530

Added multivariate regression-component

jessegrabowski added 2 commits July 6, 2025 11:59

Always count names to determine k_endog

0c4590e

LevelTrend state/shock names depend on component name

3c5124d

jessegrabowski commented Jul 6, 2025

View reviewed changes

jessegrabowski added 5 commits July 11, 2025 17:37

Save static shape of last data dim

c15f965

More static shapes

f8e7729

Broken test of decomposition with multiple observed

d38c71b

Don't use pad

ce14343

fix decompose test

503eec5

Dekermanjian reviewed Jul 12, 2025

View reviewed changes

Dekermanjian and others added 8 commits July 12, 2025 10:38

updated leveltrend, seasonal, cycle components to adhere to naming sc…

77c27f4

…hema and updated tests in accordance to naming changes

Merge pull request #5 from Dekermanjian/multivariate-structural

08085c7

naming schema adherence

Delete notebook

083e786

Use nwe name order in autoregressive component

66e0252

Improve docstrings

21c64c6

Refactor test_regression to cover innotations = True

b94c9a3

Fix comment in test_cycle

ec98064

Add docstrings to core and measurement_error

124e1c3

AlexAndorra approved these changes Jul 14, 2025

View reviewed changes

AlexAndorra added 3 commits July 15, 2025 11:48

Improve cycle and seasonal docstrings

9c14472

Fix shape of AR params

16761c7

Some other AR dims fixes

b71354a

Cast all component parameters to lists in Component.__init__

22072ac

jessegrabowski force-pushed the multivariate-structural branch from 2c4d075 to 22072ac Compare July 16, 2025 03:37

jessegrabowski added 2 commits July 16, 2025 11:42

Add type checking and errors for property combination

f1f3d38

re-run structural notebook

13e9174

jessegrabowski merged commit 9c50e94 into pymc-devs:main Jul 16, 2025
17 checks passed

jessegrabowski mentioned this pull request Aug 30, 2025

Add k_endog argument to structrual Components to enable multivariate structural models #485

Closed

Multivariate Structural Statespace Components #529

Multivariate Structural Statespace Components #529

Uh oh!

Conversation

jessegrabowski commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexAndorra commented Jun 26, 2025

Uh oh!

Uh oh!

AlexAndorra left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AlexAndorra left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AlexAndorra commented Jul 2, 2025

Uh oh!

review-notebook-app bot commented Jul 5, 2025

Uh oh!

jessegrabowski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jessegrabowski commented Jul 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jessegrabowski commented Jul 10, 2025

Uh oh!

jessegrabowski commented Jul 11, 2025

Uh oh!

Dekermanjian commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dekermanjian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AlexAndorra left a comment

Choose a reason for hiding this comment

Uh oh!

Dekermanjian commented Jul 14, 2025

Uh oh!

codecov-commenter commented Jul 16, 2025

Welcome to Codecov 🎉

Uh oh!

jessegrabowski commented Jul 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

jessegrabowski commented Jun 25, 2025 •

edited

Loading

AlexAndorra left a comment •

edited

Loading

AlexAndorra left a comment •

edited

Loading

jessegrabowski commented Jul 6, 2025 •

edited

Loading

Dekermanjian commented Jul 11, 2025 •

edited

Loading