Parallel Fragments #2

sfc-gh-tteixeira · 2025-09-24T20:51:58Z

Summary

Make it possible for st.fragments to run in a parallel thread.

Problem statement

Dashboards are one of the most common classes of apps in Streamlit. In a dashboard, data is typically loaded, then transformed (sometimes after some user input), then finally displayed as charts and other widgets.

It's very common for the load-transform code paths of any given chart to be completely distinct from the code paths of other charts. However, these code paths are typically executed sequentially, which leads to a slow loading pattern for the app, where one section will only load once the previous has done so.

Toy example:

import numpy as np

def load_user_growth():
    time.sleep(1)
    return np.random.randn(100, 2)

def load_revenue_growth():
    time.sleep(1)
    return np.random.randn(100, 2)

def load_expenses_growth():
    time.sleep(1)
    return np.random.randn(100, 2)

def transform_user_growth(arr, x):
    time.sleep(1)
    return arr + x

def transform_revenue_growth(arr, x):
    time.sleep(1)
    return arr - x

def transform_expenses_growth(arr, x):
    time.sleep(1)
    return arr * x

slider1 = st.slider("Pick a number", 123)
slider2 = st.slider("Pick a second number", 456)

arr1 = load_user_growth()
arr1 = transform_user_growth(arr1, slider1)
st.line_chart(arr1)

arr2 = load_revenue_growth()
arr2 = transform_revenue_growth(arr2, slider2)
st.line_chart(arr2)

arr3 = load_expenses_growth()
arr3 = transform_expenses_growth(arr3, slider2)
st.line_chart(arr3)

In this app, each step runs sequentially after the previous one is done, so the whole thing takes 6s to draw:

flowchart
	s1@{ label: "slider1" }
	s2@{ label: "slider2" }
	l1@{ label: "load_user_growth (1s)" }
	l2@{ label: "load_revenue_growth (1s)" }
	l3@{ label: "load_expenses_growth (1s)" }
	t4@{ label: "transform_user_growth (1s)" }
	t5@{ label: "transform_revenue_growth (1s)" }
	t6@{ label: "transform_expenses_growth (1s)" }
	d7@{ label: "st.line_chart" }
	d8@{ label: "st.line_chart" }
	d9@{ label: "st.line_chart" }
	startCircle@{ shape: "circle", label: "Start" }
	endCircle@{ shape: "circle", label: "End" }
	startCircle --> s1
	s1 --> s2
	s2 --> l1
	l1 --> t4
	l2 --> t5
	l3 --> t6
	t4 --> d7
	t5 --> d8
	t6 --> d9
	d7 --> l2
	d8 --> l3
	d9 --> endCircle
	style s1 fill:#e0f2fe
	style s2 fill:#e0f2fe
	style l1 fill:#fce7f3
	style l2 fill:#fce7f3
	style l3 fill:#fce7f3
	style t4 fill:#ecfccb
	style t5 fill:#ecfccb
	style t6 fill:#ecfccb
	style d7 fill:#fef9c3
	style d8 fill:#fef9c3
	style d9 fill:#fef9c3
	style startCircle fill:#eee
	style endCircle fill:#eee

Desired flow

Given that these code paths are so different, it would make a lot more sense to load them in parallel instead. What would be a simple, Streamlity API that is powerful enough to cover the more common patterns for this?

That is, we want the app to load like this:

flowchart
	s1@{ label: "slider1" }
	s2@{ label: "slider2" }
	l1@{ label: "load_user_growth (1s)" }
	l2@{ label: "load_revenue_growth (1s)" }
	l3@{ label: "load_expenses_growth (1s)" }
	t4@{ label: "transform_user_growth (1s)" }
	t5@{ label: "transform_revenue_growth (1s)" }
	t6@{ label: "transform_expenses_growth (1s)" }
	d7@{ label: "st.line_chart" }
	d8@{ label: "st.line_chart" }
	d9@{ label: "st.line_chart" }
	startCircle@{ shape: "circle", label: "Start" }
	endCircle@{ shape: "circle", label: "End" }
	startCircle --> s1
	s1 --> s2
        s2 --> l1
	s2 --> l2
	s2 --> l3
	l1 --> t4
	l2 --> t5
	l3 --> t6
	t4 --> d7
	t5 --> d8
	t6 --> d9
	d7 --> endCircle
	d8 --> endCircle
	d9 --> endCircle
	style s1 fill:#e0f2fe
	style s2 fill:#e0f2fe
	style l1 fill:#fce7f3
	style l2 fill:#fce7f3
	style l3 fill:#fce7f3
	style t4 fill:#ecfccb
	style t5 fill:#ecfccb
	style t6 fill:#ecfccb
	style d7 fill:#fef9c3
	style d8 fill:#fef9c3
	style d9 fill:#fef9c3
	style startCircle fill:#eee
	style endCircle fill:#eee

Goals

Make it possible to load parts of the app in separate threads.
Very easy to use.
Covers major use-cases.
Does not break existing apps.
The solution should not preclude a great solution to a separate problem: updating disconnected parts of app.

In the example code above, when a user moves one of the sliders, the entire app reloads. How can we make sure only the parts that depend on that slider reload instead? Today we tell users to use st.fragment, but since the sliders and the charts are not in contiguous parts of the app, fragments will not help here. This is a problem we'd like to solve in a future STEP, but in true Streamlit form we'd like these solutions (and all our other primitives) to feel like part of the same system.

Non-goals

Covering every possible use-case.

Proposed solution

To address question 1, let's extend the fragments primitive to support parallel execution, so the example above looks more like this:

(NOTE: Ignore the exact API right now)

import numpy as np

def load_user_growth():
    time.sleep(1)
    return np.random.randn(100, 2)

def load_revenue_growth():
    time.sleep(1)
    return np.random.randn(100, 2)

def load_expenses_growth():
    time.sleep(1)
    return np.random.randn(100, 2)

def transform_user_growth(arr, x):
    time.sleep(1)
    return arr + x

def transform_revenue_growth(arr, x):
    time.sleep(1)
    return arr - x

def transform_expenses_growth(arr, x):
    time.sleep(1)
    return arr * x

slider1 = st.slider("Pick a number", 123)
slider2 = st.slider("Pick a second number", 456)

@st.fragment(parallelize=True)
def chart1():
    arr1 = load_user_growth()
    arr1 = transform_user_growth(arr1, slider1)
    st.line_chart(arr1)

@st.fragment(parallelize=True)
def chart2():
    arr2 = load_revenue_growth()
    arr2 = transform_revenue_growth(arr2, slider2)
    st.line_chart(arr2)

@st.fragment(parallelize=True)
def chart3():
    arr3 = load_expenses_growth()
    arr3 = transform_expenses_growth(arr3, slider2)
    st.line_chart(arr3)

chart1()
chart2()
chart3()

API

How should we declare that a given fragment can be executed in a parallel thread?

Option 1: New keyword argument

Signature

st.fragment(func=None, *, run_every=None, parallelize=True)

Usage

@st.fragment(parallelize=True, ...)
def my_fragment():
  ...

Pros

Doesn't introduce a new primitive in Streamlit
Very discoverable
?

Cons

A bit wordy
?

Naming

parallelize
thread
background
bg
async
task
daemon
background_task
run_in_thread
run_in_parallel
run_in_background
run_in_bg
run_async

Option 2: New decorator

Signature

st.parallel_fragment(func=None, *, run_every=None)

Usage

@st.parallel_fragment
def my_fragment():
  ...

Pros

Very discoverable
?

Cons

Introduces a new flow control primitive in Streamlit.

People tend to be confused by the primitives we already support (cache_resource, cache_data, fragment, form), so I'd rather not make things more complicated for them.
?

Naming:

@st.parallel_fragment
@st.threaded_fragment
@st.async_fragment
@st.thread
@st.fragment_thread
@st.daemon
@st.task
@st.async

Option 3: Async def ✅ CURRENT FAVORITE

The idea Option 3 is that you declare a parallel fragment using async def instead of def.

Signature

With this option, there would be no change to the @st.fragment signature:

st.fragment(func=None, *, run_every=None)

Usage

@st.fragment
async def my_fragment():
   ...

Pros

Doesn't introduce a new primitive in Streamlit
[Opinion] Feels really natural
?

Cons

Harder to discover
This somewhat stretches the definition of async in Python
?

Design

This is a Python-only feature. No impact on design.

Behavior

The return value of an async fragment is ignored.

Another option would be to return a Future or to somehow stuff the return value into Session State, but it's unclear that any of this is needed. So let's leave this feature out for now and see if there's a need. We can always add this later.

Metrics

Impact on metrics:

The hope is that this would make a certain class of apps faster. However, it may be hard to measure this since we'd need to look at performance metrics from before and after the change.

Requires new metrics:

If going with Option 3, we'll need to add some telemetry logic to be able to tell how much usage this feature is getting.

Otherwise, Options 1 and 2 should get automatically tracked with the current telemetry logic.

Implementation

Once there's a prototype implementation, we'll link the Github branch for it here.

Appendix

Link to figure source, in case you want to create your own: https://excalidraw.com/#json=T3rhAIxRXnR-2QCk9ql_R,6Xp4sRqjoxV-hMeTP8eZIg

sfc-gh-tteixeira · 2025-10-14T21:26:27Z

To people commenting: if you have a real-world use-case, please comment below!

sfc-gh-tteixeira added 4 commits September 24, 2025 17:49

Add Parallel Fragments spec, draft 1.

79773dc

Update mermaid diagram

97a64ca

Fix syntax and wording.

9cb6372

Replace with stub

39411ea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Parallel Fragments #2

Parallel Fragments #2

Uh oh!

sfc-gh-tteixeira commented Sep 24, 2025 •

edited

Loading

Uh oh!

sfc-gh-tteixeira commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Parallel Fragments #2

Are you sure you want to change the base?

Parallel Fragments #2

Uh oh!

Conversation

sfc-gh-tteixeira commented Sep 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem statement

Desired flow

Goals

Non-goals

Proposed solution

API

Signature

Usage

Pros

Cons

Naming

Signature

Usage

Pros

Cons

Naming:

Signature

Usage

Pros

Cons

Design

Behavior

Other solutions considered

Pros

Cons

Major difference

Metrics

Impact on metrics:

Requires new metrics:

Implementation

Appendix

Uh oh!

sfc-gh-tteixeira commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

sfc-gh-tteixeira commented Sep 24, 2025 •

edited

Loading