Add missing cases when the pipeline should be stopped #3564

leszko · 2025-05-13T12:48:02Z

No description provided.

j0sh · 2025-05-13T15:54:28Z

server/ai_live_video.go

@@ -63,6 +63,7 @@ func startTricklePublish(ctx context.Context, url *url.URL, params aiRequestPara
 			if err := publisher.Close(); err != nil {
 				clog.Infof(ctx, "Error closing trickle publisher. err=%v", err)
 			}
+			params.liveParams.stopPipeline(fmt.Errorf("publisher is closed"))


Technically it is the segment reader that is closing; the trickle publisher is being closed as a side effect. I also wonder how informative this will actually be; typically the segment reader is being closed for another reason that is already reported upstream, eg WHIP disconnect.

Yeah, but shouldn't we just stop the pipeline in any failure? How would the pipeline function if the publisher is closed?

I'd try to avoid this leaking, because in this case, the publisher is stopped, but we still have the whole pipeline running.

We are closing the publisher here as a result of the segment reader closing, which happens because the ingest connection has terminated. The publisher is not closing spontaneously on its own. That isn't a failure, it's just a normal teardown. That only happens in one place per ingest method (mediamtx) (whip).

To be clear, this should be harmless, I just don't know how much value it will add vis-a-vis being noise in the metabase event stream.

Even if no-op right now, I think it's better to make a clear code flow. If we stop publishing or subscribing any data, just stop everything.

Otherwise, it's confusing, I've started to detach the ingest closing from the trickle publish (for O Swapping) and suddenly realized, "hey the old publisher is still working, why?". So, I'd like to avoid such pitfalls. We may not print an error message, we may not send an error event, but I think we should have a clear condition for stopping everything if any of the trickle parts is not working.

I think we should have a clear condition for stopping everything if any of the trickle parts is not working.

Sure, but as far as I can tell, the only reason trickle is "not working" at this point is because the input has already stopped. Do we need to stop again? Do you see a case where it might not be stopped?

Generally I would prefer not to sprinkle cleanup functions around without fully understanding the code paths that could lead to those. For example, it is very helpful to have the first error be closest to the root cause. If a teardown "error" happens to be sent out first, then that masks an important piece of information. So we should be careful when adding stopPipeline calls (and preferably propagate errors as far back upstream as possible, because calls like stopPipeline kick off an inherently async teardown process which can make the overall flow hard to trace out.)

Anyway, that being said, if we can be sure that "publisher is closed" won't be the first error sent out under normal conditions then this is probably fine. Also, do you see cases where we'd see this as the first error?

j0sh · 2025-05-13T17:17:21Z

server/ai_live_video.go

@@ -368,6 +370,7 @@ func startControlPublish(ctx context.Context, control *url.URL, params aiRequest
 				}
 				// if there was another type of error, we'll just retry anyway
 			case <-done:
+				params.liveParams.stopPipeline(fmt.Errorf("control publish stopped %w", err))


Normally the input is already disconnected by this point, too, since StopControl (which closes the done channel) is only called once per ingest method, after the input has disconnected (mediamtx)(whip).

The same what I wrote above, maybe it's already handled, but in the different part of the code, I think we should make it clear that if any Trickle part is closed, we stop the pipeline.

Add missing cases when the pipeline should be stopped

1196c48

github-actions bot added go Pull requests that update Go code AI Issues and PR related to the AI-video branch. labels May 13, 2025

leszko requested review from victorges, j0sh and mjh1 May 13, 2025 12:48

mjh1 approved these changes May 13, 2025

View reviewed changes

victorges approved these changes May 13, 2025

View reviewed changes

j0sh reviewed May 13, 2025

View reviewed changes

leszko requested a review from j0sh May 14, 2025 06:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add missing cases when the pipeline should be stopped #3564

Add missing cases when the pipeline should be stopped #3564

Uh oh!

leszko commented May 13, 2025 •

edited

Loading

Uh oh!

j0sh May 13, 2025

Uh oh!

leszko May 13, 2025

Uh oh!

j0sh May 13, 2025

Uh oh!

leszko May 14, 2025

Uh oh!

j0sh May 15, 2025

Uh oh!

j0sh May 13, 2025

Uh oh!

leszko May 14, 2025

Uh oh!

Uh oh!

Add missing cases when the pipeline should be stopped #3564

Are you sure you want to change the base?

Add missing cases when the pipeline should be stopped #3564

Uh oh!

Conversation

leszko commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

j0sh May 13, 2025

Choose a reason for hiding this comment

Uh oh!

leszko May 13, 2025

Choose a reason for hiding this comment

Uh oh!

j0sh May 13, 2025

Choose a reason for hiding this comment

Uh oh!

leszko May 14, 2025

Choose a reason for hiding this comment

Uh oh!

j0sh May 15, 2025

Choose a reason for hiding this comment

Uh oh!

j0sh May 13, 2025

Choose a reason for hiding this comment

Uh oh!

leszko May 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

leszko commented May 13, 2025 •

edited

Loading