-
Notifications
You must be signed in to change notification settings - Fork 9
Closed
Labels
bugSomething isn't workingSomething isn't workinghelp wantedExtra attention is neededExtra attention is needed
Milestone
Description
Summary
YouTube timedtext (subtitles) requests during transcript ingestion sometimes return 429 Too Many Requests because the ingest plugin performs rapid sequential fetches with no retry/backoff and no configurable inter-request delay.
Steps to reproduce
- Submit multiple videos for transcript ingestion in rapid succession (e.g., 10–20 videos with ~5–15 seconds between submissions).
- Observe ingestion jobs failing with
429 Client Error: Too Many Requests for url: https://www.youtube.com/api/timedtext?.... - Confirm subtitle download path uses
yt-dlpto findsubtitles/automatic_captionsand thenrequests.get(subtitle_url)to fetch the VTT.
I think that we just need to ingest videos using some kind of queue, one by one, with a backoff mechanism (otherwise we will trigger the 429 error)

Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
bugSomething isn't workingSomething isn't workinghelp wantedExtra attention is neededExtra attention is needed