Adds batch processing support #590

mhenrixon · 2025-06-26T18:09:44Z

This builds on some work that I did for Sidekiq.

I had to put a database in front of Sidekiq to make it work as I wanted and to ensure that no duplicates are processed, while some tasks need to occur at the end.

There are several factors to consider, including the tracking of pending jobs. This might be better as a simple query, depending on how fast the jobs are processed; it could cause side effects with concurrent increments.

This is intended to initiate a discussion. There are other ways of handling this, but none that I like. This is the only approach that I can think of that doesn't have too many negatives.

The only negative is the database changes.

rosa · 2025-06-26T21:28:18Z

Hey @mhenrixon, thanks for this! I haven't looked at it yet, but there's a previous PR for batch support: #142 and a discussion there. It's work in progress as well, but maybe the discussion there is relevant for this, too.

This builds on some work that I did for Sidekiq actually. I had to put a database in front of sidekiq to make it work like I wanted and to ensure that no duplicates are processed while some stuff needs to happen at the end.

mhenrixon · 2025-07-07T17:48:02Z

Hey @mhenrixon, thanks for this! I haven't looked at it yet, but there's a previous PR for batch support: #142 and a discussion there. It's work in progress as well, but maybe the discussion there is relevant for this, too.

Hey @rosa, so I had a look at the other PR!

Admittedly, I am biased, but I feel that my suggestion is more in line with the code I already saw in solid_queue. I don't care either way as long as the functionality ends up in a release in the not so distant future, though.

Would @jpcamara be open to share notes and get the feature across the finish line? I am more than happy to support you in finishing up the work in your branch.

ollym · 2025-08-15T18:08:52Z

@mhenrixon would be good to change the API to support batch enqueue using a block also so that:

SolidQueue::Batch.enqueue(metadata: { source: "test", priority: "high", user_id: 123 }) do
  SomeJob.perform_later('args')
  SomeJob.perform_later('args')
end

Which would support a use case we have where we have an around_action on certain controller methods that we like to batch all jobs performed in that action and provide a background job progress indicator. This wouldn't be possible if we have to explicitly list every job.

Job enqueue using a block is possible in #142

mhenrixon · 2025-08-16T03:17:51Z

@mhenrixon would be good to change the API to support batch enqueue using a block also so that:

I'll continue the work this weekend. Has been super busy developing a Hotwire native app at work.

Thanks for letting me know about this.

jpcamara · 2025-08-16T03:44:12Z

I have some free time this coming week to put together some notes and figure out next steps.

I think neither of our PRs fully matches the solid queue model - the solid queue model has core tables and then execution models. Both of our PRs kind of dump everything into one hot path table. Maybe that's ok, I'm not sure yet.

I want to re-evaluate my approach and line it up better with the overall architecture of SQ. I'd be happy to try to work together on this

jpcamara · 2025-08-16T04:20:40Z

I have some free time this coming week to put together some notes and figure out next steps.

I think neither of our PRs fully matches the solid queue model - the solid queue model has core tables and then execution models. Both of our PRs kind of dump everything into one hot path table. Maybe that's ok, I'm not sure yet.

I want to re-evaluate my approach and line it up better with the overall architecture of SQ. I'd be happy to try to work together on this

Gonna dig into this PR as well and your approach, I've only reviewed it a bit so far

mhenrixon · 2025-08-16T13:15:45Z

Gonna dig into this PR as well and your approach, I've only reviewed it a bit so far

Let me know if you want to team up on this. It is a much-needed feature.

ollym · 2025-08-28T12:42:18Z

@jpcamara @mhenrixon did you connect on this, or find a new direction? I'm keen to support in any way.

mhenrixon · 2025-08-28T12:44:15Z

Hi @ollym yes, we have touched base and @jpcamara is working on combining the two with ideas from both of them.

I'll close this one when he's ready.

* Thanks to Mikael Henriksson for his work in rails#590. His work decentralizes management of batch status by moving it to the BatchUpdateJob, and tracking status using counts rather than querying specific job statuses after the fact. This is a much simpler approach to tracking the jobs, and allows us to avoid a constantly polling set of queries in the dispatcher. Also add in arbitrary metadata to allow tracking data from start to end of execution. This also means enqueueing a BatchUpdateJob based on callbacks in two different kinds of Batchable, which are included when a job is updated and finished, or when a FailedExecution is created (since failed jobs never "finish"). * This batch feature already took some inspiration from the GoodJob batch implementation (https://github.com/bensheldon/good_job). But now we also increase that by adopting some of the buffering and abstractions in a similar form as GoodJob. To discourage heavy reliance on the JobBatch model, it has been renamed to BatchRecord, and a separate Batch interface is how you interact with batches, with some delegation to the core model. * A new Buffer class (also modeled after GoodJob) was added specifically for batches. This was primarily added to support enqueue_after_transaction_commit. We now override the ActiveJob #enqueue method so we can keep track of which jobs are attempting to enqueue. When enqueue_after_transaction_commit is on, those jobs do not enqueue until all transactions commit. By tracking them at the high level enqueue and keeping a buffer of jobs, we can ensure that the jobs get tracked even when their creation is deferred until the transaction is committed. The side benefit is that we get to enqueue all the jobs together, probably offering some performance advantage. This buffer also keeps track of child batches for the same reason. * To support triggering a callback/BatchUpdateJob when a job finishes, the update to finished_at needed to become an update! call * As a simplification, on_failure is now only fired after all jobs finish, rather than at the first time a job fails * The adapter logic itself also needed to be updated to support the buffer and enqueue_after_transaction_commit. If a job is coming from a batch enqueue, we ignore it here and allow the batching process to enqueue_all at the end of the enqueue block. If the job is originally from a batch, but is retrying, we make sure the job counts in the batch stay updated. I don't love this addition, since it adds alot of complication to the adapter code, all solely oriented around batches * Batches benefit from keeping jobs until the batch has finished. As such, we ignore the preserve jobs setting, but if it is set to false, we enqueue a cleanup job once the batch has finished and clear out finished jobs Co-authored-by: Mikael Henriksson <[email protected]>

mhenrixon · 2025-08-29T06:29:19Z

I'm closing this now that my ideas have been incorporate into #142

mhenrixon force-pushed the batch-support branch from f9f505d to 2ca726d Compare June 26, 2025 18:11

Adds batch processing support

bf635be

This builds on some work that I did for Sidekiq actually. I had to put a database in front of sidekiq to make it work like I wanted and to ensure that no duplicates are processed while some stuff needs to happen at the end.

mhenrixon force-pushed the batch-support branch from 2ca726d to bf635be Compare July 7, 2025 17:41

mhenrixon closed this Aug 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds batch processing support #590

Adds batch processing support #590

Uh oh!

mhenrixon commented Jun 26, 2025 •

edited

Loading

Uh oh!

rosa commented Jun 26, 2025

Uh oh!

mhenrixon commented Jul 7, 2025

Uh oh!

ollym commented Aug 15, 2025 •

edited

Loading

Uh oh!

mhenrixon commented Aug 16, 2025

Uh oh!

jpcamara commented Aug 16, 2025

Uh oh!

jpcamara commented Aug 16, 2025

Uh oh!

mhenrixon commented Aug 16, 2025

Uh oh!

ollym commented Aug 28, 2025

Uh oh!

mhenrixon commented Aug 28, 2025

Uh oh!

mhenrixon commented Aug 29, 2025

Uh oh!

Uh oh!

Adds batch processing support #590

Adds batch processing support #590

Uh oh!

Conversation

mhenrixon commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rosa commented Jun 26, 2025

Uh oh!

mhenrixon commented Jul 7, 2025

Uh oh!

ollym commented Aug 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mhenrixon commented Aug 16, 2025

Uh oh!

jpcamara commented Aug 16, 2025

Uh oh!

jpcamara commented Aug 16, 2025

Uh oh!

mhenrixon commented Aug 16, 2025

Uh oh!

ollym commented Aug 28, 2025

Uh oh!

mhenrixon commented Aug 28, 2025

Uh oh!

mhenrixon commented Aug 29, 2025

Uh oh!

Uh oh!

mhenrixon commented Jun 26, 2025 •

edited

Loading

ollym commented Aug 15, 2025 •

edited

Loading