Process large sets of metadata records in chunks to avoid long-running transactions and memory issues#9319
Open
josegar74 wants to merge 2 commits into
Open
Conversation
…mponent that processes large sets of metadata records in chunks to avoid long-running transactions and memory issues.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This change request introduces a generic infrastructure to handle batch processing of items within database transactions, addressing code duplication found in several parts of the application where large sets of metadata records are being processed in chunks to avoid long-running transactions and memory issues.
Two new components were added to the
jeeves.transactionpackage:BatchItemProcessor<T>: A functional interface that defines the processing logic for a single item of typeT. It allows for clean, lambda-based implementations of business logic.BatchTransactionalProcessor<T>: The engine that orchestrates the batching. It partitions a collection of items (using Guava'sIterables.partition) and executes each batch within a new transaction managed byTransactionManager.setBatchSize(int).TransactionManager.runInTransactionwithCREATE_NEWpropagation andALWAYS_COMMITbehavior for each batch.XSLT processing and batch editing API's have been updated to use these new components
Checklist
mainbranch, backports managed with labelREADME.mdfilespom.xmldependency management. Update build documentation with intended library use and library tutorials or documentation