Process large sets of metadata records in chunks to avoid long-running transactions and memory issues by josegar74 · Pull Request #9319 · geonetwork/core-geonetwork

josegar74 · 2026-06-08T16:27:30Z

This change request introduces a generic infrastructure to handle batch processing of items within database transactions, addressing code duplication found in several parts of the application where large sets of metadata records are being processed in chunks to avoid long-running transactions and memory issues.

Two new components were added to the jeeves.transaction package:

BatchItemProcessor<T>: A functional interface that defines the processing logic for a single item of type T. It allows for clean, lambda-based implementations of business logic.
```
@FunctionalInterface
public interface BatchItemProcessor<T> {
    void process(T item) throws Exception;
}
```
BatchTransactionalProcessor<T>: The engine that orchestrates the batching. It partitions a collection of items (using Guava's Iterables.partition) and executes each batch within a new transaction managed by TransactionManager.
- Configurable Batch Size: Defaults to 100, but can be adjusted via setBatchSize(int).
- Transaction Management: Uses TransactionManager.runInTransaction with CREATE_NEW propagation and ALWAYS_COMMIT behavior for each batch.

XSLT processing and batch editing API's have been updated to use these new components

Checklist

…mponent that processes large sets of metadata records in chunks to avoid long-running transactions and memory issues.

Extract common batch transactional processing logic into a generic co…

00437a1

…mponent that processes large sets of metadata records in chunks to avoid long-running transactions and memory issues.

josegar74 added this to the 4.4.12 milestone Jun 8, 2026

josegar74 requested review from GeoSander, fxprunayre and juanluisrp June 8, 2026 16:27

josegar74 changed the title ~~Processes large sets of metadata records in chunks to avoid long-running transactions and memory issues.~~ Processes large sets of metadata records in chunks to avoid long-running transactions and memory issues Jun 8, 2026

josegar74 changed the title ~~Processes large sets of metadata records in chunks to avoid long-running transactions and memory issues~~ Process large sets of metadata records in chunks to avoid long-running transactions and memory issues Jun 8, 2026

josegar74 marked this pull request as draft June 8, 2026 17:17

Fix unit tests

48f914f

josegar74 marked this pull request as ready for review June 9, 2026 15:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Process large sets of metadata records in chunks to avoid long-running transactions and memory issues#9319

Process large sets of metadata records in chunks to avoid long-running transactions and memory issues#9319
josegar74 wants to merge 2 commits into
geonetwork:mainfrom
GeoCat:44-xslprocessing-batchedit-transactions

josegar74 commented Jun 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Uh oh!

Conversation

josegar74 commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

josegar74 commented Jun 8, 2026 •

edited

Loading