perf(objstore): S3/object-store scan is far too slow on large buckets

## Problem Statement

Scanning large S3 / object-store buckets is extremely slow. A real scan of **291,422 objects took ~3 hours 14 minutes**. The bottlenecks are in `packages/server/engine-lib/engLib/store/endpoints/objstore/base/scan.cpp`:

- Listing relied on `StartAfter` plus page **result-equality heuristics** to detect the end of a listing instead of proper `ListObjectsV2` continuation-token pagination — fragile and wasteful.
- A **fresh S3 client was created per scan call** (including each recursive sub-prefix scan), so the HTTP connection pool was not reused across scanner threads.
- The default `ClientConfiguration.maxConnections` **serialized** the scanner threads.
- Every object incurred a separate **`HeadObject` (Content-Type) round-trip**.

## Proposed Solution

- Walk listings with `ListObjectsV2` **continuation-token pagination**.
- **Cache and share one S3 client** across scanner threads (mutex-guarded, reset on list errors so the next scan re-connects).
- Raise `ClientConfiguration.maxConnections` to **64** so the shared pool does not serialize threads.
- **Drop the per-object `Content-Type` HEAD**; fetch owner metadata inline via `ListObjectsV2 FetchOwner`.

This also benefits the generic `objstore` connector, which inherits the same base scan.

## Affected Modules

- [x] server (C++ engine)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(objstore): S3/object-store scan is far too slow on large buckets #1208

Problem Statement

Proposed Solution

Affected Modules

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

perf(objstore): S3/object-store scan is far too slow on large buckets #1208

Description

Problem Statement

Proposed Solution

Affected Modules

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions