Shared metadata cache #8819

AlexPeshkoff · 2025-11-30T18:55:15Z

Tasks to be solved.

Have a single metadata cache that is used by all attachments in both jrd and dsql.
Provide the ability to mix DLL/DML in a single transaction, including working with new/modified elements.

The old metadata cache is essentially single-threaded. For me, the problem I faced with attempts to introduce synchronization into this structure and get off with relatively little blood was taking into account the use of procedures and functions using a second reference counter, and in order to decide whether to delete an object, a global recalculation of these references is required throughout the cache. dsqlcache is also extremely unsympathetic, which puts objects created and still unmixed in a single transaction for all transactions of the same attach. An additional consideration is that objects stored in the cache are used in a huge number of places in the code, and they can be modified from other places (and threads) during MET_scan_xxx. This could still be dealt with using CopyOnWrite, but how to approach the first two points was unclear.

The new metadata cache is versioned. Each object is divided into 2 parts - a constant and a variable. For relation, for example, the type (table, view, external table, etc.), name, all locks (GCLock, partners lock, rescan lock) will be constant, and format, list of fields, triggers will be variable. The list of versions is linked to a permanent part of the object. As versions become obsolete, the list is cleared (although I won't say that it's very active - only as the cache is accessed, there is no special garbage collector).

In order to access the desired object for reading without locks, it was possible to allocate memory for the entire cache at once. In principle, there are not so many - 64K pointers to each type of cached objects. But I kept in mind the possible increase in the number of objects in the database and therefore made a 2-level array - each lower-level block of N objects is allocated once and for all (until the database is closed), and the upper-level block has the right to grow when the required number of lower-level blocks does not fit there. Therefore, the upper level is implemented using a read-safe array. For reading, we would take a slice of the current state of the array and this slice remains unchanged while we work with it. An external mutex is used to modify the array. This array (SharedReadVector) turned out to be very convenient, I also use it for requests in a statement and a list of formats in a relation. The 2-level arrays (CacheVector) for all objects are the same (template), and all this ultimately lives in DBB.

Ideally, I would like to remove dsqlcache altogether. So far, it has been removed for relations, procedures, and functions. At the same time, I did not abandon the dsql_rel format and the like - they are needed in order to be able to work with it when creating/changing an object (dsql_rel), although there is no such object in the metadata cache yet / or the format is not the same. Temporary objects of this format are created in dsqlScratch and die with it (by pool).

libcds is used to implement secure pointers (HazardPtr), they are needed for version headers in the list of object versions and for a slice in the SharedReadVector.

In all requests, instead of a pointer directly to objects, the cache stores a structure of 2 elements - a pointer to the permanent part of the object and a index in an array of pointers to versions of objects. These arrays may differ in different query clones. To reduce duplication, a RefCounted array is used. rpb_relation's are registered in rpb when creating a clone. The decision on whether to update a set of versions in a clone of the request is made in GetRequest, that is, when a clone of the request is received.

At the moment, TCS is fully working. QA makes mistakes - but I think it's time to pour metacache into the master so as not to prevent merge of other branches. And it's probably time to decide with the indexes of the system tables, which ones are needed and which ones are not very necessary. For example, I need to add indexes by ID for relations and other cached tables.

…ons)

…s related fixes around

… erased)

…t_drop) was very bad idea

…witch there are no DbTriggers

AlexPeshkoff added 30 commits November 25, 2021 20:17

Encapsulation of metadata cache

b38046e

Merge branch 'master' into metacache

2a397de

WIP - shared cache of sequences for replicator compiles fine

6234437

Avoid too many builds for a while

a178212

Shared cache of generators appears to be working

dd86eba

Before adding conversion to non-safe PTRs

c6383cb

Successfully compiled met.epp with hazard pointers to cache objects

3420540

Compiled shared metacache

76ddfee

Merge branch 'master' into metacache

4053ea8

Make it compile after merge

2917928

Use SharedReadVector in HazardArray

13895b9

Unify ctor

90a5d0e

Avoid unlimited recursion in hazard GC

55eac37

Merge branch 'master' into metacache

5c82cfb

Work in progress

cf85f59

WIP

25ff2f7

WIP

f0a0d07

WIP

0164b8a

Successful CREATE DATABASE with shared cache (still has some limitati…

5b85f04

…ons)

Misc

a25c5c5

Successfully created security.fdb

1ab5457

Completed DEV_BUILD, including creation of employee.fdb

9054816

Check for simple (trivially copyable) type in Array, some more or les…

1ff3c46

…s related fixes around

Some changes before merge

92a909a

Merge result

dae83f6

WIP

4215b9d

WIP - take into an account states of an object (needs load / normal /…

7736734

… erased)

WiP

462d01b

met.epp compiles

a16665e

jrd dir compiled

e1647ab

AlexPeshkoff added 30 commits November 13, 2025 19:40

Fixed drop of field, used as foreign key reference

edbbc56

Looks like attempt to reanimate index prepared for deletion (state ir…

e56b498

…t_drop) was very bad idea

Merged with master branch

d13fd3e

Fixed case-insensitive behavior in CONTAINING

54e921c

Fixed use of INTL data

6c85965

Carefully watch for cache version

3f2f549

Fixed assert when removing statement from att_attachments

4d0e054

Fixed bugs/core_2042_test.py

baadab2

Fixed bugs/core_2032_test

032b41d

Ensure safe MT access to relation's formats

4018f3f

Fixed use of locks

80b932e

Fixed use of shared read vector in formats list

9511c2c

Fixed core_2766_test, DROP TABLE with indices caused problems

cbc096e

Added forgotten irt page release

7d3c4b8

DROP TABLE with indices, continued

f9b82d5

Fixed requests leak in EXECUTE PROCEDURE

02c5c8a

Fixed DROP TABLE with self-referencing foreign key

c8d0e0e

Fixed DROP VIEW with calculated fields

6205b57

Fixed races between primary and secondary attachments

91413ed

Rollback metadata access transaction when unwind fresh attach

f3ad315

Fixed cleanup of per-transaction temp table's pages

4fc1348

Removed wrong assertion - in a case of single attachment with -nodb s…

6bf86ad

…witch there are no DbTriggers

Fixed per-statement memory stats

514922e

Enhancement needed for working DSQL statements cache

f4b6e63

Merge branch 'master' into metacache

2d25c50

This should fix clang build

4c6d657

Fixed building tests

36871e8

Next attempt to make clang happy

b9591b4

Avoid clang warning re. use of uninitialized variable

fdba886

Fixed races when taking rescan lock

840859f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Shared metadata cache #8819

Shared metadata cache #8819

Uh oh!

AlexPeshkoff commented Nov 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Shared metadata cache #8819

Are you sure you want to change the base?

Shared metadata cache #8819

Uh oh!

Conversation

AlexPeshkoff commented Nov 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants