neo4j
diff --git a/‎modules/ROOT/content-nav.adoc‎
Lines changed: 1 addition & 1 deletion b/‎modules/ROOT/content-nav.adoc‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎modules/ROOT/images/scalability/property-shard-architecture.jpg‎
55.7 KB b/‎modules/ROOT/images/scalability/property-shard-architecture.jpg‎
55.7 KB
diff --git a/‎modules/ROOT/images/scalability/property-shard-architecture.png‎
53.4 KB b/‎modules/ROOT/images/scalability/property-shard-architecture.png‎
53.4 KB
diff --git a/‎modules/ROOT/pages/scalability/composite-databases/scaling-with-composite-databases.adoc‎
Lines changed: 0 additions & 3 deletions b/‎modules/ROOT/pages/scalability/composite-databases/scaling-with-composite-databases.adoc‎
Lines changed: 0 additions & 3 deletions
diff --git a/‎modules/ROOT/pages/scalability/scaling-with-neo4j.adoc‎
Lines changed: 24 additions & 33 deletions b/‎modules/ROOT/pages/scalability/scaling-with-neo4j.adoc‎
Lines changed: 24 additions & 33 deletions
diff --git a/‎modules/ROOT/pages/scalability/sharded-property-databases/admin-operations.adoc‎
Lines changed: 12 additions & 9 deletions b/‎modules/ROOT/pages/scalability/sharded-property-databases/admin-operations.adoc‎
Lines changed: 12 additions & 9 deletions
diff --git a/‎modules/ROOT/pages/scalability/sharded-property-databases/altering-sharded-databases.adoc‎
Lines changed: 1 addition & 7 deletions b/‎modules/ROOT/pages/scalability/sharded-property-databases/altering-sharded-databases.adoc‎
Lines changed: 1 addition & 7 deletions
diff --git a/‎modules/ROOT/pages/scalability/sharded-property-databases/configuration.adoc‎
Lines changed: 6 additions & 2 deletions b/‎modules/ROOT/pages/scalability/sharded-property-databases/configuration.adoc‎
Lines changed: 6 additions & 2 deletions
diff --git a/‎modules/ROOT/pages/scalability/sharded-property-databases/create-spd-database.adoc‎
Lines changed: 103 additions & 0 deletions b/‎modules/ROOT/pages/scalability/sharded-property-databases/create-spd-database.adoc‎
Lines changed: 103 additions & 0 deletions
@@ -171,7 +171,7 @@
 *** xref:scalability/composite-databases/delete-composite-databases.adoc[]
 *** xref:scalability/composite-databases/querying-composite-databases.adoc[]
 *** xref:scalability/composite-databases/sharding-with-copy.adoc[]
-*** xref:scalability/composite-databases/scaling-with-composite-databases.adoc[]
+//*** xref:scalability/composite-databases/scaling-with-composite-databases.adoc[]
 ** Property sharding (Preview feature)
 *** xref:scalability/sharded-property-databases/overview.adoc[]
 *** xref:scalability/sharded-property-databases/planning-and-sizing.adoc[]
 
@@ -4,17 +4,19 @@
 
 Neo4j offers various options for scaling, tailored to specific use cases and requirements. Here are some of the supported scaling strategies:
 
-* *Data replication via Neo4j clustering (read scalability)* -- via Neo4j clustering.
-A Neo4j cluster is a collection of servers running Neo4j that are configured to communicate with each other.
-These may be used to host databases, and the databases may be configured to replicate across servers in the cluster, thus achieving read scalability or high availability.
-A minimum of three servers is required for the cluster to be fault-tolerant.
-Neo4j cluster is good for:
-
-** Horizontal, READ scalability
-** Always on, highly available with disaster recovery and rolling upgrades (Neo4j 5.0+)
-** Flexible infrastructure from 1 to many copies of the same database
+* *Data replication via Neo4j analytics clustering (read scalability)* -- A Neo4j cluster is a high-availability cluster with multi-DB support.
+It is a collection of servers running Neo4j that are configured to communicate with each other.
+This means that servers and databases are decoupled: servers provide computation and storage power for databases to use.
+Each database relies on its own cluster architecture, organized into primaries (with a minimum of 3 for high availability) and secondaries (for read scaling).
+Scalability, allocation/reallocation, service elasticity, load balancing, and automatic routing are automatically provided (or they can be finely controlled).
++
+xref:clustering/setup/analytics-cluster.adoc[Neo4j analytics cluster] is good for:
+
+** Horizontal, read scalability
+** Always on, highly available with disaster recovery and rolling upgrades (Neo4j 5.0+).
+** Flexible infrastructure from 1 to many copies of the same database.
 ** Servers may be service-specific (analytical/transactional workloads, data science, reporting, etc.).
-Multi-region, multi-tenant, SaaS-style scalability
+Multi-region, multi-tenant, SaaS-style scalability.
 
 * *Data federation and sharding via composite database* -- using federated queries, Neo4j allows you to query multiple Neo4j databases with a single query.
 The data is partitioned into smaller, more manageable pieces, called shards.
@@ -35,24 +37,27 @@ This allows, in theory, the unlimited growth of a graph.
 label:preview[Preview feature] xref:scalability/sharded-property-databases/overview.adoc[Property sharding] (part of Infinigraph) allows you to decouple the properties attached to nodes and relationships and store them in separate graphs.
 This architecture enables the independent scaling of property data, allowing for the handling of high volumes, heavy queries, and high read concurrency.
 
-The following table summarizes the similarities and differences between composite databases and sharded property databases:
+The following table summarizes the similarities and differences between analytics clustering, composite databases and sharded property databases:
 
-.Similarities and differences between composite databases and sharded property databases
-[cols="2,4a,4a",frame="topbot",options="header"]
+.Similarities and differences between analytics clustering, composite databases and sharded property databases
+[cols="2,4a,4a,4a",frame="topbot",options="header"]
 |===
 |
+| Analytics cluster
 | Composite database
 | Sharded property database
 
 
 | Typical use cases
+|
 | *Federated data* +
 Time-based sharding +
 *Application-based access*
 | *Graphs with a large volume of properties* +
 Ideal for vector and full-text search
 
 | Scalability
+|
 | *Data volume: unlimited* +
 Read concurrency: horizontal scale on multiple instances +
 *Write concurrency: horizontal scale depending on the graph model*
@@ -61,18 +66,21 @@ Read concurrency: horizontal scale on multiple instances +
 *Write concurrency: single instance*
 
 | Transactions
+|
 | Parallel read transactions +
 Single-shard write transactions +
 `CALL {} IN TRANSACTION` for multiple, isolated read/write transactions with manual error handling
 | Parallel read & write transactions on all shards +
 Standard transaction management
 
 | Data load
+|
 | Manually orchestrated import +
 Ad-hoc, project-based, sharded import
 | Initial and incremental data import via neo4j-admin and Aura importer
 
 | Cypher queries
+|
 | Parallel execution on shards. +
 Single database queries must be modified according to the *sharding rules*. +
 Automated shard pruning using sharding functions.
@@ -81,39 +89,22 @@ Single database *queries run as is*. +
 Automated shard pruning based on node selection.
 
 | User tools
+|
 | Work with Browser and Cypher Shell. +
 Tools used on individual shards and Bloom are not supported on composite databases.
 | All tools supported.
 
 | Admin tools
+|
 | Tools used on individual shards are not supported on composite databases.
 | All tools supported.
 
 | Libraries
+|
 | Supported on individual shards.
 | All libraries supported.
 |===
 
-== Neo4j clustering
-
-xref:clustering/index.adoc[Neo4j cluster] is a high-availability cluster with multi-DB support.
-This means that servers and databases are decoupled: servers provide computation and storage power for databases to use.
-Each database relies on its own cluster architecture, organized into primaries (with a minimum of 3) and secondaries (for read scaling).
-Scalability, allocation/reallocation, service elasticity, load balancing, and automatic routing are automatically provided (or they can be finely controlled).
-
-image::scalability/cluster.png[title="some title.", role="middle"]
-
-
-== Composite databases
-
-Composite databases enable queries to access multiple graphs simultaneously.
-They provide:
-
-* *Data Federation:* the ability to access data available in distributed sources in the form of disjoint graphs.
-* *Data Sharding:* the ability to access data available in distributed sources in the form of a common graph partitioned on multiple databases.
-
-For more information, see xref:scalability/composite-databases/concepts.adoc[Composite databases].
-
 //TODO
 //Admin considerations
 
 
@@ -29,18 +29,21 @@ ENABLE SERVER 'serverId'  OPTIONS { allowedDatabases: ['foo-sharded-p000'] }
 
 == Resizing and resharding
 
-Online resharding (adding new shards, removing old ones, relocating data to accommodate the new topology) is currently not supported.
-You can reshard your data via the `neo4j-admin database copy` command.
-See xref:scalability/sharded-property-databases/data-ingestion.adoc#splitting-existing-db-into-shards[Splitting an existing database into shards] for more information.
-
-Alternatively, you can select more shards than needed to start with and allow space for their data to grow, as the Neo4j cluster allows databases to be moved based on server availability.
+=== Resizing
+You can resize a sharded property database by adding or removing property shards.
+You can select more shards than needed to start with and allow space for their data to grow, as the Neo4j cluster allows databases to be moved based on server availability.
 For example, ten property shards can be initially hosted on five servers (two shards per server), and additional servers can be added as needed.
 For details on managing databases and servers in a cluster, see xref:clustering/databases.adoc[Managing databases in a cluster] and xref:clustering/servers.adoc[Managing servers in a cluster].
 
+=== Resharding
+
+You can reshard your data via the `neo4j-admin database copy` command.
+See xref:scalability/sharded-property-databases/data-ingestion.adoc#splitting-existing-db-into-shards[Splitting an existing database into shards] for more information.
+
 //TODO: We should talk about co-location, adding/removing servers in a cluster and say what is supported and what is not.
 
-[[backup-and-recovery]]
-== Backup and recovery
+[[backup-and-restore]]
+== Backup and restore
 
 A sharded property database is a database made up of multiple databases.
 This means that when you want to back up a database, you must back up all the shards individually, resulting in a sharded property database backup that is composed of multiple smaller backup chains.
@@ -126,7 +129,7 @@ Failure to meet this requirement will make a given replica of a property shard u
 
 If a property shard replica does fall behind the transaction log range available on the graph shard, you can recover it by:
 
-. Connecting to the server hosting the affected replica using the _bolt://_ scheme.
+. Connecting to the `system` database on the server hosting the affected replica using the _bolt://_ scheme.
 . Quarantining the replica using xref:procedures.adoc#procedure_dbms_quarantineDatabase[`dbms.quarantineDatabase()`].
 . Unquarantining the replica using xref:procedures.adoc#procedure_dbms_unquarantineDatabase[`dbms.unquarantineDatabase()`] with the `replaceStateReplaceStore` option.
 This will force the replica to copy the database store files from another replica of the property shard.
@@ -135,7 +138,7 @@ If all replicas of a given property shard are behind, then the sharded property
 This is an irrecoverable state.
 Up until this point, losing replicas reduces fault tolerance, but the database remains available.
 When a sharded property database becomes irrecoverable, it needs to be dropped and recreated from a backup.
-See <<backup-and-recovery, Backup and recovery>>.
+See <<backup-and-restore, Backup and restore>>.
 
 One mechanism to avoid property shards falling out of range of the graph shard’s transaction log is to set a sufficiently large transaction log prune time on the graph shard.
 See xref:scalability/sharded-property-databases/limitations-and-considerations.adoc#setting-suitable-tx-log-retention-policy[Setting a suitable transaction log retention policy].
@@ -81,10 +81,4 @@ SET TOPOLOGY 1 PRIMARY 2 SECONDARIES;
 ----
 ALTER DATABASE `foo-sharded-p000`
 SET TOPOLOGY 2 REPLICAS;
-----
-
-[NOTE]
-====
-Resharding is currently not supported.
-When the database is operational, altering a property shard can only be done by altering the number of replicas per graph shard.
-====
+----
@@ -15,14 +15,18 @@ To enable the property sharding in your cluster, you must configure the followin
 | Configuration setting | Description
 
 | internal.dbms.sharded_property_database.enabled=true
-| By default, the sharded property database is disabled.footnote:[This setting is a feature toggle behind which the sharded property database is developed. See xref:scalability/sharded-property-databases/overview.adoc[Property sharding overview].]
+| By default, the sharded property database is disabled.footnote:[Property sharding is a preview feature. For details, see xref:scalability/sharded-property-databases/overview.adoc[Property sharding overview].]
 
 | db.query.default_language=CYPHER_25
 | Ensures that any database created will use Cypher 25 (unless users specifically override the default version in the `CREATE DATABASE` command).
-See xref:configuration/cypher-version-configuration.adoc[Configure the Cypher default version] and link:   shttps://neo4j.com/docs/cypher-manual/25/queries/select-version/[Cypher Manual -> Select Cypher version].
+See xref:configuration/cypher-version-configuration.adoc[Configure the Cypher default version] and link:https://neo4j.com/docs/cypher-manual/25/queries/select-version/[Cypher Manual -> Select Cypher version].
 
 | internal.dbms.cluster.experimental_protocol_version.dbms_enabled=true
 | Allows users to take valid backups of a sharded database.
+
+|internal.dbms.single_raft_enabled=true
+| Allows a sharded property database to start with 1 primary for the graph shard and scale up to 3 at a later date.
+It is not needed if you will always run 3 primary graph shard.
 |===
 
 
 
@@ -0,0 +1,103 @@
+:description: This page describes how to create a sharded property database using the `CREATE DATABASE` command.
+:page-role: new-2025.10 enterprise-edition not-on-aura
+:keywords: sharded property database, CREATE DATABASE, Cypher 25
+= `CREATE DATABASE` command with sharded databases
+
+You can create a sharded database using the Cypher command `CREATE DATABASE` (requires Cypher 25, introduced alongside Neo4j 2025.06.0).
+For details on configuring the Cypher version, see xref:configuration/cypher-version-configuration.adoc[Configure the Cypher default version].
+
+
+== Syntax
+
+[options="header", width="100%", cols="1m,5a"]
+|===
+| Command | Syntax
+
+| CREATE DATABASE
+|
+[source, syntax, role="noheader"]
+----
+CREATE DATABASE name [IF NOT EXISTS]
+[[SET] GRAPH SHARD {
+  [TOPOLOGY n PRIMAR{Y\|IES} [m SECONDAR{Y\|IES}]]
+}]
+[SET] PROPER{TY\|IES} {
+  COUNT n [TOPOLOGY m REPLICA[S]]
+}
+[OPTIONS "{" option: value[, ...] "}"]
+[WAIT [n [SEC[OND[S]]]]\|NOWAIT]
+----
+|===
+
+When creating a sharded database, the following are created:
+
+* A virtual sharded database `<name>`.
+* A single graph shard with the name `<name>-g000`.
+* A number of property shards with the name `<name>-p000<index>`.
+The count property in `SET PROPERTY SHARDS` specifies the number of property shards.
+
+[NOTE]
+====
+`CREATE OR REPLACE` does not replace an existing sharded database.
+====
+
+== Options
+
+The `CREATE DATABASE` command can have a map of options, e.g., `OPTIONS {key: 'value'}`.
+For sharded databases, only the seeding option is supported.
+
+The following table describes the `seedUri` option:
+
+[frame="topbot", grid="cols", cols="<1s,<4"]
+|===
+| *Key*
+m| seedURI
+| *Value*
+a| URI to a folder containing all the backups or a list of dumps/backups.
+
+[NOTE]
+The folder notation only works for backups, not dumps.
+
+When specifying each artifact manually the key of the map is the name of the shard.
+Where shard name = `databaseName-g000` or `databaseName-p000` for property shards where the last shard name would be `databaseName-px` where `x = numShards -1`.
+| *Description*
+a| Defines an identical seed from an external source, which will be used to seed all servers. For more information, see xref::database-administration/standard-databases/seed-from-uri.adoc[Seed from a URI].
+| *Example*
+|
+[source, syntax, role="noheader"]
+----
+seedUri: {
+  `foo-sharded-g000`: "s3://bucket/folder/foo-g000.backup",
+  `foo-sharded-p000`: "s3://bucket/folder/foo-p001.backup",
+  `foo-sharded-p001`: "s3://bucket/folder/foo-p002.backup"
+ }
+----
+Or
+[source, syntax, role="noheader"]
+----
+seedUri: "s3://bucket/folder/"
+----
+|===
+
+== Default numbers for topology
+
+The sharded property databases use the Neo4j cluster topology.
+Therefore, you need to consider how the following settings will affect the creation of your sharded property database.
+
+[options="header", width="100%", cols="4m,1m,1m,3a"]
+|===
+| Configuration settings with their default value
+| Default value
+| Valid values
+| Description
+
+|initial.dbms.default_primaries_count
+| 1
+| [1-10]
+| The default number of primaries for the graph shard when the database is created.
+
+|initial.dbms.default_secondaries_count
+| 0
+| [0-19]
+| The default number of secondaries for the graph shard when the database is created.
+|===