Shard #74

saj9191 · 2019-06-24T20:20:19Z

No description provided.

For both shard and non-sharded cases, we want to correctly setup the primary logs. However, in the sharded case, we call RecoveryAsync for every shard we recover from. We don't want to move the upgraded directory and log file for every shard, so we move this final logic to a function so we can call it once in both sharded / non-sharded cases.

If we want to perform shard recovery in parallel, we need to ensure each shard has it's own set of variables (ex. does not use the same log file variables). To make this easier to share with the non-sharded case, we move the variables to a class.

The service name folder will change depending on the shardID. This will affect where we look for log and checkpoint files. We update all directory and file lookups to use functions that account for the shardID. Note: For the initial self-connections, I use _serviceName and don't consider if it's a shard. This is because the Sample applications are still linked to the old version of Ambrosia. We will change this when we are able to link Samples to the new version.

This makes it easier to switch between sharded and non-sharded scenario. We still need to figure out the best way to pass in the shards to recover from.

For the sharded case, what is checked changes depending on the shard.

Add functionality so Immmortal Coordinator determines which shard to send a message. The shard is determined based on a hash of the destination currently. The hash only returns shard 1 for now.

For shard recovery, we need InputConnectionRecords to track the last processed ID and last processed replayable ID of ancestor shards. This map needs to be sent between peers. We only want to send longs instead of strings containing the name of the peer. To make this easier, we add a ShardID field so that we only have to parse the peer name when we make the initial connection. To make sure we don't break serialization when sharding is added, we move the replay serialization logic into a separate class so we can add unit tests. Any other serialization changes affected by shards will be moved into this class as well.

In the sharded case, we have to recover from multiple machines. We initialize the machine with a list of shards to recover from as well as a map indicating which machines each key belongs to. This also starts the merge process for the parent input records. This commit may be easier to view with -w.

Create a dictionary to keep track of the ancestors associated with each shard ID. This information should not change, so we store it with service information. The ancestor information will help determine which input / output information needs to be shared with peers.

When a connection is established, before the peers send a replay message, we send our ancestor list, so the peer knows which shard input / output data to add to the replay message.

We clear the ancestor data as we know the shard received the data and we no longer need to track this data.

We need to preserve a global ordering of output records during shard recovery. If we fail before recovery completes, it's possible that the order of execution will change. To handle this, we record a global ordering of outputs. We also handle merging parent output state in this commit.

This shard test tests basically the same thing as the normal basic end to end test, except the coordinators use the sharded logic. This is to ensure that the basic behavior for the sharded case is the same as the non-sharded case.

For sharding, we need to be able to concat output buffers to different records in the case a peer splits or merge.

It's possible for _lastShuffleDest to be null, which would result in a null dereference when we try to call length.

We don't want to update _outputs when a shard is launching. There will be an output variable for each parent that will be accessed in parallel. We need to update this variable to ensure recovery state is correct and to prevent race conditions.

saj9191 force-pushed the shard branch 8 times, most recently from 4d41c90 to c3c4ddf Compare July 1, 2019 18:55

saj9191 force-pushed the shard branch 3 times, most recently from d7a320a to 1c6ba0d Compare July 3, 2019 22:09

saj9191 force-pushed the shard branch from 1c6ba0d to 9b8aa59 Compare July 11, 2019 21:33

saj9191 force-pushed the shard branch from 9b8aa59 to 7a1adb2 Compare July 18, 2019 22:28

Shannon Joyner and others added 17 commits July 25, 2019 16:16

Store shard specific state in a class

11da977

If we want to perform shard recovery in parallel, we need to ensure each shard has it's own set of variables (ex. does not use the same log file variables). To make this easier to share with the non-sharded case, we move the variables to a class.

Allow shard ID to be passed in as a parameter

467bbc2

This makes it easier to switch between sharded and non-sharded scenario. We still need to figure out the best way to pass in the shards to recover from.

Add RuntimeChecksOnProcessStart to sharded case

11e54aa

For the sharded case, what is checked changes depending on the shard.

Prepare kill logic to handle shard case

733aead

Shard Hashing: Implement basic hashing logic

9eb4c5a

Add functionality so Immmortal Coordinator determines which shard to send a message. The shard is determined based on a hash of the destination currently. The hash only returns shard 1 for now.

Send ancestor information to peers

f8375b4

When a connection is established, before the peers send a replay message, we send our ancestor list, so the peer knows which shard input / output data to add to the replay message.

Trim output based on replay ancestor data

f4d10bc

We clear the ancestor data as we know the shard received the data and we no longer need to track this data.

Basic Shard Test

0df30af

This shard test tests basically the same thing as the normal basic end to end test, except the coordinators use the sharded logic. This is to ensure that the basic behavior for the sharded case is the same as the non-sharded case.

Create a buffer append function.

60ff69e

For sharding, we need to be able to concat output buffers to different records in the case a peer splits or merge.

Flushing output of c# client when taking becoming primary checkpoint

256bcf0

Fix _lastShuffleDest null bug

8a3d605

It's possible for _lastShuffleDest to be null, which would result in a null dereference when we try to call length.

Shannon Joyner added 5 commits July 25, 2019 16:17

Update output connections

1c168a8

Special case sharded case in ProcessRPC

3aaf995

We don't want to update _outputs when a shard is launching. There will be an output variable for each parent that will be accessed in parallel. We need to update this variable to ensure recovery state is correct and to prevent race conditions.

Incorporate CRA sharded API

7a44f3d

Doing sharding stuff

74da143

Debugging

70f1694

saj9191 force-pushed the shard branch from 8fb56af to 8aa2073 Compare August 23, 2019 23:13

darrenge force-pushed the shard branch from 8aa2073 to 70f1694 Compare September 20, 2024 23:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shard #74

Shard #74

saj9191 commented Jun 24, 2019

Shard #74

Are you sure you want to change the base?

Shard #74

Conversation

saj9191 commented Jun 24, 2019