Implement `start_batch` message batching #3793

jkczyz · 2025-05-22T17:11:21Z

Instead of batching commitment_signed messages using a batch TLV, the splicing spec has been updated to introduce a start_batch messages. It used to indicate that the next batch_size messages for the channel_id should be treated as one logical message. Update commitment_signed message to contain the funding_txid instead of both that and a batch_size. Also, update PeerManager to batch accordingly instead of the previous custom approach.

Instead of batching commitment_signed messages using a batch TLV, the splicing spec has been updated to introduce a start_batch messages. It used to indicate that the next batch_size messages for the channel_id should be treated as one logical message. This commit simply adds the message while the following commits will implement the handling logic.

ldk-reviews-bot · 2025-05-22T17:11:25Z

👋 Thanks for assigning @wpaulino as a reviewer!
I'll wait for their review and will help manage the review process.
Once they submit their review, I'll check if a second reviewer would be helpful.

lightning/src/ln/peer_handler.rs

wpaulino · 2025-05-22T18:08:24Z

lightning/src/ln/peer_handler.rs

-					.commitment_signed_batch
-					.get_or_insert_with(|| (msg.channel_id, BTreeMap::new()));
+		if let wire::Message::StartBatch(msg) = message {
+			if peer_lock.commitment_signed_batch.is_some() {


If a new batched message gets introduced, we'll have to make this a bit more generic, but I guess it's fine for now

Went ahead and added support for other types in the future. It requires having an Unknown enum variant since we don't know the message type until we receive the first message in the batch. Wonder if it would be worth adding the message type id to start_batch? The code would be marginally better as we'd remove one of the two needed unreachables. The second could be removed by using swap, which we decided against using when writing the original code in favor of a move, IIRC.

Wonder if it would be worth adding the message type id to start_batch?

Sounds like a good idea. The message in its current format seems very tailored to the commitment_signed use case, which is fine, but we may regret in the future. It'd be nice to have the channel_id and message_type_id be optional TLVs (requiring one or the other to be present of course) that you can specify for your specific use case.

ldk-reviews-bot · 2025-05-22T18:09:46Z

👋 The first review has been submitted!

Do you think this PR is ready for a second reviewer? If so, click here to assign a second reviewer.

Update commitment_signed message to contain the funding_txid instead of both that and a batch_size. The spec was updated to batch messages using start_batch, which contains the batch_size. This commit also updates PeerManager to batch commitment_signed messages in this manner instead of the previous custom approach.

jkczyz · 2025-05-23T17:21:26Z

lightning/src/ln/peer_handler.rs

+				let funding_txid = match msg.funding_txid {
+					Some(funding_txid) => funding_txid,
+					None => {
+						log_debug!(logger, "Peer {} sent batched commitment_signed without a funding_txid for channel {}", log_pubkey!(their_node_id), message_batch.channel_id);
+						return Err(PeerHandleError { }.into());
+					},
+				};


Looks like the spec wants us to send an error and fail the channel in this case.

- If the sending node sent `start_batch` and we are processing a batch of `commitment_signed` messages: - If `funding_txid` is missing in one of the `commitment_signed` messages: - MUST send an `error` and fail the channel.

Does this imply we need to move this handling -- and thus the BTreeMap use -- into the channel layer?

Seems like it, at least we can still keep the message type and channel_id checks at the peer layer. Disconnecting does not help since they'll just resend the same message upon reconnection (I guess only until the splice becomes locked, but it's a buggy peer regardless).

Updated to fail the channel.

While the spec only includes commitment_signed messages in batches, there may be other types of batches in the future. Generalize the message batching code to allow for other types in the future.

If a message in a commitment_signed batch does not contain a funding_txid or has duplicates, the channel should be failed. Move this check from PeerManager to FundedChannel such that this can be done.

jkczyz · 2025-05-23T21:40:42Z

Fixed some places where we disconnected but didn't send a warning (or disconnected when we should ignore and send a warning) according too the spec. Feels a little verbose, so let me know if there is some utility for this that I'm missing.

jkczyz requested a review from TheBlueMatt May 22, 2025 17:11

wpaulino requested review from wpaulino and removed request for TheBlueMatt May 22, 2025 17:40

wpaulino reviewed May 22, 2025

View reviewed changes

jkczyz added 2 commits May 22, 2025 15:09

f - re-order start_batch handling

cc2d059

jkczyz force-pushed the 2025-05-start-batch branch from 97118cf to b2d073f Compare May 23, 2025 16:57

jkczyz mentioned this pull request May 23, 2025

Channel Splicing (feature 62/63) lightning/bolts#1160

Open

jkczyz commented May 23, 2025

View reviewed changes

jkczyz added 3 commits May 23, 2025 17:36

f - send warning message when necessary

d8db355

Add support for batches of other types

96a6090

While the spec only includes commitment_signed messages in batches, there may be other types of batches in the future. Generalize the message batching code to allow for other types in the future.

Fail channel for batched commitment_signed appropriately

5110534

If a message in a commitment_signed batch does not contain a funding_txid or has duplicates, the channel should be failed. Move this check from PeerManager to FundedChannel such that this can be done.

jkczyz force-pushed the 2025-05-start-batch branch from b2d073f to 5110534 Compare May 23, 2025 21:39

jkczyz requested a review from wpaulino May 23, 2025 21:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement `start_batch` message batching #3793

Implement `start_batch` message batching #3793

Uh oh!

jkczyz commented May 22, 2025

Uh oh!

ldk-reviews-bot commented May 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

wpaulino May 22, 2025

Uh oh!

jkczyz May 23, 2025

Uh oh!

wpaulino May 23, 2025

Uh oh!

ldk-reviews-bot commented May 22, 2025

Uh oh!

jkczyz May 23, 2025

Uh oh!

wpaulino May 23, 2025

Uh oh!

jkczyz May 23, 2025

Uh oh!

jkczyz commented May 23, 2025

Uh oh!

Uh oh!

Implement start_batch message batching #3793

Are you sure you want to change the base?

Implement start_batch message batching #3793

Uh oh!

Conversation

jkczyz commented May 22, 2025

Uh oh!

ldk-reviews-bot commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wpaulino May 22, 2025

Choose a reason for hiding this comment

Uh oh!

jkczyz May 23, 2025

Choose a reason for hiding this comment

Uh oh!

wpaulino May 23, 2025

Choose a reason for hiding this comment

Uh oh!

ldk-reviews-bot commented May 22, 2025

Uh oh!

jkczyz May 23, 2025

Choose a reason for hiding this comment

Uh oh!

wpaulino May 23, 2025

Choose a reason for hiding this comment

Uh oh!

jkczyz May 23, 2025

Choose a reason for hiding this comment

Uh oh!

jkczyz commented May 23, 2025

Uh oh!

Uh oh!

Implement `start_batch` message batching #3793

Implement `start_batch` message batching #3793

ldk-reviews-bot commented May 22, 2025 •

edited

Loading