MMR Leaf Data: off-by-one error in beefy_next_authority_set #11797

acatangiu · 2022-07-07T11:37:57Z

A given Leaf with MMR 0-based leaf index N-1 is constructed and added to MMR during construction of block N here.

The Leaf data/contents can be seen populated here and it consists of:

// contents for leaf index <N-1> added by block <N>
MmrLeaf {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_<N-1>>,
    ),
    extra_data: <para_heads_of_<N-1>>,
    next_auth_set: <next_auth_set_of<N>>,
}

While this is not a show-stopper - (light)clients can simply handle this in their code - it introduces cognitive complexity and code logic corner cases.

We could avoid this and make it easier for users by changing Leaf<N-1> contents to:

// contents for leaf index <N-1> added by block <N>
MmrLeaf {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_<N-1>>,
    ),
    extra_data: <para_heads_of_<N-1>>,
-   next_auth_set: <next_auth_set_of<N>>,
+   next_auth_set: <next_auth_set_of<N-1>>,
}

The text was updated successfully, but these errors were encountered:

acatangiu · 2022-07-07T11:50:57Z

AFAICT an easy fix is to move Mmr: pallet_mmr before pallet_session so that it builds its leaf before session is rotated.

But I would prefer a fix contained in pallet_mmr itself since above is prone to developer error when adding pallet_mmr to their runtime.

Lederstrumpf · 2022-07-07T12:18:39Z

AFAICT an easy fix is to move Mmr: pallet_mmr before pallet_session so that it builds its leaf before session is rotated.

So since session is rotated prior to leaf construction currently, next_auth_set: <next_auth_set_of<N>> (i.e. Pallet::<T>::beefy_next_authorities()) can be distinct on forks of depth 1?

vgeddes · 2022-07-07T12:56:02Z

We haven't really been affected by this off-by-one issue while developing our light client (or we just worked around it).

The proposed change may affect how our light client detects authority handovers. But I have a feeling there shouldn't be any impact. If there is we can update our code.

Some background on our light client: It receives updates like the following at the start of every beefy session (Assume a session length of 10 blocks). Each update contains the beefy commitment for block N (first block of session) and the MMR leaf for block N-1 (last block of the previous session).

Update:
  BeefyCommitment(block=11) { validatorSetId: 1, ... }
  MmrLeaf(block=10) { nextValidatorSetId: 2, ...}

Update:
  BeefyCommitment(block=21) { validatorSetId: 2, ... }
  MmrLeaf(block=20) { nextvalidatorSetId: 3, ... }

Update:
  BeefyCommitment(block=31) { validatorSetId: 3, ... }
  MmrLeaf(block=30) { nextvalidatorSetId: 4, ... }

The light client knows it can perform an authority handover when the latest update contains a leaf.nextValidatorSetId that is greater than that of the previously observed value (See code here).

acatangiu · 2022-07-07T13:17:31Z

AFAICT an easy fix is to move Mmr: pallet_mmr before pallet_session so that it builds its leaf before session is rotated.

So since session is rotated prior to leaf construction currently, next_auth_set: <next_auth_set_of<N>> (i.e. Pallet::<T>::beefy_next_authorities()) can be distinct on forks of depth 1?

There is no issue for forks in either option (current behavior & suggested change) because the validator set is identical for blocks at height N (or N+1, ..., N+m) across all forks.

The issue is more about usability in the sense that the BEEFY payload for block N is a MMR root where latest latest leaf is a set of primitives coming from a mix of blocks N-1 and N; it could be simplified to:

(block numbers are 1-indexed whereas leaf indexes are 0-indexed)
BEEFY payload for block N contains MMR root built from leaves indexed L in 0..=N-1, with each Leaf index L containing primitives coming exclusively from block number L.
At block N, the latest leaf L=N-1 contains primitives values pertaining to block N-1 (instead of some from N-1 and some from N).

Marking this as "Some day maybe" since existing clients already successfully handle current behavior.

seunlanlege · 2022-08-11T02:02:11Z

actually have encountered this issue and i created this pr as a fix: #10907

acatangiu · 2022-09-22T12:12:20Z

actually have encountered this issue and i created this pr as a fix: #10907

this is actually something different - by building leaf on_finalize() you actually get:

// contents for leaf index <N-1> added by block <N>
MmrLeaf {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_<N-1>>,
    ),
-   extra_data: <para_heads_of_<N-1>>,
+   extra_data: <para_heads_of_<N>>,
    next_auth_set: <next_auth_set_of<N>>,
}

and at this point you also run into trouble with 1-block deep forks writing different leaf contents to offchain db for same (parent, pos) key.

The offchain-db thing can be fixed by also doing #11799.

@seunlanlege Is the mixed layout of MmrLeaf above really better?

Isn't light-client code simpler if it simply references leaf index N when interested in extra_data of block N?
(i.e. for N=5: leaf index 5 is the leaf added by block 6 containing extra_data of block 5)

seunlanlege · 2022-09-22T12:22:39Z

Isn't light-client code simpler if it simply references leaf index N when interested in extra_data of block N?

So you're correct that this makes sense, but only from a leaf index perspective.

But light clients won't be tracking leaf indexes, they'll be tracking block heights.

And it's a bit wonky from that perspective that a leaf emitted a block N contains data about N-1 especially when there's no clear advantage to having it this way.

Whereas it's easier to reason that a leaf emitted at block N contains information about block N.

Also the leaves in this N-1 will miss the authority set change. So you'll need a leaf N+1 where N is the authority set change height to know exactly the beginning of a new epoch. Which is just cognitive overhead.

acatangiu · 2022-09-22T12:37:38Z

Aha, that makes sense. So to summarize, you're saying for a light client, the best experience to interact with pallet-mmr is if it exposed leafs with following format:

// contents for leaf added by block <N>
MmrLeaf {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_block<N-1>>,
    ),
    extra_data: <para_heads_at_block<N>>,
    next_auth_set: <next_auth_set_at_block<N>>,
}

right?

seunlanlege · 2022-09-22T12:58:09Z

Aha, that makes sense. So to summarize, you're saying for a light client, the best experience to interact with pallet-mmr is if it exposed leafs with following format:

yes correct.

vgeddes · 2022-09-22T17:00:40Z

I'm happy with this change too. It makes sense for me and would allow us to improve some of the code in our off-chain relayers. As the relationship between block-height, leaf-index, and the desired leaf.paras-root was previously a bit confusing.

+1

acatangiu · 2022-10-14T13:33:00Z

Aha, that makes sense. So to summarize, you're saying for a light client, the best experience to interact with pallet-mmr is if it exposed leafs with following format:
// contents for leaf added by block <N>
MmrLeaf {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_block<N-1>>,
    ),
    extra_data: <para_heads_at_block<N>>,
    next_auth_set: <next_auth_set_at_block<N>>,
}

I'm sorry @vgeddes @seunlanlege but this doesn't seem to be possible 😢 see #12446 (comment)

Closing for now, please reopen if you have any other ideas.

acatangiu · 2022-12-06T15:51:08Z

We decided we want to do at least this (since #12446 isn't possible):

// contents for leaf index <N-1> added by block <N>
MmrLeaf {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_<N-1>>,
    ),
    extra_data: <para_heads_of_<N-1>>,
-   next_auth_set: <next_auth_set_of<N>>,
+   next_auth_set: <next_auth_set_of<N-1>>,
}

which could be as easy as:
#11797 (comment)

Lederstrumpf · 2023-01-12T10:10:07Z

We decided we want to do at least this (since #12446 isn't possible):

// contents for leaf index <N-1> added by block <N>
MmrLeaf {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_<N-1>>,
    ),
    extra_data: <para_heads_of_<N-1>>,
-   next_auth_set: <next_auth_set_of<N>>,
+   next_auth_set: <next_auth_set_of<N-1>>,
}

which could be as easy as: #11797 (comment)

I can confirm this works:

change (only moving pallet_mmr ahead of pallet_session in runtime enum) in https://github.com/Lederstrumpf/polkadot/tree/beefy-mmr-leaf-off-by-one, with log bumps for debugging in https://github.com/Lederstrumpf/substrate/tree/beefy-mmr-leaf-off-by-one,
logs showing that it's taking next_auth_set_of<N-1>: https://hackmd.io/Ve-FPPAJSdeGNc3rc-FmrA

So this would make the leaf content consistent wrt. block numbering.

However, looking at Adrian's PR #12446 again, the reason for closing it was that offchain worker is not guaranteed to run on every block, but since #12753 moved the offchain storage handling to client-side gadget, there's nothing blocking doing next_auth_set_of<N> as implemented in #12446, if I'm not mistaken?

Otherwise, I'll open up PR for next_auth_set_of<N-1>.

acatangiu · 2023-01-12T10:27:37Z

but since #12753 moved the offchain storage handling to client-side gadget, there's nothing blocking doing next_auth_set_of as implemented in #12446, if I'm not mistaken?

it could be done at the expense of extra runtime storage - keep non-canon full data in runtime storage instead of offchain storage and "back it up" to offchain on client-side finality, but then you still have problems when finality lags and aggressive pruning is configured; and obviously increased storage costs for the chain.

I think the wise path is to take the slightly increased code complexity on light-client side and use <N-1>, and thus avoid all the extra runtime storage required to work directly on <N>.

Otherwise, I'll open up PR for next_auth_set_of<N-1>.

I think this is the right call.

@vgeddes @seunlanlege just to confirm there is no difference in light-client "runtime costs" between:

MmrLeaf @ block N {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_<N-1>>,
    ),
    extra_data: <para_heads_of_<N-1>>,
    next_auth_set: <next_auth_set_of<N-1>>,
}

and

MmrLeaf @ block N {
    version: <leaf-data-format-version>,
    (
        parent_num: <N-1>,
        parent_hash: <hash_of_<N-1>>,
    ),
    extra_data: <para_heads_of_<N>>,        <---- diff here
    next_auth_set: <next_auth_set_of<N>>,   <---- diff here
}

seunlanlege · 2023-01-12T12:35:41Z

@vgeddes @seunlanlege just to confirm there is no difference in light-client "runtime costs" between:

Yes correct. The light client only needs to be aware of a single header, rather than 2.

vgeddes · 2023-01-12T17:56:14Z

Should not be a problem for our light client, I think.

acatangiu · 2023-01-19T12:00:43Z

Fixed in paritytech/polkadot#6577

acatangiu added I3-bug The node fails to follow expected behavior. U2-some_time_soon Issue is worth doing soon. E5-breaksapi labels Jul 7, 2022

acatangiu added U4-some_day_maybe Issue might be worth doing eventually. and removed U2-some_time_soon Issue is worth doing soon. labels Jul 7, 2022

acatangiu added U2-some_time_soon Issue is worth doing soon. and removed U4-some_day_maybe Issue might be worth doing eventually. labels Sep 23, 2022

seunlanlege mentioned this issue Sep 23, 2022

use on_finalize for creating new mmr leaves #10907

Closed

acatangiu mentioned this issue Oct 7, 2022

pallet-mmr: improve offchain storage, relax LeafData requirements #12446

Closed

acatangiu removed the E5-breaksapi label Oct 11, 2022

acatangiu closed this as completed Oct 14, 2022

acatangiu reopened this Dec 6, 2022

acatangiu mentioned this issue Dec 6, 2022

[MMR] production ready Merkle Mountain Range pallet (and gadget) paritytech/parity-bridges-common#1636

Closed

13 tasks

acatangiu added U1-asap No need to stop dead in your tracks, however issue should be addressed as soon as possible. and removed U2-some_time_soon Issue is worth doing soon. labels Dec 6, 2022

acatangiu assigned serban300 and Lederstrumpf and unassigned serban300 Dec 6, 2022

Lederstrumpf mentioned this issue Jan 4, 2023

Reset mmr storage #12915

Closed

Lederstrumpf mentioned this issue Jan 18, 2023

construct mmr leaf prior to session pallet hook paritytech/polkadot#6577

Merged

acatangiu closed this as completed Jan 19, 2023

acatangiu mentioned this issue Jan 25, 2024

[MMR] fix MmrLeaf::next_auth_set for session-boundary blocks polkadot-fellows/runtimes#160

Closed

Lederstrumpf mentioned this issue Jan 26, 2024

BEEFY: Rococo⇄Sepolia deployment stalled paritytech/polkadot-sdk#3080

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MMR Leaf Data: off-by-one error in beefy_next_authority_set #11797

MMR Leaf Data: off-by-one error in beefy_next_authority_set #11797

acatangiu commented Jul 7, 2022 •

edited

Loading

acatangiu commented Jul 7, 2022

Lederstrumpf commented Jul 7, 2022 •

edited

Loading

vgeddes commented Jul 7, 2022 •

edited

Loading

acatangiu commented Jul 7, 2022 •

edited

Loading

seunlanlege commented Aug 11, 2022

acatangiu commented Sep 22, 2022

seunlanlege commented Sep 22, 2022 •

edited

Loading

acatangiu commented Sep 22, 2022

seunlanlege commented Sep 22, 2022

vgeddes commented Sep 22, 2022 •

edited

Loading

acatangiu commented Oct 14, 2022

acatangiu commented Dec 6, 2022

Lederstrumpf commented Jan 12, 2023

acatangiu commented Jan 12, 2023

seunlanlege commented Jan 12, 2023 •

edited

Loading

vgeddes commented Jan 12, 2023

acatangiu commented Jan 19, 2023

MMR Leaf Data: off-by-one error in beefy_next_authority_set #11797

MMR Leaf Data: off-by-one error in beefy_next_authority_set #11797

Comments

acatangiu commented Jul 7, 2022 • edited Loading

acatangiu commented Jul 7, 2022

Lederstrumpf commented Jul 7, 2022 • edited Loading

vgeddes commented Jul 7, 2022 • edited Loading

acatangiu commented Jul 7, 2022 • edited Loading

seunlanlege commented Aug 11, 2022

acatangiu commented Sep 22, 2022

seunlanlege commented Sep 22, 2022 • edited Loading

acatangiu commented Sep 22, 2022

seunlanlege commented Sep 22, 2022

vgeddes commented Sep 22, 2022 • edited Loading

acatangiu commented Oct 14, 2022

acatangiu commented Dec 6, 2022

Lederstrumpf commented Jan 12, 2023

acatangiu commented Jan 12, 2023

seunlanlege commented Jan 12, 2023 • edited Loading

vgeddes commented Jan 12, 2023

acatangiu commented Jan 19, 2023

acatangiu commented Jul 7, 2022 •

edited

Loading

Lederstrumpf commented Jul 7, 2022 •

edited

Loading

vgeddes commented Jul 7, 2022 •

edited

Loading

acatangiu commented Jul 7, 2022 •

edited

Loading

seunlanlege commented Sep 22, 2022 •

edited

Loading

vgeddes commented Sep 22, 2022 •

edited

Loading

seunlanlege commented Jan 12, 2023 •

edited

Loading