go/runtime/history: Ensure WatchBlocks emits increasing round sequence (no gaps) #6079

martintomazic · 2025-02-23T20:00:43Z

What:
After initial history reindex, node without local storage will notify subscribers about every block (even if during subsequent reindex) , ensuring sequential block rounds, just like is the case when node has local storage.**

Finally, we simplify roothash.BlockHistory interface as discussed here: #6050 (comment)

**For comparison, node with local storage first waits for initial reindex to complete, and then starts syncing rounds with worker.storage.committe.Node, fetching missing rounds one by one, and notifying them with StorageSyncCheckpoint. Subscriber in such scenario always receives blocks with no gaps. Finally, if they subscribe right after first reindex is done, they will still receive most blocks from initial reindex (since storage sync is slower then history reindex).

Why:
#6050 (comment) && #6050 (comment)

Possible considerations
We might as well notify during first reindex (?), which would simplify things further. Not sure why current code in master was preventing notification during every reindex (if without local storage).

netlify · 2025-02-23T20:01:00Z

✅ Deploy Preview for oasisprotocol-oasis-core canceled.

Name	Link
🔨 Latest commit	`80cfc76`
🔍 Latest deploy log	https://app.netlify.com/sites/oasisprotocol-oasis-core/deploys/67bb939f7853ef0008477e9d

martintomazic · 2025-02-23T20:02:16Z

go/runtime/history/history.go

 type History interface {
 	roothash.BlockHistory



Technically this interface (History) is no longer needed, and clients could reference struct directly ?

martintomazic · 2025-02-23T20:04:51Z

Draft: If reviewers agree I would prefer to get this in before #6050, possibly in parallel. :)

This method was only used for testing, and is redundant since Commit correctly updates LastConsensusHeight.

Fixes existing problem where clients of WatchBlocks, may receive block with non-sequentially increasing rounds.

martintomazic · 2025-02-24T10:54:47Z

go/runtime/history/history.go

+	// If no local storage worker, and not during initial history reindex,
+	// notify the block watcher that new block is committed.
+	// Otherwise the storage-sync-checkpoint will do the notification.
+	if h.hasLocalStorage || !h.reindexDone {


use existing mutex for h.reindexDone, or don't protect it at all since it is currently only called from one thread/can be only overwritten to true. Not sure what is best practice in such scenario, probably protect everything to prevent future bugs if someone extends? :)

peternose

I think that this solution is not simple and we should find a better one. Unless someone convinces me otherwise.

peternose · 2025-02-25T10:56:20Z

go/roothash/api/history.go

-	// Passing the special value `RoundLatest` will return results for the latest round.
-	GetRoundResults(ctx context.Context, round uint64) (*RoundResults, error)
+	// Calling this methods more then once has no additional side effect.
+	ReindexFinished()


Reindexing is happening one level higher, so the BlockHistory should know nothing about it. Therefore, this doesn't belong here.

We agreed to skip this for now in private and try to rework this sometime in the future. Leaving comment for when we proceed.

I still believe only non-storage related methods can stay in roothash.api.BlockHistory since roothash service should not know anything about "nodes" and "(local) storage": fffd403 is probably a good step?

I agree ReindexFinished is probably off, albeit notify is probably off for the same reason, given we only use it to modify behaviour of WatchBlocks: 1. WatchBlocks should not be part of the roothash.api.BlockHistory since it depends on storage syncing if node has local storage, and 2. it is misleading since in case of notify=false blocks will still be eventually notified in case of node having a local storage.

peternose · 2025-02-25T10:57:03Z

go/runtime/history/history.go

 		return nil
 	}
 	h.blocksNotifier.Broadcast(blk)

 	return nil
 }

-func (h *runtimeHistory) ConsensusCheckpoint(height int64) error {
-	return h.db.consensusCheckpoint(height)
+func (h *runtimeHistory) ReindexFinished() {


This could be private.

peternose · 2025-02-25T10:59:47Z

go/consensus/cometbft/roothash/roothash.go

@@ -702,6 +703,10 @@ func (sc *serviceClient) processFinalizedEvent(
 				return fmt.Errorf("failed to reindex blocks: %w", err)
 			}
 			tr.reindexDone = true
+			if !tr.initialReindexDone {
+				tr.initialReindexDone = true
+				tr.blockHistory.ReindexFinished()


Not a simple solution. See how many ifs and flags we have in this method. We should refactor this code to make it more readable, not to make it more complex.

Agree this is not best, cross-referencing this issue/thread for the next steps.

Technically ReindexFinished is idempotent, so this 3 lines could be replaced by tr.blockHistory.ReindexFinished(). #6050 removed some of those flags. Finally, we could also factor out reindexing part to helper function, making code even simpler.

Alternative is to use existing notify=true for all reindexes after initial one, still not cleanest, likely the simplest.

By far the best option is to refactor as proposed in the issue (?).

martintomazic commented Feb 23, 2025

View reviewed changes

martintomazic mentioned this pull request Feb 23, 2025

go/roothash: Batch history reindex writes #6050

Merged

martintomazic added 4 commits February 23, 2025 21:16

go/runtime/history: Remove unused ConsensusCheckpoint method

c9497ca

This method was only used for testing, and is redundant since Commit correctly updates LastConsensusHeight.

go/runtime/history: Remove unused errNopHistory

0091ec8

go/roothash: Remove storage related methods from BlockHistory

fffd403

go/runtime/history: Ensure WatchBlocks emits sequential rounds

80cfc76

Fixes existing problem where clients of WatchBlocks, may receive block with non-sequentially increasing rounds.

martintomazic force-pushed the martin/internal/rework-roothash-block-history-interface branch from 1044e1a to 80cfc76 Compare February 23, 2025 21:31

martintomazic marked this pull request as ready for review February 23, 2025 22:01

martintomazic requested review from kostko, peterjgilbert, pro-wh, ptrus and peternose as code owners February 23, 2025 22:01

martintomazic commented Feb 24, 2025

View reviewed changes

martintomazic self-assigned this Feb 25, 2025

peternose requested changes Feb 25, 2025

View reviewed changes

martintomazic mentioned this pull request Feb 28, 2025

Fix runtime.history.History.Watchblocks to emit sequential blocks (no gaps) #6085

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

go/runtime/history: Ensure WatchBlocks emits increasing round sequence (no gaps) #6079

go/runtime/history: Ensure WatchBlocks emits increasing round sequence (no gaps) #6079

martintomazic commented Feb 23, 2025 •

edited

Loading

netlify bot commented Feb 23, 2025 •

edited

Loading

martintomazic Feb 23, 2025 •

edited

Loading

martintomazic commented Feb 23, 2025

martintomazic Feb 24, 2025

peternose left a comment

peternose Feb 25, 2025

martintomazic Feb 25, 2025

peternose Feb 25, 2025

peternose Feb 25, 2025

martintomazic Feb 28, 2025

go/runtime/history: Ensure WatchBlocks emits increasing round sequence (no gaps) #6079

Are you sure you want to change the base?

go/runtime/history: Ensure WatchBlocks emits increasing round sequence (no gaps) #6079

Conversation

martintomazic commented Feb 23, 2025 • edited Loading

netlify bot commented Feb 23, 2025 • edited Loading

✅ Deploy Preview for oasisprotocol-oasis-core canceled.

martintomazic Feb 23, 2025 • edited Loading

Choose a reason for hiding this comment

martintomazic commented Feb 23, 2025

martintomazic Feb 24, 2025

Choose a reason for hiding this comment

peternose left a comment

Choose a reason for hiding this comment

peternose Feb 25, 2025

Choose a reason for hiding this comment

martintomazic Feb 25, 2025

Choose a reason for hiding this comment

peternose Feb 25, 2025

Choose a reason for hiding this comment

peternose Feb 25, 2025

Choose a reason for hiding this comment

martintomazic Feb 28, 2025

Choose a reason for hiding this comment

martintomazic commented Feb 23, 2025 •

edited

Loading

netlify bot commented Feb 23, 2025 •

edited

Loading

martintomazic Feb 23, 2025 •

edited

Loading