Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Allow to automatical restart when data node is behind the core but local snapshot is present on the disk #10342

Open
daniel1302 opened this issue Jan 8, 2024 · 0 comments
Labels
feature new feature to be developed

Comments

@daniel1302
Copy link
Contributor

Feature Overview

It is very often that data-node is behind the core when the system is not very healthy for some reason (e.g high traffic, disk IOPS issues, etc...). We can see the followingerror:


Jan 08 14:45:29 api0.vega.community visor[3749951]: vega data node stopped with error: block height on begin block, 30391214, is too high, the height of the last processed block is 30391119

It says data-node last block is 30391119, the core block was 30391214.

You can see that the data node was only 5 blocks behind. However, the snapshot was created at block 30391120.

To restart it We have to specify the core start height it with the --snapshot.load-from-block-height flag or with the config param in the <vega_home>/config/node/config.toml file.

It is not very practical. We can add some config that allows us to find the last available snapshot if available when this error happens.

@daniel1302 daniel1302 added the feature new feature to be developed label Jan 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature new feature to be developed
Projects
Status: No status
Development

No branches or pull requests

1 participant