Plumb unified scheduler for BP minimally #4533

ryoqun · 2025-01-19T04:42:27Z

Finally (yet minimally still), unified scheduler enters into the banking stage land.

Many essential functionalities for production use are still missing, yet it now can handle the most normal case of transaction inflow from the sigverify stage to work as a banking stage.

extracted from #3946

core/src/banking_stage.rs

core/src/banking_stage/packet_deserializer.rs

core/src/banking_stage/unified_scheduler.rs

core/src/banking_stage/packet_deserializer.rs

ryoqun · 2025-01-22T15:12:46Z

@apfitzge thanks for the initial code-review.

dunno how much i read into this pr. but i largely managed to simplify the code at
71cb2ef ... after marking this pr as ready for code-review... ;)

also added docs at cf20e54

hopefully, your 2nd code-review should be easier with these changes....

ryoqun · 2025-01-22T15:20:56Z

core/src/banking_stage/unified_scheduler.rs

+//! 1. Translate the raw packet bytes into some structs
+//! 2. Do various sanitization on them
+//! 3. Calculate priorities


not at this pr. ;)

ryoqun · 2025-01-22T15:21:28Z

core/src/banking_stage/unified_scheduler.rs

+//! block production) as much as possible.
+//!
+//! Lastly, what the callback closure in this module does is roughly as follows:
+//! 1. Translate the raw packet bytes into some structs


support of the recently landed TransactionView is out of scope for this pr as well.

ryoqun · 2025-01-22T15:23:18Z

core/tests/unified_scheduler.rs

@@ -185,3 +203,97 @@ fn test_scheduler_waited_by_drop_bank_service() {
    // the scheduler used by the pruned_bank have been returned now.
    assert_eq!(pool_raw.pooled_scheduler_count(), 1);
 }
+
+#[test]
+fn test_scheduler_producing_blocks() {


this test is the only code which calls some newly added fns. so, there's some #[allow(dead_code)] here and there across this pr.

apfitzge · 2025-02-04T16:32:37Z

core/src/banking_stage/unified_scheduler.rs

+) {
+    let mut root_bank_cache = RootBankCache::new(bank_forks.clone());
+    let unified_receiver = channels.unified_receiver().clone();
+    let decision_maker = DecisionMaker::new(cluster_info.id(), poh_recorder.clone());


I rugged your PR with #4724.

Decision maker caches the decision now internally for a short period (5ms) to avoid taking poh locks all the time when worker threads need them most.

I think you should be able to just make this mut and it'll work.

apfitzge · 2025-02-04T18:17:00Z

core/src/banking_stage/unified_scheduler.rs

+                    let index = task_id_base + packet_index;
+
+                    let task = helper.create_new_task(transaction, index);
+                    helper.send_new_task(task);


Trying to make sure I understand the flow of transactions and their lifetime, since it is different than for replay where txs should be 1:1 with a bank.

When we receive new packet batches from sigverify this handler is called; it pops, deserializes, etc for the tasks. They get sent over a channel here.

These are going to the Scheduler.

That scheduler does not yet have a SchedulingContext.

The scheduler will be given a SchedulingContext.

Does scheduler start scheduling unconflicted work immediately or wait for a context to be given?
Scheduled/executed items may be lost in slot transitions but unscheduled txs will be retained for processing in the next slot?
What's the plan for limiting the number of tasks outstanding in banking stage scheduler?

nice question! Maybe, I'll accompany next pr with a doccomment of transaction lifetime with a fancy chart... lol

Note that the transaction lifetime described blow doesn't match to the following answers as of this pr.

That said..., this is the planned impl:

it is different than for replay where txs should be 1:1 with a bank.

yeah, 1:1 doens't hold anymore for the banking stage. fyi, Task was designed with this in mind from start, so that it can be consumed by any recent banks.

Does scheduler start scheduling unconflicted work immediately or wait for a context to be given?

yes, by locking transaction's addresses internally. then, if higher paying conflicting tx arrives, the scheduler will relocks the addresses with the new one. As soon as it gets a context, it bursts to send all of successfully locked highest-paying-at-the-moment tasks back to the handler threads, literally just with channel.send()s for lowest latency possible.

Scheduled/executed items may be lost in slot transitions but unscheduled txs will be retained for processing in the next slot?

unified scheduler won't lose any items in slot transitions, considering these could be most paying transactions. in those times, unified scheduler behaves just like before context is given. so, it continuously reorders its per-address priority queue (UsageQueue) until the very moment of next context.

the trick for this is that SchedulingStateMachine can trivially be carried over without emptying tasks to subsequent contexts. this behavior isn't like the currently-existing block verification mode. so, you might need to wrap a head a bit to grok the next pr...

What's the plan for limiting the number of tasks outstanding in banking stage scheduler?

So, assuming SchedulingStateMachine is now stateful accorss banks, it just maintains the number of outstanding tasks and pops off any excess tasks.

Lastly, as of this pr, transactions are buffered at banking_packet_receiver until context is given, and SchedulingStateMachine will be emptied at session ending. so, it's quite sub-optimal. ;)

the trick for this is that SchedulingStateMachine can trivially be carried over without emptying tasks to subsequent contexts. this behavior isn't like the currently-existing block verification mode. so, you might need to wrap a head a bit to grok the next pr...

Yeah I think I understand and it makes sense after reading the comments on the SchedulingContext, and mainly wanted to clarify this point.
AFAICT we'd still lose transactions in the forking events I described, since they'd be on banks we abandoned and (at least for now) not re-inserted (we probably should do this). <-- out of scope for this PR and probably even the scheduler behavior imo

Lastly, as of this pr, transactions are buffered at banking_packet_receiver until context is given, and SchedulingStateMachine will be emptied at session ending. so, it's quite sub-optimal. ;)

Thanks for detailed explanations. I think the general path is correct so far, but obviously this PR is minimized and not ready for prod as you stated. Just trying to layout here (for myself) what we still need to happen:

Need to ingest packets earlier before being given context, otherwise its at risk of OOM. if too early just drop them.

Is there any concept of retryable transactions in unified-scheduler? I think natively these cannot occur, but if jito is running they will steal locks your scheduler expects AND reserve CUs the scheduler thinks you might have.

AFAICT we'd still lose transactions in the forking events I described, since they'd be on banks we abandoned and (at least for now) not re-inserted (we probably should do this). <-- out of scope for this PR and probably even the scheduler behavior imo

I agree. forking events isn't supported for unified scheduler.

Need to ingest packets earlier before being given context, otherwise its at risk of OOM. if too early just drop them.

here it is.. :) #4949

Is there any concept of retryable transactions in unified-scheduler? I think natively these cannot occur, but if jito is running they will steal locks your scheduler expects AND reserve CUs the scheduler thinks you might have.

nice question. this will be addressed yet another upcoming pr...

diman-io · 2025-02-05T10:15:41Z

@ryoqun Hi, sorry for pinging you here, but it looks like the other channels aren’t working. Just wanted to make sure you saw #4211

ryoqun requested a review from apfitzge January 19, 2025 04:42

ryoqun force-pushed the unified-scheduler-minimal-bp-plumbing branch 5 times, most recently from 1a637e6 to a5190b4 Compare January 19, 2025 11:38

apfitzge reviewed Jan 19, 2025

View reviewed changes

core/src/banking_stage.rs Show resolved Hide resolved

apfitzge reviewed Jan 19, 2025

View reviewed changes

core/src/banking_stage/packet_deserializer.rs Show resolved Hide resolved

apfitzge reviewed Jan 19, 2025

View reviewed changes

core/src/banking_stage/unified_scheduler.rs Outdated Show resolved Hide resolved

apfitzge reviewed Jan 19, 2025

View reviewed changes

core/src/banking_stage/packet_deserializer.rs Show resolved Hide resolved

ryoqun force-pushed the unified-scheduler-minimal-bp-plumbing branch 3 times, most recently from e6e6119 to cf20e54 Compare January 22, 2025 14:39

ryoqun requested a review from apfitzge January 22, 2025 15:10

ryoqun commented Jan 22, 2025

View reviewed changes

ryoqun force-pushed the unified-scheduler-minimal-bp-plumbing branch from a3e78c2 to 46ac6ab Compare January 31, 2025 13:29

apfitzge reviewed Feb 4, 2025

View reviewed changes

ryoqun force-pushed the unified-scheduler-minimal-bp-plumbing branch from 46ac6ab to 31500c7 Compare February 5, 2025 04:08

ryoqun added 8 commits February 5, 2025 15:23

Plumb unified scheduler for BP minimally

0d33795

Clean up uses a bit

26876e5

Simplify closure interaction

d28c656

Rework banking stage plubming by extending handler

6c4087d

Add docs

5b9bc01

Use RootBankCache

b90ac53

Fix ci

ee44963

Add mut to decision_maker

9c48731

ryoqun force-pushed the unified-scheduler-minimal-bp-plumbing branch from 7889256 to 9c48731 Compare February 5, 2025 06:23

ryoqun requested a review from apfitzge February 5, 2025 14:25

apfitzge approved these changes Feb 5, 2025

View reviewed changes

ryoqun merged commit 5a7852a into anza-xyz:master Feb 6, 2025
77 checks passed

ryoqun mentioned this pull request Feb 12, 2025

Buffer pre-session tasks in BP unified scheduler #4949

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plumb unified scheduler for BP minimally #4533

Plumb unified scheduler for BP minimally #4533

ryoqun commented Jan 19, 2025 •

edited

Loading

ryoqun commented Jan 22, 2025

ryoqun Jan 22, 2025

ryoqun Jan 22, 2025

ryoqun Jan 22, 2025

apfitzge Feb 4, 2025

apfitzge Feb 4, 2025

ryoqun Feb 5, 2025 •

edited

Loading

apfitzge Feb 5, 2025

ryoqun Feb 12, 2025

diman-io commented Feb 5, 2025

Plumb unified scheduler for BP minimally #4533

Plumb unified scheduler for BP minimally #4533

Conversation

ryoqun commented Jan 19, 2025 • edited Loading

ryoqun commented Jan 22, 2025

ryoqun Jan 22, 2025

Choose a reason for hiding this comment

ryoqun Jan 22, 2025

Choose a reason for hiding this comment

ryoqun Jan 22, 2025

Choose a reason for hiding this comment

apfitzge Feb 4, 2025

Choose a reason for hiding this comment

apfitzge Feb 4, 2025

Choose a reason for hiding this comment

ryoqun Feb 5, 2025 • edited Loading

Choose a reason for hiding this comment

apfitzge Feb 5, 2025

Choose a reason for hiding this comment

ryoqun Feb 12, 2025

Choose a reason for hiding this comment

diman-io commented Feb 5, 2025

ryoqun commented Jan 19, 2025 •

edited

Loading

ryoqun Feb 5, 2025 •

edited

Loading