poll_* methods to support custom futures implementations #78

dgrr · 2024-05-29T15:39:46Z

No description provided.

dgrr · 2024-05-29T15:40:22Z

I'll implement the poll_* methods for the WebSocketRead & WebSocketWrite

…bSocketWrite

dgrr · 2024-05-29T20:49:11Z

btw, if you implement Stream + Sink from futures you can call split. It's difficult for Stream to return a Frame<'_>, or maybe impossible

littledivy · 2024-06-20T02:55:57Z

Cargo.toml


 [features]
 default = ["simd"]
 simd = ["simdutf8/aarch64_neon"]
-upgrade = ["hyper", "pin-project", "base64", "sha1", "hyper-util", "http-body-util"]
-unstable-split = []


Is this intentional to remove unstable-split? I would like to keep it

Sure, I can put it back in

well, the question is, why is it unstable?

matheus23 · 2024-06-20T09:56:29Z

Would love to see this land. This library would become a great alternative to tokio-tungstenite and tokio-websockets in that case. Especially since it supports true, lock-free stream splitting.

matheus23 · 2024-06-20T12:53:50Z

FWIW, it'd be nice to have the same poll_ treatment for FragmentCollectorRead 👀 😁

matheus23 · 2024-06-24T06:08:10Z

btw, if you implement Stream + Sink from futures you can call split. It's difficult for Stream to return a Frame<'_>, or maybe impossible

This is true, but internally it means it'll simply create two references to the same thing using a BiLock. So you end up locking every time you use either the Stream or Sink.

Having a "true" split implementation that is lock free would be a lot cooler! :)

Also, while it's impossible to write an implementation of Stream<Item = Frame<'???>>, it's possible to map the Frame<'f> into your own domain-specific application struct that doesn't have a lifetime, e.g. by parsing binary and text frames into your own types, before returning them in your stream implementation as Items.

Just spelling this out for anyone reading this ✌️

… some buffered data

dgrr · 2024-06-24T06:50:25Z

Having a "true" split implementation that is lock free would be a lot cooler! :)

How is this not lock free? If you use tokio::io::split it will use a Mutex internally.

Also, while it's impossible to write an implementation of Stream<Item = Frame<'???>>, it's possible to map the Frame<'f> into your own domain-specific application struct that doesn't have a lifetime, e.g. by parsing binary and text frames into your own types, before returning them in your stream implementation as Items.

Yes, that's what I do mostly. Because I want my code to be compatible with futures::Stream and Sink. I shouldn't but I had to code it in a bit of a rush 👀

matheus23 · 2024-06-24T07:05:10Z

Having a "true" split implementation that is lock free would be a lot cooler! :)

How is this not lock free? If you use tokio::io::split it will use a Mutex internally.

Yeah no I'm agreeing with you and that's exactly what I'm trying to say.
You wrote "if you implement Stream + Sink from futures you can call split.", but that gives you worse performance than having the stream and sink already split from the start ✌️ (which this PR enables/makes easier!)

dgrr · 2024-06-24T07:09:38Z

Yeah no I'm agreeing with you and that's exactly what I'm trying to say. You wrote "if you implement Stream + Sink from futures you can call split.", but that gives you worse performance than having the stream and sink already split from the start ✌️ (which this PR enables/makes easier!)

Ah yes, but it is mainly for compatibility because sometimes it might be useful to use futures::StreamExt. Anyways, the user can implement it on their own, given that Frame<'_> cannot be returned as the Item in futures::Stream

dgrr · 2024-06-24T14:55:24Z

@matheus23 about poll in FragmentCollectorRead, what would you expect? Do you expect a callback also or that it returns early with the mandatory reply frame?

matheus23 · 2024-06-24T15:26:16Z

Personally, I'd highly prefer returning the mandatory frame :)

dgrr · 2024-06-24T17:03:11Z

Personally, I'd highly prefer returning the mandatory frame :)

Indeed. Me too

littledivy

Thanks for the PR and sorry for slow replies, I hope you don't mind me taking some time here as this is a relatively big change :)

It seems there is a regression in echo_server benchmark with larger payloads:

# this pr
$ ./load_test 10 0.0.0.0 8080 0 0 102400
Msg/sec: 35997.750000
Msg/sec: 35021.000000

# main
$ ./load_test 10 0.0.0.0 8080 0 0 102400
Msg/sec: 42045.750000
Msg/sec: 42146.500000

(measured on an M1 macbook; similar numbers on x64 Linux server)

matheus23 · 2024-06-30T18:24:07Z

Just as an FYI, I've been trying to get the benchmarks to compile & run here on my NixOS to help diagnose the regression, but I'm still fighting my way through with the linker. Will of course post once I got something.

dgrr · 2024-06-30T19:15:45Z

Just as an FYI, I've been trying to get the benchmarks to compile & run here on my NixOS to help diagnose the regression, but I'm still fighting my way through with the linker. Will of course post once I got something.

I managed to reproduce the benchmarks, not in macos in Linux, but I cannot find where the regression is

src/lib.rs

Co-authored-by: Conrad Ludgate <[email protected]>

conradludgate · 2024-06-30T20:12:54Z

src/lib.rs

@@ -197,7 +259,7 @@ pub(crate) struct WriteHalf {
  vectored: bool,
  auto_apply_mask: bool,
  writev_threshold: usize,
-  write_buffer: Vec<u8>,
+  buffer: BytesMut,


Using Vec is faster in my testing. To replicate buffer.advance(written) you can do buffer.splice(..written, [0u8; 0]). Because this is expensive what I found is better is to instead have a buf_pos: usize that I increment, and only in start_send do I run

self.buffer.splice(..self.buf_pos, [0u8; 0]); self.buf_pos = 0;

before the fmt_head call to write the frame into the buffer. This is because BytesMut::advance is kinda expensive it seems.

conradludgate · 2024-07-01T08:44:42Z

Further testing shows that, for large buffers, removing the vectored write support is a significant source of the regressions. Now the difference it just 43500 Msg/sec compared to 44000 Msg/sec

Opened a PR against this PR 😅 dgrr#1

re-introduce vectored writes to the poll-based impl

…flict

dgrr · 2024-07-06T22:44:12Z

@littledivy it seems that this PR is ok now thanks to the contributions of @conradludgate

ShabbirHasan1 · 2024-07-29T07:58:10Z

@littledivy All checks have passed successfully 🚀. Could you please review and merge this PR at your convenience 🙏.

Thank you @dgrr and @conradludgate for your kind contributions in making this library even better. 🫡

dgrr · 2024-09-01T13:04:41Z

@littledivy

) ## Description Before we used to depend on both tungstenite version 0.21 as well as 0.24, because: ``` tungstenite v0.21.0 └── tokio-tungstenite v0.21.0 └── tokio-tungstenite-wasm v0.3.1 ├── iroh v0.29.0 (/home/philipp/program/work/iroh/iroh) └── iroh-relay v0.29.0 (/home/philipp/program/work/iroh/iroh-relay) ├── iroh v0.29.0 (/home/philipp/program/work/iroh/iroh) └── iroh-net-report v0.29.0 (/home/philipp/program/work/iroh/iroh-net-report) └── iroh v0.29.0 (/home/philipp/program/work/iroh/iroh) tungstenite v0.24.0 └── tokio-tungstenite v0.24.0 ├── iroh v0.29.0 (/home/philipp/program/work/iroh/iroh) └── iroh-relay v0.29.0 (/home/philipp/program/work/iroh/iroh-relay) ├── iroh v0.29.0 (/home/philipp/program/work/iroh/iroh) └── iroh-net-report v0.29.0 (/home/philipp/program/work/iroh/iroh-net-report) └── iroh v0.29.0 (/home/philipp/program/work/iroh/iroh) ``` Basically, `tokio-tungstenite-wasm` pulls in `0.21` and there's no newer version of it yet. But we updated all our dependencies including `tungstenite`, duplicating it. ## Notes & open questions  I want this to be temporary until we can finally switch to `fasterwebsockets` entirely once it implements [`poll`-based methods](denoland/fastwebsockets#78) (but I worry the project's maintenance is ... unclear). I checked the [tungstenite changelog](https://github.com/snapview/tungstenite-rs/blob/master/CHANGELOG.md), and it doesn't look like there's anything critical in there. The `rustls` update doesn't affect us - we don't duplicate rustls versions after this rollback. ## Change checklist - [x] Self-review. - [x] Documentation updates following the [style guide](https://rust-lang.github.io/rfcs/1574-more-api-documentation-conventions.html#appendix-a-full-conventions-text), if relevant. - ~~[ ] Tests if relevant.~~ - [x] All breaking changes documented.

dgrr added 8 commits May 27, 2024 22:46

frame: Use abstract trait to write the header to

5f1d0e5

Implement poll* methods to support future-based systems

9ea19e6

Merge branch 'main' of github.com:dgrr/fastwebsockets

3c779ff

poll methods for WebSocketStream

7fc1617

WebSocket: start_send_frame function

f1696f1

Cargo.toml: futures version

30c1586

Cargo.toml: tokio_util version

909cd67

Set the waker before flushing

78f81dd

Implement poll_read_frame & poll_write_frame for WebSocketRead and We…

23a7101

…bSocketWrite

littledivy reviewed Jun 20, 2024

View reviewed changes

Bring back unstable-split feature

59904ad

write_frame: Check readiness of the underlying connection by flushing…

8285b55

… some buffered data

dgrr added 2 commits June 24, 2024 16:51

Function docs

3f050bf

Fix return type when not using SIMD for utf8 processing

b7b7696

littledivy reviewed Jun 26, 2024

View reviewed changes

Generate state machine for reading

b1b087c

conradludgate reviewed Jun 30, 2024

View reviewed changes

src/lib.rs Outdated Show resolved Hide resolved

WriteHalf: Advance buffer on poll_ready

3636e5b

conradludgate reviewed Jun 30, 2024

View reviewed changes

src/lib.rs Outdated Show resolved Hide resolved

Simplify loop by conradludgate

3a537e3

Co-authored-by: Conrad Ludgate <[email protected]>

conradludgate reviewed Jun 30, 2024

View reviewed changes

conradludgate added 3 commits July 1, 2024 08:30

use vec

da4f6d7

better buffer manipulation

9abbf6c

re-introduce vectored writes

e91e922

conradludgate and others added 6 commits July 1, 2024 09:46

refactor

5d45077

Use a Vec<u8> instead of BytesMut for the write buffer

1d579a1

Merge branch 'main' into dgrr/main

988cbd5

Merge pull request #1 from conradludgate/dgrr/main

53b2b96

re-introduce vectored writes to the poll-based impl

Remove underscore from used variable names

28c331a

Added test to check that simple and vectored serialization do not con…

7656018

…flict

WebSocket: Handle obligated send flushing with states to ensure delivery

b960f15

Merge branch 'denoland:main' into main

9eebea5

matheus23 mentioned this pull request Oct 10, 2024

Support HTTP proxies in clients connecting to relays over websockets n0-computer/iroh#2418

Open

matheus23 mentioned this pull request Dec 4, 2024

chore(iroh, iroh-relay): Avoid a duplicate tungstenite dependency n0-computer/iroh#3006

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

poll_* methods to support custom futures implementations #78

poll_* methods to support custom futures implementations #78

dgrr commented May 29, 2024

dgrr commented May 29, 2024

dgrr commented May 29, 2024

littledivy Jun 20, 2024

dgrr Jun 20, 2024

dgrr Jun 23, 2024

matheus23 commented Jun 20, 2024 •

edited

Loading

matheus23 commented Jun 20, 2024

matheus23 commented Jun 24, 2024 •

edited

Loading

dgrr commented Jun 24, 2024

matheus23 commented Jun 24, 2024

dgrr commented Jun 24, 2024

dgrr commented Jun 24, 2024

matheus23 commented Jun 24, 2024

dgrr commented Jun 24, 2024

littledivy left a comment

matheus23 commented Jun 30, 2024

dgrr commented Jun 30, 2024

conradludgate Jun 30, 2024

conradludgate commented Jul 1, 2024 •

edited

Loading

dgrr commented Jul 6, 2024

ShabbirHasan1 commented Jul 29, 2024

dgrr commented Sep 1, 2024

poll_* methods to support custom futures implementations #78

Are you sure you want to change the base?

poll_* methods to support custom futures implementations #78

Conversation

dgrr commented May 29, 2024

dgrr commented May 29, 2024

dgrr commented May 29, 2024

littledivy Jun 20, 2024

Choose a reason for hiding this comment

dgrr Jun 20, 2024

Choose a reason for hiding this comment

dgrr Jun 23, 2024

Choose a reason for hiding this comment

matheus23 commented Jun 20, 2024 • edited Loading

matheus23 commented Jun 20, 2024

matheus23 commented Jun 24, 2024 • edited Loading

dgrr commented Jun 24, 2024

matheus23 commented Jun 24, 2024

dgrr commented Jun 24, 2024

dgrr commented Jun 24, 2024

matheus23 commented Jun 24, 2024

dgrr commented Jun 24, 2024

littledivy left a comment

Choose a reason for hiding this comment

matheus23 commented Jun 30, 2024

dgrr commented Jun 30, 2024

conradludgate Jun 30, 2024

Choose a reason for hiding this comment

conradludgate commented Jul 1, 2024 • edited Loading

dgrr commented Jul 6, 2024

ShabbirHasan1 commented Jul 29, 2024

dgrr commented Sep 1, 2024

matheus23 commented Jun 20, 2024 •

edited

Loading

matheus23 commented Jun 24, 2024 •

edited

Loading

conradludgate commented Jul 1, 2024 •

edited

Loading