Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

POC to show performance improvements of not copying token #1561

Draft
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

alamb
Copy link
Contributor

@alamb alamb commented Nov 26, 2024

This PR was hoping to show that the approach described in this ticket would be effective

While I had this all loaded into my head, I wanted to bash out a PR / POC, but ran out of time.

This PR shows how we could avoid copying tokens

Goals;

  • Sketch out peek_token_ref() and similar functions
  • Use those functions to rewrite the hot paths shown in the flamegraph (parse_prefix, etc. as shown below)
  • Demonstrate improved performance with the benchmark

fixed-flamegraph

fixed-flamegraph

It will take some restructring of parse_prefix to avoid the use of the copying APIs, but I absolutely think it is possible

Sorry, something went wrong.

@davisp
Copy link
Member

davisp commented Dec 10, 2024

main...davisp:datafusion-sqlparser-rs:pd/experiment/less-token-cloning

tl;dr - Roughly 28% speedup on the microbenchmarks compared to main. 18% of that by tweaking a few of the main token methods in the last commit.

I ended up poking at this today and have managed to get about a 28% improvement on the benchmarks over main. Roughly 10% of that is checking for hot paths in the flame graphs and slowly converting those functions over to avoid cloning tokens.

On a whim, I also poked at optimizing the Token::make_word method that does a binary search over all keywords. That appears to have saved about 400ms (of 1.2s total originally attributed to it). I skimmed the docs for phf but I'm not certain what this project's appetite is for new dependencies. It currently seems quite lean so I didn't pursue that just yet.

Lastly, the biggest single commit was the latest that updates a few of the token handling methods themselves to avoid clones. This was the biggest jump at 18% improvement for a total just around 28% (based on my very non-scientific measurements).

Also, one thing I did different than you @alamb, I noted CI flagged your EOF_TOKEN, so I just made mine a static since it doesn't have any initialization.

All in all, this approach certainly seems feasible as well as actually productive. It turns out the hardest part is accounting for error reporting with the expected("message", token) API. I ended up adding an expected_current which doesn't take a token parameter and uses the current index for reporting. That works for now, but going forward I think we're gonna want to figure out a much different error reporting API as its quite fragile (you have to pay attention to which token is being reported and whether or not the token index has been modified). There are also a number of places where the token being reported in errors is incorrect.

@alamb
Copy link
Contributor Author

alamb commented Dec 11, 2024

On a whim, I also poked at optimizing the Token::make_word method that does a binary search over all keywords. That appears to have saved about 400ms (of 1.2s total originally attributed to it). I skimmed the docs for phf but I'm not certain what this project's appetite is for new dependencies. It currently seems quite lean so I didn't pursue that just yet.

I agree we try to keep the dependency set down to a minimum. \

I wonder if we could automatically generate a jumptable with some trickery (maybe we could make a custom match block keyed on individual letters)L

Something like

match word[0] {
  'a' | 'A' => match word[1] {
    'b'| 'B' = > //match the third letter here

We could then make a script, etc to generate this jumptable from the list of keywords 🤔

@alamb
Copy link
Contributor Author

alamb commented Dec 11, 2024

This sounds really neat @davisp

Would you be willing to start making some smaller PRs with parts of your findings?

@davisp
Copy link
Member

davisp commented Dec 11, 2024

Would you be willing to start making some smaller PRs with parts of your findings?

I've opened #1587 for the token cloning work and #1588 for investigating ways to speed up Token::make_word. Looks like I broke an error message though so time to fix that.

@alamb
Copy link
Contributor Author

alamb commented Dec 12, 2024

Would you be willing to start making some smaller PRs with parts of your findings?

I've opened #1587 for the token cloning work and #1588 for investigating ways to speed up Token::make_word. Looks like I broke an error message though so time to fix that.

AMAZING!!! Thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants