Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stuck with the creation of tidy_pmi #184

Open
marcelbaumgartner opened this issue Nov 1, 2022 · 2 comments
Open

Stuck with the creation of tidy_pmi #184

marcelbaumgartner opened this issue Nov 1, 2022 · 2 comments

Comments

@marcelbaumgartner
Copy link

Just discovered this wonderful book. Amazing job, congratulations! In Chapter 5, I try to create the tidy_pmi tibble, using the slide_windows function you created. Using a Windows machine with 16GB memory. But I never managed to end the calculation. How long does this normally take? Any suggestions to adapt the parallel processing.

sessionInfo()
R version 4.2.1 (2022-06-23 ucrt)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 10 x64 (build 19042)

@EmilHvitfeldt
Copy link
Owner

EmilHvitfeldt commented Nov 1, 2022

Hello @marcelbaumgartner!! This part of the code does indeed take a long time to run. We extracted out that bit of code to be cached separately than everything else as you can see here https://github.com/EmilHvitfeldt/smltar/blob/master/inst/tidy_pmi.R.

This is definitely a task that takes hours rather than minutes. One thing that might help you is to downsample the data a bit. If you downsample to 1% data you could extrapolate how long the full runtime would be.

@marcelbaumgartner
Copy link
Author

Thanks, appreciate your quick response. Clear, will try it on a smaller sample, and will then run it for an entire night :). Best regards from Switzerland, Marcel

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants