Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cool! #1

Open
rw86347 opened this issue Jul 31, 2023 · 1 comment
Open

Cool! #1

rw86347 opened this issue Jul 31, 2023 · 1 comment

Comments

@rw86347
Copy link

rw86347 commented Jul 31, 2023

did you get anywhere with this?

Did you have spaces and punctuation in the text?

@alilland
Copy link
Owner

alilland commented Jul 31, 2023

Haven’t stepped back into it in a few weeks, I spoke with a math professor based in Israel who published an article in the 1980’s while he worked as a math professor for the University of Southern California. He got back to me very quickly and included the dataset he used in the email, but what he sent me was a pre parsed copy of the Torah converting Hebrew words to English letters. Not something I could use as an unbiased starting point.

I haven’t spent more time hunting for others who have done it yet, but eventually I’ll give it another try

I did give it an effort to strip out spaces, non Hebrew characters and stripped out phonetic additions, I just want to make sure I have the right starting text, which you can take a look under src/index.ts

the one I’m starting with is the earliest and most complete Old Testament, but not necessarily the oldest copy of the Torah, not that it should make a difference theoretically, but I want to spend more time on it.

this was all to write an article for one of my websites and I’ve just been distracted on other projects

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants