You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
iirc, PyShEx failed tests where the schema (or data?) had codepoints > U+FFFD . I stumbled across a repo that I created for dealing with this in Java and Javascript, both of which use UTF16 internally and thus require the grammar to be written not in terms of codepoints U+10000- but instead surrogate pairs. I don't remember the state of this repot, but it could be handy to clone it and play with the python rather than experimenting in the larger ShEx g4.
The text was updated successfully, but these errors were encountered:
iirc, PyShEx failed tests where the schema (or data?) had codepoints > U+FFFD . I stumbled across a repo that I created for dealing with this in Java and Javascript, both of which use UTF16 internally and thus require the grammar to be written not in terms of codepoints U+10000- but instead surrogate pairs. I don't remember the state of this repot, but it could be handy to clone it and play with the python rather than experimenting in the larger ShEx g4.
The text was updated successfully, but these errors were encountered: