-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[foliatextcontent] Implement adding markup information in the text that points to the substrings #23
Comments
|
Awesome, thanks so much for making this enhancement! Just wondering (not requesting), would this enable manual correction operations, at least partly, such as: -->
(Once the PAGE-XML to FoLiA converter is there, I could use ucto and generate a test file --please let me know I could do st more.) |
(I don't think this relates directly to this issue, which is about substrings (arbitrary references on untokenised text)) I assume you refer to manual annotation in FLAT, and editing corrections in FLAT indeed only works on the token-level. If a tokenised document is available with all the markup information present then the procedure you described would work for the first three steps yes, but the fourth is still an issue as FLAT doesn't support annotating markup (e.g. style) yet, the markup support in FLAT is limited to viewing currently. The other caveat is preserving all the markup information after tokenisation, ucto currently doesn't do that. You're currently stuck with the markup information on mostly the paragraph level. Neither TICCL nor ucto propagate it to deeper levels, which is what you need if you want to correct it in FLAT. I had already opened a related issue to implement this specific functionality in foliatextcontent: #19 .. The good news is that this should all be automatically resolvable. |
Awesome, thanks! Sorry about commenting at the wrong issue, I meant indeed the functionality of FLAT. |
This is needed for proycon/flat#92 . There is already an option for this in foliatextcontent but it doesn't seem to work yet in all cases , most specifically, the case where the text content is already present rather than generated by foliatextcontent.
The text was updated successfully, but these errors were encountered: