You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Discrepancies between interpretations of the 1997 and the 2012 Lao romanisations tables may need to be resolved. There is divergent practice that would affect mappings. See a draft note.
The Lao -> Latin mapping is a sieve, lots of data is lost, making a Latin -> Lao mapping problematic at the least. Lao, Thai and various other mappings would be better served by developing machine learning models for them. In the absence of an ML model, next best approach would be to base the assigned Latin -> Lao mapping on character frequencies based on analysis of a Lao corpus. It will also require Lao syllable boundary identification to distinguish syllable initial and final consonants.
Alternatively, it may be easier to map complete syllables form Lao -> Latin. Romanised syllables to Lao are a one-to-many mapping.
An example of many-to-one syllable mappings. But the situation becomes more complex with syllable final consonants.
We are exploring the use of Aksharamukha embedded in Scriptshifter for some East Asian and Southeast Asian languages. Currently there is experimental support for Bengali, Burmese, Devanagari, Gurmukhi, Japanese (Katakana + Hiragana - but slated for removal because not accurate enough), Tamil (+ Brahmi + extended), Thai, Tibetan via Aksharamukha.
Lao (in two versions) seems to be supported in Aksharamukha but we haven't tested it yet. If you were able to confirm the accuracy of Lao transliteration I could very easily add support for that in Scriptshifter.
Roman to Script transliteration support has a lower priority than Script to Roman at the moment. So if S2R transliteration is not reliable we can disable it on a script-by-script basis.
Add support for Lao: https://www.loc.gov/catdir/cpso/romanization/lao.pdf
The text was updated successfully, but these errors were encountered: