-
-
Notifications
You must be signed in to change notification settings - Fork 955
Filename‐Trouble with Unicode
toraxmalu edited this page Apr 16, 2024
·
4 revisions
The Latin alphabet also includes diacritics, i.e. á, ç, ä, ñ. Unicode offers two variants to represent these characters:
- NFC (Normalization Form Composed): character with diacritics is encoded as a single character.
- NFD (Normalization Form Decomposed): base character and diacritics are stored separately.
Examples for 1st and 2nd form:
- á => a + ´
- ç => c + ¸
- ä => a + ̈
- ñ => n + ~
iOS uses APFS, which stores such characters always in the 2nd, decomposed, form.
In normal operation file names are read as simple byte sequences. Trouble starts, where they where interpreted…
My situation:
- A folder was mounted via
mount -f ios . workCopy
- check-out via
svn
into the iOS-folder - A directory name contained an "ü"
- SVN worked without any hick-up in checkout
The trouble began with use of svn status
or svn commit
: They recognized the directory in two variants:
- "original no longer exists"
- "not versioned" SVN searched for the directory name with "ü" but only found the variant with "u¨" on the iOS directory.
And before i forget: Neither the file-manager nor cp
converting the filename while copying or renaming. Don't use diacritics in names for folders or files! And there is another trouble lurking in the shadows: File content, if the app / command is not converting…
- Contributing to iSH Development
- How to add a new Character Device to iSH
- How to add a new entry to /proc/ish
- Fixing hostname localhost
- Running nmap
- Running Ruby Programs
- Installing PHP with a TLS certificate and a PHP filemanager
- Installing R and any package from the CRAN
- iSH Alpine Release Issues
- Using Alpine Linux repositories
- Upgrading to a new release
- Install & Activate Alternate Filesystems