aspell: Unicode Normalization
1
1 C.3 Unicode Normalization
1 =========================
1
1 Because Unicode contains a large number of precomposed characters there
1 are multiple ways a character can be represented. For example letter รถ
1 can either be represented as
1
1 U+00F6 LATIN SMALL LETTER O WITH DIAERESIS
1 or
1 U+0061 LATIN SMALL LETTER O + U+0308 COMBINING DIAERESIS
1
1 By performing normalization first, Aspell will only see one of these
1 representations. The exact form of normalization depends on the
1 language. Give the choice of:
1
1 1. Precomposed character
1
1 2. Base letter + combining character(s)
1
1 3. Base letter only
1
1 if the precomposed character is in the target character set, then (1),
1 if both base and combining character is present, then (2), otherwise
1 (3).
1
1 Unicode Normalization is now implemented in Aspell 0.60.
1