11 Facts About Unicode equivalence

1.

Unicode equivalence is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same character.

FactSnippet No. 1,634,454
2.

The code point U+006E followed by U+0303 is defined by Unicode to be canonically equivalent to the single code point U+00F1.

FactSnippet No. 1,634,455
3.

For each of the two equivalence notions, Unicode defines two normal forms, one fully composed, and one fully decomposed.

FactSnippet No. 1,634,456
4.

Compatibility or other reasons, Unicode equivalence sometimes assigns two different code points to entities that are essentially the same character.

FactSnippet No. 1,634,457
5.

Consistency with some older standards, Unicode equivalence provides single code points for many characters that could be viewed as modified forms of other characters or as combinations of two or more characters.

FactSnippet No. 1,634,458
6.

Consistency with other standards, and for greater flexibility, Unicode equivalence provides codes for many elements that are not used on their own, but are meant instead to modify or combine with a preceding base character.

FactSnippet No. 1,634,459
7.

Unicode equivalence provides code points for some characters or groups of characters which are modified only for aesthetic reasons, or to add new semantics without losing the original one.

FactSnippet No. 1,634,460
8.

Unicode equivalence provides two normal forms that are semantically meaningful for each of the two compatibility criteria: the composed forms NFC and NFKC, and the decomposed forms NFD and NFKD.

FactSnippet No. 1,634,461
9.

For defective Unicode equivalence strings starting with a Hangul vowel or trailing conjoining jamo, concatenation can break Composition.

FactSnippet No. 1,634,462
10.

Unicode equivalence assigns each character a combining class, which is identified by a numerical value.

FactSnippet No. 1,634,463
11.

In one specific instance, OS X normalized Unicode equivalence filenames sent from the Samba file- and printer-sharing software.

FactSnippet No. 1,634,464