Text Normalization: Unicode Forms and Composed Chars