Quote |
Your use of "lingua franca" illustrates the issue |
Quote |
Just from my experience I don't think adding a few extra characters will work. |
Quote |
[newby:] The dravidian languages (Bengali, Hindi, et cetera) present the greatest difficulty due to a great number of diacritical and other marks. |
Quote (stupid_idiot @ Mar. 13 2008,13:06) | ||
Hi newby: I don't think the slight added convenience of tone marks (Chinese pinyin) and diacritics (Indic languages) justifies the major trouble of a new charset (research, testing, everyday use). If 2 people want to communicate using phonetic representation, I think there is a much simpler solution: In the case of Chinese, the user can just use numbers (as in "1, 2, 3, 4") to represent tone: http://en.wikipedia.org/wiki/Pinyin#Numbers_in_place_of_tone_marks This way, 2 people can communicate in Chinese using pinyin, without the aid of a Chinese font. This is easier compared to using tone marks. When using tone marks, the user must know where (which vowel) to place the tone mark. This is not required when using numbers: e.g. Wo3 shi4 zhong1 wen2 xue2 sheng1. Similarly, 2 people can communicate in Japanese using Romaji, without the aid of a Japanese font. Indic languages might be harder to transliterate/romanize:
If I understand correctly, your goal is: To input [any-language] characters using phonetic representation -- i.e. actual input, rather than transliteration/romanization. If so, I think the existing combination of pinyin input methods (e.g. SCIM, and others) + Unicode/locale-specific font(s) is the only solution. (Your proposed new charset/font can be used to input other languages, but it cannot display them.) For romanization: I think we should use ascii-only methods, rather than create a new charset. In the case of Indic languages (Bengali, Hindu, Tamil, etc), I don't know those languages at all; but in my uninformed opinion, I think the proposed new charset will be of very little use. e.g. If 2 people want to communicate in Tamil: -- If no Tamil font is installed, then they would use the most compatible way of transliterating Tamil (ascii??). If so, they would not want to use an obscure charset...? -- If a Tamil font is installed, then the proposed new font would be redundant? |