Save examples of various kinds of international character here. Be careful when editing this page - older browsers may eat some of these valuable characters.
This is becoming a good resource. If you are able, please add any details that would help the international developers support these better, such as:
- the name of the scripts/alphabets/character sets/encodings used
- others that we should know about
- quick note on their importance/use
- names of necessary fonts
- translations, for interest
- small reference screenshots of how these should look with proper browser support and fonts installed
..and/or useful links to any of these.
See also InternationalCharactersInPageContent, InternationalCharactersInPageNames.
These have all been converted to UTF-8 encoding (I think). I've removed a lot of old characters that were doubtful or unknown.
Examples
European Latin-1
- A-ZÀÁÂÃÄÅÆÈÉÊËÌÍÎÏÒÓÔÕÖØÙÚÛÜÝÇÐÑÞ 0-9a-zàáâãäåæèéêëìíîïòóôõöøùúûüýÿµßçðñþ
TschörmänPeydschNeyms
Chinese
- 中文中文 <- able to display Chinese characters
中文中文 <- Chinese bracketed wikilink works fine (using a new browser), though the URL may only consists out of strange Latin characters
[newlink]?中文中文中文中文 <- English wikiname and Chinese text works flawlessly (as expected)
Korean
- 한글
Latvian
- çûîâðìíïþèò, ÞurkasÂda
Polish
- Na przykład "Łabędzie śpią u źródełka" * Adding all Polish non-ASCII characters: ą ć ę ł ń ó ś ż ź Ą Ć Ę Ł Ń Ó Ś Ż Ź
Hungarian
- basically lots of accented letters are used (éáűőúöüóí), they almost always change the meaning, and therefore it's hard to read and understand a text without accents.
- there are only two "nasty" characters: ő Ő and ű Ű which aren't in the Latin-1 encoding, but these are responsible for many troubles I came across using Zope, Plone, Plone Archetypes and finally Zwiki.
- many times (for instance if a latin-2 charset is not present, only latin-1) they are substituted with õ Õ and û Û from Latin-1 character set, but this is incorrect, and cannot be reached from hungarian keyboard layout.
- In the Hungarian alphabet, the accented variants are situated right after the plain ones, unlike in Finnish alphabet, for instance, where äåö are at the end:
a á b c cs d dz dzs e é f g gy h i í j k l ly m n ny o ó ö ő p q r s sz t ty u ú ü ű v x y z zs
-So when a list is ordered using english alphabet, the accented letters are at the end, wich makes a bit harder for us to find something in the list.
Japanese
- 作为面向数码家电的操作系统(OS),Linux最近正迅速成为人们关注的热点。不仅新力和松下电器产业已就联合开发面向数码家电、基于Linux的OS达成了协议,而且在手机领域NEC也准备在3G手机中采用Linux。
... -- Fri, 18 Jun 2004 07:49:30 -0700 reply
Вот ведь ,а русский ?
Cyrillic -- Fri, 27 Aug 2004 02:05:20 -0700 reply
Cyrillic
Plain: абвгд
Italic: *абвгд*
Bold: **абвгд**
STX-style links: "абвгд":someurl
test -- Thu, 02 Sep 2004 00:40:17 -0700 reply
한글링크
संस्क्रितं -- Tue, 21 Dec 2004 04:26:12 -0800 reply
संस्क्रितं
संस्कृतम् -- Tue, 21 Dec 2004 04:28:23 -0800 reply
संस्कृतम्
umlaute -- Tue, 03 May 2005 14:37:03 -0700 reply
äöüß ÄÖÜ
Greek - Ελληνικά --betabug, Mon, 12 Feb 2007 22:44:44 -0800 reply
As expected, Greek should work too, we are set to UTF-8 after all... τα Ελληνικά δουλεύουνε μια χαρά!