This page is about supporting international characters in the text of zwiki pages. Part of the i18n project.

The goal is to safely store and accurately render the characters used by any language, using the proper character sets etc. As far as we know, Zwiki does this now, as long as you use a modern browser and have the necessary fonts installed. But we may be confused.

See also InternationalCharacterExamples, InternationalCharactersInPageNames.


Discussion

However, I think it would be good if structure text would handle DBCs properly. I have played with phpwiki and ZWiki. Sadly both doesn't handle DBCs properly as the certainly combinations of Chinese words come out as garbage. --FrankieChu

Simon Michael <simon@joyful.com>, 2002/02/14 23:28 GMT (via mail):
Alastair Burt <burt@dfki.de> writes:
> Hi,
>
> why does ZWiki make an attempt to change intl_char_entities in the raw
> page string into the &xxx; form? This to me seems to be bit contrary to
> Wikiness. The characters may have to turned into &xxx; to be rendered in
> a foreign browser, but the user presumably wants to edit them in the
> "native" form. In any event, only a subset of the latin-1 character set
> is converted.

Hi Alastair - it was a workaround for a problem we were having where these characters would get stripped out during subsequent edits by certain browsers. This is the highest of the i18n issues in my work queue; if someone else gets it done first I will be glad.

I think we should drop the workaround; if it turns out that the dumb browsers are no longer a problem then we are done; if not we add some smarts to _set_text (?) so that it can recognize when someone's browser has stripped out these characters and pops up a warning.

Forwarding this to the wiki.

Simon Michael <simon@joyful.com>, 2002/02/22 06:19 GMT (via mail):
Disabled the old safety-escaping of international characters.. these and others should now be stored as-is..

=C0 =C2 =C9 =CA =CE =D4 =DB =E0 =E2 =E7 =E8 =E9 =EA =EE =F4 =F9 =FB

SimonMichael, 2002/02/22 06:23 GMT (via web):
Except mailin interestingly does the above. Try a direct emacs edit.

À Â É Ê Î Ô Û à â ç è é ê î ô ù û

SimonMichael, 2002/02/22 06:28 GMT (via web):
And these seem to be preserved during edits by current browsers.. I'll do nothing further unless I see additional bug reports.

ÔakTree

Simon, 2002/09/13 18:49 GMT (via web):
Tip: when using emacs to edit a page containing international characters, save with the proper MULE coding system (C-x RET f). For this page it would be iso-latin-1.

SimonMichael, 2002/10/24 05:39 GMT (via web):
See also [中文]?


comments:

How to enable UTF-8 --SimonMichael, Tue, 06 Jan 2004 02:55:51 -0800 reply
Novica Nakov writes:

I'm running Zope 2.6.2 and ZWiki 0.25.0. I've set up the ZMI and wikipage charset to UTF-8. (The system locale is also UTF-8.)
Remind me - how do you set up these things ?

For the ZMI
a property in the root folder. name: management_page_charset, value: utf-8, type: string. For the wikipage content="text/html;charset=utf-8" in the head of the document.

cf --simon, Fri, 09 Jan 2004 03:05:06 -0800 reply
#681 polish characters in page content converted to html entities ?

Enable UTF-8 Support on ReST in Zope 2.7 --Roy Chan, Wed, 10 Mar 2004 05:26:07 -0800 reply
If you are using Zope 2.7 and want to type UTF-8 in ReST of ZWiki, Edit the zope.conf and add the following lines:

rest-input-encoding utf-8

rest-output-encoding utf-8

Of course, remember to restart you zope

а по русски??? -- Sun, 19 Sep 2004 03:36:32 -0700 reply
*Проба*

አማረኛ? -- Tue, 14 Dec 2004 16:01:22 -0800 reply
ይሰራል!

а по русски??? -- Mon, 08 Aug 2005 03:54:01 -0700 reply
I've tried this, too. Formatting rules don't work for non-ascii characters at all. :(

русский --paha, Wed, 09 Apr 2008 03:32:33 -0700 reply
*починилось?*

русский --paha, Wed, 09 Apr 2008 03:33:55 -0700 reply
russian didn't work

русский --betabug, Wed, 09 Apr 2008 04:20:08 -0700 reply
it doesn't work because this page is in "structured text" format, it's a known problem. With RST ("restructured text") everything works.