Submitted by : simon at: 2003-10-26T21:31:51+00:00 (14 years ago)
Name :
Category : Severity : Status :
Optional subject :  
Optional comment :

Formatted Text

using special chars at the first character in the text eg. äüö you cannot use structured formatting like My Bold Text.

so its not possible f.i. in German to make the word "Ändern" bold just by adding a two asterisks at the beginning and end of the word (or sentence).

this only results in: **Ändern**

this also doesn't work for using a single asterisk and a underscore for italic and underline

so I hope now it's readable ...

-- ArnoPucher


comments:

property change --simon, Sat, 27 Sep 2003 21:37:17 -0700 reply
Category: general => formatting and rendering

An other problem with non-ASCII characters -- Wed, 17 Mar 2004 17:58:32 -0800 reply
Trying to put an accented character in the text of a link. The link is not recognized. Example:

Voyez "ma simplicité volontaire":http://simple.qc.ca/

Workaround -- Fri, 26 Mar 2004 00:50:57 -0800 reply
I had the same problem on my site and spent some time to hunt this down. Somehow the StructuredText formatting does not work with UTF-8, so my workaround is to switch back from UTF-8 to ISO-Latin-1.

The following worked for me with Python 2.3.3, Zope 2.7.0 and ZWiki 0.28 on a Linux machine:

These changes can also be applied to a Windows installation, although the locale settings is somewhat more difficult. Under Windows, Python rejects the locale de_DE and is really picky about acceptable character encodings. The following worked for me with Python 2.1.3, Zope 2.6.2 and ZWiki 0.28 on a Windows machine:

Maybe someone will pick up this workaround and implement a real solution for this problem--she or he may find this useful:

I first thought, that by changing the character encoding for the page contents I also had to change the regular expressions in Regexps.py. The rules for uppercase (U) and lowercase letters (L) are UTF-8-aware and I replaced them by the old single-byte regular expressions. This fixed the rendering of international characters, but broke the mark-up of Wiki names. I traced this back to the method ZWikiPage.markLinksIn where anywikilinkexpr matched only URLs and nothing else.

I am a novice to Python regular expression matcher, but by a quick search I found that the matcher is locale aware and has special support for UTF-8. I think, this may be the key to fix the text formatting for UTF-8 encodings, too.

--Andreas

property change --simon, Mon, 25 Oct 2004 21:28:44 -0700 reply
Name: #463 Text formatting with special chars like @ or german "Umlauts" => #463 STX formatting with special chars like @ or german "Umlauts" Category: user-pagetypes => general-rendering

property change --Bob McElrath?, Tue, 26 Oct 2004 05:15:06 -0700 reply
Fixed in newstx: http://mcelrath.org:9675/newstx, see #9 and #10.

property change --Bob McElrath?, Tue, 26 Oct 2004 05:22:52 -0700 reply
Bob McElrath? [bob+zwiki@mcelrath.org]? wrote:

Fixed in newstx: http://mcelrath.org:9675/newstx, see #9 and #10.

Where #9 and #10 refer to the list on the page mentioned. Hmm I don't like the new #nnn issue numbers. But then, I don't use the tracker.

Who wants @ in emphasize/strong/underline --Bob McElrath?, Tue, 26 Oct 2004 05:38:51 -0700 reply
In fixing this I took the approach to disallow punctuation between delimiters.

The other option is to enumerate all allowed things between delimiters. I think this is nearly impossible considering all the languages and character sets in UTF-8.

So, emphasize, strong, and underline should generally be used to mark up a few words in a sentence, and should not span sentences. Currently the set of characters not allowed in these kinds of markup are:

  !"#$%&\'\(\)\+-./<=>\?@\[\\\]\^`{|}~

I am open to removing some characters from the excluded set, if people have arguments about some.

Who wants @ in emphasize/strong/underline -- Tue, 26 Oct 2004 06:43:32 -0700 reply
Speculation: I'm concerned about the characters used to highlight numbers. $.=- , and remotely #() Test: $5.50 ($5.50) $5.50/#

ändern -- Mon, 13 Dec 2004 00:22:17 -0800 reply

ändern -- Mon, 13 Dec 2004 00:23:02 -0800 reply

property change -- Sun, 30 Jan 2005 08:47:21 -0800 reply
Category: user-pagetypes => user-editing-stx

... --betabug, Wed, 21 Feb 2007 08:13:06 +0000 reply
Name: #463 treva => #463 STX formatting with special chars like @ or german "Umlauts" Category: admin-installing => user-editing-stx Severity: minor => normal