Re: "Difficult Characters" draft

"Martin J. Duerst" <mduerst@ifi.unizh.ch> Sun, 04 May 1997 15:59 UTC

Received: from cnri by ietf.org id aa22653; 4 May 97 11:59 EDT
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa11088; 4 May 97 11:59 EDT
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id LAA10448 for uri-out; Sun, 4 May 1997 11:36:31 -0400 (EDT)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with ESMTP id LAA10436 for <uri@services.bunyip.com>; Sun, 4 May 1997 11:36:27 -0400 (EDT)
Received: from josef.ifi.unizh.ch (josef.ifi.unizh.ch [130.60.48.10]) by mocha.bunyip.com (8.8.5/8.8.5) with SMTP id LAA07545 for <uri@bunyip.com>; Sun, 4 May 1997 11:36:24 -0400 (EDT)
Received: from enoshima.ifi.unizh.ch by josef.ifi.unizh.ch with SMTP (PP) id <05503-0@josef.ifi.unizh.ch>; Sun, 4 May 1997 17:36:21 +0200
Date: Sun, 04 May 1997 17:36:18 +0200
From: "Martin J. Duerst" <mduerst@ifi.unizh.ch>
To: Larry Masinter <masinter@parc.xerox.com>
cc: URI mailing list <uri@bunyip.com>
Subject: Re: "Difficult Characters" draft
In-Reply-To: <336A3609.668A@parc.xerox.com>
Message-ID: <Pine.SUN.3.96.970504172656.245p-100000@enoshima>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
Sender: owner-uri@bunyip.com
Precedence: bulk

On Fri, 2 May 1997, Larry Masinter wrote:

> > Other issues:
> > The bidi issues for RLT languages in conjunction with
> > normal punctuation used in and around identifiers. (Will
> > the identifiers present themselves 'correctly' without
> > these characters in all cases?)
> 
> When you type Hebrew and enter:
> 
>    "http://host.il/WERBEH/DROW"
> 
> (HEBREW WORD), might some typing software add direction markers
> and some other typing software leave it out?

When the above sequence is typed (in logical/phonetical order
as HEBREW/WORD) in a *plain text context* and with a display
engine using the Unicode BIDI algorithm, it will not appear
as desired, i.e. as WERBEH/DROW. The same applies to a lot of
other kinds of formal syntax, in particular also to HTML/SGML
syntax.

Sadly enough, there doesn't seem anything much that can be done
to change this.

The aim of the upcomming draft for BIDI for URLs (or identifiers)
is to define their display (and input) behaviour in places where
these identifiers are handled as such, i.e. the input field at
the top of a browser page, the file list display in finder-like
places, and URL input/display in structured HTML editors,...

Regards,	Martin.