Re: RFC 2047 and gatewaying

Bruce Lilly <blilly@erols.com> Thu, 09 January 2003 17:29 UTC

Message-ID: <3E1DB169.6080203@alex.blilly.com>
Date: Thu, 09 Jan 2003 12:29:13 -0500
From: Bruce Lilly <blilly@erols.com>
Reply-To: ietf-822@imc.org
Organization: Bruce Lilly
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.2.1) Gecko/20021130
MIME-Version: 1.0
To: Charles Lindsey <chl@clw.cs.man.ac.uk>
CC: ietf-822@imc.org
Subject: Re: RFC 2047 and gatewaying
References: <ylk7hnwi2d.fsf@windlord.stanford.edu> <8d9he9cmw-B@khms.westfalen.de> <20030104033518.GA16177@ussenterprise.ufp.org> <yln0mho6dl.fsf@windlord.stanford.edu> <3E1731A6.5030604@alex.blilly.com> <H8As9G.5C2@clw.cs.man.ac.uk>
In-Reply-To: <H8As9G.5C2@clw.cs.man.ac.uk>
Content-Type: text/plain; charset="us-ascii"; format="flowed"
Content-Transfer-Encoding: 7bit
Sender: owner-ietf-822@mail.imc.org
Precedence: bulk

Charles Lindsey wrote:
> In <3E1731A6.5030604@alex.blilly.com> Bruce Lilly <blilly@erols.com> writes:
> 
> 
>>It's worse than that; there are at least 3 different versions of UTF-8.
>>They differ in the longer multi-byte sequences.
> 
> 
> No, there is precisely one, as defined by the relevant Unicode documents.
> See RFC 2044 and draft-yergeau-rfc2279bis-02.txt, with all of which Usefor
> is fully compatible.

Charles, in spite of having been shown the differences in the past, you persist
in claiming that there are none.  RFC 2044 is not a Unicode document, and has
long been obsoleted.  One clue that you are wrong is the following quotation
from Unicode Technical report 28:

"Most notable among the corrigenda to the Standard is a further tightening of the definition of UTF-8, to eliminate irregular UTF-8 and to bring the Unicode specification of UTF-8 more completely into line with other specifications of UTF-8. "

Obviously if the Unicode consortium states unequivocally that there are
multiple utf-8 specifications which differ, there cannot be "precicely
one" utf-8 specification.

Here, once again:

Unicode 2.0, table A-3 (applies through Unicode 3.0):

  Unicode Value     1st Byte  2nd Byte  3rd Byte  4th Byte
000000000xxxxxxx   0xxxxxxx
00000yyyyyxxxxxx   110yyyyy  10xxxxxx
zzzzyyyyyyxxxxxx   1110zzzz  10yyyyyy  10xxxxxx
110110wwwwzzzzyy +
110111yyyyxxxxxx   11110uuu  10uuzzzz  10yyyyyy  10xxxxxx

RFC 2044 is obsolete; here's the table from RFC 2279:

    UCS-4 range (hex.)           UTF-8 octet sequence (binary)
    0000 0000-0000 007F   0xxxxxxx
    0000 0080-0000 07FF   110xxxxx 10xxxxxx
    0000 0800-0000 FFFF   1110xxxx 10xxxxxx 10xxxxxx

    0001 0000-001F FFFF   11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
    0020 0000-03FF FFFF   111110xx 10xxxxxx 10xxxxxx 10xxxxxx 10xxxxxx
    0400 0000-7FFF FFFF   1111110x 10xxxxxx ... 10xxxxxx

Unicode 3.2, Unicode Technical Report #28:

  Code Points   1st Byte   2nd Byte   3rd Byte   4th Byte
U+0000..U+007F 00..7F
U+0080..U+07FF C2..DF 80..BF
U+0800..U+0FFF E0 A0..BF 80..BF
U+1000..U+CFFF E1..EC 80..BF 80..BF
U+D000..U+D7FF ED 80..9F 80..BF
U+D800..U+DFFF ill-formed
U+E000..U+FFFF EE..EF 80..BF 80..BF
U+10000..U+3FFFF F0 90..BF 80..BF 80..BF
U+40000..U+FFFFF F1..F3 80..BF 80..BF 80..BF
U+100000..U+10FFFF F4 80..8F 80..BF  80..BF

Clearly RFC 2279 provides for 5- and 6-byte utf-8 sequences, which are
not provided for by Unicode through 3.2.  And some 4-byte sequences
differ in different Unicode versions (particulary those corresponding
to surrogate pairs).  Whether or not Unicode 4.0 and/or the draft
mentioned above will introduce additional variants is another matter.

Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Lawrence Greenfield
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Lawrence Greenfield
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Arnt Gulbrandsen
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Arnt Gulbrandsen
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Arnt Gulbrandsen
Re: RFC 2047 and gatewaying Kai Henningsen
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Lawrence Greenfield
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Bruce Lilly
RFC 2047 and gatewaying Charles Lindsey
Re: For shame (Was: Re: RFC 2047 and gatewaying) Shmuel (Seymour J.) Metz
Re: RFC 2047 and gatewaying Charles Lindsey
Re: prevervation of installed base Charles Lindsey
Re: RFC 2047 and gatewaying Bruce Lilly
Re: For shame (Was: Re: RFC 2047 and gatewaying) D. J. Bernstein
Re: RFC 2047 and gatewaying Charles Lindsey
Re: For shame (Was: Re: RFC 2047 and gatewaying) Andrew Gierth
Re: RFC 2047 and gatewaying Bruce Lilly
For shame (Was: Re: RFC 2047 and gatewaying) Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Henry Spencer
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Henry Spencer
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: prevervation of installed base Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Charles Lindsey
Re: prevervation of installed base Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Dave Crocker
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Dave Crocker
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Dave Crocker
Re: RFC 2047 and gatewaying Dave Crocker
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Keith Moore
Re: prevervation of installed base Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: prevervation of installed base Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: prevervation of installed base Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Bruce Lilly
Re: Unicode principles (Was Re: UTF-8 versions (w… Bruce Lilly
Re: Unicode principles (Was Re: UTF-8 versions (w… Philip Hazel
Re: Unicode principles (Was Re: UTF-8 versions (w… Andrew Gierth
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Unicode principles (Was Re: UTF-8 versions (was: … Bruce Lilly
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Arnt Gulbrandsen
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Bruce Lilly
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Bruce Lilly
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Arnt Gulbrandsen
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Andrew Gierth
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Shmuel (Seymour J.) Metz
Re: RFC 2047 and gatewaying Claus Färber
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Claus Färber
Re: RFC 2047 and gatewaying Claus Färber
Re: prevervation of installed base Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Russ Allbery
Re: prevervation of installed base Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Andrew Gierth
Re: RFC 2047 and gatewaying Keith Moore
Re: old format date fields (was Re: RFC 2047 and … Keith Moore
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Bruce Lilly
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Bruce Lilly
Re: prevervation of installed base Andrew Gierth
Re: RFC 2047 and gatewaying Andrew Gierth
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Andrew Gierth
Re: RFC 2047 and gatewaying Bruce Lilly
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Andrew Gierth
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Dave Crocker
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Bruce Lilly
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Bruce Lilly
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Charles Lindsey
Re: prevervation of installed base Keld Jørn Simonsen
Re: prevervation of installed base Russ Allbery
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Andrew Gierth
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: UTF-8 versions (was: Re: RFC 2047 and gateway… Bruce Lilly
Re: prevervation of installed base Dave Crocker
Re: RFC 2047 and gatewaying Dave Crocker
UTF-8 versions (was: Re: RFC 2047 and gatewaying) Andrew Gierth
Re: RFC 2047 and gatewaying Leo Bicknell
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Bruce Lilly
Re: old format date fields (was Re: RFC 2047 and … Bruce Lilly
Re: prevervation of installed base Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Nick Shelness
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Charles Lindsey
Re: prevervation of installed base Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
old format date fields (was Re: RFC 2047 and gate… Keith Moore
RE: RFC 2047 and gatewaying Dan Kohn
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Russ Allbery
RE: RFC 2047 and gatewaying Dan Kohn
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Leo Bicknell
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Leo Bicknell
Re: RFC 2047 and gatewaying Dave Crocker
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Leo Bicknell
Re: RFC 2047 and gatewaying Jean-Marc Desperrier
Re: RFC 2047 and gatewaying Keith Moore
Re: Running code (Was: RFC 2047 and gatewaying) Dave Crocker
prevervation of installed base Dave Crocker
Re: RFC 2047 and gatewaying Jean-Marc Desperrier
Running code (Was: RFC 2047 and gatewaying) Pete Resnick
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying ned+ietf-822
Re: RFC 2047 and gatewaying Pete Resnick
Re: RFC 2047 and gatewaying ned+ietf-822
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
[OT] mUTF-7 in IMAP Marc Mutz
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Charles Lindsey
RE: RFC 2047 and gatewaying Dan Kohn
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying ned+ietf-822
Re: RFC 2047 and gatewaying Kai Henningsen
Re: RFC 2047 and gatewaying Kai Henningsen
Re: RFC 2047 and gatewaying Kai Henningsen
Re: RFC 2047 and gatewaying Kai Henningsen
Re: RFC 2047 and gatewaying Kai Henningsen
Re: RFC 2047 and gatewaying Leo Bicknell
Re: RFC 2047 and gatewaying Russ Allbery
RE: RFC 2047 and gatewaying Dan Kohn
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Sam Roberts
Re: RFC 2047 and gatewaying ned+ietf-822
Re: RFC 2047 and gatewaying Leo Bicknell
Re: RFC 2047 and gatewaying Lawrence Greenfield
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Kai Henningsen
Re: RFC 2047 and gatewaying Kai Henningsen
Re: RFC 2047 and gatewaying Tony Hansen
Re: RFC 2047 and gatewaying Lawrence Greenfield
Re: RFC 2047 and gatewaying ned+ietf-822
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying ned+ietf-822
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying ned+ietf-822
Re: RFC 2047 and gatewaying Keith Moore
Re: RFC 2047 and gatewaying Lawrence Greenfield
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying ned+ietf-822
Re: RFC 2047 and gatewaying Charles Lindsey
Re: RFC 2047 and gatewaying D. J. Bernstein
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Bruce Lilly
Re: RFC 2047 and gatewaying Russ Allbery
Re: RFC 2047 and gatewaying Jacob Palme