Re: revised "generic syntax" internet draft

"Martin J. Duerst" <mduerst@ifi.unizh.ch> Mon, 21 April 1997 13:36 UTC

Received: from cnri by ietf.org id aa04277; 21 Apr 97 9:36 EDT
Received: from services.Bunyip.Com by CNRI.Reston.VA.US id aa07700; 21 Apr 97 9:36 EDT
Received: (from daemon@localhost) by services.bunyip.com (8.8.5/8.8.5) id IAA24495 for uri-out; Mon, 21 Apr 1997 08:54:49 -0400 (EDT)
Received: from mocha.bunyip.com (mocha.Bunyip.Com [192.197.208.1]) by services.bunyip.com (8.8.5/8.8.5) with SMTP id IAA24490 for <uri@services.bunyip.com>; Mon, 21 Apr 1997 08:54:47 -0400 (EDT)
Received: from josef.ifi.unizh.ch by mocha.bunyip.com with SMTP (5.65a/IDA-1.4.2b/CC-Guru-2b) id AA20979 (mail destined for uri@services.bunyip.com); Mon, 21 Apr 97 08:54:45 -0400
Received: from enoshima.ifi.unizh.ch by josef.ifi.unizh.ch with SMTP (PP) id <24664-0@josef.ifi.unizh.ch>; Mon, 21 Apr 1997 14:53:43 +0200
Date: Mon, 21 Apr 1997 14:53:41 +0200
From: "Martin J. Duerst" <mduerst@ifi.unizh.ch>
To: Chris Newman <Chris.Newman@innosoft.com>
Cc: John C Klensin <klensin@mci.net>, IETF URI list <uri@bunyip.com>
Subject: Re: revised "generic syntax" internet draft
In-Reply-To: <Pine.SOL.3.95.970415130735.22015K-100000@eleanor.innosoft.com>
Message-Id: <Pine.SUN.3.96.970421145201.245I-100000@enoshima>
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset="US-ASCII"
Sender: owner-uri@bunyip.com
Precedence: bulk

On Tue, 15 Apr 1997, Chris Newman wrote:

> On Tue, 15 Apr 1997, John C Klensin wrote:

[About length problems with UTF-8.]

> UTF-8 requires 2 octets to encode characters from the 8859-1 set which
> normally take 1 octet.  UTF-8 requires 3 octets to encode ideographic
> characters from UCS-2 which normally require 2 octets.  So
> western Europeans take a worse storage hit from UTF-8 than ideographic
> languages do.

This is not exactly true. Western European languages contain many
characters from ASCII, and only occasionally a character that needs
two bytes in UTF-8. But anyway, I think we agree that the size
of UTF-8 is not really an issue.

Regards,	Martin.