Re: [Ltru] UTF-8

"Reshat Sabiq (Reşat)" <tatar.iqtelif.i18n@gmail.com> Fri, 22 June 2007 17:12 UTC

Return-path: <ltru-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1I1mgf-00059J-IM; Fri, 22 Jun 2007 13:12:45 -0400
Received: from ltru by megatron.ietf.org with local (Exim 4.43) id 1I1mgd-000533-SN for ltru-confirm+ok@megatron.ietf.org; Fri, 22 Jun 2007 13:12:43 -0400
Received: from [10.91.34.44] (helo=ietf-mx.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1I1mgd-00052m-IV for ltru@ietf.org; Fri, 22 Jun 2007 13:12:43 -0400
Received: from ms-smtp-01.rdc-kc.rr.com ([24.94.166.115]) by ietf-mx.ietf.org with esmtp (Exim 4.43) id 1I1mgc-0001rI-7x for ltru@ietf.org; Fri, 22 Jun 2007 13:12:43 -0400
Received: from [192.168.2.2] (CPE-65-30-31-71.kc.res.rr.com [65.30.31.71]) by ms-smtp-01.rdc-kc.rr.com (8.13.6/8.13.6) with ESMTP id l5MHBQij024038; Fri, 22 Jun 2007 12:11:26 -0500 (CDT)
Message-ID: <467C031F.4070509@gmail.com>
Date: Fri, 22 Jun 2007 12:13:03 -0500
From: "\"Reshat Sabiq (Reşat)\"" <tatar.iqtelif.i18n@gmail.com>
User-Agent: Thunderbird 2.0.0.4 (Windows/20070604)
MIME-Version: 1.0
To: John Cowan <cowan@ccil.org>
Subject: Re: [Ltru] UTF-8
References: <467ABA07.5010306@gmail.com> <20070621180158.GC9078@mercury.ccil.org> <41a006820706211153r45ef3094p169901d87cb910d4@mail.gmail.com> <20070621185738.GE9078@mercury.ccil.org>
In-Reply-To: <20070621185738.GE9078@mercury.ccil.org>
X-Enigmail-Version: 0.95.1
OpenPGP: id=262839AF; url=http://keyserver.veridis.com:11371
Content-Type: text/plain; charset="UTF-8"
X-Virus-Scanned: Symantec AntiVirus Scan Engine
Content-Transfer-Encoding: quoted-printable
X-MIME-Autoconverted: from 8bit to quoted-printable by ms-smtp-01.rdc-kc.rr.com id l5MHBQij024038
X-Spam-Score: 0.5 (/)
X-Scan-Signature: 244a2fd369eaf00ce6820a760a3de2e8
Cc: ltru@ietf.org, GerardM <gerard.meijssen@gmail.com>
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Errors-To: ltru-bounces@ietf.org

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

John Cowan yazmış:
> GerardM scripsit:
> 
>> It happens often that exactly because of the content is NOT UTF-8 that
>> I get the information mangled. With UTF-8 you either get the message
>> or it will be indicated that you have insufficient fonts installed. The
>> notion that the current encoding always works is wrong.
> 
> It always works for ASCII text, and the Registry is restricted to
> ASCII text.
Well, i think for me the interest is whether the next release could
mandate UTF-8, so that we don't have to deal w/ escapes.
I guess there are 2 issues:
1. Email clients
2. Browser clients

On 1:
If you can see s w/ cedilla on the first line above, then your email
client handled UTF-8; if not maybe you could try choosing UTF-8 encoding
manually. I know that Thunderbird and Outlook don't have a problem w/
UTF-8. What other email clients are we talking about? If your pine
client thru your shell can't handle it, then really you will only have
few characters mangled, because charactes overlapping w/ ASCII are
rendered the same whether it's UTF-8 or ISO-8859-1. Then you still have
an option of using another email client to UTF-8 appropriately when you
really care.

On 2:
I don't think any up-to-date browsers have any problem w/ UTF-8 (IE,
firefox, seamonkey, opera). The only ones that would have problem could
be cell-phone based ones that don't belong to one of the brands above
and some shell-based ones, which probably depends on what shell is being
used. And then again i'd say it's not a big deal, because you can use
another client to read stuff if you care. Besides the pros of not having
to deal w/ escapes outweigh the cons of not having UTF-8 support on some
cell-phone devices and some shells.

So what exactly email and browser clients are we talking about that
don't support UTF-8? If it's only a matter of email, we could say: "when
using email, you can send requests using escapes, but these escapes will
be posted in UTF-8 in the registry"? If we adopt UTF-8 for new
submissions, we could also say "starting on <someDate/> all character
escapes in the registry will be converted to UTF-8."

Most importantly, if UTF-8 is going to be easier for 95% of mainstream
users, couldn't we tell 5% of peculiar users to deal w/ it? The registry
for the internet should probably be based on up-to-date technologies,
rather than trying to accommodate obsolete ones as well, which i think
are used to rarely to be cared about anyway.

- --
My public GPG key (ID 0x262839AF) is at: http://keyserver.veridis.com:11371
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.2.1 (Cygwin)

iD8DBQFGfAMfO75ytyYoOa8RAmvdAKCFyK9AXwVpB0t1zFleyELBVrtHlACfcjp7
WxouWLWqXt+rq6xf4I9iwgQ=
=4ilM
-----END PGP SIGNATURE-----


_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru