Re: [iucg] Unicode 7.0.0, (combining) Hamza Above, and normalization for comparison

Jefsey <jefsey@jefsey.com> Thu, 07 August 2014 18:07 UTC

Return-Path: <jefsey@jefsey.com>
X-Original-To: iucg@ietfa.amsl.com
Delivered-To: iucg@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 02F3F1A033E for <iucg@ietfa.amsl.com>; Thu, 7 Aug 2014 11:07:45 -0700 (PDT)
X-Quarantine-ID: <O1xrJi10KmnH>
X-Virus-Scanned: amavisd-new at amsl.com
X-Amavis-Alert: BAD HEADER SECTION, Non-encoded 8-bit data (char E2 hex): To: ...nsin@jck.com>, Mark Davis \342\230\225\357\270\217 <mark@mac[...]
X-Spam-Flag: NO
X-Spam-Score: 1.931
X-Spam-Level: *
X-Spam-Status: No, score=1.931 tagged_above=-999 required=5 tests=[BAYES_50=0.8, IP_NOT_FRIENDLY=0.334, MIME_8BIT_HEADER=0.3, MISSING_MID=0.497] autolearn=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id O1xrJi10KmnH for <iucg@ietfa.amsl.com>; Thu, 7 Aug 2014 11:07:43 -0700 (PDT)
Received: from host.presenceweb.org (host.presenceweb.org [67.222.106.46]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 4C7C91A02EC for <iucg@ietf.org>; Thu, 7 Aug 2014 11:07:43 -0700 (PDT)
Received: from 21.104.14.81.rev.sfr.net ([81.14.104.21]:22447 helo=MORFIN-PC.mail.jefsey.com) by host.presenceweb.org with esmtpa (Exim 4.82) (envelope-from <jefsey@jefsey.com>) id 1XFS68-0008TI-6f; Thu, 07 Aug 2014 11:07:32 -0700
X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9
Date: Thu, 07 Aug 2014 20:07:21 +0200
To: John C Klensin <klensin@jck.com>,Mark Davis ������ <mark@macchiato.com>
From: Jefsey <jefsey@jefsey.com>
In-Reply-To: <AA3E83F7AA61539EC4FFC502@JcK-HP8200.jck.com>
References: <C0D401D76B8D1BA472604BB4@JCK-EEE10> <CAJ2xs_F9+6_+Fz-xFdSGBUV82qmMa33Y8+F9mjinMKx9=YoKcA@mail.gmail.com> <CAJ2xs_H_Gy9b_A5LZj0o9rFffbvbnVGLv+22CD7NhmZhLXE6Rg@mail.gmail.c om> <219A83FB-B0C4-4B58-93A9-84A976B9147E@frobbit.se> <20140806124932.E9AA77C3B37@mork.alvestrand.no> <AA3E83F7AA61539EC4FFC502@JcK-HP8200.jck.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-AntiAbuse: This header was added to track abuse, please include it with any abuse report
X-AntiAbuse: Primary Hostname - host.presenceweb.org
X-AntiAbuse: Original Domain - ietf.org
X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12]
X-AntiAbuse: Sender Address Domain - jefsey.com
X-Get-Message-Sender-Via: host.presenceweb.org: authenticated_id: jefsey+jefsey.com/only user confirmed/virtual account not confirmed
X-Source:
X-Source-Args:
X-Source-Dir:
Archived-At: http://mailarchive.ietf.org/arch/msg/iucg/mefFPgIneZ_L7PVPMjP3NdmsAsg
Cc: Marc Blanchet <Marc.Blanchet@viagenie.ca>, IDNA update work <idna-update@alvestrand.no>, iucg@ietf.org, gerard lang <gerard_lang@orange.fr>
Subject: Re: [iucg] Unicode 7.0.0, (combining) Hamza Above, and normalization for comparison
X-BeenThere: iucg@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
Reply-To: internet users contributing group <iucg@ietf.org>
List-Id: internet users contributing group <iucg.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/iucg>, <mailto:iucg-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/iucg/>
List-Post: <mailto:iucg@ietf.org>
List-Help: <mailto:iucg-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/iucg>, <mailto:iucg-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 07 Aug 2014 18:07:45 -0000
X-Message-ID:
Message-ID: <20140807180750.5452.8639.ARCHIVE@ietfa.amsl.com>

At 18:09 07/08/2014, John C Klensin wrote:
>Jefsey,
>
>I am not sure I understand what you are talking about but, if I
>do, it is about an almost completely different topic.

John,

as you may remember I am not interested in Unicode per se. I am 
interested in the open pragmatic use of the digisphere through 
whatever is available and people want to use. This calls for external 
(fringe to fringe) innovations not to be constrained by any internal 
(end to end) MUST. As far as I undedrstand, the issue being raised 
concerns orthotypography in a  specific language?

This has been ruled out of the IDNA2008 scope. Because it was ruled 
out the Unicode scope.

Now, what may have to be clarified is what a user calls confusable. 
For users, "confusables" are different codepoint sequences that look 
the same (whatever the reason). If the Hamza added sequences are not 
creating internet use confusion, we are not concerned. If they are, 
we are concerned. However, we MUST not decide for others: we only are 
to give them the possibility to decide by themselves.

A digital name supports a semantic address through group of visual 
signs. Whatever the underlying code, version, etc. For the time being 
the IETF has chosen a single underlying typographic code to support 
the digital names' signs. That code does not consider orthotypography 
(i.e. semantic constraints that are language dependant). So the IETF 
has chosed that its end to end protocol are not concerned by 
orthotypographic issues. Tomorrow we can chose an additional code to 
Unicode: we need the use of these two codes to be transparent to our 
own use, independently from any orthotypographic issue. This is the 
metric of our choice.

1. TLD Managers must be able to use their own add-ons to support or 
not orthotypographic aspects in their zone. How do you know if you 
will not create conflicts today with Hamza, or in "equivalent" other cases?
2. If you consider Hamza you must consider French majucules. Or am I wrong?

Best.
jfc

>At 19:37 07/08/2014, John C Klensin wrote:
>p.s. RFC 5895 was never intended to be standards track.  Talking 
>about "denying" that status to it is misleading at be

Short memories. Anyway, the point is that without Paul's and Pete's 
solution I would never have given up the orthotypographic requirement 
and IDNA would have been technically and politically blocked. I 
consider that RFC 5895 is one of the most fundamental architectural 
RFCs of the internet as it illustrates how the internet architecture 
survives external diversity, by subsidiarity at the fringe. This is 
where French majuscules can be treated (layer six experimentation).