Re: [Idna-update] [Ext] FWD: Expiration impending: <draft-klensin-idna-rfc5891bis-01.txt>

Asmus Freytag <asmusf@ix.netcom.com> Tue, 06 March 2018 07:45 UTC

Return-Path: <asmusf@ix.netcom.com>
X-Original-To: idna-update@ietfa.amsl.com
Delivered-To: idna-update@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BE668124217 for <idna-update@ietfa.amsl.com>; Mon, 5 Mar 2018 23:45:46 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.72
X-Spam-Level:
X-Spam-Status: No, score=-2.72 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=ix.netcom.com; domainkeys=pass (2048-bit key) header.from=asmusf@ix.netcom.com header.d=ix.netcom.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id MRbAoB3AaEvJ for <idna-update@ietfa.amsl.com>; Mon, 5 Mar 2018 23:45:43 -0800 (PST)
Received: from elasmtp-mealy.atl.sa.earthlink.net (elasmtp-mealy.atl.sa.earthlink.net [209.86.89.69]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7B998126C89 for <idna-update@ietf.org>; Mon, 5 Mar 2018 23:45:42 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ix.netcom.com; s=dk12062016; t=1520322343; bh=oQUaADcVMfgf5Ps26ibe6tEHjem6yMMJ8SBn uvke6UA=; h=Received:Subject:To:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language: X-ELNK-Trace:X-Originating-IP; b=eSrJy3hL8T3nWheDy2xvA7pi9y7tL8P3k ZRIVeWHSkUhu4hXYQwMbP0R40s7Urt0UdGzTOF6ADFcqNJhblUa61dSIcUGb5A3cZkQ K+uxemFgjI51f9kJVXJ0ev2CL/IbmVVnsoum3ZMc0vxkNA8paVBYyIUP6+uWe3crhFT wz1D8Y0DnG7eoVkvU4uELrQWEttX0xXjCkyIjhpPt7MjblGcS4QlVhZjfst2lqTMhez Q2C2xBm2rEahY8Z3wdiFXabv8bbAdLaVvcCfsZSCyhbta7i/qfZR4zFvMqNVQH7YpMr mnQME2B+psXfHE9Eb6SKmKhrRmu05rRZsm6WYpWFQ==
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk12062016; d=ix.netcom.com; b=d1B78cCpcAOmy+j58V7qvskjqc5DqtabZ81nZeo7AWw7XljJp0VSN7eqTAFpD6x0x+ci4bhpO3wohs1v7BBuGGSStSByVI4ssttet4SlXTJgcBgoUKyrss2tWlzni49s/hKL4DlcZuMwEQnOCqWLN94b4607myrK/PSnzo2/kQz2P/Di9sKkJiGOC36Fsvf7okuGn2rwbh2K10yNb1onYUyspvtCtRPuTCgzZgys9T7JhMql+JLc1SwmEW1lz4gGmWzTGN3iSyQToeqP3X2N6v+3hEr4/OtoZvQ1w66qUmA1JJdqg80Xa+SoHB0FswiWGEUnNy3Jia0kcVk+29O2LQ==; h=Received:Subject:To:References:From:Message-ID:Date:User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Language:X-ELNK-Trace:X-Originating-IP;
Received: from [71.35.186.204] (helo=[192.168.0.5]) by elasmtp-mealy.atl.sa.earthlink.net with esmtpa (Exim 4) (envelope-from <asmusf@ix.netcom.com>) id 1et7I4-000Eua-To for idna-update@ietf.org; Tue, 06 Mar 2018 02:45:41 -0500
To: idna-update@ietf.org
References: <0AAE384126E73857E6EEC32C@PSB> <20180305191527.GA99731@KIDA-6861.local> <822FD6FA-4FA5-449D-9491-01315DB57A9E@frobbit.se> <161f7c23760.2772.55b9c0b96417b0a70c4dcaded0d2e1c6@anvilwalrusden.com> <9A04CF8C-DF86-4562-8AC0-21EF0FF539FF@frobbit.se> <7BE50D38-969D-422A-AF0F-C58B442472FE@gmail.com>
From: Asmus Freytag <asmusf@ix.netcom.com>
Message-ID: <36497509-9b33-7996-c322-efb1862d9695@ix.netcom.com>
Date: Mon, 05 Mar 2018 23:45:41 -0800
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0
MIME-Version: 1.0
In-Reply-To: <7BE50D38-969D-422A-AF0F-C58B442472FE@gmail.com>
Content-Type: multipart/alternative; boundary="------------FD139A9ED89CAC9CD63546C9"
Content-Language: en-US
X-ELNK-Trace: 464f085de979d7246f36dc87813833b2c1627926350bb93fae72648617988c31bbb110dbefe2e643350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c
X-Originating-IP: 71.35.186.204
Archived-At: <https://mailarchive.ietf.org/arch/msg/idna-update/scM-wilAnaDcTyUg-2HGg_WhZwM>
Subject: Re: [Idna-update] [Ext] FWD: Expiration impending: <draft-klensin-idna-rfc5891bis-01.txt>
X-BeenThere: idna-update@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "Internationalized Domain Names in Applications \(IDNA\) implementation and update discussions" <idna-update.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idna-update>, <mailto:idna-update-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idna-update/>
List-Post: <mailto:idna-update@ietf.org>
List-Help: <mailto:idna-update-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idna-update>, <mailto:idna-update-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 06 Mar 2018 07:45:47 -0000

On 3/5/2018 1:23 PM, Suzanne Woolf wrote:
> Patrik,
>
> That was before my time on the IAB, sorry.
>
> But as it happens, I’m looking at an update to the IAB statement, so 
> have been pondering this very question. Do you think that 
> https://www.ietf.org/id/draft-klensin-idna-5892upd-unicode70-05.txt covers 
> the options reasonably well?

There is one statement here (and it's the premise of the whole affair) 
that is actually not 100% correct:

    Unicode 7.0.0 introduces the new code point U+08A1, ARABIC LETTER BEH
    WITH HAMZA ABOVE.  As can be deduced from the name, it is/*visually identical*/  to the glyph that can be formed from a combining sequence
    consisting of the code point for ARABIC LETTER BEH (U+0628) and the
    code point for Combining Hamza Above (U+0654).

(/*emphasis */added)

In fact, in many (all?) fonts that do support this code point, there's a 
noticeable difference between the combining sequence and the precomposed 
character. Here is one example:



similar systematic differences, although slight, obtain in many cases of 
combining sequences that /appear at first to be identical/ to 
precomposed Latin code points without being normalized:


where the right side is 'o' + 0x0337   [this case has been in Unicode 
since IDNA2008 first started]

Obviously much depends on the font, but because the entire premise rests 
on the *absolute identical appearance* of the sequences to the code 
points in question, I think we need to take a step back here.

Perhaps, one difference since the time this was first raised is that the 
code point 08A1 is now supported in a number of fonts, which may not 
have been the case immediately after Unicode 7.0 was released and it may 
not have been apparent that the precomposed code point would be treated 
differently from a sequence that was so apparently made up from the same 
elements.

To put this into perspective: there are thousands of PVALID Chinese 
characters for which the distinctions are equally subtle, just saying. 
(And no, for the most part, they are not variants of each other). Yet 
this didn't stop IDNA from tracking many thousands of Han additions over 
the years.

I think the best thing for the IAB is to find a way to climb down from 
seeing this as an earth-shattering crisis and accepting such near 
similarities as unavoidable consequences of trying to implement mnemonic 
identifiers that are readable to the world population.

The right way forward is simply not to leap to the assumption that 
action on the protocol level is called for for these cases, but that 
they are in some ways the end points of a continuum that includes cases 
like .com/.corn and apple.com/app1e.com.

The right way forward also means that one cannot blithely assume that 
identifiers are always made from a small set of very distinct shapes 
(like most of the Latin alphabet) and to perhaps find ways to mitigate 
the worst cases.

A./


>
>
> Thanks,
> Suzanne
> ("member of, not speaking for, the IAB")
>
>> On Mar 5, 2018, at 3:22 PM, Patrik Fältström <paf@frobbit.se 
>> <mailto:paf@frobbit.se>> wrote:
>>
>> I did send the question, and gave choices, to IAB on July 9 2014.
>>
>> Message-Id: <93768164-79F7-443D-8A7F-0411661B6EF9@frobbit.se 
>> <mailto:93768164-79F7-443D-8A7F-0411661B6EF9@frobbit.se>>
>>
>> Since then the question has been stalled waiting for consensus on how 
>> to move forward.
>>
>>   Patrik
>>
>> On 5 Mar 2018, at 21:02, Andrew Sullivan wrote:
>>
>>> Does the IAB know that, and have an opinion?
>>>
>>> A
>>>
>>> -- 
>>> Please excuse my clumbsy thums
>>>
>>>
>>>
>>> ----------
>>> On March 5, 2018 14:38:48 Patrik Fältström <paf@frobbit.se 
>>> <mailto:paf@frobbit.se>> wrote:
>>>
>>>> On 5 Mar 2018, at 20:15, Kim Davies wrote:
>>>>
>>>>> Quoting John C Klensin on Monday March 05, 2018:
>>>>>
>>>>>> Given that there has been no discussion of this draft in the
>>>>>> last few months, that there seems to be no interest in it in the
>>>>>> IESG (although we have not pressed hard on it specifically), and
>>>>>> that the possibly-complementary
>>>>>> draft-freytag-troublesome-characters draft has already expired,
>>>>>> I'm just going to let this expire unless someone gives me a good
>>>>>> reason why not in the next few minutes (if a revision is not
>>>>>> posted today, the draft will expire because, without special
>>>>>> permission, nothing can be posted between tomorrow and the start
>>>>>> of IETF).
>>>>>>
>>>>>> I hope no one interprets this as complete lack of interest in
>>>>>> pursuing IDN issues, or even clarification of the IDNA
>>>>>> standards, in the IETF.
>>>>>
>>>>>
>>>>> With the expiry of these efforts, I am concerned more generally 
>>>>> that the standing IAB guidance in
>>>>> https://www.iab.org/documents/correspondence-reports-documents/2015-2/iab-statement-on-identifiers-and-unicode-7-0-0/
>>>>> has kept the pre-computed IDNA tables published at IANA downgraded 
>>>>> to Unicode 6.3.0 for a number of years. It seems appropriate to 
>>>>> revisit publishing tables against contemporary Unicode editions if 
>>>>> there is no active work on publishing clarifying RFCs in this regard.
>>>>
>>>> Speaking as the by IAB appointed expert that do approve new tables, 
>>>> I simply can not unless there are some guidance and agreement in 
>>>> the IETF on how to move forward.
>>>>
>>>> The few people involved that spend time on these issues makes it 
>>>> hard to draw conclusions on "the right path forward". This given 
>>>> the situation we are in, as described by John.
>>>>
>>>>   Patrik
>>>>
>>>>
>>>>
>>>> ----------
>>>> _______________________________________________
>>>> IDNA-UPDATE mailing list
>>>> IDNA-UPDATE@ietf.org <mailto:IDNA-UPDATE@ietf.org>
>>>> https://www.ietf.org/mailman/listinfo/idna-update
>>>>
>> _______________________________________________
>> IDNA-UPDATE mailing list
>> IDNA-UPDATE@ietf.org <mailto:IDNA-UPDATE@ietf.org>
>> https://www.ietf.org/mailman/listinfo/idna-update
>
>
>
> _______________________________________________
> IDNA-UPDATE mailing list
> IDNA-UPDATE@ietf.org
> https://www.ietf.org/mailman/listinfo/idna-update