Re: [Idna-update] emoji and security

Asmus Freytag <asmusf@ix.netcom.com> Wed, 14 March 2018 00:09 UTC

Return-Path: <asmusf@ix.netcom.com>
X-Original-To: idna-update@ietfa.amsl.com
Delivered-To: idna-update@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3A8EE126DFB for <idna-update@ietfa.amsl.com>; Tue, 13 Mar 2018 17:09:34 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.72
X-Spam-Level:
X-Spam-Status: No, score=-2.72 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H3=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=ix.netcom.com; domainkeys=pass (2048-bit key) header.from=asmusf@ix.netcom.com header.d=ix.netcom.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id d8wCTzGo2sog for <idna-update@ietfa.amsl.com>; Tue, 13 Mar 2018 17:09:32 -0700 (PDT)
Received: from elasmtp-masked.atl.sa.earthlink.net (elasmtp-masked.atl.sa.earthlink.net [209.86.89.68]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E954312E897 for <idna-update@ietf.org>; Tue, 13 Mar 2018 17:09:28 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ix.netcom.com; s=dk12062016; t=1520986168; bh=9F6xIhs8cz2ComEpKBUISzu0m1vnUiNpi3gQ XGXGTgE=; h=Received:Subject:To:References:From:Message-ID:Date: User-Agent:MIME-Version:In-Reply-To:Content-Type: Content-Transfer-Encoding:Content-Language:X-ELNK-Trace: X-Originating-IP; b=C8Ff9jN+m7+tcWVuzAhCmbq8aw55Rubd4pWA73mslXMYq/ bBhETwawM+wNyiWL1KjsE4iGBzupNGwhk1JBO4b+sf3PbW+HdjY/VMYZHKjw43TSbm2 fsnYNoykOGO29V6kcbHK2JvgnSP/bL6IyY9pn1GAiznZ0ofGCMfLPxDvRtVKFNZMWhZ DJlT3JOPAXPfI9OmnNrBGLTxR5UNO/ZPgdg2VvegShr0RqURjoFQ4iAq3ipGIXBTIbU zpeo4dkBL+k+3r6auxNFv0Ao/hYRtKdF4ta87S4U/QFImp+oTK2IIH7uzPgsQNTRgwA L4d88cY6OINazaq69xaSnRLFsdOw==
DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dk12062016; d=ix.netcom.com; b=hZt3y2iRDTOdP0DztFFApvoPExg9kFZkuqmCbW19BpJmvJuhLaX0hg+Id9ok9qdGqlS5r+EpaTW3NLQR35KrarpZLAh7cGIpWsXCTDvXZO7RJyrYS/SwdOn0v1s908W27Y1enBvr805nET9fz9qgeA1aJGahQGfhddKhnqMQ5A+QhxGXdnngCBTmnR7ooQRoTq5be7KFV3o6A5hQJqoylWbaBHWOzn370rAXzOomiaX+KUrYZJB5gWGriV5EQy+0azniTtIV/TBFmyJKMFmbGoMKBp8lOqQNrIE+QL7cacYS4lmPtuAPtF0EeoP36oSCGX+wclRVSg31jJCpSZbKlA==; h=Received:Subject:To:References:From:Message-ID:Date:User-Agent:MIME-Version:In-Reply-To:Content-Type:Content-Transfer-Encoding:Content-Language:X-ELNK-Trace:X-Originating-IP;
Received: from [46.21.151.107] (helo=[10.4.47.190]) by elasmtp-masked.atl.sa.earthlink.net with esmtpa (Exim 4) (envelope-from <asmusf@ix.netcom.com>) id 1evtyx-000Gp8-BG for idna-update@ietf.org; Tue, 13 Mar 2018 20:09:27 -0400
To: idna-update@ietf.org
References: <533bb471-da9b-64d0-76aa-a8a1251d256b@ix.netcom.com> <DM5PR1901MB219712F39A6297F9A147312DA2D30@DM5PR1901MB2197.namprd19.prod.outlook.com> <20180313202505.ztersmy2z5xuxlvp@mx4.yitter.info> <DM5PR1901MB2197A704B3233E5236EB703AA2D20@DM5PR1901MB2197.namprd19.prod.outlook.com>
From: Asmus Freytag <asmusf@ix.netcom.com>
Message-ID: <ac2e51de-a9ad-c8ee-96b0-5b50a0e225c4@ix.netcom.com>
Date: Tue, 13 Mar 2018 17:09:29 -0700
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.6.0
MIME-Version: 1.0
In-Reply-To: <DM5PR1901MB2197A704B3233E5236EB703AA2D20@DM5PR1901MB2197.namprd19.prod.outlook.com>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Content-Language: en-US
X-ELNK-Trace: 464f085de979d7246f36dc87813833b2c1627926350bb93f36669cd4014ef125d9c60059c04c87f9350badd9bab72f9c350badd9bab72f9c350badd9bab72f9c
X-Originating-IP: 46.21.151.107
Archived-At: <https://mailarchive.ietf.org/arch/msg/idna-update/WWoWeWCobA3AYQcAMzEDvxp3TNA>
Subject: Re: [Idna-update] emoji and security
X-BeenThere: idna-update@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: "Internationalized Domain Names in Applications \(IDNA\) implementation and update discussions" <idna-update.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/idna-update>, <mailto:idna-update-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/idna-update/>
List-Post: <mailto:idna-update@ietf.org>
List-Help: <mailto:idna-update-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/idna-update>, <mailto:idna-update-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 14 Mar 2018 00:09:34 -0000

On 3/13/2018 1:44 PM, Michel Suignard wrote:
> <<
> If the general-category approach to IDNA2008 was unsuitable in 2008 because it yielded examples of things that should never in principle be identifiers, I wish someone would have made that argument.  I do not recall it having been made, and I believe I participated pretty avidly.  I do recall some people arguing that various archaic things were "not needed" or "almost always unusable by anyone", but that would have put us in the situation of going through everything one code point at a time.  There were some who wanted to try to re-do
> IDNA2003 only updated for then-current versions of Unicode, but I think experience shows that wouldn't have worked out too well either.
> No, I still think in context, general-category approach was the best approach possible, short of developing a new category which was unpalatable to most. As you correctly pointed out in your answer in my comment about Egyptian hieroglyphs, these may be used in limited-use identifier context (for example if you want to identify base Old Egyptian phonemes) but with some caveat.

The design of IDNA2008 reflects two or three conflicting thrusts:

1) it was intended to be updatable to newer versions by simply applying 
the properties, with using an exception list as a fail-safe.
2) it was intended to avoid clear-cut cases of dual encoding (via 
normalization)
3) it was intended to be usable on all levels of the tree - therefore it 
could not be maximally restrictive

It would have been easy to restrict IDNA 2008 to modern scripts only - 
the number of scripts that are in actual modern use of significance for 
network identifiers is not something that changes based on Unicode 
version (there really aren't any unencoded ones left in that category). 
They could have been enumerated and their script properties be used to 
filter the code points.

That was seen as too restrictive - and it's not worth arguing about 
whether that was the wrong or correct thing to do.

However, many non-modern scripts are really not suitable for general 
identifiers: there's not a body of working expertise on where the 
problems are with them. It's arguable that they should (have been) kept 
out of second level domains altogether, because nobody who includes them 
in a registry can be said to have done so understanding the 
consequences, therefore implicitly violating the prescription in RFC 5891.

Yet they are perfectly fine on some 3rd or 4th level, for example any 
level, where the same entity controls all labels and there therefore 
can't be a risk from spoofing, etc.

Now, given that the protocol as a whole is that permissive, and given 
that there are some dozens of outright homographs even in the modern 
scripts, and have been from 2008 (not to mention hundreds if not 
thousands of rather similar-looking code points), it's bordering on the 
absurd to stop all work on updating IDNA 2008 over the case of a single 
Arabic addition that isn't even an exact homoglyph (fonts that I have 
researched tend to bind the precomposed mark more tightly to the 
skeleton than an independent code point having the same shape as the mark).

So, Michel's question is relevant:

>
> Still, what needs to happen to unlock the IANA IDN table from being frozen at Unicode 6.3? Unicode 11.0 is months away. I don't think anyone wants to revise the protocol (IDNA 2008), can't we just agree that the protocol is not 100% failsafe and move on? Documenting the problematic cases outside the protocol seems to me the best way to proceed. And it should not be a gating factor in letting IANA revise their table (although it would be great if that documentation was progressing).
So, let's get back to that discussion.

A./
>
> Best regards
>
> Michel
> _______________________________________________
> IDNA-UPDATE mailing list
> IDNA-UPDATE@ietf.org
> https://www.ietf.org/mailman/listinfo/idna-update
>