Re: [EAI] Proposed resolution of List-* and downgrade- Issues

"Frank Ellermann" <hmdmhdfmhdjmzdtjmzdtzktdkztdjz@gmail.com> Mon, 04 August 2008 15:20 UTC

Return-Path: <ima-bounces@ietf.org>
X-Original-To: ima-archive@megatron.ietf.org
Delivered-To: ietfarch-ima-archive@core3.amsl.com
Received: from [127.0.0.1] (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 7929E3A6BAE; Mon, 4 Aug 2008 08:20:14 -0700 (PDT)
X-Original-To: ima@core3.amsl.com
Delivered-To: ima@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 1838228C22D for <ima@core3.amsl.com>; Mon, 4 Aug 2008 08:20:13 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.195
X-Spam-Level:
X-Spam-Status: No, score=-0.195 tagged_above=-999 required=5 tests=[AWL=0.074, BAYES_00=-2.599, FROM_LOCAL_NOVOWEL=3.196, HTTP_ESCAPED_HOST=0.134, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id fWx2wQ11+APe for <ima@core3.amsl.com>; Mon, 4 Aug 2008 08:20:12 -0700 (PDT)
Received: from ciao.gmane.org (main.gmane.org [80.91.229.2]) by core3.amsl.com (Postfix) with ESMTP id E742B3A68D0 for <ima@ietf.org>; Mon, 4 Aug 2008 08:20:11 -0700 (PDT)
Received: from list by ciao.gmane.org with local (Exim 4.43) id 1KQ1rO-0003xQ-IM for ima@ietf.org; Mon, 04 Aug 2008 15:20:34 +0000
Received: from rain.gmane.org ([80.91.229.7]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for <ima@ietf.org>; Mon, 04 Aug 2008 15:20:34 +0000
Received: from hmdmhdfmhdjmzdtjmzdtzktdkztdjz by rain.gmane.org with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for <ima@ietf.org>; Mon, 04 Aug 2008 15:20:34 +0000
X-Injected-Via-Gmane: http://gmane.org/
To: ima@ietf.org
From: Frank Ellermann <hmdmhdfmhdjmzdtjmzdtzktdkztdjz@gmail.com>
Date: Mon, 04 Aug 2008 17:21:28 +0200
Organization: <URL:http://purl.net/xyzzy>
Lines: 71
Message-ID: <g776nm$rr7$1@ger.gmane.org>
References: <p06240609c4b3aaea35c0@[130.129.20.99]><6.0.0.20.2.20080730151421.0847dc08@localhost><g6pb4f$mi3$1@ger.gmane.org> <op.ue5h8sha6hl8nm@clerew.man.ac.uk><g6smit$cmn$1@ger.gmane.org> <op.ufczzgfa6hl8nm@clerew.man.ac.uk>
Mime-Version: 1.0
X-Complaints-To: usenet@ger.gmane.org
X-Gmane-NNTP-Posting-Host: rain.gmane.org
X-MSMail-Priority: Normal
X-Newsreader: Microsoft Outlook Express 6.00.2800.1914
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1914
Subject: Re: [EAI] Proposed resolution of List-* and downgrade- Issues
X-BeenThere: ima@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: "EAI \(Email Address Internationalization\)" <ima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/pipermail/ima>
List-Post: <mailto:ima@ietf.org>
List-Help: <mailto:ima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=subscribe>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Sender: ima-bounces@ietf.org
Errors-To: ima-bounces@ietf.org

Charles Lindsey wrote:

>> JFTR, s/as always with IRIs/as always with EAI/
[...]
> 1. You convert the KO18-R into unicode
> 2. You convert the unicode into UTF-8
> 3. You %-encode the UTF-8
> 4. Now you have the proper URI

> Of course if the context is headers within a UTF8SMTP
> message, then steps 1 and 2 are redundant.

Yes, that's what I meant.  And depending on the charset
steps 1 and 2 could be combined, KOI8-R is an example
where that works - it works for all SBCS with US-ASCII
as propoer subset.
  
>> you would use IDNA(bis) rules to convert this into
>> an URL:
>> Archived-At: <http://archive.xn--maana-pta.com/>

> RFC 3987 says you MAY (possibily even SHOULD) convert
> to punycode in this case, but you also MAY just %-encode
> it and hope whatever comes next knows how to punycode it
> before doing any DNS lookup.

The goal is to get a <reg-name> for DNS in many relevant
contexts:

| A registered name intended for lookup in the DNS uses
| the syntax defined in Section 3.5 of [RFC1034] and
| Section 2.1 of [RFC1123].

 [Digression, this remark in RFC 3986 kills hypothetical
  IDN TLDs again, until RFC 1123 section 2.1 is updated.]

| Such a name consists of a sequence of domain labels
| separated by ".",  each domain label starting and ending
| with an alphanumeric character and possibly also
| containing "-" characters.

Clearly no "percent" characters in a <reg-name> for DNS.

That is as it should be, DNS won't do the punycode magic,
legacy applications (i.e. almost all applications minus
some modern browsers, xmpp, or similar) also won't do it,
but RFC 3987 behind a barrage of MAY and SHOULD weasel
words "does" it.  Where it doesn't it cannot work, there
is no archive.ma%c3%B1ana.com in DNS.

Hopefully, nobody knows what this could be if it exists,
I'd bet on all possible malicious purposes.

But you are right - any serious 3987bis proposal has to
fix this strange "MAY or SHOULD" to a simple "MUST" wrt
DNS.  Nobody is going to register %-encoded raw UTF-8
CNAMEs for all IDN A-labels.  Only the opposition might
try that where it sees a potential profit.

> IRIs are not currently allowed in the 'http' scheme

Why do you think that http-IRIs are not allowed ?  My
assumption is that all URI-schemes can be also used as
IRI-schemes, in theory, down to 'mailto' as worst case.

But some document formats, notably HTML 4, XHTML 1, and
message/rfc822 header fields, allow only URIs.  And no
oddities with "percent" characters in a <reg-name> if
this is about the registry known as DNS.  

 Frank

_______________________________________________
IMA mailing list
IMA@ietf.org
https://www.ietf.org/mailman/listinfo/ima