Re: [storm] iSCSI and Unicode (newprep BOF)

Julian Satran <julian.satran@gmail.com> Wed, 24 March 2010 05:47 UTC

Return-Path: <julian.satran@gmail.com>
X-Original-To: storm@core3.amsl.com
Delivered-To: storm@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id D539F3A6CBE for <storm@core3.amsl.com>; Tue, 23 Mar 2010 22:47:39 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.61
X-Spam-Level:
X-Spam-Status: No, score=-1.61 tagged_above=-999 required=5 tests=[BAYES_20=-0.74, DNS_FROM_OPENWHOIS=1.13, GB_I_LETTER=-2]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id cv9edlb7NSnF for <storm@core3.amsl.com>; Tue, 23 Mar 2010 22:47:38 -0700 (PDT)
Received: from mail-bw0-f211.google.com (mail-bw0-f211.google.com [209.85.218.211]) by core3.amsl.com (Postfix) with ESMTP id CC5A53A6CD3 for <storm@ietf.org>; Tue, 23 Mar 2010 22:47:20 -0700 (PDT)
Received: by bwz3 with SMTP id 3so6083639bwz.29 for <storm@ietf.org>; Tue, 23 Mar 2010 22:47:37 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:references:message-id:from:to :in-reply-to:content-type:content-transfer-encoding:mime-version :subject:date:cc:x-mailer; bh=fqJmmOgBDN/6aEmL+6JSWVgHuzvStn42Tyj84y/R9qA=; b=e5xKO0ZJV5Ki4+334RG5+o85r05efjz5HLnJYUBgscnOMGrkgOmNfQTJeqG/EBcznI /8sNJE1N/KQsKUGxZgKDYFQKIf6PODLA5qshzePq1i5f3KywcJUq9QuztlE+yuwm5s/o +KVRxjpIajeOuiwI5Kt31NRjKTNn0D8MrgndA=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=references:message-id:from:to:in-reply-to:content-type :content-transfer-encoding:mime-version:subject:date:cc:x-mailer; b=N6hX3Bajhd+4VOppOEfXFewhiJqXVqMF7SRCXH8rte+I3tgyIEi5kprXrfiEaUlnK2 3MlvOXR/nmiA2bXLFtYRzdOhmolFMiCn9ePRQMQUIxGJFIF/VvewwMYyfvDXaI7g8rVs P8Hnju08FiZKvtflwEvoOFnDm/oxUR54h/ABo=
Received: by 10.204.138.219 with SMTP id b27mr6576789bku.139.1269409657253; Tue, 23 Mar 2010 22:47:37 -0700 (PDT)
Received: from [10.180.203.137] ([192.118.11.118]) by mx.google.com with ESMTPS id l1sm32553110bkl.8.2010.03.23.22.47.33 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 23 Mar 2010 22:47:35 -0700 (PDT)
References: <C2D311A6F086424F99E385949ECFEBCB01FA6ADD@CORPUSMX80B.corp.emc.com> <D8CEBB6AE9D43848BD2220619A43F326538EF2@M31.equallogic.com> <C2D311A6F086424F99E385949ECFEBCB020DAA1B@CORPUSMX80B.corp.emc.com> <D8CEBB6AE9D43848BD2220619A43F326538F55@M31.equallogic.com> <C2D311A6F086424F99E385949ECFEBCB020DACD8@CORPUSMX80B.corp.emc.com>
Message-Id: <539DC7A9-D167-4606-A479-B229A0308AF4@gmail.com>
From: Julian Satran <julian.satran@gmail.com>
To: "<Black_David@emc.com>" <Black_David@emc.com>
In-Reply-To: <C2D311A6F086424F99E385949ECFEBCB020DACD8@CORPUSMX80B.corp.emc.com>
Content-Type: text/plain; charset="us-ascii"; format="flowed"; delsp="yes"
Content-Transfer-Encoding: 7bit
Mime-Version: 1.0 (iPhone Mail 7D11)
Date: Wed, 24 Mar 2010 07:47:12 +0200
X-Mailer: iPhone Mail (7D11)
Cc: "<Paul_Koning@Dell.com>" <Paul_Koning@Dell.com>, "<storm@ietf.org>" <storm@ietf.org>
Subject: Re: [storm] iSCSI and Unicode (newprep BOF)
X-BeenThere: storm@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Storage Maintenance WG <storm.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/storm>, <mailto:storm-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/storm>
List-Post: <mailto:storm@ietf.org>
List-Help: <mailto:storm-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/storm>, <mailto:storm-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 24 Mar 2010 05:47:40 -0000

A way out that would enable things to coexist would be to negotiate  
stringprep or nameprep with stringprep mandatory and default and  
nameprep only when the two parties have it. It may not be a major  
issue for a while (or till we learn to live in Babylon :-)).

Regards,
Julo

On 24/03/2010, at 04:59, <Black_David@emc.com> wrote:

> Bad news - we are going to have unavoidable incompatibility problems,
> courtesy of domain name changes in how Unicode is handled.  There are
> four characters that cause these problems, of which the easiest to
> explain is the German eszett or sharp s (looks like a Greek beta
> character for those not familiar with German):
>
>    http://en.wikipedia.org/wiki/%C3%9F
>
> This is often changed to a pair of ordinary s characters (e.g. StraBe
> vs. Strasse), but that change is not always correct.  Stringprep  
> mapped
> sharp s to ss, but the new 2008 domain name specifications apparently
> allow the Unicode character to be used - LATIN SMALL LETTER SHARP S'
> (U+00DF).  Any IQN constructed from such a domain name would be
> corrupted by stringprep mapping sharp s to ss.
>
> While easiest to explain, this is not the worst problem - the worst
> problem involves some non-printing format characters that stringprep
> ignores in a fashion that causes serious problems (meaning changes) in
> languages such as Persian.
>
> More details on the newprep BOF coming in a later message.
>
> Thanks,
> --David
>
>> -----Original Message-----
>> From: Paul Koning [mailto:Paul_Koning@Dell.com]
>> Sent: Tuesday, March 23, 2010 3:34 PM
>> To: Black, David; storm@ietf.org
>> Subject: RE: [storm] iSCSI and Unicode (newprep BOF)
>>
>>>> Re "Any update SHOULD be backward compatible with RFC 3722" -- I
>>> don't understand the "SHOULD".  The
>>>> obvious word is "MUST".  More precisely -- and maybe this is where
>>> the "SHOULD" came in? -- an IQN
>>>> that was legal under RFC 3722 must still be legal, and two names
>> that
>>> under the old rules were
>>>> considered different must still be different.
>>>
>>> The "SHOULD" is there to a significant extent so that I can plead
> "I'm
>>> not a Unicode expert" ;-).  At the iSCSI protocol level, I expect
>>> everything to be backwards-compatible because iSCSI names aren't
>>> checked for stringprep compliance on login (the iSCSI name string
>>> comparison for equality is basically opaque wrt Unicode issues).
>>>
>>> Moving up to the iSCSI name input level (human involvement), it is
>>> possible that things have changed in Unicode so that the same
> Unicode
>>> input string will need to map to a different newprep output string
> vs.
>>> stringprep output string for reasons involving Unicode.  If that's
> the
>>> case (and especially if some previously valid inputs have to be
>>> prohibited), then everything will need to be carefully explained and
>>> documented with advice on how to cope ... but I hesitate to lead
> with
>> a
>>> "MUST", as the IDN folks have much deeper knowledge of what's going
> on
>>> here, what needed to be done with Unicode domain names, and why (a
>>> "MUST" risks freezing iSCSI @ an old version of Unicode).
>>
>> If so, then so be it.  I really do want MUST here.
>>
>> Consider a simple example.  To avoid getting deep into Unicode, let's
>> use an ASCII analogy.  Let's say the old rule is that names are
> prepped
>> by tolower() and the new rule says they are prepped by toupper().   
>> And
>> let's assume that this operation is done only when the target name is
>> mentioned in some management UI as you suggested.  (I think we might
> do
>> it also at login, but it doesn't actually affect the outcome of the
>> example in a way that matters.)
>>
>> Ok, so in the old version, the admin asked me to create target Foo,
>> which I prepped to "foo", so we now have that target.  Next the user
>> tells the initiator to log in to Foo and the initiator preps that to
>> "foo" which comes across the login protocol exchange and all is well.
>>
>> Now we go to the new rule.  The admin sets up a new initiator, and
> tell
>> it to connect to Foo.  That initiator UI preps the string, which
> results
>> in "FOO".  That comes across in the login.  We do binary compare at
>> login time, so the result of the login operation is "no such target".
>>
>> You might think you can cure this problem by re-prepping all
> previously
>> stored names with the new rules.  That works ONLY if (a) all
> previously
>> valid names are still valid, (b) all previously distinct names are
> still
>> distinct.
>>
>> We clearly can't have a case where an existing iSCSI target becomes
>> inaccessible because of new name rules; this is my reason for saying
>> that any newprep changes MUST not have such an effect.
>>
>>    paul
>>
>
> _______________________________________________
> storm mailing list
> storm@ietf.org
> https://www.ietf.org/mailman/listinfo/storm