Re: [EAI] Rechartering

John C Klensin <klensin@jck.com> Tue, 21 July 2009 18:25 UTC

Return-Path: <klensin@jck.com>
X-Original-To: ima@core3.amsl.com
Delivered-To: ima@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id EF9DB3A69B2 for <ima@core3.amsl.com>; Tue, 21 Jul 2009 11:25:13 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.801
X-Spam-Level:
X-Spam-Status: No, score=-1.801 tagged_above=-999 required=5 tests=[AWL=-0.332, BAYES_00=-2.599, DNS_FROM_OPENWHOIS=1.13]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id u6wGtLOld3Ot for <ima@core3.amsl.com>; Tue, 21 Jul 2009 11:25:12 -0700 (PDT)
Received: from bs.jck.com (ns.jck.com [209.187.148.211]) by core3.amsl.com (Postfix) with ESMTP id 508063A679C for <ima@ietf.org>; Tue, 21 Jul 2009 11:25:12 -0700 (PDT)
Received: from [127.0.0.1] (helo=p3.JCK.COM) by bs.jck.com with esmtp (Exim 4.34) id 1MTK1V-0003Jw-Fb; Tue, 21 Jul 2009 14:25:10 -0400
Date: Tue, 21 Jul 2009 14:25:08 -0400
From: John C Klensin <klensin@jck.com>
To: Shawn Steele <Shawn.Steele@microsoft.com>, Harald Alvestrand <harald@alvestrand.no>
Message-ID: <F2BC6EC973C4D97B22FB5FE1@p3.int.jck.com>
In-Reply-To: <CAD7705D4A93814F97D3EF00790AF0B315FCB1CA@TK5EX14MBXC104.redmond.corp.microsoft.com>
References: <mailman.13830.1247508102.4936.ima@ietf.org> <CAD7705D4A93814F97D3EF00790AF0B315FA6650@tk5ex14mbxc105.redmond.corp.microsoft.com> <4A5BABF8.4080900@isode.com> <CAD7705D4A93814F97D3EF00790AF0B315FA6AAF@tk5ex14mbxc105.redmond.corp.microsoft.com> <4A60AA0B.4000106@alvestrand.no> <CAD7705D4A93814F97D3EF00790AF0B315FCA179@TK5EX14MBXC104.redmond.corp.microsoft.com> , <EA9664FBEBEB7127550C3D30@[192.168.1.110]> <CAD7705D4A93814F97D3EF00790AF0B315FCB1CA@TK5EX14MBXC104.redmond.corp.microsoft.com>
X-Mailer: Mulberry/4.0.8 (Win32)
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline
Cc: ima@ietf.org
Subject: Re: [EAI] Rechartering
X-BeenThere: ima@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: "EAI \(Email Address Internationalization\)" <ima.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ima>
List-Post: <mailto:ima@ietf.org>
List-Help: <mailto:ima-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ima>, <mailto:ima-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 21 Jul 2009 18:25:14 -0000

--On Tuesday, 21 July, 2009 06:37 +0000 Shawn Steele
<Shawn.Steele@microsoft.com> wrote:

> I appreciate the concerns about moving too fast, and I don't
> disagree with your assessment of the risk.
> 
> My concern is the standardization of UTF-8 email in China.
> I'd rather that the IETF standards, including downgrade, were
> compatible with China's standards.  Otherwise we could end up
> with a de-facto standard of whatever China is (or isn't) doing
> with downgrade.

Well, I don't know how to go much further in speculation in that
direction.  There are people on the list, including the
co-Chair, who presumably know what is going on with Chinese
standardization efforts and they can speak up if they consider
that appropriate.

There are a number of areas in which we've discovered, in more
ways than I (at least) would have initially predicted, that
Chinese is different from other scripts.   Some of them, like
the effects of Han unification and SC/TC relationships, are
perhaps artifacts of Unicode coding decisions, others are more
basic.  It is more difficult in some ways, and easier in others.
Almost every time someone has extrapolated from CJK to other
scripts or in the other direction, they have gotten themselves
into trouble.  

That doesn't change your basic concern, but is an extra argument
why we should proceed with due caution and why an international
standard might be very different from one for use within China
for communication among those who are Chinese-literate.   I
think our Chinese colleagues have recognized that distinction
from the beginning and, again, can speak for themselves.

> I'm also concerned that the quirks of downgrade aren't going
> to be very discoverable in an laboratory setting :(.  

We share that concern.  It is why the "testing" I'm most anxious
to see involves turning EAI loose in the general Internet email
environment and seeing what happens.  That is exactly the kind
of testing Ernie has been trying to do.   While the specific
problems that have been identified can be fixed (some may
require revisiting documents to be sure we all really intend the
same thing), the trend is not encouraging even after only a
handful of tests.

> I don't
> foresee any problems that can't be corrected with the current
> approach to downgrade, however it's the unforeseen that's the
> problem.  At the leisurely pace it's been proceeding, I'm
> afraid that the industry won't wait for the working group,
> particularly if the Chinese standards proceed without the IETF
> WG standards.

While I'm actually anxious to move this along as quickly as we
reasonably can, at this point, I can only remind you and others
about the success rate of proprietary (and even national and
regional) email models and standards that do no interoperate
smoothly with the existing Internet base.  I'm sure you, or
those around you, remember both MSMail and the original,
incompatible with even X.400, Exchange Server model.  Harald
doesn't need to be reminded about X.400 itself, and that list
goes on and on.

> I also fear a Chinese EAI standard without downgrade more than
> a downgrade with quirks.  Also, hopefully, "downgrade" will
> eventually stop being used, so even if it's really bad, at
> least it should be limited :)

See above above "quirks".  And, whether "downgrade" is used or
not,  it makes  sufficient changes to the fundamental syntax of
Internet email addresses, and sufficient requirements on systems
processing headers, that it will be with us forever even if no
one is actually using it.  Conversely, it makes implementation
of i18n mail sufficiently more difficult that we just can't
estimate whether having it will speed deployment (by making it
slightly easier for EAI-compliant systems to communicate with
legacy ones) or slow it (by making implementation and deployment
of EAI-compliant systems much more difficult.   The issue with
implementation and deployment are pervasive and may extend into
things that the documents don't talk about.  For example, the
typical address book today maps a name onto one email address
(or maybe "home" and "work" email addresses).  EAI downgrading
essentially requires that address book support more than one
address with the same role.  Is that a big deal?  I don't know--
it will certainly depend on the address book design.  But it is
almost certainly going to require more work than simply making
provision for the existing fields to be in UTF-8 and more
instruction to the user about what is going on.   Part of that
discussion is closely related to one we are having in the
IDNABIS context, which is how much we really expect users to
know and adapt to in order to keep computer systems and networks
happy.  I don't know the answer, but, unlike a few years ago,  I
now suspect that alternate addresses in the protocol will put
more of a knowledge burden on users than just having messages
returned if non-ASCII address doesn't go through.  

As another example of a side effect of the downgrading
architecture, consider the huge number of web sites out there
that ask for email addresses.  We know that many of them,
probably a majority, can't even get a local part that contains a
"+"-delimited subaddress right.  Can they be converted to handle
UTF-8 local parts?  I think so, but, for some systems, it will
require more work and education than we probably anticipate.
But I personally think that the odds of getting a site that
can't manage "john+ietf@bogus.domain.name" to be able to handle 
  <иван@bogus.domain.name <john@bogus.domain.name>>
within my lifetime are pretty small -- the pointed brackets will
be rejected, the alternate address will confuse it beyond any
hope, etc.   On the other hand, I think a simple
   иван@bogus.domain.name
will work before I convince them to handle that "+".

> So IMO moving cautiously/slowly is also a risk, and it's
> rapidly becoming a larger risk than the proposed documents in
> their current state.  I'm aware of at least 2 other
> attempts/approaches to "international email" that were shot
> down/deferred to wait for a real EAI standard from the IETF.
> I doubt the users in those communities will wait forever.

Sure.  But see above.

I'm actually getting very sympathetic to the idea in Charles's
note of doing the rest and then letting downgrading sort itself
out later.  But, in practice, I think that dooms downgrading, at
least along the lines of the current specification.   Without
it, we get to simplify the syntax to make i18n address syntax
just traditional syntax with non-ASCII characters -- not only
does the alternate address syntax go away but so does the
requirement for brackets, etc.  Syntactically, SMTP is actually
pretty easy either way.  But, once you get the provisions for
downgrading out of the syntax of addresses in the mail headers
--and, in practical terms, the addresses that users see-- those
extensions are, in practice, probably gone for good: it won't be
easy to put them back in if systems don't even recognize the
syntax as valid.

But, to partially agree with Charles and say it differently, I
think one can have "quick" without Downgrade or somewhat more
slowly (a lot more slowly if the WG continues at its recent
pace) with it.   I don't have any idea how to choose but I am
determined that, if we have Downgrading, it will be "right" and
its interoperability and implementation characters well-tested
and understood.

    john


     john