Re: [Slim] draft-tomkinson comments

Nik Tomkinson <rfc.nik.tomkinson@gmail.com> Tue, 27 October 2015 09:52 UTC

Return-Path: <rfc.nik.tomkinson@gmail.com>
X-Original-To: slim@ietfa.amsl.com
Delivered-To: slim@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id BDCA81A6F6D for <slim@ietfa.amsl.com>; Tue, 27 Oct 2015 02:52:03 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.999
X-Spam-Level:
X-Spam-Status: No, score=-1.999 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, SPF_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id pCdicYTrFFKT for <slim@ietfa.amsl.com>; Tue, 27 Oct 2015 02:52:00 -0700 (PDT)
Received: from mail-lb0-x22a.google.com (mail-lb0-x22a.google.com [IPv6:2a00:1450:4010:c04::22a]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 0C8DF1A6F66 for <slim@ietf.org>; Tue, 27 Oct 2015 02:52:00 -0700 (PDT)
Received: by lbbwb3 with SMTP id wb3so59187132lbb.1 for <slim@ietf.org>; Tue, 27 Oct 2015 02:51:58 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type; bh=f3W3mJx6pRyVBD7wrxrOUIHaVGHjeNlf+ezRSHCy300=; b=a5bs649k5H9YKrQOvpgx+Yk6A4s3Z1bJBmf1QURB0HVLxbSnEpxdUwcyL33yNqI7Jv aLJlECjjQpT7MfRZIFG2o817eCtvHKWh9NK4uc0H99K5CUztP3BTmQmwY6lxyf4QDAKq WsypBUJ+UqqeJbSrFF7Ym1Zb8CWMYK4t6iXs206MJinVDFYovFvgnq0FE5VQUDQHzWBG vXKPYJfpVjPNEPFd+aoaaF0mG4w9/hYrQF90gDul6th648O3ndyp8cOzXfXmoKXqZTQC yS6JXGB8MTUjFBG7fTaT99dBz4vefotwsVIzxjEEhnpo5bUQufrwQeXQozYiA7sys8VI wUUA==
MIME-Version: 1.0
X-Received: by 10.112.157.195 with SMTP id wo3mr19686547lbb.0.1445939518176; Tue, 27 Oct 2015 02:51:58 -0700 (PDT)
Received: by 10.112.13.72 with HTTP; Tue, 27 Oct 2015 02:51:58 -0700 (PDT)
In-Reply-To: <9d9f51af87d94c8caf26b1f3801c69d4@EX13D08UWC002.ant.amazon.com>
References: <9d9f51af87d94c8caf26b1f3801c69d4@EX13D08UWC002.ant.amazon.com>
Date: Tue, 27 Oct 2015 09:51:58 +0000
Message-ID: <CAK5rQdwFF9t2Wu5GacJ2N_KNi2eXzqCBqAZgvxQ8x4F6pB6c5A@mail.gmail.com>
From: Nik Tomkinson <rfc.nik.tomkinson@gmail.com>
To: "Phillips, Addison" <addison@lab126.com>
Content-Type: multipart/alternative; boundary="001a11c3274a5901510523130774"
Archived-At: <http://mailarchive.ietf.org/arch/msg/slim/AiHLqrjtpbk_JJoGZUSg07v5r08>
Cc: "slim@ietf.org" <slim@ietf.org>
Subject: Re: [Slim] draft-tomkinson comments
X-BeenThere: slim@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Selection of Language for Internet Media <slim.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/slim>, <mailto:slim-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/slim/>
List-Post: <mailto:slim@ietf.org>
List-Help: <mailto:slim-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/slim>, <mailto:slim-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 27 Oct 2015 09:52:03 -0000

Thanks very much for the extensive comments. I am working through them now
and should have responses for you in the next day or so.

Nik.

On 21 October 2015 at 03:10, Phillips, Addison <addison@lab126.com> wrote:

> Hi,
>
> As promised, a number of comments on the current draft-tomkinson follow my
> .sig. Comments are labeled by section in the document.
>
> Addison
>
> Addison Phillips
> Principal SDE, I18N Architect (Amazon)
> Chair (W3C I18N WG)
>
> Internationalization is not a feature.
> It is an architecture.
>
> =====
>
>
> Abstract: The term "language code" should be replaced with the term
> "language tag". It's helpful to be consistent to avoid confusing with ISO
> standard elements (which are generally referred to as 'codes') and to adopt
> the same terminology as BCP 47.
>
> 3.1 Health warning: the explanatory section may need to use different
> scripts to handle all of the languages. This suggests that the explanatory
> block should use the UTF-8 encoding. A health warning about character
> encodings and the recommendation for using UTF-8 would be useful if
> included here?
>
> 3.1 "language (or locale)": the term locale is unnecessary here, unless
> you want to explain that the language might be derived from the end-user's
> locale. Should there be a terminology section? W3C is working on a document
> that defines some of these, but it is not mature enough to serve as a
> reference (http://www.w3.org/TR/ltli/). Unicode TR 35 talks about the
> same issue: that languages and locales are fundamentally related and that
> you can, in many cases (such as this one) ignore the locale concept.
>
> 3.2 "language message parts are translations of the same message" This
> presumes that they are translations. They might not be exact translations
> and might not be exactly the same. While the *intention* of the document is
> that they are translations, there is no guarantee of such. Perhaps "...are
> usually translations..."
>
> 3.2 The recommendation to apply a 'Name'  parameter kind of presumes that
> the name is meaningful in ASCII, since this is an header field and QP or
> MIME encoding won't be anything but a bunch of gibberish. Rendering into
> ASCII or into English is not very international friendly and having the
> example be 'english.txt' is ignoring the problem?
>
> 3.3 The unmatched message part seems problematic to me. It requires the
> sender to provide a version of the message (increasing message size--some
> systems have size limits!) that will be shown when language negotiation
> fails, rather than allowing one of the already extant sections to be
> selected as the default. There is already a preamble section for cases that
> multipart/multilingual is not supported. And the unmatched message
> (probably falsely) affects a pose of having "no language". Why not allow
> one of the language bearing sections be tagged as the "default" message? If
> a sender wishes to make the "default" (unmatched) message be language
> neutral somehow, they can tag it using a tag provided for that purpose,
> such as zxx or und.
>
> Alternatively, why not use the Preface section? If the end user wants to
> read the message, presumably they will need to choose one of the languages
> to do it in.
>
> 4. This section is introduced:
>
> --
>    The logic for selecting the message part to render and present to the
>    recipient is quite straightforward and is summarised in the next few
>    paragraphs.
> --
>
> It seems hyperbolic to say "quite straightforward". Eliminate that phrase?
>
> 4. The section says:
>
> --
> ...select the best match for
>    the user's preferred language from the language message parts
>    available
> --
>
> But describes that as:
>
> --
> The selection of language part may be implemented in a
>    variety of ways and is a matter for the email client and its user
>    preferences.
> --
>
> Isn't this the nub of the problem? I know that, for example, Mark Davis
> suggests that some clients may wish to use more complex or helpful matching
> schemes than those defined in BCP 47 (which are pretty simplistic). We
> don't have to forbid something more elegant. But fundamental matching is
> improved if there are some base recommendations.
>
>
> 4. There is a note about the subject line:
>
> --
> Similarly, the subject to display (for example in a
>    message listing) should be chosen from the selected language message
>    part if it is available.
> --
>
> And slightly later:
>
> --
> The top-level Subject header field value should be used
>    whenever a suitable translation cannot be identified.
> --
>
> These requirements should be pulled together and made consistent. In
> particular, the message needs to have a Subject even if the "selected
> translations" does not.
>
> 5. There is this recommendation:
>
> --
> While RFC 5646 provides a mechanism accommodating
>    increasingly fine-grained distinctions, in the interest of maximum
>    interoperability, each Content-Language value SHOULD be restricted to
>    the largest granularity of language tags; in other words, it is
>    RECOMMENDED to specify only a Primary-subtag and NOT to include
>    subtags (e.g., for region or dialect) unless the languages might be
>    mutually incomprehensible without them.
> --
>
> I think this text is potentially harmful, as it might interfere with the
> needs of senders and clients to distinguish text based on a variety of
> factors and ignores the fact that some language variation is not mutually
> comprehensible at the primary language level.
>
> An implementation restriction to the primary language subtag would be a
> disadvantage to languages that have important script, region, or other
> variations such as extlang or variant. BCP 47 has a whole section (
> http://tools.ietf.org/html/bcp47#section-4.1) about the choice of
> language tag that basically recommends similar things and I understand why
> it's important for senders not to add unnecessary subtags. But I think
> you'd be better off not having quite so much normative language in this
> specification and a more focused recommendation that does not impinge on
> the needs of senders and clients to provide adequate language information.
>
> 7. ISO 8859-1? How about using the UTF-8 encoding instead?
>
> Should the document point out here that each example Subject field is in a
> separate multipart/multilingual block?
>
>
> _______________________________________________
> SLIM mailing list
> SLIM@ietf.org
> https://www.ietf.org/mailman/listinfo/slim
>



-- 

-----------------------------------------------------------------
Multiple Language Content Type Internet Draft:

<http://datatracker.ietf.org/doc/draft-tomkinson-multilangcontent/>
http://datatracker.ietf.org/doc/draft-tomkinson-slim-multilangcontent/

-----------------------------------------------------------------