Re: [Ltru] Review of 4646bis-10, macrolanguages in section 4.1

"Mark Davis" <mark.davis@icu-project.org> Fri, 07 December 2007 22:54 UTC

Return-path: <ltru-bounces@ietf.org>
Received: from [127.0.0.1] (helo=stiedprmman1.va.neustar.com) by megatron.ietf.org with esmtp (Exim 4.43) id 1J0m5P-0001yv-4C; Fri, 07 Dec 2007 17:54:23 -0500
Received: from ltru by megatron.ietf.org with local (Exim 4.43) id 1J0m5O-0001yo-8V for ltru-confirm+ok@megatron.ietf.org; Fri, 07 Dec 2007 17:54:22 -0500
Received: from [10.90.34.44] (helo=chiedprmail1.ietf.org) by megatron.ietf.org with esmtp (Exim 4.43) id 1J0m5N-0001ye-S6 for ltru@ietf.org; Fri, 07 Dec 2007 17:54:21 -0500
Received: from wa-out-1112.google.com ([209.85.146.179]) by chiedprmail1.ietf.org with esmtp (Exim 4.43) id 1J0m5M-0003qa-NQ for ltru@ietf.org; Fri, 07 Dec 2007 17:54:21 -0500
Received: by wa-out-1112.google.com with SMTP id k40so2427393wah for <ltru@ietf.org>; Fri, 07 Dec 2007 14:54:20 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:sender:to:subject:cc:in-reply-to:mime-version:content-type:references:x-google-sender-auth; bh=xROFqdT6+pt8KqztNPDHaVFvxbyZs7Z/9VHMYT0T0oA=; b=feaqlgfYUQdAvJWdDwj1wZ4BQnwE1VbBWo1OKjCmwMzY/xWqLI7LGcb+gj4KFhkXfURPfTmFM7yNxwQytKco9Jz/vXBrcoo2PjJKcOAIlR3wSP6T448wWaerk1gz7+dFX9MrKWvcr2xQlO8kKWH+w/6eXip+bTKGjzCKjVQr+gc=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:sender:to:subject:cc:in-reply-to:mime-version:content-type:references:x-google-sender-auth; b=Ev9DUwvLcC0dq7P5ixMG3eICqMj2ZwzPI1CXyKGPJKLD8GpeuW2ShNwigoSvhXzvQqAQIphgKZYG/KK7+UmWb7uBTO23idj+1LNevtopkvWFTUNxqZDEEJvV5wPXpLa77okKSMu19xLGdoEM93HBdGZjLdks/JME38VK3cHgOEQ=
Received: by 10.115.93.16 with SMTP id v16mr3731889wal.1197068059800; Fri, 07 Dec 2007 14:54:19 -0800 (PST)
Received: by 10.114.192.9 with HTTP; Fri, 7 Dec 2007 14:54:19 -0800 (PST)
Message-ID: <30b660a20712071454p26420ea9qda3f5600ae323dcc@mail.gmail.com>
Date: Fri, 07 Dec 2007 14:54:19 -0800
From: Mark Davis <mark.davis@icu-project.org>
To: John Cowan <cowan@ccil.org>
Subject: Re: [Ltru] Review of 4646bis-10, macrolanguages in section 4.1
In-Reply-To: <20071207215557.GD3346@mercury.ccil.org>
MIME-Version: 1.0
References: <20071207215557.GD3346@mercury.ccil.org>
X-Google-Sender-Auth: 71b6abc22048a3a1
X-Spam-Score: 0.0 (/)
X-Scan-Signature: 37af5f8fbf6f013c5b771388e24b09e7
Cc: ltru@ietf.org
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www1.ietf.org/pipermail/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www1.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
Content-Type: multipart/mixed; boundary="===============0224816859=="
Errors-To: ltru-bounces@ietf.org

LGTM

On Dec 7, 2007 1:55 PM, John Cowan <cowan@ccil.org> wrote:

> Because of the thicket of rewordings in this part, I'm just presenting
> my suggested revised text here.  It's very important to make sure
> that we don't talk about "dialects" or "sub-languages" here.  Also,
> I've used "Macrolanguage" for the header only, but "macrolanguage"
> for the languages.
>
> The affected text begins "Languages with a Macrolanguage field" and ends
> "did not specify zh-Hans-CN in their request.)".
>
>        Some of the languages in the registry are labeled
>        "macrolanguages" by ISO 639-3, which defines the term as
>        "clusters of closely-related language varieties that [...] can
>        be considered distinct individual languages, yet in certain
>        usage contexts a single language identity for all is needed".
>        These correspond to codes registered in ISO 639-2 as single
>        languages that were found to correspond to more than one language
>        in ISO 639-3.  The languages encompassed by a macrolanguage
>        contain a Macrolanguage header in the registry; the macrolanguages
>        themselves are not specially marked.
>
>        It is always permitted, and sometimes useful, to tag an
>        encompassed language using the subtag for its macrolanguage.
>        However, the Macrolanguage field doesn't define what the
>        relationship is between the encompassed language and its
>        macrolanguage, nor does it define how languages encompassed by the
>        same macrolanguage are related to each other.  In some cases, In
>        some cases, one of the encompassed languages serves as a standard
>        form for the entire macrolanguage and is frequently identified
>        with it; in other cases there is no dominant language, and the
>        macrolanguage simply serves as a cover term for the entire group.
>
>        Applications MAY use macrolanguage information to improve matching
>        or language negotiation.  For example, the information that 'sr'
>        (Serbian) and 'hr' (Croatian) share a macrolanguage expresses
>        a closer relation between those languages than between, say,
>        'sr' (Serbian) and 'ma' (Macedonian).  It is valid to use the
>        subtag of the encompassed language or of the macrolanguage to
>        form language tags.  However, many matching applications will
>        not be aware of the relationship between the languages.  Care in
>        selecting which subtags are used is crucial to interoperability.
>
>        In general, use the most specific tag.  However, where the
>        macrolanguage tag has been historically used to denote a dominant
>        encompassed language, it SHOULD be used in place of the subtag
>        specific to that encompassed language unless it is necessary
>        to clearly distinguish the macrolanguage as a whole from the
>        dominant language variety.
>
>        In particular, the Chinese family of languages call for special
>        consideration.  Because the written form is very similar for most
>        languages having 'zh' as a macrolanguage (and because historically
>        subtags for the various encompassed languages were not available),
>        languages such as 'yue' (Cantonese) have historically used
>        either 'zh' or a tag (now grandfathered) beginning with 'zh'.
>        This means that macrolanguage information can be usefully
>        applied when searching for content or when providing fallbacks
>        in language negotiation.  For example, the information that 'yue'
>        has a macrolangauge of 'zh' could be used in the Lookup algorithm
>        to fallback from a request for "yue-Hans-CN" to "zh-Hans-CN"
>        without losing the script and region information (even though
>        the user did not specify "zh-Hans-CN" in their request).
>
> --
> John Cowan              cowan@ccil.org          http://www.ccil.org/~cowan<http://www.ccil.org/%7Ecowan>
> Any day you get all five woodpeckers is a good day.  --Elliotte Rusty
> Harold
>
>
> _______________________________________________
> Ltru mailing list
> Ltru@ietf.org
> https://www1.ietf.org/mailman/listinfo/ltru
>



-- 
Mark
_______________________________________________
Ltru mailing list
Ltru@ietf.org
https://www1.ietf.org/mailman/listinfo/ltru