Re: Subtag registration: Russian transliteration of Chinese

Mark Davis ☕️ <mark@macchiato.com> Tue, 13 October 2015 22:05 UTC

Return-Path: <mark.edward.davis@gmail.com>
X-Original-To: ietf-languages@alvestrand.no
Delivered-To: ietf-languages@alvestrand.no
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id DFA397C0201 for <ietf-languages@alvestrand.no>; Wed, 14 Oct 2015 00:05:23 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Authentication-Results: mork.alvestrand.no (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id tqjEMJ_OsDtV for <ietf-languages@alvestrand.no>; Wed, 14 Oct 2015 00:05:22 +0200 (CEST)
X-Greylist: domain auto-whitelisted by SQLgrey-1.8.0
X-Greylist: domain auto-whitelisted by SQLgrey-1.8.0
Received: from pechora4.lax.icann.org (pechora4.icann.org [IPv6:2620:0:2d0:201::1:74]) by mork.alvestrand.no (Postfix) with ESMTPS id AD5047C00D4 for <ietf-languages@alvestrand.no>; Wed, 14 Oct 2015 00:05:21 +0200 (CEST)
Received: from mail-lf0-x229.google.com (mail-lf0-x229.google.com [IPv6:2a00:1450:4010:c07::229]) by pechora4.lax.icann.org (8.13.8/8.13.8) with ESMTP id t9DM4w10007000 (version=TLSv1/SSLv3 cipher=AES256-SHA bits=256 verify=FAIL) for <ietf-languages@iana.org>; Tue, 13 Oct 2015 22:05:19 GMT
Received: by lfeh64 with SMTP id h64so3028909lfe.3 for <ietf-languages@iana.org>; Tue, 13 Oct 2015 15:04:57 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc:content-type; bh=sWbVluMgK/VxwKJr9XYWNtenOFHqUgljdRIp7dgI0uA=; b=VEQ520izn/CeOg60ptXmsfVg6A6icz6ejgqG4mPOzId8FkumLgympXOGNE9606hAWS fdXRgw8fnp1HAz+PkXXzPVR5QVdetNHeRmx5LEYqqXRTWGQy0FwCo1PhGSl36HdCA2lg U33Q7plOwGgJ1UM5Cx+Qh3P4jyPuQ/rxnhCqMvruAtAH8h5SO1l1rgqfMJPkiGxZzHlg EQmBVg4S9Rj7iH38/MXY2rGV54/lAYx3d31NaRKbedBOv6LBG7fWdz3ha2w5wutp9+om 2xxaIv6NxQ9YtN6z+cvf4BRivkwhRwzZ/5DHC4Ze2seAQqpvv+0eTm+KhHFHOic+DBJI arPg==
X-Received: by 10.25.24.162 with SMTP id 34mr1725286lfy.84.1444760491891; Tue, 13 Oct 2015 11:21:31 -0700 (PDT)
MIME-Version: 1.0
Sender: mark.edward.davis@gmail.com
Received: by 10.25.196.13 with HTTP; Tue, 13 Oct 2015 11:21:12 -0700 (PDT)
In-Reply-To: <C5BA9550-B22D-4C2E-9587-56D80AE8ACA8@evertype.com>
References: <20151013094724.665a7a7059d7ee80bb4d670165c8327d.14960e68d0.wbe@email03.secureserver.net> <C5BA9550-B22D-4C2E-9587-56D80AE8ACA8@evertype.com>
From: Mark Davis ☕️ <mark@macchiato.com>
Date: Tue, 13 Oct 2015 20:21:12 +0200
X-Google-Sender-Auth: op-4oscO_qCZNJkl7ZL6tiXY4Tk
Message-ID: <CAJ2xs_GQfrLgLNYHFUk4PjfAbC88VsiXZbfqN5bEA2Yztj=fYA@mail.gmail.com>
Subject: Re: Subtag registration: Russian transliteration of Chinese
To: Michael Everson <everson@evertype.com>
Content-Type: multipart/alternative; boundary="001a11401988e7a2c305220083bc"
X-Greylist: Delayed for 00:43:44 by milter-greylist-4.0 (pechora4.lax.icann.org [IPv6:2620:0:2d0:201::1:74]); Tue, 13 Oct 2015 22:05:20 +0000 (UTC)
Cc: ietflang IETF Languages Discussion <ietf-languages@iana.org>
X-BeenThere: ietf-languages@alvestrand.no
X-Mailman-Version: 2.1.16
Precedence: list
List-Id: IETF Language tag discussions <ietf-languages.alvestrand.no>
List-Unsubscribe: <http://www.alvestrand.no/mailman/options/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=unsubscribe>
List-Archive: <http://www.alvestrand.no/pipermail/ietf-languages/>
List-Post: <mailto:ietf-languages@alvestrand.no>
List-Help: <mailto:ietf-languages-request@alvestrand.no?subject=help>
List-Subscribe: <http://www.alvestrand.no/mailman/listinfo/ietf-languages>, <mailto:ietf-languages-request@alvestrand.no?subject=subscribe>
X-List-Received-Date: Tue, 13 Oct 2015 22:05:24 -0000

> “Eto obrezets prigovor” is in no way English.

I agree. It's a bit of a gray area, because "Gorbachev" is definitely an
English word, though originally Russian, just as "Athens" is an English
word, though originally Greek.

We choose to use the direction we did because the primary use case for
transformations is for place names and person names, and because it also
works better for script transforms: und-Cyrl-t-und-Latn is marking text
that is *in* Cyrillic, though originally from English.

Anyway, that's water under the bridge.

Mark

On Tue, Oct 13, 2015 at 6:56 PM, Michael Everson <everson@evertype.com>
wrote:

> Isn’t what he’s talking about just a Cyrillicization, like Pinyin is a
> Romanization?
>
> > On 13 Oct 2015, at 12:47, Doug Ewell <doug@ewellic.org> wrote:
> >
> > Yegor Grebnev wrote:
> >
> >> Sorry, it seems that I have misunderstood Doug's original
> >> recommendation. Yes, "Chinese transcribed as Russian" seems to be an
> >> appropriate option. Thank you!
> >
> > So it sounds like Yegor does expect the transformed content to continue
> > to be identified primarily as Chinese, not as Russian, where "ru-t-zh"
> > or "ru-anything" might imply the latter.
> >
> > So, I know RFC 6497 has been around for three and a half years, but I
> > guess I'm still a little puzzled by the idea that content in language A,
> > transliterated according to the orthographic conventions of language B,
> > should result in that content being labeled "language B" according to
> > the primary language subtag.
> >
> > Consider:
> >
> > (English)
> > This is a sample sentence.
> >
> > (Russian)
> > Это образец приговор.
> >
> > (Russian, transliterated according to some English-specific convention)
> > Eto obrazets prigovor.
> >
> > These snippets would be tagged, respectively, as "en", "ru", and
> > (according to 6497) "en-t-ru". Is this right?
>
> That can’t be right. “Eto obrezets prigovor” is in no way English.
>
> Michael
> _______________________________________________
> Ietf-languages mailing list
> Ietf-languages@alvestrand.no
> http://www.alvestrand.no/mailman/listinfo/ietf-languages
>