Re: [Ltru] Minor proofreading nits again

Mark
*— Il meglio è l’inimico del bene —*

On Mon, Jul 18, 2011 at 02:39, Jukka K. Korpela <jkorpela@cs.tut.fi> wrote:

> 18.07.2011 11:57, "Martin J. Dürst" wrote:
>
>  There are certainly cases where there's more than the source and target
>> language and script involved. But on the other hand, there are also
>> cases where there's not really a target language.
>>
>
> Yes; I was writing about what translation _may_ depend on. Now that I read
> the sentence “That is, for fully specifying such content, it is important to
> specify the source language and/or script,” I realize that it doesn’t say
> “may.” In fact, it’s somewhat odd—as the source language of transliterated
> or otherwise transformed text is supposed to be indicated using existing
> methods for identifying a language. When you use, say, the tag ru-Latn, you
> are saying that the text is in Russian, and there is no need for
> additionally specifying “source language.”
>
> I’d suggest that the sentence and the sentence after it in the Introduction
> be changed thusly:
>
> “In order to fully specify such content, the transformation needs to be
> specified in addition to the language. This may require the identification
> of the source script, the target script, and the specific transformation.”

I changed the working copy to the following. I reworded a bit, because the
bcp47 tags already supply the target language

   In order to fully specify such content, the transformation needs to be
specified in addition to the language.

   This may require the identification of the source script or source
language, in addition to the main subtags in the language tag.

   It may also require the identification of the specific conventions used
by transformation, such as the rules used by a UNGEGN transliteration.

How does that look?

>
>  An example would be what can currently be denoted by ja-Latn-hepburn.My
>> understanding is that such cases are also supposed to be covered by -t.
>> How would such cases look? How much more time and effort (than for a
>> variant subtag) would be required for registration.
>>
>
> (I assume that you mean “jp,” not “ja.”)
> As far as I can see, jp-Latin-hepburn as such is unambiguous, because the
> Hepburn system does not depend on “target” language (or language context, as
> I would say).

Agreed. For those mechanisms that are only used with a specific source
script, the -t- extension is not needed.

Note: The correct code would be "ja-Latn-hepburn", but that doesn't affect
your main point.

Type: variant
Subtag: hepburn
Description: Hepburn romanization
Added: 2009-10-01
Prefix: ja-Latn

> But in different countries, some modifications may be in use, or may have
> been in use.
>
> This raises an issue that doesn’t really fall under “minor proofreading
> nits” (sorry!). What does a subtag like “hepburn” really mean? A very
> specific system, or system with known variants, or loosely a set of systems
> that share some common properties? I think we need to be inclined into a
> loose meaning, one that can be further clarified using additional subtags.
> This would imply that you cannot be absolutely sure that a particular
> character in a text labelled as jp-Latin-hepburn can be unambiguously
> interpreted—you may need to look at possible additional subtags or to assume
> that some default variant of Hepburn is used.
>

Agreed. That is the whole design philosophy of BCP47; that additional
subtags can be used to get a higher degree of specificity -- where the more
specific information is known / needed. That is why we allow the mechanism
to have multiple subtags, so that a greater or lesser degree of specificity
can be used.

>
> I’m not aware of specifically language-dependent variants of Hepburn, for
> example, but I know that in Finnish, a national variant (e.g., with “š”
> instead of “sh”) has been recommended and used, though nowadays the global
> variant is more common. When the differences matter and need to be
> indicated, a particular named variant is needed, rather than destination
> language specifier.
>
> --
> Yucca, http://www.cs.tut.fi/~**jkorpela/ <http://www.cs.tut.fi/~jkorpela/>
>
> ______________________________**_________________
> Ltru mailing list
> Ltru@ietf.org
> https://www.ietf.org/mailman/**listinfo/ltru<https://www.ietf.org/mailman/listinfo/ltru>
>