Re: [Slim] Moving forward on draft-ietf-slim-negotiating-human-language

Bernard Aboba <bernard.aboba@gmail.com> Sun, 19 November 2017 05:25 UTC

Return-Path: <bernard.aboba@gmail.com>
X-Original-To: slim@ietfa.amsl.com
Delivered-To: slim@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 10FD0126BF3 for <slim@ietfa.amsl.com>; Sat, 18 Nov 2017 21:25:18 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.699
X-Spam-Level:
X-Spam-Status: No, score=-2.699 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id w85_2m5SlQM2 for <slim@ietfa.amsl.com>; Sat, 18 Nov 2017 21:25:15 -0800 (PST)
Received: from mail-vk0-x22c.google.com (mail-vk0-x22c.google.com [IPv6:2607:f8b0:400c:c05::22c]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 8F244124D37 for <slim@ietf.org>; Sat, 18 Nov 2017 21:25:15 -0800 (PST)
Received: by mail-vk0-x22c.google.com with SMTP id g11so3848392vkd.13 for <slim@ietf.org>; Sat, 18 Nov 2017 21:25:15 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=HDS6FKDiXgohsKRFoRkLf7Wfc/MTxUABa4+nZE3KRRM=; b=Q7PUHqS7833Yu2nlSv7PMbrURamRlfbbAXAQPXjbnX6QLFbZ7geQUr/qhkiyiT3ygY 9uUzJsHa0AWohof86idWkyF5GY+lhA3tXh9aWtSyLkIro3rNt6galqJUzT7y6u254hA8 TGMET4VwHxMdItv0AJN0W9XLQugmjLXW2nw7NqbWbDEg4Bs8iVife8EkZIHfQr1eUMUQ klYyT3mSRhIy81dC3Ucrh6WOXO9PUHjU/GP6co0s13WXd6591fzwFTuRGDLGyoMh2EVQ gWNGlNNimiRUhFdjNAw3oL2vZCfd4jZlVCnGCUZRYKA0Ap5mIyTWsXZYVjh/BWcqpUAC sYtw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=HDS6FKDiXgohsKRFoRkLf7Wfc/MTxUABa4+nZE3KRRM=; b=ME5Shm29Hm4GultqvMP5evKduNaWjQuBvVdclLBRN8ETJbWzTLvM0b2k7jfdv+7kD2 hthepEvmCnvOp6CqD6DaCFoE7SYgrK2mJ8glxrIZKyJ83TRAg/+e34bzZW4trsCFCtUS dT+5g0RgXlThPXkhDAceP+1yCaWwOJRv3KvWL1lRprfLPhbJkJdi9jP9+TakvFghGn45 LaUyL6bxFP0FI0ZrMfUgYJ8m7YUR7H0JhofkiYDcImeHGkJsoVUqf9ZbU7lYUdUeVyVX 3nIxsEr4tcTJHUjzPCJVh8tA7Jzt5o2c9QFm7aDSTn8KYemdH3faWJF3ZxNKBJ/nFre5 ASeg==
X-Gm-Message-State: AJaThX762xo9d48UvFzQ+4f6NjmiWcoV6TFC0Pb+LZnWEu3hxb+hA0n6 RTx8epDTTfJmV1XAwmfal6Y42Slte4QA3L86UGXXabDb
X-Google-Smtp-Source: AGs4zMZxZyMupASFl66XsKlo43IgrMciZAuhFEhB98YKT7stNQViskFidJ4FDqGdvRf/rpP0L/9LQJg9TGMazRT3n3A=
X-Received: by 10.31.5.66 with SMTP id 63mr7764814vkf.15.1511069114238; Sat, 18 Nov 2017 21:25:14 -0800 (PST)
MIME-Version: 1.0
Received: by 10.159.54.230 with HTTP; Sat, 18 Nov 2017 21:24:53 -0800 (PST)
In-Reply-To: <6ebf2b8a-8699-27c1-87af-41acab4cb940@omnitor.se>
References: <CAOW+2dsZtuciPiKMfif=ZmUqBcUd9TyYtL5gPYDp7ZfLOHHDBA@mail.gmail.com> <6ebf2b8a-8699-27c1-87af-41acab4cb940@omnitor.se>
From: Bernard Aboba <bernard.aboba@gmail.com>
Date: Sat, 18 Nov 2017 21:24:53 -0800
Message-ID: <CAOW+2duq9qkXBy8S+a_GSpmPwypMGLfYL3V9ZZfkrDraSA+S1w@mail.gmail.com>
To: Gunnar Hellström <gunnar.hellstrom@omnitor.se>
Cc: slim@ietf.org
Content-Type: multipart/alternative; boundary="001a1143dd5ac90e12055e4f31fa"
Archived-At: <https://mailarchive.ietf.org/arch/msg/slim/w2SMbne9cCrNKVLmlyu7bZQOSEI>
Subject: Re: [Slim] Moving forward on draft-ietf-slim-negotiating-human-language
X-BeenThere: slim@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Selection of Language for Internet Media <slim.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/slim>, <mailto:slim-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/slim/>
List-Post: <mailto:slim@ietf.org>
List-Help: <mailto:slim-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/slim>, <mailto:slim-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 19 Nov 2017 05:25:18 -0000

Gunnar said:

"I earlier thought that an application needed to look into the language
subtag Description to find the word "sign" there in the text string. That
is not a good solution."

[BA] Agreed. Also, as indicated in RFC 5646 Section 4.1.2 and the IANA
registry (
https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry),
sign languages may not have always have a subtag of 'sgn':

   Sign languages share a mode of communication rather than a linguistic
   heritage.  There are many sign languages that have developed
   independently, and the subtag 'sgn' indicates only the presence of a
   sign language.  A number of sign languages also had grandfathered
   tags registered for them during the RFC 3066
<https://tools.ietf.org/html/rfc3066> era.  For example, the
   grandfathered tag "sgn-US" was registered to represent 'American Sign
   Language' specifically, without reference to the United States.  This
   is still valid, but deprecated: a document in American Sign Language
   can be labeled either "ase" or "sgn-ase" (the 'ase' subtag is for the
   language called 'American Sign Language').


Gunnar also said:

"A specific sign language can be identified by its existence in the IANA
registry of language subtags according to BCP 47 [RFC5646] , and finding
that the language subtag is found at least in two entries in the
registry, once with the Type field "language" and once with the Type
field "extlang" combined with the Prefix field value "sgn".


So that should be the response on Dales request to easily decide if a
language tag is for a sign language. "

[BA]  Looking at the IANA registry, having a Type field "extlang" combined
with the Prefix field value 'sgn' seems to be used as an indicator of a
sign language.  Do you think we can rely on this? Currently this is only a
SHOULD  in RFC 5646 Section 3.4:

            3.  Sign languages SHOULD have an 'extlang' record with
an'Prefix' of 'sgn'.



"My wording proposal starts with the obvious cases: a non-signed
language tag in audio media is spoken, and a non-signed language tag
in text media is written."


[BA] Assuming your suggested approach allows us to reliably determine
non-sign languages, this seems solid.


Gunnar further said:


"But for other cases, like in video or message or application or
multiplexed media, other indications must be used to

understand the intended modality... I wish for the ambiguous cases we
could use the script subtag -zxxx to indicate

spoken modality and a real script subtag..."


[BA] This is where I become uneasy, because without an explicit
mechanism such as script subtags, there is the potential for
ambiguity.

Trying to address reduce that ambiguity via heuristics could turn out
to be a bad idea, compared with proceeding more cautiously

by leaving behavior undefined for now and revisiting the situation
later when we understand the problem better. For example:


"Use for sending of a visual view of a speaking person may be
indicated by the value "speaker" in an SDP
Content attribute according to RFC 4796 [RFC4796] in a "video" media
stream or another media carrying video (e.g. "message" or
"application")."

[BA] There are quite a few potential corner cases here. For example,
if "en-US" language is included in an offer within a video m-line,
should the answerer assume this implies a willingness to lip read US
English if the value "speaker" is in the Content attribute?  What can
be assumed if a value other than "speaker" is in the Content
attribute? Might that represent something entirely different, such as
the desire to receive captioning in US English? If so, how could the
Offerer indicate both the capability of lipreading and the ability to
handle captions? And what happens if the Answerer doesn't mimic the
Content attribute in the Offer? Seems like there are some potential
"gotchas" here.

"Use of written modality in another media stream than

"text", may be discriminated by use of a script subtag in the language
tag, where that is appropriate."

[BA] What if the language in question has script subtags suppressed?
What if the Offerer includes a script subtag in the video m-line but
also "speaker" in the Content attribute? Again, there could be quite a
few corner cases lurking here.










On Sat, Nov 18, 2017 at 2:33 PM, Gunnar Hellström <
gunnar.hellstrom@omnitor.se> wrote:

> Thanks Bernard for pushing for closing the last open issue.
> Den 2017-11-18 kl. 19:46, skrev Bernard Aboba:
>
> At this point, only a single Issue (43) remains open on
> draft-ietf-slim-negotiating-human-language::
> https://trac.ietf.org/trac/slim/ticket/43
>
> This relates to the modality of a language indication.
>
> Currently, Gunnar has suggested a modification to the text of Section 5.4
> in order to address the issue:
> https://mailarchive.ietf.org/arch/msg/slim/A4b6Wpgh0Z0zpXKqpwF9bfdW35g
>
>
> Can WG participants review this suggested change, so that we can determine
> how to move forward?
>
> Currently, Section 5.4 states that:
>
>    The problem of knowing which language tags are signed and which are
>    not is out of scope of this document.
>
> I earlier thought that an application needed to look into the language
> subtag Description to find the word "sign" there in the text string. That
> is not a good solution. But when studying the topic again in RFC 5646 I
> found that there is a consistent machine-implementable way to assess if a
> language subtag is for a sign language.
> Therefore I included this text in the latest proposal for section 5.4 at
> the link that Bernard provided:
>
> "
>
> A specific sign language can be identified by its existence in the IANA
> registry of language subtags according to BCP 47 [RFC5646] , and finding
> that the language subtag is found at least in two entries in the
> registry, once with the Type field "language" and once with the Type
> field "extlang" combined with the Prefix field value "sgn".
>
> "
>
> So that should be the response on Dales request to easily decide if a
> language tag is for a sign language.
>
> Worse is next topic in issue 43: to assess if a language tag is for a
> spoken modality or written modality of a language.
> My wording proposal starts with the obvious cases: a non-signed languge
> tag in audio media is spoken, and a non-signed language tag in text media
> is written.  But for other cases, like in video or message or application
> or multiplexed media, other indications must be used to understand the
> intended modality. The proposed text mentions a few, and leaves to
> applications to decide which mechanisms to use for such cases. I wish we
> for the ambiguous cases could use the script subtag -zxxx to indicate
> spoken modality and a real script subtag even on language subtags where
> script subtags are suppressed, because that would satisfy issue 43 nicely
> and make section 5.4 much shorter and clearer. But we have had resistance
> against that solution.
>
> The proposed text might be a bit long and detailed. I am prepared to agree
> on a shortened version if there are any proposals. I think though that
> contents of 5.4 in the direction of my proposal is what satisfies  issue 43
> and also the comments lately that section 5.4 is too restrictive.
>
> /Gunnar
>
>
>
>
>
>
> _______________________________________________
> SLIM mailing listSLIM@ietf.orghttps://www.ietf.org/mailman/listinfo/slim
>
>
> --
> -----------------------------------------
> Gunnar Hellström
> Omnitorgunnar.hellstrom@omnitor.se
> +46 708 204 288
>
>