Re: [Slim] IETF last call for draft-ietf-slim-negotiating-human-language (Section 5.4)

Gunnar Hellström <gunnar.hellstrom@omnitor.se> Mon, 13 February 2017 22:23 UTC

Return-Path: <gunnar.hellstrom@omnitor.se>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 80FE11299A2 for <ietf@ietfa.amsl.com>; Mon, 13 Feb 2017 14:23:20 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.599
X-Spam-Level:
X-Spam-Status: No, score=-2.599 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id bN0-iUhwG2Ml for <ietf@ietfa.amsl.com>; Mon, 13 Feb 2017 14:23:17 -0800 (PST)
Received: from bin-vsp-out-03.atm.binero.net (bin-mail-out-05.binero.net [195.74.38.228]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 6D8C5129994 for <ietf@ietf.org>; Mon, 13 Feb 2017 14:23:17 -0800 (PST)
X-Halon-ID: f7c137f6-f23a-11e6-9c99-0050569116f7
Authorized-sender: gunnar.hellstrom@omnitor.se
Received: from [192.168.2.136] (unknown [83.209.158.27]) by bin-vsp-out-03.atm.binero.net (Halon Mail Gateway) with ESMTPSA; Mon, 13 Feb 2017 23:22:55 +0100 (CET)
Subject: Re: [Slim] IETF last call for draft-ietf-slim-negotiating-human-language (Section 5.4)
To: Bernard Aboba <bernard.aboba@gmail.com>
References: <CAOW+2du3zqYfS9iu4XjrQ6Rr6B5C50OXk49=u7Wrg0-1TE7QzA@mail.gmail.com> <64c8d9d7-9df8-023e-6d47-4807cc0e30b7@omnitor.se> <CAOW+2dsncmCrGxcFBeBJ33eGuVT8gnF2CD8QvHbVFT5NzN5F8A@mail.gmail.com>
From: Gunnar Hellström <gunnar.hellstrom@omnitor.se>
Message-ID: <6dc77c0f-6d3d-5945-e936-6f7523003b9c@omnitor.se>
Date: Mon, 13 Feb 2017 23:23:12 +0100
User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.7.1
MIME-Version: 1.0
In-Reply-To: <CAOW+2dsncmCrGxcFBeBJ33eGuVT8gnF2CD8QvHbVFT5NzN5F8A@mail.gmail.com>
Content-Type: multipart/alternative; boundary="------------D4945B31890E75A6F10555B3"
Archived-At: <https://mailarchive.ietf.org/arch/msg/ietf/cOMhHrFyr4jSvif1-tg0rZlJv2c>
Cc: slim@ietf.org, ietf@ietf.org
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ietf/>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 13 Feb 2017 22:23:20 -0000

Den 2017-02-13 kl. 22:58, skrev Bernard Aboba:
> Gunnar said:
>
> "With some hesitation I suggest to let it mean to see a speaking person."
>
> [BA] Is this for the purpose of enabling lip reading?
Yes
>
> Assuming that we go that way, how would captioning be negotiated?
It is best placed in text media.

But captions overlayed on video in the media stream is a used technology 
so it would be good to be able to specify it.
That we cannot do it is again a sad effect of the language tags not 
distinguishing between spoken and written modality.
I once had an ambition to try to specify a notation for that to be added 
to BCP 47, but did not succeed to get any real discussion going on the 
topic.

Eventually there may be a need to specify a Modality attribute. That may 
be needed for media specified e.g. as m=application where the protocol 
can carry all kinds of modality and it is not apparent from the m-line 
what it is. These are however not common for real-time conversational 
purposes, so I do not think it is urgent to solve the problem for 
m=application now.  But maybe for captioning in video media?

/Gunnar


>
> On Mon, Feb 13, 2017 at 1:23 PM, Gunnar Hellström 
> <gunnar.hellstrom@omnitor.se <mailto:gunnar.hellstrom@omnitor.se>> wrote:
>
>     Bernard,
>
>     I just issued comments where I also included the "silly states"
>     topic with similar views as yours.
>
>
>     Den 2017-02-13 kl. 20:06, skrev Bernard Aboba:
>>     Looking over Section 5.4, it seems to me that the title "Silly
>>     States" may not be appropriate, because it mixes discussion of
>>     combinations of media and language that have an "undefined"
>>     meaning with combinations for which normative guidance can be
>>     provided  So rather than having a single "Silly States" section,
>>     perhaps we can have a section on "Undefined States" (for those
>>     combinations which have an undefined meaning) provide normative
>>     guidance on defined combinations elsewhere.
>>
>>
>>           5.4
>>           <https://tools.ietf.org/html/draft-ietf-slim-negotiating-human-language-06#section-5.4>.
>>           Silly States
>>
>>
>>
>>         It is possible to specify a "silly state" where the language
>>         specified does not make sense for the media type, such as specifying
>>         a signed language for an audio media stream.
>>         An offer MUST NOT be created where the language does not make sense
>>         for the media type.  If such an offer is received, the receiver MAY
>>         reject the media, ignore the language specified, or attempt to
>>         interpret the intent (e.g., if American Sign Language is specified
>>         for an audio media stream, this might be interpreted as a desire to
>>         use spoken English).
>>
>>         A spoken language tag for a video stream in conjunction with an audio
>>         stream with the same language might indicate a request for
>>         supplemental video to see the speaker.
>>     [BA] Rather than using terms like "might" for combinations that could have a
>>     defined meaning, I would like to see the specification provide normative
>>     language on these use cases. In particular, I would like the specification to describe:
>>     a. What it means when a spoken language tag is included for a video stream.
>>     Is this to be interpreted as a request for captioning?
>>     b. What it means when a signed language tag is included for an audio stream.
>>     Is the meaning of this "undefined" and if so, should it be ignored?
>>     c. What it means when a signed language tag is included for a text stream.
>>     If some of these scenarios are not defined, the specification can say
>>     "this combination does not have a defined meaning" or something like that.
>     See my recent comments for more views. I support the idea to be
>     normative and specific when possible.
>     A complication is that there is no difference between language
>     tags for written and spoken language.
>
>     So we have the following possible combinations and interpretations
>     of "silly states"
>
>     1. Spoken/written tag in video media, can mean to see a speaking
>     person, or to provide captions overlayed on video.
>     With some hesitation I suggest to let it mean to see a speaking
>     person. The draft adds a requirement to have the same language in
>     the audio stream in the same direction to have that
>     interpretation.  Should that mean that if there is another
>     language in the audio stream, then the spoken/written tag in the
>     video stream should mean captions in the specified language? That
>     sounds useful for some cases, but complex to interpret and unfair
>     to the users who would benefit from captions in the same language
>     as in audio.
>     Summary: I think we had better to use the interpretation to see a
>     speaking person regardless of what language is indicated for audio.
>
>     2. Signed language tag in audio media, can mean audio from a
>     signing person. That could be anything between near silence and
>     spoken words corresponding to the signed signs as far as feasible.
>     This is usually seen as disturbing to sign language users but it
>     exists, e.g. when one erson needs to communicate with both hearing
>     and deaf persons simultaneously. There are also variants of
>     signing, called sign supported language, with signs expressed with
>     spoken language word order and grammar. That can more easily be
>     combined with spoken language, but would more likely be indicated
>     by spoken language tag in audio media.
>     Summary: I am inclined to let signed language tag in audio media
>     mean audio from the signing person and possibly used for the rare
>     cases when it has some relevance for language communication.
>
>     3. Sign language tag in text media. There are some ways to
>     represent sign language in various kinds of symbol or text
>     representation. Some are represented in Unicode. One is a system
>     called Sign Writing. Some fingerspelling methods also have fonts
>     corresponding to characters in code pages. There is also an
>     informal way to write manuscripts for signing in words with
>     capitals approximately corresponding to signs, often with some
>     notation added for unique sign language ways of expression that
>     has no direct correspondance to words. None of these systems above
>     are common in real-time conversation, but I have seen examples of
>     such use.
>     Summary: I think we can leave freedom here and just specify that a
>     sign language tag in text media means some representation of sign
>     language or a corresponding fingerspelling system in text media.
>
>     If these conclusions are accepted, we can formulate modified text.
>     Note that the case with spoken/written language tag in video media
>     is mentioned in two places in the draft.
>
>     Regards
>     Gunnar
>
>>
>>
>>     _______________________________________________
>>     SLIM mailing list
>>     SLIM@ietf.org <mailto:SLIM@ietf.org>
>>     https://www.ietf.org/mailman/listinfo/slim
>>     <https://www.ietf.org/mailman/listinfo/slim>
>
>     -- 
>     -----------------------------------------
>     Gunnar Hellström
>     Omnitor
>     gunnar.hellstrom@omnitor.se <mailto:gunnar.hellstrom@omnitor.se>
>     +46 708 204 288
>
> _______________________________________________
> SLIM mailing list
> SLIM@ietf.org
> https://www.ietf.org/mailman/listinfo/slim
-- 
-----------------------------------------
Gunnar Hellström
Omnitor
gunnar.hellstrom@omnitor.se
+46 708 204 288