Re: [Slim] Extended functionality for the real-time language negotiation

Brian Rosen <br@brianrosen.net> Fri, 10 March 2017 18:46 UTC

Return-Path: <br@brianrosen.net>
X-Original-To: slim@ietfa.amsl.com
Delivered-To: slim@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B40511294C5 for <slim@ietfa.amsl.com>; Fri, 10 Mar 2017 10:46:19 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.899
X-Spam-Level:
X-Spam-Status: No, score=-1.899 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_NONE=-0.0001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=brianrosen-net.20150623.gappssmtp.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id lRVxgOX8F6FI for <slim@ietfa.amsl.com>; Fri, 10 Mar 2017 10:46:16 -0800 (PST)
Received: from mail-qk0-x244.google.com (mail-qk0-x244.google.com [IPv6:2607:f8b0:400d:c09::244]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 1BE0A12942F for <slim@ietf.org>; Fri, 10 Mar 2017 10:46:16 -0800 (PST)
Received: by mail-qk0-x244.google.com with SMTP id v125so29203084qkh.1 for <slim@ietf.org>; Fri, 10 Mar 2017 10:46:16 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=brianrosen-net.20150623.gappssmtp.com; s=20150623; h=mime-version:subject:from:in-reply-to:date:cc:message-id:references :to; bh=9Qtsq04pWWg22eGsp8/IvFlUXuMs4s4HVPCvmgAS2K0=; b=mP31DN58P9R2FKRkQxUIlcITc/j6xhRK/BJ7XXdp3rpwRcW/PQkTSLMXq1ZasDjMXI GLr1/rlQ821zVdVK3ACmATZbIQLA1N371YJ/SWD1GM5GbAtRiwNVG5+MjPB2+96EL5Iv +K1+wBCikxrE0OL3lys0hM64Y1PUAOAxTJhdVJhhzVxMKVLiEGul825e2UKhktw3JfDi 9wte1E94Ne0OMoSr+SRyJj3m7DUZ/Qv9fQjLDbb2vYbj5UlxsVJL6msgJULDjCYjOvqC Z6iqU9uZHalB+Jb6bdlLw2JxST+n3xBuytollehuLMaaO71mclXOZvwGUUQuwPpgK3Ux UWbg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:subject:from:in-reply-to:date:cc :message-id:references:to; bh=9Qtsq04pWWg22eGsp8/IvFlUXuMs4s4HVPCvmgAS2K0=; b=oJTaVnmfasCFCriN/1RQrg7xF7JjVTFzSOOYjClDV8V4dFcvR7KTj6kqUJSpQHYKbG RsZ/IThu/0/OP/VnF8tQdTyuvoR3eexOQO+/vcWAKrW4GXgZqQyE6cCKXcf4JxuYPi1a gZVm0HxSPCfI1fDAuq/+qolDuB+121ZLF+AmTTESLvVtvRLPFatj6kj7DkOUBK1iGAUk WpLqjowug8dkbhMtJOUZAHyh/l5PgIkl9VSe2AbCOw76T4Y2XsOMdenKyww+EjPvgnZC YW2vrZF5/iW7LI5Yg3pa5MNFur6SixjDwF6YEJK1/b5zvZ5k+fdEc6Wgd44qrQsCdbmB cc3Q==
X-Gm-Message-State: AMke39lfeXmNzbULBUxbghuk5koKFuq+u+yzaNg+PdsNweWB0Sl/bC/c+4Q1HyW0+XAjvQ==
X-Received: by 10.200.52.161 with SMTP id w30mr21103194qtb.69.1489171575025; Fri, 10 Mar 2017 10:46:15 -0800 (PST)
Received: from [10.96.43.71] ([156.154.81.54]) by smtp.gmail.com with ESMTPSA id r60sm6861690qtd.53.2017.03.10.10.46.13 (version=TLS1 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Fri, 10 Mar 2017 10:46:14 -0800 (PST)
Content-Type: multipart/alternative; boundary="Apple-Mail=_2E787662-1C1A-4A14-AE5E-2B0B371D1EBF"
Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\))
From: Brian Rosen <br@brianrosen.net>
In-Reply-To: <60797269-4dad-5f48-3184-b8fbca42c30c@realtimetext.org>
Date: Fri, 10 Mar 2017 13:46:12 -0500
Message-Id: <FFABE6D6-316E-40E1-B923-4C44A05F39B7@brianrosen.net>
References: <084a066e-ea68-d614-58e1-08c904f477ea@omnitor.se> <60797269-4dad-5f48-3184-b8fbca42c30c@realtimetext.org>
To: arnoud.vanwijk@realtimetext.org
X-Mailer: Apple Mail (2.3124)
Archived-At: <https://mailarchive.ietf.org/arch/msg/slim/vZfvfvZCkgv9p67gZD1jPiDmHZw>
Cc: "slim@ietf.org" <slim@ietf.org>, Gunnar Hellström <gunnar.hellstrom@omnitor.se>
Subject: Re: [Slim] Extended functionality for the real-time language negotiation
X-BeenThere: slim@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Selection of Language for Internet Media <slim.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/slim>, <mailto:slim-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/slim/>
List-Post: <mailto:slim@ietf.org>
List-Help: <mailto:slim-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/slim>, <mailto:slim-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 10 Mar 2017 18:46:20 -0000

We seem to be running in circles.

The work group has discussed the idea of indicating media & language combination preferences and DECLINED, at this time to do that.  We decided our first effort would be to label language preferences per media, with no cross media preferences.  That doesn’t mean we won’t ever do it.  It means our first document on SDP negotiation won’t cover that case.

Yet you keep trying to get it in the current document.

I don’t want to revisit our decision, and I don’t want to see any linkage between media in this document.  I accept there are requirements to have such linkages, but think there are many. many uses for a simpler mechanism such as what the current document discusses.

If you want to write a draft that proposes an extension to the current mechanism, please write it, and I’ll review it.  But I would like to see this document published with the simple per-media preference.

Brian


> On Mar 10, 2017, at 1:25 PM, Arnoud van Wijk <arnoud.vanwijk@realtimetext.org> wrote:
> 
> Hi Gunnar,
> 
> I also prefer solution 1.2 and 2.2
> 
> I like the use of q values here. I think that is better then adding a new attribute. 
> 
> Good proposal.
> 
> 
> /Arnoud
> 
> 
> On 07/03/2017 14:01, Gunnar Hellström wrote:
>> This is a slightly improved extract from the summary of the LC period for the real-time SLIM draft about two discussed functionality extensions. (Some of the examples had copy-paste errors.) 
>> The intention is to discuss this topic as far as needed to decide if any or both these functionality enhancements could go into the current draft before publication, and if not included then decide if any changes are required in the current draft in order to accommodate for the extensions. 
>> (see the summary sent on March 4 for e.g. references to the other summary points.
>> ----------------------------------------------------------------------------------------------------------
>> 
>> n). Indication of preference between media, and of simultanous versus alternative languages.
>> 
>> This is a summary of the remaining issues related to items e), f), j) and m) above. 
>> 
>> Issue e) proposes changes to enable indication of preference for language in different media.
>> Issue f) requests possibility to indicate request or offering of text captions of spoken language.
>> Issue j) proposes use of the Accept-Language syntax, that could both solve the functional needs
>> of e) and f) and sort out the syntax and semantics problems of the asterisk parameter.
>> Issue m) just indicates that issue e) is not yet resolved. 
>> 
>> Randall has proposed to resolve the functional needs in a new draft, and not accept the Accept-Language syntax.
>> I have proposed a simple way to resolve issue e) - the preference between media, at the same time improving the definition of the semantics of the asterisk parameter.
>> No real solution to issue f) - the simultaneity indication  has been discussed.
>> Issue j) - the proposal to use the Accept-Language syntax can potentially be used to resolve issues e) and f).
>> 
>>  Discussion:
>> Issue e) says that there is a need to be able indicate which of a set of language/media indications are more preferred alternatives than others.
>> Examples are:
>> 1. A want to get written English in text, A can as a less preferred alternative accept to get spoken English.  An answering party B who can use text will then respond with written text and get good satisfaction, while another answering user C without text capability will answer in spoken English and have a possibility for a reasonably successful call.
>> Without this indication, the first answering party B may have seen the spoken and written alternatives as true equal preference alternatives and answered with spoken English that will result in less satisfied users. 
>> 2. A Prefer to receive spoken language, and can accept to receive text.   When answering party B can use spoken language, that will be satisfied, otherwise written language will be used.
>> 3. A Prefer to use spoken language in both directions, and can accept to use sign language in video in both directions. Answering party B has a clear indication of why both signed and written is indicated and can answer according to its capabilities trying to satisfy the preference for spoken language.
>> 4. A Prefer to use  sign language in both directions and can accept to use written language in both directions. Sign language users will use sign language, others will use text. 
>> 5. Prefer to send sign language and receive text (deaf-blind user), can accept to send text.  In a call with a person with similar prefeences, text will be used both ways, otherwise sign one way and text the other.
>> 
>> etc. 
>> 
>> Issue f) requires a way to indicate use of captioning and other situations where use of simultaneous languages in different modalities are needed:
>> 
>> 1. Preference for hearing spoken language and simultaneously read written language in text. ( captioning) .   The time is here when this can be provided automatically in some settings, but also traditionally by a manned service.
>> 2. Preference for hearing spoken language and simultaneously seeing the speaker in video.   (lip-reading).  Easily and naturally provided once the need is known.
>> 3. Preference for seeing sign language and simultaneously hear spoken language in audio.  ( for multiple users at the terminal )    One of the streams is provided by an interpreter.
>> 4. Preference for hearing spoken language and simultaneously view written language in video. (captioning if we accept to specify text as overlay on video, otherwise it is same as number 1.)  
>> 
>> Some of these can be acceptable also if just one of the language/media combinations can be provided, but is much more preferred if both can be provided together. In other cases it is essential to get both simultaneously. There is a need to differentiate in the indication that this preference for getting the languages together is preferred.
>> 
>> Alternative coding proposals:
>> 1. Preference between language/modality
>> 1.1 Based on draft -08, add the coding of an asterisk last in an attribute to mean lower preference for a lanugage/media combination than the one(s) without an asterisk. 
>> example
>> m=audio
>> a=hlang-recv:en* 
>> m=text
>> a=hlang-recv:en
>> 
>> Audio and text are alternatives and text preferred 
>> 1.2. Change to the Accept-Language syntax and let the q-values have scope over the whole SDP.    
>>  m=video 51372 RTP/AVP 31 32
>>  a=hlang-recv:ase;q=0.9
>>  a=hlang-send:ase;q=0.9
>> 
>> 
>>  m=text 49250 RTP/AVP 98,99
>>  a=hlang-send:en;q=0.5,*;q=0.1
>>  a=hlang-recv:en;q=0.5,*;q=0.1
>> 
>> Sign language is higher preferred than text.
>>  
>> 
>> 
>> 1.3. Introduce a new a=modality attribute on media level, with parameters: <modality>, <direction>, <preference>     
>> example:
>> m=text
>> a=modality:written,recv,hi
>> a=hlang-recv:en*
>> m=audio
>> a=modality:spoken,recv,med
>> a=hlang-recv:en*
>> 
>> 
>> 2. Preference for simultaneous languages vs alternative languages:
>> 
>> 2.1. Based on draft -08, add another notation to the use of the asterisk, e.g. an optional character to be used together with the asterisk to mark media that are wanted together. (ugly)   example: 
>> m=audio
>> a=hlang-recv:en*$c 
>> m=text
>> a=hlang-recv:en$c
>> 
>> The $ is a simultaneity indication, the c is a groouping indicator telling that all modalities marked with the $c are wanted together. (we might be able to restrict the indication to just one set of languages that are wanted simultaneously. 
>> 
>> 2.2. Use the Accept-Language syntax and add the usage rules that q-values with less than .1 difference mean languages with a preference to be used together. Higher differences indicate that they are alternatives.  Thereby it is both possible to indicate simultaneity and preference if the simultaneity cannot be satisfied.
>> 
>>  m=audio 51372 RTP/AVP 0
>>  a=hlang-recv:ase;q=0.5
>>  a=hlang-send:ase;q=0.5
>> 
>> 
>>  m=text 49250 RTP/AVP 98,99
>>  a=hlang-send:en;q=0.51,*;q=0.1
>>  a=hlang-recv:en;q=0.51,*;q=0.1
>> 
>> The q-values differences are within 0.1 so it is a preference for getting both together, but if that is not possible, text is preferred. 
>> 
>> 2.3. Add to the new a=modality attribute a fourth, optional parameter [simultaneity]   with value any single letter, indicating a preference for having that modality simultaneously with another modality indicated with the same value in the [simultaneity] parameter.   Without this parameter, the modalities are alternatives.
>> m=text
>> a=modality:written,recv,hi,d
>> a=hlang-recv:en*
>> m=audio
>> a=modality:spoken,recv,hi,d
>> a=hlang-recv:en*
>> 
>> 
>> 
>> Status: Not solved. Conclusion needed on how to handle these issues e, f, and j,  both regarding which solution and what procedure to take to apply it.
>> 
>> I have a slight preference for solution 1.2 and 2.2 , 
>> Your views?
>> Regards
>> Gunnar
>> -- 
>> -----------------------------------------
>> Gunnar Hellström
>> Omnitor
>> gunnar.hellstrom@omnitor.se <mailto:gunnar.hellstrom@omnitor.se>
>> +46 708 204 288
>> 
>> 
>> _______________________________________________
>> SLIM mailing list
>> SLIM@ietf.org <mailto:SLIM@ietf.org>
>> https://www.ietf.org/mailman/listinfo/slim <https://www.ietf.org/mailman/listinfo/slim>
> 
> _______________________________________________
> SLIM mailing list
> SLIM@ietf.org
> https://www.ietf.org/mailman/listinfo/slim