Re: [Ltru] Fw: I-D Action:draft-burnett-pronunciation-alphabet-registry-00.txt

Leif Halvard Silli <lhs@malform.no> Wed, 16 December 2009 19:56 UTC

Return-Path: <lhs@malform.no>
X-Original-To: ltru@core3.amsl.com
Delivered-To: ltru@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id A4EC53A6834 for <ltru@core3.amsl.com>; Wed, 16 Dec 2009 11:56:53 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.555
X-Spam-Level:
X-Spam-Status: No, score=-2.555 tagged_above=-999 required=5 tests=[AWL=0.555, BAYES_05=-1.11, GB_I_LETTER=-2]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id OCU1laUm32AC for <ltru@core3.amsl.com>; Wed, 16 Dec 2009 11:56:52 -0800 (PST)
Received: from smtp.domeneshop.no (smtp.domeneshop.no [194.63.248.54]) by core3.amsl.com (Postfix) with ESMTP id 97F803A677C for <ltru@ietf.org>; Wed, 16 Dec 2009 11:56:49 -0800 (PST)
Received: from cm-84.208.110.159.getinternet.no ([84.208.110.159] helo=[10.0.1.2]) by smtp.domeneshop.no with esmtpa (Exim 4.69) (envelope-from <lhs@malform.no>) id 1NKzz7-0006XI-05; Wed, 16 Dec 2009 20:56:33 +0100
Date: Wed, 16 Dec 2009 20:56:31 +0100
From: Leif Halvard Silli <lhs@malform.no>
To: "Phillips, Addison" <addison@amazon.com>
Message-ID: <20091216205631957633.bb4405de@malform.no>
In-Reply-To: <C7A5719F1E562149BA9171F58BEE2CA412991F8EF6@EX-IAD6-B.ant.amazon.com>
References: <012a01ca7c75$07aa4e60$6801a8c0@oemcomputer> <20091215210229.GA28404@mercury.ccil.org> <001d01ca7e1b$4a6796c0$6801a8c0@oemcomputer> <C7A5719F1E562149BA9171F58BEE2CA412991F8EF6@EX-IAD6-B.ant.amazon.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: quoted-printable
Organization: =?utf-8?B?TcOlbGZvcm0ubm8=?=
X-Mailer: GyazMail version 1.5.9
Cc: LTRU Working Group <ltru@ietf.org>
Subject: Re: [Ltru] Fw: I-D Action:draft-burnett-pronunciation-alphabet-registry-00.txt
X-BeenThere: ltru@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Language Tag Registry Update working group discussion list <ltru.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ltru>
List-Post: <mailto:ltru@ietf.org>
List-Help: <mailto:ltru-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ltru>, <mailto:ltru-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 16 Dec 2009 19:56:53 -0000

Phillips, Addison, Wed, 16 Dec 2009 11:08:07 -0500:

> [..] the authors (they represent the W3C SSML [..] had reasons to seek
>  a  separate registry which didn't seem unreasonable to me at the time.
> 
> The main thing is that pronunciation schemes tend to be vendor 
> specific and are not necessarily transcriptions or, in fact, even 
> useful outside the speech synthesis process. It seems unlikely to me 
> that the language subtag registry would take in a wide variety of 
> these things, especially the vendor-specific variations.

They cease to be vendor specific when registered in the Pronunciation 
Registry (PR), just as windows-1252 has long ago ceased to be Windows 
specific. PR obviously should exist on its own. But perhaps it would be 
useful if there were cooperation between the PR and LTR:

* PR could be shaped more like LTR is. What about recommending 
"pinyin-2001" instead of "pinyin2001"? Then it would be easy to reuse 
'2001' in LTR, if requested. And 'fonipa' could be registered in the PR 
as an alias for 'ipa', to reflect the LTRU registry. 
* Many pronunciations alphabets are unregistered, and regardless of 
where they would registered first, the tag they would get in the first 
registration, could be a candidate for reuse in the other registry.

Examples of unregistered (anywhere) pronunciation alphabets that I am 
aware of:

* the Russian phonetic/pronunciation alphabet for use in education and 
teaching of Russian. It only uses a subset of the usual Russian 
Cyrillic letters + diacritical marks.
* Norvegia, for the notation of spoken/dialect Norwegian (in use since 
19th century) [1][2]
* Danish has the Danica alphabet and Swedish has a similar thing.[2]

I can already say that we ourselves in our teaching material efforts 
would have found it practical if there existed a LTR variant subtag for 
the Russian pronunciation alphabet.

[1] http://no.wikipedia.org/wiki/Norvegia_(lydskrift)
[2] http://www.hum.uit.no/Forproven/Fagpresentasjoner/lydskrift.html
-- 
leif halvard silli