Return-Path: <dburnett@voxeo.com>
X-Original-To: speechsc@core3.amsl.com
Delivered-To: speechsc@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix)
 with ESMTP id C68B23A680B; Tue, 29 Dec 2009 03:01:25 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0.301
X-Spam-Level: 
X-Spam-Status: No, score=0.301 tagged_above=-999 required=5 tests=[AWL=-0.300,
 BAYES_50=0.001, HTML_MESSAGE=0.001, J_CHICKENPOX_16=0.6]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com
 [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id zo95H47v-TAY;
 Tue, 29 Dec 2009 03:01:20 -0800 (PST)
Received: from voxeo.com (mmail.voxeo.com [66.193.54.208]) by core3.amsl.com
 (Postfix) with ESMTP id D3D5A3A6844; Tue, 29 Dec 2009 03:01:19 -0800 (PST)
Received: from [71.204.33.81] (account dburnett HELO [192.168.15.111]) by
 voxeo.com (CommuniGate Pro SMTP 5.2.3) with ESMTPSA id 55101526;
 Tue, 29 Dec 2009 11:00:52 +0000
Message-Id: <C46B7F31-9989-442C-B2F1-CA77E79F04F8@voxeo.com>
From: Dan Burnett <dburnett@voxeo.com>
To: Roni Even <Even.roni@huawei.com>
In-Reply-To: <027801ca1b1c$c2e8ee80$48bacb80$%roni@huawei.com>
Content-Type: multipart/alternative; boundary=Apple-Mail-40-309409471
Mime-Version: 1.0 (Apple Message framework v936)
Date: Tue, 29 Dec 2009 06:00:50 -0500
References: <033101c9ff3a$cbe33160$63a99420$%roni@huawei.com>
 <E2C626B8-8CA1-4A1D-A2CE-B6AB4B269DEE@voxeo.com>
 <027801ca1b1c$c2e8ee80$48bacb80$%roni@huawei.com>
X-Mailer: Apple Mail (2.936)
X-Mailman-Approved-At: Tue, 29 Dec 2009 03:37:58 -0800
Cc: speechsc@ietf.org, sarvi@cisco.com, oran@cisco.com, rai@ietf.org
Subject: Re: [Speechsc] RAI review of draft-ietf-speechsc-mrcpv2-19
X-BeenThere: speechsc@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: Speech Services Control Working Group <speechsc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/speechsc>,
 <mailto:speechsc-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/speechsc>
List-Post: <mailto:speechsc@ietf.org>
List-Help: <mailto:speechsc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/speechsc>,
 <mailto:speechsc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 29 Dec 2009 11:01:25 -0000

--Apple-Mail-40-309409471
Content-Type: text/plain;
	charset=WINDOWS-1252;
	format=flowed;
	delsp=yes
Content-Transfer-Encoding: quoted-printable

Hi Roni,

Just to finish up on your last comments . . .

-- dan

On Aug 12, 2009, at 3:15 AM, Roni Even wrote:

> Hi Dan,
> I understand your explanation about all these "vendor specific" =20
> parameter. I think that since this a standard track document there =20
> should be some text explaining the usage of these parameters as well =20=

> as making a note that since these are vendor specific information =20
> you cannot compare the values coming from different vendors

Thank you.  I will note this in the next draft and suggest how these =20
parameters may be used in light of their vendor dependence.

>
>
> As for my comment number 5 on payload type 96. My comment was that =20
> if the m-line has a payload type number of 96 you must have a =20
> a=3Drtpmap line mapping 96 to a specific subtype name while for pcmu =20=

> it is not mandatory to have a=3Drtpmap like you have in your examples =20=

> since payload type number 0 is a static payload type number assigned =20=

> to pcmu
>

I'm sorry, I did not explain this very well.  I understood your =20
comment.  My reply was that of the three examples, example 2 did =20
actually provide the a=3Drtpmap line for 96.  Since the payload type of =20=

96 should not even have been included in the first and third examples, =20=

once I removed it from those two examples all three contained the =20
proper a=3Drtpmap lines.
Although not necessary to have an a=3Drtpmap line for payload type 0, =20=

others in the past had requested it so I left it in.

>
> Roni Even
>
> From: Dan Burnett [mailto:dburnett@voxeo.com]
> Sent: Tuesday, August 11, 2009 9:22 PM
> To: Roni Even
> Cc: sarvi@cisco.com; oran@cisco.com; 'Eric Burger'; =20
> speechsc@ietf.org; rai@ietf.org
> Subject: Re: RAI review of draft-ietf-speechsc-mrcpv2-19
>
>
> On Jul 7, 2009, at 3:40 PM, Roni Even wrote:
>
>
> Hi,
>
> I was assigned to do a RAI review of the draft.  The draft looks =20
> ready for publication to me. I have some comments mostly editorial.
>
> The only issue I see that is not pure editorial is the issue of the =20=

> different parameters like confidence threshold, sensitivity level =20
> (see comments 11, 13, 15, 16 and 17). I think that some =20
> clarification on the semantics and the scale (for example are the =20
> values linearly spaced) as well as when they are useful will be =20
> helpful to implementers.
>
> 1.       In figure 1 Expand the abbreviations TTS, ASR, SV , SI and =20=

> how they are related to the media resource types in 3.1
>
>
> Done.  Added some text explaining Figure 1 and enhanced Figure 1 =20
> slightly for clarification.
>
> 2.       In figure 1 there is a SIP dialog between the MRCPv2 client =20=

> and the media source/sink, what is this dialog, I only saw in =20
> section 4 a dialog between the client and server.
>
> Clarified in the first example of section 4.2 that the SIP dialog =20
> with the media source/sink is not shown.
> 3.       In section 3.2 you have =93For example: =20
> sip:mrcpv2@example.net=94 twice one after the other.
>
> Fixed.
>
>
> 4.       In the example in section 4.2 you =93a=3Dcmid:1=94, cmid is =20=

> specified later in the document so maybe you can add some reference =20=

> to where it is specified
>
> Done.
>
>
>
> 5.       In the example is section 4.2 and in following examples you =20=

> have =93m=3Daudio 49170 RTP/AVP 0 96=94 but do not have an rtpmap =20
> parameter for mapping 96 (dynamic payload type number) to a media =20
> encoding name.
>
> It is not in the first or third examples (Synthesizer only), but it =20=

> is in the second example (Recognizer).  I have removed 96 as an =20
> option for the Synthesizer-only examples but let it remain as an =20
> addition for the Recognizer example.
>
>
>
> 6.       In section 4.3 =93Also note that more that one media session =20=

> can be associated with a single resource if need be, but this =20
> scenario is not useful for the current set of resources=94. There is a =
=20
> typo the second =93that=94 should be =93than=94. I am also not sure if =
the =20
> current syntax in this document can support the mode.
>
> Fixed the typo.
>
>
>
> 7.       In section 4.3 =93The formatting of the"cmid" attribute in =20=

> SDP RFC3388 [RFC4566]=94. I think you meant SDP grouping and need the =20=

> reference to RFC 3388.
>
> I removed the reference altogether because it already exists =20
> (correctly) earlier in the paragraph.
>
>
>
> 8.       In section 5.1 =93The message-length field specifies the =20
> length of the message, including the start-line=94 is the length in =20=

> Bytes, there is no unit specified.
>
> Changed "length of the message" to "length of the message in bytes".
>
>
>
> 9.       In section 6.3.1, typo you have =93Verfication =93 instead of =
=20
> verification. It appears twice in the section.
>
> Fixed.
>
>
>
> 10.   In the example in section 7 you have =93m=3Daudio 0 RTP/AVP 0 1 =
3=94 =20
> payload type 1 was deleted from the IANA registry, maybe have =20
> another payload type number.
>
> I just removed that payload type.  It is not germane to the example.
>
>
>
> 11.   In section 9.4.1, 9.4.2 and 9.4.3 you specify confidence =20
> threshold, sensitivity level and speed vs accuracy. What is the =20
> scale here; is it linear between 0 and 1. What is the absolute value =20=

> of the number, if you receive the same confidence level from two =20
> recognizers are they the same (e.g. when using context block to =20
> switch servers).  For the speed vs accuracy, how does the client =20
> know what is the relation between the value and the number of =20
> available sessions, since this seems to be the reason for using this =20=

> parameter.
>
> The interpretation of all of these parameters is implementation-=20
> specific because the underlying technologies used to implement them =20=

> vary and can even be proprietary.  In practice the speech =20
> recognition and synthesis and speaker authentication communities =20
> have lived with this state of affairs for many years, and users of =20
> other APIs for this technology are well aware of and have built =20
> applications that accommodate this variability in interpretation.  =20
> It is outside the scope of this specification to attempt to =20
> standardize interpretations of these values.
>
>
> 12.   In 9.4.9 and in 10.4.8, 11.4.11 what are the values for media-=20=

> type-value, you also mention audio and video but it looks to me that =20=

> this document only discusses voice.
>
> Yes.  Although the original intent was to record speech, application =20=

> authors today are beginning to look at ways to incorporate other =20
> audio or video.  The intent of the sentences in these sections is to =20=

> clarify that the specification itself imposes no restriction on the =20=

> types of media that are allowed.
>
>
>
> 13.   In 9.4.35 and 9.4.36 what is the scale for the consistency =20
> here. How does one know what close means. What is the consistency =20
> between different recognizers.
>
> The answer to question 11, above, applies here as well.
>
>
>
> 14.   In section 9.6.3.3 in the example (figure 2) confidence should =20=

> be 0.75 and not 75
>
> Fixed.
>
>
>
> 15.   In section 10.4.1 it is not clear how you measure the =20
> sensitivity in order to specify, is it based on some SNR translated =20=

> to 0 to 1 scale?
>
> The answer to question 11, above, applies here as well.
>
>
>
> 16.   In 11.4.6 the same issue with the scale, how does the client =20
> know how to set a value when working with different speaker =20
> verification servers.
>
> Ditto.  I should point out that in all of these cases the parameters =20=

> are typically passed directly to the engine, and their =20
> interpretations are defined (and described) in the vendors' =20
> documentation.  The most common MRCPv2 server implementations are by =20=

> the technology vendors themselves (the providers of the synthesis, =20
> recognition, and verification engines).  This is commonly understood =20=

> in this technology industry (meaning those who use this technology =20
> regularly).
>
>
>
> 17.   In 11.5.2.9 you state that the verification-score is not a =20
> probability, so what is it. How can the client decide if, for =20
> example, 0 is a good score for specifying the threshold.  I also =20
> noticed that the values in the example in section 11.5.2.10 are very =20=

> precise like 0.98514 is this the expected precision. The examples =20
> here and in section 11.11 do not show the threshold, if the =20
> threshold is required for this flow why not show it in the example?
>
> This parameter, as others mentioned above, has only a vendor-=20
> specific interpretation.  In practice authors interpret these values =20=

> based both on guidance from the technology vendors and via =20
> experimentation on large sets of recorded data.
>
> The Min-Verification-Score threshold is not required to be set.  In =20=

> many cases the technology vendor has a fairly good understanding of =20=

> what the default threshold should be.  The verification-score is =20
> returned, however, in case the application author determines =20
> (through experimentation, as described above) that the default =20
> threshold is not producing optimal results for the application.  In =20=

> that case the author can set the threshold to a different value or =20
> can set it to -1 and make the determination within the application =20
> itself based on the verification-score values.
>
>
>
> 18.   In section 12.3 the suggestion is to use SRTP as the mandatory =20=

> interoperability mode. If the reason for mandating SRTP is for a =20
> common mode you should also decide on a key exchange mechanism. I =20
> suggest you look at =
http://tools.ietf.org/html/draft-ietf-avt-srtp-not-mandatory-02=20
>  for discussion on media security.
>
> Based on the discussion between you and Dan York on the list, I will =20=

> change this:
>
> 12.3. Media session protection
> Sensitive data is also carried on media sessions terminating on =20
> MRCPv2 servers (the other end of a media channel may or may not be =20
> on the MRCPv2 client). This data includes the user's spoken =20
> utterances and the output of text-to-speech operations. MRCPv2 =20
> servers MUST support SRTP for protection of audio media sessions. =20
> MRCPv2 clients that originate or consume audio similarly MUST =20
> support SRTP. Alternative media channel protection MAY be used if =20
> desired (e.g. IPSEC).
>
> to this:
>
> 12.3. Media session protection
> Sensitive data is also carried on media sessions terminating on =20
> MRCPv2 servers (the other end of a media channel may or may not be =20
> on the MRCPv2 client). This data includes the user's spoken =20
> utterances and the output of text-to-speech operations. MRCPv2 =20
> servers MUST support a security mechanism for protection of audio =20
> media sessions. MRCPv2 clients that originate or consume audio =20
> similarly MUST support a security mechanism for protection of the =20
> audio. If appropriate, usage of the Secure Real-time Transport =20
> Protocol (SRTP) [RFC3711] is recommended.
>
> 19.   In section13.7.2 you specify the attribute resource as session =20=

> level yet in the example in section 4.2 it is a media level =20
> attribute. The same goes for the channel attribute
>
> I have corrected both in section 13.7.2 to be media-level.
>
>
>
> Thanks
>
> Roni Even
>
>
>


--Apple-Mail-40-309409471
Content-Type: text/html;
	charset=WINDOWS-1252
Content-Transfer-Encoding: quoted-printable

<html><body style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; =
-webkit-line-break: after-white-space; ">Hi =
Roni,<div><br></div><div>Just to finish up on your last comments . . =
.</div><div><br></div><div>-- dan</div><div><br><div><div><div>On Aug =
12, 2009, at 3:15 AM, Roni Even wrote:</div><br =
class=3D"Apple-interchange-newline"><blockquote type=3D"cite"><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-size: medium; font-style: =
normal; font-variant: normal; font-weight: normal; letter-spacing: =
normal; line-height: normal; orphans: 2; text-align: auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; "><div lang=3D"EN-US" link=3D"blue" =
vlink=3D"purple" style=3D"word-wrap: break-word; -webkit-nbsp-mode: =
space; -webkit-line-break: after-white-space; "><div =
class=3D"Section1"><div style=3D"margin-top: 0in; margin-right: 0in; =
margin-bottom: 0.0001pt; margin-left: 0in; font-size: 12pt; font-family: =
'Times New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); ">Hi =
Dan,<o:p></o:p></span></div><div style=3D"margin-top: 0in; margin-right: =
0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: 12pt; =
font-family: 'Times New Roman', serif; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: rgb(31, 73, 125); ">I =
understand your explanation about all these "vendor specific" parameter. =
I think that since this a standard track document there should be some =
text explaining the usage of these parameters as well as making a note =
that since these are vendor specific information you cannot compare the =
values coming from different =
vendors</span></div></div></div></span></blockquote><div><br></div>Thank =
you. &nbsp;I will note this in the next draft and suggest how these =
parameters may be used in light of their vendor =
dependence.</div><div><br><blockquote type=3D"cite"><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-size: medium; font-style: =
normal; font-variant: normal; font-weight: normal; letter-spacing: =
normal; line-height: normal; orphans: 2; text-align: auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; "><div lang=3D"EN-US" link=3D"blue" =
vlink=3D"purple" style=3D"word-wrap: break-word; -webkit-nbsp-mode: =
space; -webkit-line-break: after-white-space; "><div =
class=3D"Section1"><div style=3D"margin-top: 0in; margin-right: 0in; =
margin-bottom: 0.0001pt; margin-left: 0in; font-size: 12pt; font-family: =
'Times New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); =
"><o:p></o:p></span></div><div style=3D"margin-top: 0in; margin-right: =
0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: 12pt; =
font-family: 'Times New Roman', serif; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: rgb(31, 73, 125); =
"><o:p>&nbsp;</o:p></span></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: rgb(31, 73, 125); =
"><o:p>&nbsp;</o:p></span></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: rgb(31, 73, 125); ">As =
for my comment number 5 on payload type 96. My comment was that if the =
m-line has a payload type number of 96 you must have a a=3Drtpmap line =
mapping 96 to a specific subtype name while for pcmu it is not mandatory =
to have a=3Drtpmap like you have in your examples since payload type =
number 0 is a static payload type number assigned to =
pcmu<o:p></o:p></span></div><div style=3D"margin-top: 0in; margin-right: =
0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: 12pt; =
font-family: 'Times New Roman', serif; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: rgb(31, 73, 125); =
"><o:p>&nbsp;</o:p></span></div></div></div></span></blockquote><div><br><=
/div>I'm sorry, I did not explain this very well. &nbsp;I understood =
your comment. &nbsp;My reply was that of the three examples, example 2 =
did actually provide the a=3Drtpmap line for 96. &nbsp;Since the payload =
type of 96 should not even have been included in the first and third =
examples, once I removed it from those two examples all three contained =
the proper a=3Drtpmap lines.</div><div>Although not necessary to have an =
a=3Drtpmap line for payload type 0, others in the past had requested it =
so I left it in.</div><div><br><blockquote type=3D"cite"><span =
class=3D"Apple-style-span" style=3D"border-collapse: separate; color: =
rgb(0, 0, 0); font-family: Helvetica; font-size: medium; font-style: =
normal; font-variant: normal; font-weight: normal; letter-spacing: =
normal; line-height: normal; orphans: 2; text-align: auto; text-indent: =
0px; text-transform: none; white-space: normal; widows: 2; word-spacing: =
0px; -webkit-border-horizontal-spacing: 0px; =
-webkit-border-vertical-spacing: 0px; =
-webkit-text-decorations-in-effect: none; -webkit-text-size-adjust: =
auto; -webkit-text-stroke-width: 0px; "><div lang=3D"EN-US" link=3D"blue" =
vlink=3D"purple" style=3D"word-wrap: break-word; -webkit-nbsp-mode: =
space; -webkit-line-break: after-white-space; "><div =
class=3D"Section1"><div style=3D"margin-top: 0in; margin-right: 0in; =
margin-bottom: 0.0001pt; margin-left: 0in; font-size: 12pt; font-family: =
'Times New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: rgb(31, 73, 125); =
"><o:p>&nbsp;</o:p></span></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: rgb(31, 73, 125); ">Roni =
Even<o:p></o:p></span></div><div style=3D"margin-top: 0in; margin-right: =
0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: 12pt; =
font-family: 'Times New Roman', serif; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: rgb(31, 73, 125); =
"><o:p>&nbsp;</o:p></span></div><div style=3D"border-top-style: none; =
border-right-style: none; border-bottom-style: none; border-width: =
initial; border-color: initial; border-left-style: solid; =
border-left-color: blue; border-left-width: 1.5pt; padding-top: 0in; =
padding-right: 0in; padding-bottom: 0in; padding-left: 4pt; "><div><div =
style=3D"border-right-style: none; border-bottom-style: none; =
border-left-style: none; border-width: initial; border-color: initial; =
border-top-style: solid; border-top-color: rgb(181, 196, 223); =
border-top-width: 1pt; padding-top: 3pt; padding-right: 0in; =
padding-bottom: 0in; padding-left: 0in; position: static; z-index: auto; =
"><div style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: =
0.0001pt; margin-left: 0in; font-size: 12pt; font-family: 'Times New =
Roman', serif; "><b><span style=3D"font-size: 10pt; font-family: Tahoma, =
sans-serif; ">From:</span></b><span style=3D"font-size: 10pt; =
font-family: Tahoma, sans-serif; "><span =
class=3D"Apple-converted-space">&nbsp;</span>Dan Burnett [<a =
href=3D"mailto:dburnett@voxeo.com" style=3D"color: blue; =
text-decoration: underline; ">mailto:dburnett@voxeo.com</a>]<span =
class=3D"Apple-converted-space">&nbsp;</span><br><b>Sent:</b><span =
class=3D"Apple-converted-space">&nbsp;</span>Tuesday, August 11, 2009 =
9:22 PM<br><b>To:</b><span =
class=3D"Apple-converted-space">&nbsp;</span>Roni =
Even<br><b>Cc:</b><span class=3D"Apple-converted-space">&nbsp;</span><a =
href=3D"mailto:sarvi@cisco.com" style=3D"color: blue; text-decoration: =
underline; ">sarvi@cisco.com</a>;<span =
class=3D"Apple-converted-space">&nbsp;</span><a =
href=3D"mailto:oran@cisco.com" style=3D"color: blue; text-decoration: =
underline; ">oran@cisco.com</a>; 'Eric Burger';<span =
class=3D"Apple-converted-space">&nbsp;</span><a =
href=3D"mailto:speechsc@ietf.org" style=3D"color: blue; text-decoration: =
underline; ">speechsc@ietf.org</a>;<span =
class=3D"Apple-converted-space">&nbsp;</span><a =
href=3D"mailto:rai@ietf.org" style=3D"color: blue; text-decoration: =
underline; ">rai@ietf.org</a><br><b>Subject:</b><span =
class=3D"Apple-converted-space">&nbsp;</span>Re: RAI review of =
draft-ietf-speechsc-mrcpv2-19<o:p></o:p></span></div></div></div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><o:p>&nbsp;</o:p></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">On Jul 7, 2009, at 3:40 =
PM, Roni Even wrote:<o:p></o:p></div></div><div style=3D"margin-top: =
0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><p class=3D"MsoCommentText" =
style=3D"margin-right: 0in; margin-left: 0in; font-size: 12pt; =
font-family: 'Times New Roman', serif; margin-bottom: 10pt; line-height: =
18px; "><span style=3D"font-size: 11pt; line-height: 17px; font-family: =
Calibri, sans-serif; color: black; ">Hi,</span><span style=3D"font-size: =
10pt; line-height: 14px; font-family: Calibri, sans-serif; color: black; =
"><o:p></o:p></span></p><p class=3D"MsoCommentText" style=3D"margin-right:=
 0in; margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; margin-bottom: 10pt; line-height: 18px; "><span style=3D"font-size:=
 11pt; line-height: 17px; font-family: Calibri, sans-serif; color: =
black; ">I was assigned to do a RAI review of the draft. &nbsp;The draft =
looks ready for publication to me. I have some comments mostly =
editorial.</span><span style=3D"font-size: 10pt; line-height: 14px; =
font-family: Calibri, sans-serif; color: black; =
"><o:p></o:p></span></p><p class=3D"MsoCommentText" style=3D"margin-right:=
 0in; margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; margin-bottom: 10pt; line-height: 18px; "><span style=3D"font-size:=
 11pt; line-height: 17px; font-family: Calibri, sans-serif; color: =
black; ">The only issue I see that is not pure editorial is the issue of =
the different parameters like confidence threshold, sensitivity level =
(see comments 11, 13, 15, 16 and 17). I think that some clarification on =
the semantics and the scale (for example are the values linearly spaced) =
as well as when they are useful will be helpful to =
implementers.</span><span style=3D"font-size: 10pt; line-height: 14px; =
font-family: Calibri, sans-serif; color: black; =
"><o:p></o:p></span></p><p class=3D"MsoCommentText" style=3D"margin-right:=
 0in; margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; margin-bottom: 10pt; text-indent: -0.25in; line-height: 18px; =
"><span style=3D"font-size: 11pt; line-height: 17px; font-family: =
Calibri, sans-serif; color: black; ">1.</span><span style=3D"font-size: =
7pt; line-height: 10px; color: black; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; line-height: 17px; font-family: Calibri, =
sans-serif; color: black; ">In figure 1 Expand the abbreviations TTS, =
ASR, SV , SI and how they are related to the media resource types in =
3.1</span><span style=3D"font-size: 10pt; line-height: 14px; =
font-family: Calibri, sans-serif; color: black; =
"><o:p></o:p></span></p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">Done. &nbsp;Added some =
text explaining Figure 1 and enhanced Figure 1 slightly for =
clarification.<br><br><o:p></o:p></div><div><div><p =
class=3D"MsoCommentText" style=3D"margin-right: 0in; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; margin-bottom: =
10pt; text-indent: -0.25in; line-height: 18px; "><span style=3D"font-size:=
 11pt; line-height: 17px; font-family: Calibri, sans-serif; color: =
black; ">2.</span><span style=3D"font-size: 7pt; line-height: 10px; =
color: black; ">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; line-height: 17px; font-family: Calibri, =
sans-serif; color: black; ">In figure 1 there is a SIP dialog between =
the MRCPv2 client and the media source/sink, what is this dialog, I only =
saw in section 4 a dialog between the client and server.</span><span =
style=3D"font-size: 10pt; line-height: 14px; font-family: Calibri, =
sans-serif; color: black; "><o:p></o:p></span></p></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; ">Clarified in&nbsp;the first example of section 4.2 that the SIP =
dialog with the media source/sink is not =
shown.<o:p></o:p></div></div><blockquote style=3D"margin-top: 5pt; =
margin-bottom: 5pt; "><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; text-indent: -0.25in; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: black; ">3.</span><span style=3D"font-size: 7pt; color: black; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 3.2 you have =93For example:<span =
class=3D"apple-converted-space">&nbsp;</span><a =
href=3D"sip:mrcpv2@example.net" style=3D"color: blue; text-decoration: =
underline; "><span style=3D"color: windowtext; text-decoration: none; =
">sip:mrcpv2@example.net</span></a>=94 twice one after the =
other.</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><span style=3D"font-size: 11pt; font-family: Calibri, =
sans-serif; color: black; ">&nbsp;</span><span style=3D"font-size: =
10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div></blockquote><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; ">Fixed.<o:p></o:p></div></div><div><div style=3D"margin-top: =
0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; text-indent: -0.25in; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: black; ">4.</span><span style=3D"font-size: 7pt; color: black; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In the example in section 4.2 you =93a=3Dcmid:1=94, cmid is =
specified later in the document so maybe you can add some reference to =
where it is specified</span><span style=3D"font-size: 10.5pt; =
font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
">Done.<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">5.</span><span =
style=3D"font-size: 7pt; color: black; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In the example is section 4.2 and in following examples you =
have =93m=3Daudio 49170 RTP/AVP 0 96=94 but do not have an rtpmap =
parameter for mapping 96 (dynamic payload type number) to a media =
encoding name.</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">It is not in the first or =
third examples (Synthesizer only), but it is in the second example =
(Recognizer). &nbsp;I have removed 96 as an option for the =
Synthesizer-only examples but let it remain as an addition for the =
Recognizer example.<o:p></o:p></div></div><div><div style=3D"margin-top: =
0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">6.</span><span =
style=3D"font-size: 7pt; color: black; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 4.3 =93Also note that more that one media session =
can be associated with a single resource if need be, but this scenario =
is not useful for the current set of resources=94. There is a typo the =
second =93that=94 should be =93than=94. I am also not sure if the =
current syntax in this document can support the mode.</span><span =
style=3D"font-size: 10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div></div></div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; ">Fixed the typo.<o:p></o:p></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">7.</span><span =
style=3D"font-size: 7pt; color: black; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 4.3 =93The formatting of the"cmid" attribute in SDP =
RFC3388 [RFC4566]=94. I think you meant SDP grouping and need the =
reference to RFC 3388.</span><span style=3D"font-size: 10.5pt; =
font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div></div></div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; ">I removed the reference altogether because it already exists =
(correctly) earlier in the paragraph.<o:p></o:p></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">8.</span><span =
style=3D"font-size: 7pt; color: black; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 5.1 =93The message-length field specifies the length =
of the message, including the start-line=94 is the length in Bytes, =
there is no unit specified.</span><span style=3D"font-size: 10.5pt; =
font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">Changed "length of the =
message" to "length of the message in =
bytes".<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">9.</span><span =
style=3D"font-size: 7pt; color: black; =
">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 6.3.1, typo you have =93Verfication =93 instead of =
verification. It appears twice in the section.</span><span =
style=3D"font-size: 10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
">Fixed.<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">10.</span><span =
style=3D"font-size: 7pt; color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In the example in section 7 you have =93m=3Daudio 0 RTP/AVP 0 1 =
3=94 payload type 1 was deleted from the IANA registry, maybe have =
another payload type number.</span><span style=3D"font-size: 10.5pt; =
font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">I just removed that =
payload type. &nbsp;It is not germane to the =
example.<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">11.</span><span =
style=3D"font-size: 7pt; color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 9.4.1, 9.4.2 and 9.4.3 you specify confidence =
threshold, sensitivity level and speed vs accuracy. What is the scale =
here; is it linear between 0 and 1. What is the absolute value of the =
number, if you receive the same confidence level from two recognizers =
are they the same (e.g. when using context block to switch =
servers).&nbsp; For the speed vs accuracy, how does the client know what =
is the relation between the value and the number of available sessions, =
since this seems to be the reason for using this parameter.</span><span =
style=3D"font-size: 10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div></div></div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; ">The interpretation of all of these parameters is =
implementation-specific because the underlying technologies used to =
implement them vary and can even be proprietary. &nbsp;In practice the =
speech recognition and synthesis and speaker authentication communities =
have lived with this state of affairs for many years, and users of other =
APIs for this technology are well aware of and have built applications =
that accommodate this variability in interpretation. &nbsp;It is outside =
the scope of this specification to attempt to standardize =
interpretations of these values.<o:p></o:p></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; text-indent: =
-0.25in; "><span style=3D"font-size: 11pt; font-family: Calibri, =
sans-serif; color: black; ">12.</span><span style=3D"font-size: 7pt; =
color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In 9.4.9 and in 10.4.8, 11.4.11 what are the values for =
media-type-value, you also mention audio and video but it looks to me =
that this document only discusses voice.</span><span style=3D"font-size: =
10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">Yes. &nbsp;Although the =
original intent was to record speech, application authors today are =
beginning to look at ways to incorporate other audio or video. &nbsp;The =
intent of the sentences in these sections is to clarify that the =
specification itself imposes no restriction on the types of media that =
are allowed.<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">13.</span><span =
style=3D"font-size: 7pt; color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In 9.4.35 and 9.4.36 what is the scale for the consistency =
here. How does one know what close means. What is the consistency =
between different recognizers.</span><span style=3D"font-size: 10.5pt; =
font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">The answer to question =
11, above, applies here as well.<o:p></o:p></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">14.</span><span =
style=3D"font-size: 7pt; color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 9.6.3.3 in the example (figure 2) confidence should =
be 0.75 and not 75</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
">Fixed.<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">15.</span><span =
style=3D"font-size: 7pt; color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 10.4.1 it is not clear how you measure the =
sensitivity in order to specify, is it based on some SNR translated to 0 =
to 1 scale?</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">The answer to question =
11, above, applies here as well.<o:p></o:p></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">16.</span><span =
style=3D"font-size: 7pt; color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In 11.4.6 the same issue with the scale, how does the client =
know how to set a value when working with different speaker verification =
servers.</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">Ditto. &nbsp;I should =
point out that in all of these cases the parameters are typically passed =
directly to the engine, and their interpretations are defined (and =
described) in the vendors' documentation. &nbsp;The most common MRCPv2 =
server implementations are by the technology vendors themselves (the =
providers of the synthesis, recognition, and verification engines). =
&nbsp;This is commonly understood in this technology industry (meaning =
those who use this technology =
regularly).<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">17.</span><span =
style=3D"font-size: 7pt; color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In 11.5.2.9 you state that the verification-score is not a =
probability, so what is it. How can the client decide if, for example, 0 =
is a good score for specifying the threshold.&nbsp; I also noticed that =
the values in the example in section 11.5.2.10 are very precise like =
0.98514 is this the expected precision. The examples here and in section =
11.11 do not show the threshold, if the threshold is required for this =
flow why not show it in the example?</span><span style=3D"font-size: =
10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">This parameter, as others =
mentioned above, has only a vendor-specific interpretation. &nbsp;In =
practice authors interpret these values based both on guidance from the =
technology vendors and via experimentation on large sets of recorded =
data.<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">The =
Min-Verification-Score threshold is not required to be set. &nbsp;In =
many cases the technology vendor has a fairly good understanding of what =
the default threshold should be. &nbsp;The verification-score is =
returned, however, in case the application author determines (through =
experimentation, as described above) that the default threshold is not =
producing optimal results for the application. &nbsp;In that case the =
author can set the threshold to a different value or can set it to -1 =
and make the determination within the application itself based on the =
verification-score values.<o:p></o:p></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><br><br><o:p></o:p></div><div><div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">&nbsp;</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; text-indent: -0.25in; "><span style=3D"font-size: 11pt; =
font-family: Calibri, sans-serif; color: black; ">18.</span><span =
style=3D"font-size: 7pt; color: black; ">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section 12.3 the suggestion is to use SRTP as the mandatory =
interoperability mode. If the reason for mandating SRTP is for a common =
mode you should also decide on a key exchange mechanism. I suggest you =
look at<span class=3D"apple-converted-space">&nbsp;</span><a =
href=3D"http://tools.ietf.org/html/draft-ietf-avt-srtp-not-mandatory-02" =
style=3D"color: blue; text-decoration: underline; =
">http://tools.ietf.org/html/draft-ietf-avt-srtp-not-mandatory-02</a><span=
 class=3D"apple-converted-space">&nbsp;</span>for discussion on media =
security.</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">Based on the discussion =
between you and Dan York on the list, I will change =
this:<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div><pre style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
10pt; font-family: 'Courier New'; "><span class=3D"apple-style-span"><span=
 style=3D"font-size: 12pt; font-family: Helvetica, sans-serif; ">12.3. =
Media session protection&nbsp;</span></span><o:p></o:p></pre><pre =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 10pt; font-family: 'Courier New'; "><span =
class=3D"apple-style-span"><span style=3D"font-size: 9pt; font-family: =
Helvetica, sans-serif; ">Sensitive data is also carried on media =
sessions terminating on MRCPv2 servers (the other end of a media channel =
may or may not be on the MRCPv2 client). This data includes the user's =
spoken utterances and the output of text-to-speech operations. MRCPv2 =
servers MUST support SRTP for protection of audio media sessions. MRCPv2 =
clients that originate or consume audio similarly MUST support SRTP. =
Alternative media channel protection MAY be used if desired (e.g. =
IPSEC).</span></span><o:p></o:p></pre></div><div><div style=3D"margin-top:=
 0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">to =
this:<o:p></o:p></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; =
"><o:p>&nbsp;</o:p></div></div><div><pre style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
10pt; font-family: 'Courier New'; "><span class=3D"apple-style-span"><span=
 style=3D"font-size: 9pt; font-family: Helvetica, sans-serif; ">12.3. =
Media session protection&nbsp;</span></span><o:p></o:p></pre><pre =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 10pt; font-family: 'Courier New'; "><span =
class=3D"apple-style-span"><span style=3D"font-size: 9pt; font-family: =
Helvetica, sans-serif; ">Sensitive data is also carried on media =
sessions terminating on MRCPv2 servers (the other end of a media channel =
may or may not be on the MRCPv2 client). This data includes the user's =
spoken utterances and the output of text-to-speech operations. MRCPv2 =
servers MUST support a security mechanism for protection of audio media =
sessions. MRCPv2 clients that originate or consume audio similarly MUST =
support a security mechanism for protection of the audio. If =
appropriate,&nbsp;usage of the Secure Real-time Transport Protocol =
(SRTP)&nbsp;[RFC3711] is =
recommended.</span></span><o:p></o:p></pre></div><div><blockquote =
style=3D"margin-top: 5pt; margin-bottom: 5pt; "><div><div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><span style=3D"font-size: 11pt; font-family: Calibri, =
sans-serif; color: black; ">&nbsp;</span><span style=3D"font-size: =
10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; text-indent: -0.25in; =
"><span style=3D"font-size: 11pt; font-family: Calibri, sans-serif; =
color: black; ">19.</span><span style=3D"font-size: 7pt; color: black; =
">&nbsp;&nbsp;<span =
class=3D"apple-converted-space">&nbsp;</span></span><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">In section13.7.2 you specify the attribute resource as session =
level yet in the example in section 4.2 it is a media level attribute. =
The same goes for the channel attribute</span><span style=3D"font-size: =
10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div></div></div></blockquote><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><o:p>&nbsp;</o:p></div></div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; ">I have corrected both in =
section 13.7.2 to be media-level.<o:p></o:p></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><br><br><o:p></o:p></div><div><div><div style=3D"margin-left: =
0.5in; "><div style=3D"margin-top: 0in; margin-right: 0in; =
margin-bottom: 0.0001pt; margin-left: 0in; font-size: 12pt; font-family: =
'Times New Roman', serif; "><span style=3D"font-size: 11pt; font-family: =
Calibri, sans-serif; color: black; =
">&nbsp;<o:p></o:p></span></div></div><div><div style=3D"margin-top: =
0in; margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; =
font-size: 12pt; font-family: 'Times New Roman', serif; "><span =
style=3D"font-size: 11pt; font-family: Calibri, sans-serif; color: =
black; ">Thanks</span><span style=3D"font-size: 10.5pt; font-family: =
Consolas; color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><span style=3D"font-size: 11pt; font-family: Calibri, =
sans-serif; color: black; ">&nbsp;</span><span style=3D"font-size: =
10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; ">Roni =
Even</span><span style=3D"font-size: 10.5pt; font-family: Consolas; =
color: black; "><o:p></o:p></span></div></div><div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; "><span style=3D"font-size: 11pt; font-family: Calibri, =
sans-serif; color: black; ">&nbsp;</span><span style=3D"font-size: =
10.5pt; font-family: Consolas; color: black; =
"><o:p></o:p></span></div></div><div><div style=3D"margin-top: 0in; =
margin-right: 0in; margin-bottom: 0.0001pt; margin-left: 0in; font-size: =
12pt; font-family: 'Times New Roman', serif; "><span style=3D"font-size: =
11pt; font-family: Calibri, sans-serif; color: black; =
">&nbsp;<o:p></o:p></span></div></div></div></div></div><div =
style=3D"margin-top: 0in; margin-right: 0in; margin-bottom: 0.0001pt; =
margin-left: 0in; font-size: 12pt; font-family: 'Times New Roman', =
serif; =
"><o:p>&nbsp;</o:p></div></div></div></div></span></blockquote></div><br><=
/div></div></body></html>=

--Apple-Mail-40-309409471--
