Re: [payload] I-D Action: draft-ietf-payload-melpe-05.txt

"Victor Demjanenko, Ph.D." <victor.demjanenko@vocal.com> Fri, 20 January 2017 21:40 UTC

Return-Path: <victor.demjanenko@vocal.com>
X-Original-To: payload@ietfa.amsl.com
Delivered-To: payload@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 08E7F1294B5 for <payload@ietfa.amsl.com>; Fri, 20 Jan 2017 13:40:11 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.901
X-Spam-Level:
X-Spam-Status: No, score=-0.901 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, MSGID_MULTIPLE_AT=1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001] autolearn=no autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id bCiz8Ud9TwgL for <payload@ietfa.amsl.com>; Fri, 20 Jan 2017 13:40:09 -0800 (PST)
Received: from cuda.olm1.com (cuda.olm1.com [72.236.255.32]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 06E2B1294B4 for <payload@ietf.org>; Fri, 20 Jan 2017 13:40:09 -0800 (PST)
X-ASG-Debug-ID: 1484948406-092fd3687519b40001-U2jSCT
Received: from host105.olm1.com (host105.olm1.com [72.236.255.15]) by cuda.olm1.com with ESMTP id sRhwlLI8jY1uAcHj (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NO) for <payload@ietf.org>; Fri, 20 Jan 2017 16:40:06 -0500 (EST)
X-Barracuda-Envelope-From: victor.demjanenko@vocal.com
X-Barracuda-Apparent-Source-IP: 72.236.255.15
Received: from ClintonLT (rrcs-72-43-202-98.nys.biz.rr.com [72.43.202.98]) by host105.olm1.com (Postfix) with ESMTPSA id 71BCBB451CD; Fri, 20 Jan 2017 16:40:06 -0500 (EST)
From: "Victor Demjanenko, Ph.D." <victor.demjanenko@vocal.com>
To: payload@ietf.org
References: <148494810935.13328.9215549711885172728.idtracker@ietfa.amsl.com>
In-Reply-To: <148494810935.13328.9215549711885172728.idtracker@ietfa.amsl.com>
Date: Fri, 20 Jan 2017 16:40:05 -0500
X-ASG-Orig-Subj: RE: [payload] I-D Action: draft-ietf-payload-melpe-05.txt
Message-ID: <028301d27365$c42e5e70$4c8b1b50$@demjanenko>
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----=_NextPart_000_0284_01D2733B.DB585670"
X-Mailer: Microsoft Office Outlook 12.0
Thread-Index: AdJzZSP/vIqrBw62TQmfQm6mRj8WvQAABslw
Content-Language: en-us
X-Barracuda-Connect: host105.olm1.com[72.236.255.15]
X-Barracuda-Start-Time: 1484948406
X-Barracuda-Encrypted: ECDHE-RSA-AES256-GCM-SHA384
X-Barracuda-URL: https://72.236.255.32:443/cgi-mod/mark.cgi
X-Virus-Scanned: by bsmtpd at olm1.com
X-Barracuda-BRTS-Status: 1
X-Barracuda-Spam-Score: 0.01
X-Barracuda-Spam-Status: No, SCORE=0.01 using global scores of TAG_LEVEL=1000.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=5.0 tests=MSGID_MULTIPLE_AT
X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.3.35951 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- 0.01 MSGID_MULTIPLE_AT Message-ID contains multiple '@' characters
Archived-At: <https://mailarchive.ietf.org/arch/msg/payload/iI3nkAkd6cXIEOyzCGxuvBqPpik>
Subject: Re: [payload] I-D Action: draft-ietf-payload-melpe-05.txt
X-BeenThere: payload@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Audio/Video Transport Payloads working group discussion list <payload.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/payload>, <mailto:payload-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/payload/>
List-Post: <mailto:payload@ietf.org>
List-Help: <mailto:payload-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/payload>, <mailto:payload-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 20 Jan 2017 21:40:11 -0000

Hi Everyone,

We have reviewed and made a few more changes for RFC 2119 language.  The
note of our changes are below.  Please let us know if any other NITs or
changes are necessary.

Victor and Dave


1.	Corrected Spelling - (Declaritive -> Declarative)

	3 places (table of contents and section 4.3 twice)

2.	Section 3 Improvement (removed last sentence, combined paragraphs,
referenced 
	Figure 1 explicitly)

	(was)
	The RTP payload for MELPe has the format shown in the figure below.
No
	additional header specific to this payload format is required. 

	This format is intended for the situations where the sender and the
receiver
	send one or more codec data frames per packet. The RTP packet looks
as follows:

	(now)
	The RTP payload for MELPe has the format shown in Figure 1.  No
additional
	header specific to this payload format is required.  This format is
intended
	for the situations where the sender and the receiver send one or
more codec
	data frames per packet. 

3.	Section 3.1 Improvement (last sentence improved)

	(was)
	The total number of bits used to describe one frame of 2400 bps
speech is 54,
	which fits in 7 octets (with two unused bits). For the 1200 bps
speech the total
	number of bits used is 81, which fits in 11 octets (with seven
unused bits).
	For the 600 bps speech the total number of bits used is 54, which
fits in 7
	octets (with two unused bits).  Unused bits are coded as described
in 3.3 in
	support of dynamic bit-rate switching.

	(now)
	The total number of bits used to describe one frame of 2400 bps
speech is 54,
	which fits in 7 octets (with two unused bits). For the 1200 bps
speech the
	total number of bits used is 81, which fits in 11 octets (with seven
unused
	bits).  For the 600 bps speech the total number of bits used is 54,
which fits
	in 7 octets (with two unused bits).  Unused bits, shown below as
RSVA, RSVB,
	etc., are coded as described in 3.3 in support of dynamic bit-rate
switching.

4.	Table 3.4 Note (value shall be -> values are) (recommended ->
preferred)

	(was)
	Note: The default value shall be the respective parameters from the
vocoder
	frame.  It is recommended that msvq[0] and gain[1] values be derived
by
	averaging the respective parameter from some number of previous
vocoder frames.

	(now)
	Note: The default values are the respective parameters from the
vocoder
	frame.  It is preferred that msvq[0] and gain[1] values be derived
by
	averaging the respective parameter from some number of previous
vocoder frames.

5.	Section 3.3 Last Paragraph (recommended -> RECOMMENDED)

	(was)
	When dynamic bit-rate switching is used and more than one frame is
contained in a
	RTP packet, it is recommended to inspect the coder rate bits
contained in the 
	last octet.  If the coder bit rate indicates a Comfort Noise frame,
then inspect 
	the third last octet for the coder bit rate.  All MELPe speech
frames in the RTP
	packet will be of this same coder bit rate.

	(now)
	When dynamic bit-rate switching is used and more than one frame is
contained in a
	RTP packet, it is RECOMMENDED to inspect the coder rate bits
contained in the 
	last octet.  If the coder bit rate indicates a Comfort Noise frame,
then inspect 
	the third last octet for the coder bit rate.  All MELPe speech
frames in the RTP
	packet will be of this same coder bit rate.

6.	Section 3.4 First Paragraph (efficient congestion control ->
congestion 
	management) and (the transmission rate -> packet rate)

	(was)
	The target bitrate of MELPe can be adjusted at any point in time,
thus allowing
	efficient congestion control.  Furthermore, the amount of encoded
speech or
	audio data encoded in a single packet can be used for congestion
control, since
	the transmission rate is inversely proportional to the packet
duration.  A
	lower packet transmission rate reduces the amount of header
overhead, but at
	the same time increases latency and loss sensitivity, so it ought to
be used
	with care.

	(now)
	The target bitrate of MELPe can be adjusted at any point in time,
thus allowing
	congestion management.  Furthermore, the amount of encoded speech or
	audio data encoded in a single packet can be used for congestion
control, since
	packet rate is inversely proportional to the packet duration.  A
	lower packet transmission rate reduces the amount of header
overhead, but at
	the same time increases latency and loss sensitivity, so it ought to
be used
	with care.

7.	Section 4.2 Third Paragraph (may -> MAY)

	(was)
	When conveying information by SDP, the encoding name SHALL be "MELP"

	(the same as the media subtype).  Alternative encoding name types,
"MELP2400", 
	"MELP1200", and "MELP600", may be used in SDP to convey fixed
bit-rate 
	configurations.  These names have been observed in systems that do
not 
	support dynamic frame rate switching as specified by the parameter,
"bitrate".

	(now)
	When conveying information by SDP, the encoding name SHALL be "MELP"

	(the same as the media subtype).  Alternative encoding name types,
"MELP2400", 
	"MELP1200", and "MELP600", MAY be used in SDP to convey fixed
bit-rate 
	configurations.  These names have been observed in systems that do
not 
	support dynamic frame rate switching as specified by the parameter,
"bitrate".

8.	Section 6 First Paragraph (may utilize -> utilizes) and (recommended
-> preferred)

	(was)
	MELPe packet loss concealment (PLC) uses the special properties and
coding for
	the pitch/voicing parameter of the MELPe 2400 bps coder.  The PLC
erasure
	indication may utilize any of the errored encodings of a non-voiced
frame as
	identified in Table 1 of [MELPE].  For the sake of simplicity it is
recommended
	to use a code value of 3 for the pitch/voicing parameter
(represented by the
	bits P6 to P0 in Table 3.1).  Hence, set bits P0 and P1 to one and
bits
	P2, P3, P4, P5, and P6 to zero.

	(now)
	MELPe packet loss concealment (PLC) uses the special properties and
coding for
	the pitch/voicing parameter of the MELPe 2400 bps coder.  The PLC
erasure
	indication utilizes any of the errored encodings of a non-voiced
frame as
	identified in Table 1 of [MELPE].  For the sake of simplicity it is
preferred
	to use a code value of 3 for the pitch/voicing parameter
(represented by the
	bits P6 to P0 in Table 3.1).  Hence, set bits P0 and P1 to one and
bits
	P2, P3, P4, P5, and P6 to zero.


9.	References (MELP, MELPE and SCIP210 are now normative)

	(I could not find meaningful URI for documents.  Available from
various search 
	pages.)

10.	Section 2 Second to Last Paragraph (is recommended to follow ->
follows the procedures in)

	(was)
	Comfort noise handling for MELPe is recommended to follow SCIP-210
Appendix B
	[SCIP210].  After VAD no longer indicates the presence of
speech/voice, a grace
	period of a minimum of two comfort noise vocoder fames are to be
transmitted.
	The contents of the comfort noise frames is described in the next
section.

	(now)
	Comfort noise handling for MELPe follows the procedures in SCIP-210
Appendix B
	[SCIP210].  After VAD no longer indicates the presence of
speech/voice, a grace
	period of a minimum of two comfort noise vocoder fames are to be
transmitted.
	The contents of the comfort noise frames is described in the next
section.

11.	Section 3 Second Paragraph (required -> needed)

	(was)
	The RTP payload for MELPe has the format shown in Figure 1.  No
additional
	header specific to this payload format is required.  This format is
intended
	for the situations where the sender and the receiver send one or
more codec
	data frames per packet. 

	(now)
	The RTP payload for MELPe has the format shown in Figure 1.  No
additional
	header specific to this payload format is needed.  This format is
intended
	for the situations where the sender and the receiver send one or
more codec
	data frames per packet. 

12.	Text after Table 3.4 (required -> needed)

	(was)
	Since only msvq[0] (also known as LSF1x or the first LSP) and
gain[1] (also
	known as g2x or the second gain) are required, the following bit
order is used
	for comfort noise frames.

	(now)
	Since only msvq[0] (also known as LSF1x or the first LSP) and
gain[1] (also
	known as g2x or the second gain) are needed, the following bit order
is used
	for comfort noise frames.

13.	Section 3.3 First Paragraph (may -> MAY) (may be required -> is
used)

	(was)
	A MELPe RTP packet may consist of zero or more MELPe coder frames,
followed by
	zero or one MELPe Comfort Noise frames.  The presence of a comfort
noise frame
	can be deduced from the length of the RTP payload.  The default
packetization
	interval is one coder frame (22.5, 67.5 or 90 ms) according to the
coder bit rate
	(2400, 1200 or 600 bps).  For some applications, a longer
packetization
	interval may be required to reduce the packet rate.

	(now)
	A MELPe RTP packet MAY consist of zero or more MELPe coder frames,
followed by
	zero or one MELPe Comfort Noise frames.  The presence of a comfort
noise frame
	can be deduced from the length of the RTP payload.  The default
packetization
	interval is one coder frame (22.5, 67.5 or 90 ms) according to the
coder bit rate
	(2400, 1200 or 600 bps).  For some applications, a longer
packetization
	interval is used to reduce the packet rate.

14.	Section 3.3 Second Paragraph (may -> MAY)

	(was)
	A MELPe RTP packet comprised of no coder frame and no comfort noise
frame may be
	used periodically by an end point to indicate connectivity by an
otherwise idle
	receiver.

	(now)
	A MELPe RTP packet comprised of no coder frame and no comfort noise
frame MAY be
	used periodically by an end point to indicate connectivity by an
otherwise idle
	receiver.

15.	Section 3.3 Second to Last Paragraph (may -> can)

	(was)
	Information describing the number of frames contained in an RTP
packet is not
	transmitted as part of the RTP payload.  The way to determine the
number of
	MELPe frames is to count the total number of octets within the RTP
packet, and
	divide the octet count by the number of expected octets per frame
(7/11/7 per
	frame).  Keep in mind the last frame may be a 2 octet comfort noise
frame.   

	(now)
	Information describing the number of frames contained in an RTP
packet is not
	transmitted as part of the RTP payload.  The way to determine the
number of
	MELPe frames is to count the total number of octets within the RTP
packet, and
	divide the octet count by the number of expected octets per frame
(7/11/7 per
	frame).  Keep in mind the last frame can be a 2 octet comfort noise
frame.   

16.	Section 4.2 Last Paragraph (may -> can)

	(was)
	The value for "packet time" and "maximum packet time" parameters of
the "ptime"
	and "maxptime" SDP attributes respectively, SHALL use the nearest
rounded-up ms
	integer packet duration.  For MELPe, this corresponds to the values:
23, 45,
	68, 90, 112, 135, 156, and 180.  Larger values may be used as long
as they are
	properly rounded.

	(now)
	The value for "packet time" and "maximum packet time" parameters of
the "ptime"
	and "maxptime" SDP attributes respectively, SHALL use the nearest
rounded-up ms
	integer packet duration.  For MELPe, this corresponds to the values:
23, 45,
	68, 90, 112, 135, 156, and 180.  Larger values can be used as long
as they are
	properly rounded.

17.	Section 4.3 First Paragraph (may -> MAY)

	(was)
	For declarative media, the "bitrate" parameter specifes the possible
bit rates
	used by the sender.  Multiple MELPe rtpmap values (such as 97, 98,
and 99 as
	used below) may be used to convey MELPe coded voice at different bit
rates.
	The receiver can then select an appropriate MELPe codec by using 97,
98, or 99.

	(now)
	For declarative media, the "bitrate" parameter specifes the possible
bit rates
	used by the sender.  Multiple MELPe rtpmap values (such as 97, 98,
and 99 as
	used below) MAY be used to convey MELPe coded voice at different bit
rates.
	The receiver can then select an appropriate MELPe codec by using 97,
98, or 99.

18. Section 4.4 First Two Paragraphs (may -> MAY) (may -> MAY) (may -> MAY)

	(was)
	In an Offer/Answer mode [RFC3264], "bitrate" is a bi-directional
parameter.
	Both sides MUST use a common "bitrate" value or values.  The offer
contains the
	bit rates supported by the offerer listed in its preferred order.
The answerer
	may agree to any bit rate by listing the bit rate first in the
answerer
	response.  Additionally the answerer may indicate any secondary bit
rate or bit
	rates that it supports.  The initial bit rate used by both parties
SHALL be the
	first bit rate specified in the answerer response. 

	For example if offerer bit rates are "2400,600", and answer bit
rates are "600,2400",
	the initial bit rate is 600.  If other bit rates are provided by the
answerer, any
	common bit rate between offer and answer may be used at any time in
the future.  
	Activation of these other common bit rates is beyond the scope of
this document.

	(now)
	In an Offer/Answer mode [RFC3264], "bitrate" is a bi-directional
parameter.
	Both sides MUST use a common "bitrate" value or values.  The offer
contains the
	bit rates supported by the offerer listed in its preferred order.
The answerer
	MAY agree to any bit rate by listing the bit rate first in the
answerer
	response.  Additionally the answerer MAY indicate any secondary bit
rate or bit
	rates that it supports.  The initial bit rate used by both parties
SHALL be the
	first bit rate specified in the answerer response. 

	For example if offerer bit rates are "2400,600", and answer bit
rates are "600,2400",
	the initial bit rate is 600.  If other bit rates are provided by the
answerer, any
	common bit rate between offer and answer MAY be used at any time in
the future.  
	Activation of these other common bit rates is beyond the scope of
this document.

19.	Section 5 (review)

	(is)
	A primary application of MELPe is for radio communications of voice
	conversations and discontinuous transmissions are normal.  When
MELPe is used
	in an IP network, MELPe RTP packet transmissions may cease and
resume
	frequently.  RTP SSRC sequence number gaps indicate lost packets to
be filled 
	by PLC while abrupt loss of RTP packets indicate intended
discontinuous 
	transmission.

	If a MELPe coder so desires, it may send a comfort noise frame as
per SCIP-210
	Appendix B [SCIP210] prior to ceasing transmission.  A receiver may
optionally
	use comfort noise during its silence periods. No SDP negotiations
are
	required.

	(review)
	This uses "may" three times and "required" and "optionally" one
each.  Should any
	be capitalized or changed to a different word?



-----Original Message-----
From: payload [mailto:payload-bounces@ietf.org] On Behalf Of
internet-drafts@ietf.org
Sent: Friday, January 20, 2017 4:35 PM
To: i-d-announce@ietf.org
Cc: payload@ietf.org
Subject: [payload] I-D Action: draft-ietf-payload-melpe-05.txt


A New Internet-Draft is available from the on-line Internet-Drafts
directories.
This draft is a work item of the Audio/Video Transport Payloads of the IETF.

        Title           : RTP Payload Format for MELPe Codec
        Authors         : Victor Demjanenko
                          David Satterlee
	Filename        : draft-ietf-payload-melpe-05.txt
	Pages           : 28
	Date            : 2017-01-20

Abstract:
   This document describes the RTP payload format for the Mixed
   Excitation Linear Prediction Enhanced (MELPe) speech coder.  MELPe's
   three different speech encoding rates and sample frames sizes are
   supported.  Comfort noise procedures and packet loss concealment are
   detailed.

INTERNET DRAFT   RTP Payload Format for the MELPe Codec January 20, 2017



The IETF datatracker status page for this draft is:
https://datatracker.ietf.org/doc/draft-ietf-payload-melpe/

There's also a htmlized version available at:
https://tools.ietf.org/html/draft-ietf-payload-melpe-05

A diff from the previous version is available at:
https://www.ietf.org/rfcdiff?url2=draft-ietf-payload-melpe-05


Please note that it may take a couple of minutes from the time of submission
until the htmlized version and diff are available at tools.ietf.org.

Internet-Drafts are also available by anonymous FTP at:
ftp://ftp.ietf.org/internet-drafts/

_______________________________________________
payload mailing list
payload@ietf.org
https://www.ietf.org/mailman/listinfo/payload