Re: [payload] I-D Action: draft-ietf-payload-melpe-05.txt
"Victor Demjanenko, Ph.D." <victor.demjanenko@vocal.com> Fri, 20 January 2017 21:40 UTC
Return-Path: <victor.demjanenko@vocal.com>
X-Original-To: payload@ietfa.amsl.com
Delivered-To: payload@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 08E7F1294B5 for <payload@ietfa.amsl.com>; Fri, 20 Jan 2017 13:40:11 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -0.901
X-Spam-Level:
X-Spam-Status: No, score=-0.901 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, MSGID_MULTIPLE_AT=1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_PASS=-0.001] autolearn=no autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id bCiz8Ud9TwgL for <payload@ietfa.amsl.com>; Fri, 20 Jan 2017 13:40:09 -0800 (PST)
Received: from cuda.olm1.com (cuda.olm1.com [72.236.255.32]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 06E2B1294B4 for <payload@ietf.org>; Fri, 20 Jan 2017 13:40:09 -0800 (PST)
X-ASG-Debug-ID: 1484948406-092fd3687519b40001-U2jSCT
Received: from host105.olm1.com (host105.olm1.com [72.236.255.15]) by cuda.olm1.com with ESMTP id sRhwlLI8jY1uAcHj (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NO) for <payload@ietf.org>; Fri, 20 Jan 2017 16:40:06 -0500 (EST)
X-Barracuda-Envelope-From: victor.demjanenko@vocal.com
X-Barracuda-Apparent-Source-IP: 72.236.255.15
Received: from ClintonLT (rrcs-72-43-202-98.nys.biz.rr.com [72.43.202.98]) by host105.olm1.com (Postfix) with ESMTPSA id 71BCBB451CD; Fri, 20 Jan 2017 16:40:06 -0500 (EST)
From: "Victor Demjanenko, Ph.D." <victor.demjanenko@vocal.com>
To: payload@ietf.org
References: <148494810935.13328.9215549711885172728.idtracker@ietfa.amsl.com>
In-Reply-To: <148494810935.13328.9215549711885172728.idtracker@ietfa.amsl.com>
Date: Fri, 20 Jan 2017 16:40:05 -0500
X-ASG-Orig-Subj: RE: [payload] I-D Action: draft-ietf-payload-melpe-05.txt
Message-ID: <028301d27365$c42e5e70$4c8b1b50$@demjanenko>
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----=_NextPart_000_0284_01D2733B.DB585670"
X-Mailer: Microsoft Office Outlook 12.0
Thread-Index: AdJzZSP/vIqrBw62TQmfQm6mRj8WvQAABslw
Content-Language: en-us
X-Barracuda-Connect: host105.olm1.com[72.236.255.15]
X-Barracuda-Start-Time: 1484948406
X-Barracuda-Encrypted: ECDHE-RSA-AES256-GCM-SHA384
X-Barracuda-URL: https://72.236.255.32:443/cgi-mod/mark.cgi
X-Virus-Scanned: by bsmtpd at olm1.com
X-Barracuda-BRTS-Status: 1
X-Barracuda-Spam-Score: 0.01
X-Barracuda-Spam-Status: No, SCORE=0.01 using global scores of TAG_LEVEL=1000.0 QUARANTINE_LEVEL=1000.0 KILL_LEVEL=5.0 tests=MSGID_MULTIPLE_AT
X-Barracuda-Spam-Report: Code version 3.2, rules version 3.2.3.35951 Rule breakdown below pts rule name description ---- ---------------------- -------------------------------------------------- 0.01 MSGID_MULTIPLE_AT Message-ID contains multiple '@' characters
Archived-At: <https://mailarchive.ietf.org/arch/msg/payload/iI3nkAkd6cXIEOyzCGxuvBqPpik>
Subject: Re: [payload] I-D Action: draft-ietf-payload-melpe-05.txt
X-BeenThere: payload@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: Audio/Video Transport Payloads working group discussion list <payload.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/payload>, <mailto:payload-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/payload/>
List-Post: <mailto:payload@ietf.org>
List-Help: <mailto:payload-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/payload>, <mailto:payload-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 20 Jan 2017 21:40:11 -0000
Hi Everyone, We have reviewed and made a few more changes for RFC 2119 language. The note of our changes are below. Please let us know if any other NITs or changes are necessary. Victor and Dave 1. Corrected Spelling - (Declaritive -> Declarative) 3 places (table of contents and section 4.3 twice) 2. Section 3 Improvement (removed last sentence, combined paragraphs, referenced Figure 1 explicitly) (was) The RTP payload for MELPe has the format shown in the figure below. No additional header specific to this payload format is required. This format is intended for the situations where the sender and the receiver send one or more codec data frames per packet. The RTP packet looks as follows: (now) The RTP payload for MELPe has the format shown in Figure 1. No additional header specific to this payload format is required. This format is intended for the situations where the sender and the receiver send one or more codec data frames per packet. 3. Section 3.1 Improvement (last sentence improved) (was) The total number of bits used to describe one frame of 2400 bps speech is 54, which fits in 7 octets (with two unused bits). For the 1200 bps speech the total number of bits used is 81, which fits in 11 octets (with seven unused bits). For the 600 bps speech the total number of bits used is 54, which fits in 7 octets (with two unused bits). Unused bits are coded as described in 3.3 in support of dynamic bit-rate switching. (now) The total number of bits used to describe one frame of 2400 bps speech is 54, which fits in 7 octets (with two unused bits). For the 1200 bps speech the total number of bits used is 81, which fits in 11 octets (with seven unused bits). For the 600 bps speech the total number of bits used is 54, which fits in 7 octets (with two unused bits). Unused bits, shown below as RSVA, RSVB, etc., are coded as described in 3.3 in support of dynamic bit-rate switching. 4. Table 3.4 Note (value shall be -> values are) (recommended -> preferred) (was) Note: The default value shall be the respective parameters from the vocoder frame. It is recommended that msvq[0] and gain[1] values be derived by averaging the respective parameter from some number of previous vocoder frames. (now) Note: The default values are the respective parameters from the vocoder frame. It is preferred that msvq[0] and gain[1] values be derived by averaging the respective parameter from some number of previous vocoder frames. 5. Section 3.3 Last Paragraph (recommended -> RECOMMENDED) (was) When dynamic bit-rate switching is used and more than one frame is contained in a RTP packet, it is recommended to inspect the coder rate bits contained in the last octet. If the coder bit rate indicates a Comfort Noise frame, then inspect the third last octet for the coder bit rate. All MELPe speech frames in the RTP packet will be of this same coder bit rate. (now) When dynamic bit-rate switching is used and more than one frame is contained in a RTP packet, it is RECOMMENDED to inspect the coder rate bits contained in the last octet. If the coder bit rate indicates a Comfort Noise frame, then inspect the third last octet for the coder bit rate. All MELPe speech frames in the RTP packet will be of this same coder bit rate. 6. Section 3.4 First Paragraph (efficient congestion control -> congestion management) and (the transmission rate -> packet rate) (was) The target bitrate of MELPe can be adjusted at any point in time, thus allowing efficient congestion control. Furthermore, the amount of encoded speech or audio data encoded in a single packet can be used for congestion control, since the transmission rate is inversely proportional to the packet duration. A lower packet transmission rate reduces the amount of header overhead, but at the same time increases latency and loss sensitivity, so it ought to be used with care. (now) The target bitrate of MELPe can be adjusted at any point in time, thus allowing congestion management. Furthermore, the amount of encoded speech or audio data encoded in a single packet can be used for congestion control, since packet rate is inversely proportional to the packet duration. A lower packet transmission rate reduces the amount of header overhead, but at the same time increases latency and loss sensitivity, so it ought to be used with care. 7. Section 4.2 Third Paragraph (may -> MAY) (was) When conveying information by SDP, the encoding name SHALL be "MELP" (the same as the media subtype). Alternative encoding name types, "MELP2400", "MELP1200", and "MELP600", may be used in SDP to convey fixed bit-rate configurations. These names have been observed in systems that do not support dynamic frame rate switching as specified by the parameter, "bitrate". (now) When conveying information by SDP, the encoding name SHALL be "MELP" (the same as the media subtype). Alternative encoding name types, "MELP2400", "MELP1200", and "MELP600", MAY be used in SDP to convey fixed bit-rate configurations. These names have been observed in systems that do not support dynamic frame rate switching as specified by the parameter, "bitrate". 8. Section 6 First Paragraph (may utilize -> utilizes) and (recommended -> preferred) (was) MELPe packet loss concealment (PLC) uses the special properties and coding for the pitch/voicing parameter of the MELPe 2400 bps coder. The PLC erasure indication may utilize any of the errored encodings of a non-voiced frame as identified in Table 1 of [MELPE]. For the sake of simplicity it is recommended to use a code value of 3 for the pitch/voicing parameter (represented by the bits P6 to P0 in Table 3.1). Hence, set bits P0 and P1 to one and bits P2, P3, P4, P5, and P6 to zero. (now) MELPe packet loss concealment (PLC) uses the special properties and coding for the pitch/voicing parameter of the MELPe 2400 bps coder. The PLC erasure indication utilizes any of the errored encodings of a non-voiced frame as identified in Table 1 of [MELPE]. For the sake of simplicity it is preferred to use a code value of 3 for the pitch/voicing parameter (represented by the bits P6 to P0 in Table 3.1). Hence, set bits P0 and P1 to one and bits P2, P3, P4, P5, and P6 to zero. 9. References (MELP, MELPE and SCIP210 are now normative) (I could not find meaningful URI for documents. Available from various search pages.) 10. Section 2 Second to Last Paragraph (is recommended to follow -> follows the procedures in) (was) Comfort noise handling for MELPe is recommended to follow SCIP-210 Appendix B [SCIP210]. After VAD no longer indicates the presence of speech/voice, a grace period of a minimum of two comfort noise vocoder fames are to be transmitted. The contents of the comfort noise frames is described in the next section. (now) Comfort noise handling for MELPe follows the procedures in SCIP-210 Appendix B [SCIP210]. After VAD no longer indicates the presence of speech/voice, a grace period of a minimum of two comfort noise vocoder fames are to be transmitted. The contents of the comfort noise frames is described in the next section. 11. Section 3 Second Paragraph (required -> needed) (was) The RTP payload for MELPe has the format shown in Figure 1. No additional header specific to this payload format is required. This format is intended for the situations where the sender and the receiver send one or more codec data frames per packet. (now) The RTP payload for MELPe has the format shown in Figure 1. No additional header specific to this payload format is needed. This format is intended for the situations where the sender and the receiver send one or more codec data frames per packet. 12. Text after Table 3.4 (required -> needed) (was) Since only msvq[0] (also known as LSF1x or the first LSP) and gain[1] (also known as g2x or the second gain) are required, the following bit order is used for comfort noise frames. (now) Since only msvq[0] (also known as LSF1x or the first LSP) and gain[1] (also known as g2x or the second gain) are needed, the following bit order is used for comfort noise frames. 13. Section 3.3 First Paragraph (may -> MAY) (may be required -> is used) (was) A MELPe RTP packet may consist of zero or more MELPe coder frames, followed by zero or one MELPe Comfort Noise frames. The presence of a comfort noise frame can be deduced from the length of the RTP payload. The default packetization interval is one coder frame (22.5, 67.5 or 90 ms) according to the coder bit rate (2400, 1200 or 600 bps). For some applications, a longer packetization interval may be required to reduce the packet rate. (now) A MELPe RTP packet MAY consist of zero or more MELPe coder frames, followed by zero or one MELPe Comfort Noise frames. The presence of a comfort noise frame can be deduced from the length of the RTP payload. The default packetization interval is one coder frame (22.5, 67.5 or 90 ms) according to the coder bit rate (2400, 1200 or 600 bps). For some applications, a longer packetization interval is used to reduce the packet rate. 14. Section 3.3 Second Paragraph (may -> MAY) (was) A MELPe RTP packet comprised of no coder frame and no comfort noise frame may be used periodically by an end point to indicate connectivity by an otherwise idle receiver. (now) A MELPe RTP packet comprised of no coder frame and no comfort noise frame MAY be used periodically by an end point to indicate connectivity by an otherwise idle receiver. 15. Section 3.3 Second to Last Paragraph (may -> can) (was) Information describing the number of frames contained in an RTP packet is not transmitted as part of the RTP payload. The way to determine the number of MELPe frames is to count the total number of octets within the RTP packet, and divide the octet count by the number of expected octets per frame (7/11/7 per frame). Keep in mind the last frame may be a 2 octet comfort noise frame. (now) Information describing the number of frames contained in an RTP packet is not transmitted as part of the RTP payload. The way to determine the number of MELPe frames is to count the total number of octets within the RTP packet, and divide the octet count by the number of expected octets per frame (7/11/7 per frame). Keep in mind the last frame can be a 2 octet comfort noise frame. 16. Section 4.2 Last Paragraph (may -> can) (was) The value for "packet time" and "maximum packet time" parameters of the "ptime" and "maxptime" SDP attributes respectively, SHALL use the nearest rounded-up ms integer packet duration. For MELPe, this corresponds to the values: 23, 45, 68, 90, 112, 135, 156, and 180. Larger values may be used as long as they are properly rounded. (now) The value for "packet time" and "maximum packet time" parameters of the "ptime" and "maxptime" SDP attributes respectively, SHALL use the nearest rounded-up ms integer packet duration. For MELPe, this corresponds to the values: 23, 45, 68, 90, 112, 135, 156, and 180. Larger values can be used as long as they are properly rounded. 17. Section 4.3 First Paragraph (may -> MAY) (was) For declarative media, the "bitrate" parameter specifes the possible bit rates used by the sender. Multiple MELPe rtpmap values (such as 97, 98, and 99 as used below) may be used to convey MELPe coded voice at different bit rates. The receiver can then select an appropriate MELPe codec by using 97, 98, or 99. (now) For declarative media, the "bitrate" parameter specifes the possible bit rates used by the sender. Multiple MELPe rtpmap values (such as 97, 98, and 99 as used below) MAY be used to convey MELPe coded voice at different bit rates. The receiver can then select an appropriate MELPe codec by using 97, 98, or 99. 18. Section 4.4 First Two Paragraphs (may -> MAY) (may -> MAY) (may -> MAY) (was) In an Offer/Answer mode [RFC3264], "bitrate" is a bi-directional parameter. Both sides MUST use a common "bitrate" value or values. The offer contains the bit rates supported by the offerer listed in its preferred order. The answerer may agree to any bit rate by listing the bit rate first in the answerer response. Additionally the answerer may indicate any secondary bit rate or bit rates that it supports. The initial bit rate used by both parties SHALL be the first bit rate specified in the answerer response. For example if offerer bit rates are "2400,600", and answer bit rates are "600,2400", the initial bit rate is 600. If other bit rates are provided by the answerer, any common bit rate between offer and answer may be used at any time in the future. Activation of these other common bit rates is beyond the scope of this document. (now) In an Offer/Answer mode [RFC3264], "bitrate" is a bi-directional parameter. Both sides MUST use a common "bitrate" value or values. The offer contains the bit rates supported by the offerer listed in its preferred order. The answerer MAY agree to any bit rate by listing the bit rate first in the answerer response. Additionally the answerer MAY indicate any secondary bit rate or bit rates that it supports. The initial bit rate used by both parties SHALL be the first bit rate specified in the answerer response. For example if offerer bit rates are "2400,600", and answer bit rates are "600,2400", the initial bit rate is 600. If other bit rates are provided by the answerer, any common bit rate between offer and answer MAY be used at any time in the future. Activation of these other common bit rates is beyond the scope of this document. 19. Section 5 (review) (is) A primary application of MELPe is for radio communications of voice conversations and discontinuous transmissions are normal. When MELPe is used in an IP network, MELPe RTP packet transmissions may cease and resume frequently. RTP SSRC sequence number gaps indicate lost packets to be filled by PLC while abrupt loss of RTP packets indicate intended discontinuous transmission. If a MELPe coder so desires, it may send a comfort noise frame as per SCIP-210 Appendix B [SCIP210] prior to ceasing transmission. A receiver may optionally use comfort noise during its silence periods. No SDP negotiations are required. (review) This uses "may" three times and "required" and "optionally" one each. Should any be capitalized or changed to a different word? -----Original Message----- From: payload [mailto:payload-bounces@ietf.org] On Behalf Of internet-drafts@ietf.org Sent: Friday, January 20, 2017 4:35 PM To: i-d-announce@ietf.org Cc: payload@ietf.org Subject: [payload] I-D Action: draft-ietf-payload-melpe-05.txt A New Internet-Draft is available from the on-line Internet-Drafts directories. This draft is a work item of the Audio/Video Transport Payloads of the IETF. Title : RTP Payload Format for MELPe Codec Authors : Victor Demjanenko David Satterlee Filename : draft-ietf-payload-melpe-05.txt Pages : 28 Date : 2017-01-20 Abstract: This document describes the RTP payload format for the Mixed Excitation Linear Prediction Enhanced (MELPe) speech coder. MELPe's three different speech encoding rates and sample frames sizes are supported. Comfort noise procedures and packet loss concealment are detailed. INTERNET DRAFT RTP Payload Format for the MELPe Codec January 20, 2017 The IETF datatracker status page for this draft is: https://datatracker.ietf.org/doc/draft-ietf-payload-melpe/ There's also a htmlized version available at: https://tools.ietf.org/html/draft-ietf-payload-melpe-05 A diff from the previous version is available at: https://www.ietf.org/rfcdiff?url2=draft-ietf-payload-melpe-05 Please note that it may take a couple of minutes from the time of submission until the htmlized version and diff are available at tools.ietf.org. Internet-Drafts are also available by anonymous FTP at: ftp://ftp.ietf.org/internet-drafts/ _______________________________________________ payload mailing list payload@ietf.org https://www.ietf.org/mailman/listinfo/payload
- [payload] I-D Action: draft-ietf-payload-melpe-05… internet-drafts
- Re: [payload] I-D Action: draft-ietf-payload-melp… Victor Demjanenko, Ph.D.