Re: [MMUSIC] Alexey Melnikov's No Objection on draft-ietf-mmusic-rfc4566bis-35: (with COMMENT)

Alexey Melnikov <aamelnikov@fastmail.fm> Sat, 08 June 2019 09:38 UTC

Return-Path: <aamelnikov@fastmail.fm>
X-Original-To: mmusic@ietfa.amsl.com
Delivered-To: mmusic@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id DB57B12008B; Sat, 8 Jun 2019 02:38:46 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.7
X-Spam-Level:
X-Spam-Status: No, score=-2.7 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=fastmail.fm header.b=iVVtXc8o; dkim=pass (2048-bit key) header.d=messagingengine.com header.b=DNo1Nmq0
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 6lPYeFNYhAVG; Sat, 8 Jun 2019 02:38:44 -0700 (PDT)
Received: from out5-smtp.messagingengine.com (out5-smtp.messagingengine.com [66.111.4.29]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 3082012006E; Sat, 8 Jun 2019 02:38:44 -0700 (PDT)
Received: from compute7.internal (compute7.nyi.internal [10.202.2.47]) by mailout.nyi.internal (Postfix) with ESMTP id 63CFB21F9F; Sat, 8 Jun 2019 05:38:43 -0400 (EDT)
Received: from mailfrontend1 ([10.202.2.162]) by compute7.internal (MEProxy); Sat, 08 Jun 2019 05:38:43 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fastmail.fm; h= content-type:mime-version:subject:from:in-reply-to:date:cc :content-transfer-encoding:message-id:references:to; s=fm3; bh=X jhCO5tVbBPqRZ8fa++uWmh35nbaVIHxC9dhA4OTcYY=; b=iVVtXc8oS/Z8tkuqp nm9B84Zv+4zY8clgP9TDaQbbiosL2V6y3lKeaXXCWRbJRr3oLc6VVMJ1W2T4HDnK vw/RwnjeGyUD/HqCT+7cL38WBm+jqGlI7lFnK0hEhjEfw+2mjlgelLxjXsuuFxMI ZBh8jza8hzOysGhUWTBd+ovmla75SuW3lwJ8bQS0IAmgUVD+ejymhlc/nblNXWdX jJEE0eihacRF4n5QivFcK/2bOn6Td+cjoeJFe0XCHfQ0hqofsyBpDzvidTz0+RgY Fzgwu693pTTZ+NMRW3PIJzdDwcgCmUhMIZAdINfydqsR6vm2Gp7B0QhTz/Iow2R+ W0RWw==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender :x-sasl-enc; s=fm3; bh=XjhCO5tVbBPqRZ8fa++uWmh35nbaVIHxC9dhA4OTc YY=; b=DNo1Nmq0uEHhEbXD9PTqoY0ILN15NIQURiKCHVD24YYaTc93OmWTJ/JTg jJI8IoO23d1fkFsvZ1UMbVpky7k9F9+ItK3vOaKsDP/yMhvfRVyzt0Pig0etQw5x FGP7DrsVjDmpQ8I2qKUf/nAnV95dcnCWat+AVpbylISdcjy/vzXMoKteFWRmGaJR 0eUPZivEH1DbW/qilkRUJ75n+ogniXQhaVCzsx4juErNlOK6CMi9tjCHPoc7YNFB a8b88X+i07Ez/SnZuSZhbYybPy800il3/ZlwWXpTtd3ck0WsAtBJ5L2uAnsGZ35Z CZZ01sSKRcjh1Q52C07fMkmY1zarQ==
X-ME-Sender: <xms:IoL7XHYyhplMXuC0ES9p4iZkDjoW6Q3euo7KFWCTFemlrR-SFJaazQ>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeduuddrudegkedgudekucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmne cujfgurheptggguffhofgjfffgkfhfvfesthhqmhdthhdtjeenucfhrhhomheptehlvgig vgihucfovghlnhhikhhovhcuoegrrghmvghlnhhikhhovhesfhgrshhtmhgrihhlrdhfmh eqnecukfhppeejjedrleejrddugeehrdehheenucfrrghrrghmpehmrghilhhfrhhomhep rggrmhgvlhhnihhkohhvsehfrghsthhmrghilhdrfhhmnecuvehluhhsthgvrhfuihiivg eptd
X-ME-Proxy: <xmx:IoL7XI9g9foK-hDnNnrWwDNX4Jqgg13JEYALUKh8ROgwiG9ANtaJ_w> <xmx:IoL7XNlrLNzk-WqSKy96jCn5A7ysY_dHqG11Ead0sDUL-frSMEDPWg> <xmx:IoL7XNyWZ09096CfUp76UEagHGpjVUzZaVAbEq8tkbQyU0R1FMc4RA> <xmx:I4L7XA2fLDwdvcI5WUuZV5J_23DYcQ81Yq2vG8JZ9Q24oboGQ2gZUg>
Received: from [192.168.0.12] (cpc121086-nmal24-2-0-cust54.19-2.cable.virginm.net [77.97.145.55]) by mail.messagingengine.com (Postfix) with ESMTPA id 4B46B8005B; Sat, 8 Jun 2019 05:38:41 -0400 (EDT)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (1.0)
From: Alexey Melnikov <aamelnikov@fastmail.fm>
X-Mailer: iPad Mail (16D57)
In-Reply-To: <1f37b132-98b7-af5e-7997-e0fd095ff207@alum.mit.edu>
Date: Sat, 08 Jun 2019 10:38:39 +0100
Cc: Barry Leiba <barryleiba@computer.org>, The IESG <iesg@ietf.org>, Flemming Andreasen <fandreas@cisco.com>, mmusic-chairs@ietf.org, draft-ietf-mmusic-rfc4566bis@ietf.org, mmusic@ietf.org
Content-Transfer-Encoding: quoted-printable
Message-Id: <E2014981-8656-432E-82C2-2FA5ED110DD7@fastmail.fm>
References: <155922060388.22145.12090008162284261785.idtracker@ietfa.amsl.com> <5b944fc8-3f97-55e6-2faf-45bfd11c5837@alum.mit.edu> <CALaySJJjwG26NLCJqFdo2yW_JhYCYbY+ADHENa490XqM539U2A@mail.gmail.com> <d1954b5e-f7bb-40e1-88dc-5565212b517d@www.fastmail.com> <1f37b132-98b7-af5e-7997-e0fd095ff207@alum.mit.edu>
To: Paul Kyzivat <pkyzivat@alum.mit.edu>
Archived-At: <https://mailarchive.ietf.org/arch/msg/mmusic/B4M0nuXVVzfOmg0KsR4uYvi3R-k>
Subject: Re: [MMUSIC] Alexey Melnikov's No Objection on draft-ietf-mmusic-rfc4566bis-35: (with COMMENT)
X-BeenThere: mmusic@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Multiparty Multimedia Session Control Working Group <mmusic.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/mmusic>, <mailto:mmusic-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/mmusic/>
List-Post: <mailto:mmusic@ietf.org>
List-Help: <mailto:mmusic-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/mmusic>, <mailto:mmusic-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sat, 08 Jun 2019 09:38:47 -0000

Hi Paul,

> On 7 Jun 2019, at 15:24, Paul Kyzivat <pkyzivat@alum.mit.edu> wrote:
> 
> Alexy,
> 
> When we first realized the issues with charset we thought we were very near the end of these revisions, and there didn't seem to be much taste for opening this can of worms. But the collection of iesg comments have led me to do a fair number of revisions. So I will ask again if there is willingness to make this kind of change. The main concern is with backward compatibility - is there any use in the wild of other charsets. I doubt it, but don't have any data to back that up.

For new applications there is no need to support anything other than UTF-8.
But of course this doesn’t help you. In email space there is still quite a lot of non UTF-8 support, but this is mostly due to legacy code.

If it was me, I was just bite the bullet and at least start phasing out non UTF-8 (that is why I suggested “SHOULD NOT use any other charset”). If you don’t start now, you will never be able to deprecate them.

But of course this is for the WG to decide.
> 
> (The whole a=charset thing is a pain without much gain. Much trouble identifying things that are and aren't charset-dependent.)

I can see that :-)

Best Regards,
Alexey
> 
>    Thanks,
>    Paul
> 
>> On 6/7/19 8:09 AM, Alexey Melnikov wrote:
>> Hi Barry/Paul,
>>> On Mon, Jun 3, 2019, at 8:54 PM, Barry Leiba wrote:
>>> Hi, Paul.  Sticking my oar in with Alexey's here, just on a couple of items:
>>> 
>>>>> In Section 1:
>>>>> 
>>>>> electronic mail using the MIME   extensions [RFC5322]
>>>>> 
>>>>> This needs another reference for MIME. E.g. RFC 2045.
>>>> 
>>>> I don't understand. This paragraph is referencing examples of protocols
>>>> that can be used to *transport* SDP. RFC5322 references the mail message
>>>> format that would be used to encapsulate SDP if it were transported via
>>>> email. (Though it doesn't actually mention the *transport* protocols
>>>> used for mail messages.)
>>>> 
>>>> ISTM that it is the containing protocols that should reference rfc2045.
>>>> RFC5322 does so, and so says how to carry SDP in mail messages. SIP is
>>>> itself effectively an extension to RFC2045 though it doesn't say so.
>>> 
>>> Alexey's point is that you explicitly mention "MIME extensions" and
>>> don't provide a reference for it.  I'll go a bit farther to say that
>>> you're not just talking about message *format* here, but also SMTP as
>>> the transport (more correctly, application-layer) protocol, yes?  So
>>> this should say something more like, "electronic mail [RFC5321] using
>>> the MIME extensions [RFC2045]".  I don't think you need 5322, because
>>> 822 is cited by 2045, and that is obsoleted by 2822, and that by 5322.
>>> But I think you do need to cite SMTP and MIME.
>> Yes, exactly.
>>>>> In 6.10:
>>>>> 
>>>>>     Note that a character set specified MUST still prohibit the use of
>>>>>     bytes 0x00 (Nul), 0x0A (LF), and 0x0d (CR).
>>>>> 
>>>>> This doesn’t actually say what you intended. None of the common charsets
>>>>> prohibit these bytes. I think you meant that when using such charsets, these
>>>>> characters MUST NOT be used in values.
>>> 
>>> Adding to what Alexey says, and maybe clarifying a bit: Character set
>>> and encoding are different things.  The character set is the
>>> abstraction of the characters used, and the encoding is how they're
>>> represented.  The encoding is what creates the bytes on the wire.  One
>>> problem is that "ASCII" refers to both, so it's confusing.  But with
>>> Unicode, "Unicode" is the character set and "UTF-8" is (usually) the
>>> encoding.
>> Right. And the term "charset" is encoding of a particular character set. It might be worth using it below.
>>> But your point here is that the three byte values you list MUST NOT
>>> appear in the string, and that has nothing to do with the character
>>> set or the encoding.  Those three bytes are prohibited.
>>> 
>>> You say that quite well in Section 5:
>>> 
>>>    Text-containing fields such as the session-name-field and
>>>    information-field are octet strings that may contain any octet with
>>>    the exceptions of 0x00 (Nul), 0x0a (ASCII newline), and 0x0d (ASCII
>>>    carriage return).
>>> 
>>> ... and in 5.13:
>>> 
>>>    Attribute values are octet strings, and MAY use any octet value
>>>    except 0x00 (Nul), 0x0A (LF), and 0x0D (CR).
>>> 
>>> But in 6.10 I think you want something more like this:
>>> 
>>> OLD
>>>    Note that a character set specified MUST still prohibit the use of
>>>    bytes 0x00 (Nul), 0x0A (LF), and 0x0d (CR).  Character sets requiring
>>>    the use of these characters MUST define a quoting mechanism that
>>>    prevents these bytes from appearing within text fields.
>>> NEW
>>>    Note that the restriction specified in Section 5 applies: these strings
>>>    MUST NOT contain the bytes 0x00 (Nul), 0x0A (LF), and 0x0d (CR).
>>>    Character encodings that use these bytes MUST define a quoting
>>>    mechanism that prevents these bytes from appearing within the text
>>>    strings.
>>> END
>> I think this is much better, although "use these bytes" is still ambiguous. E.g. if these bytes are used to shift between encoding modes within a particular charset, then there is a problem. If they are just used to convey specific characters, it might not be.
>> However, see my comment below.
>>>> I don't recall what the state of character set definitions was in 1998
>>>> when this was first published. But it appears that they got carried away
>>>> and over-generalized. It is easy to understand how one might choose to
>>>> use ISO 8859-1 rather than UTF-8 since they are closely related and
>>>> byte-oriented. But it is unclear how one might use some other registered
>>>> charsets, such as EBCDIC, or other encodings of ISO 10646, such as UTF-16.
>>>> 
>>>> The bottom line is that use of alternate charsets other than 8859-1 is
>>>> underspecified. We considered revamping the definition of charset, but
>>>> didn't want to open that can of worms, since in practice it isn't an issue.
>>> 
>>> I appreciate that, and I think this isn't the place to tackle that.
>>> So we just need to get the text here to accurately reflect what you're
>>> trying to say.
>> I would actually suggest that the document should tighten the definition of which charsets are allowed. For textual media types we now recommend use of UTF-8 (which should be the default) and possibly allowing a few others.
>> So I suggest that the new definition of a=charset be along the lines of "MUST support UTF-8 and US-ASCII. MAY support ISO-8859-1. SHOULD NOT use any other charsets".
>>> Hoping to be helpful,
>>> Barry
>>> 
>