Re: [MMUSIC] Alexey Melnikov's No Objection on draft-ietf-mmusic-rfc4566bis-35: (with COMMENT)

"Alexey Melnikov" <aamelnikov@fastmail.fm> Fri, 07 June 2019 12:10 UTC

Return-Path: <aamelnikov@fastmail.fm>
X-Original-To: mmusic@ietfa.amsl.com
Delivered-To: mmusic@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7B02312003F; Fri, 7 Jun 2019 05:10:31 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.7
X-Spam-Level:
X-Spam-Status: No, score=-2.7 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=fastmail.fm header.b=eBHHNwVE; dkim=pass (2048-bit key) header.d=messagingengine.com header.b=RtVpyJnD
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 9cLi9C5CeVFC; Fri, 7 Jun 2019 05:10:29 -0700 (PDT)
Received: from out3-smtp.messagingengine.com (out3-smtp.messagingengine.com [66.111.4.27]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 841881200A4; Fri, 7 Jun 2019 05:10:28 -0700 (PDT)
Received: from compute7.internal (compute7.nyi.internal [10.202.2.47]) by mailout.nyi.internal (Postfix) with ESMTP id 0B7A7221B2; Fri, 7 Jun 2019 08:10:27 -0400 (EDT)
Received: from imap1 ([10.202.2.51]) by compute7.internal (MEProxy); Fri, 07 Jun 2019 08:10:27 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fastmail.fm; h= mime-version:message-id:in-reply-to:references:date:from:to:cc :subject:content-type:content-transfer-encoding; s=fm3; bh=SzQCh lVf5s5T2WTlTmcO1lDXYRAgPm31f7JYG9kuo98=; b=eBHHNwVEvLTKh9VzXcAVk +fSMsCDkkky7HuqLdWmnRo1aXPl3k8yCTjTqhKqYKmJu1r4qA3UMMm1HDYMo7jqI +Y7OdG9lkDpfpCcS9po0qk192O5TQlXOU7Etj5NyqGi8A69b/NO6QkAmDH+OOHGy Nvx3/E3To6qv3KTTDI+SvNiBBD6YdJC7H5SQ6GNjrp1hTqs/tGKxPb0gyoFuvD4F bvFuT8f0ItNlHUOKCmB/Y9Y/DeOBmZTW8nmczFi3YkrPggA3a7atWg2HC2qsxkii S1V1AH84NZHo/1P8bN9WbikbzFE16qpG9NG6KVO4dexqHRk1NM5YxpWXhEtA3Mq7 g==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :date:from:in-reply-to:message-id:mime-version:references :subject:to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender :x-sasl-enc; s=fm2; bh=SzQChlVf5s5T2WTlTmcO1lDXYRAgPm31f7JYG9kuo 98=; b=RtVpyJnDzBkywM4McHqsIpR2lBcMsiU8mjNpp9bYVtN9ghrIpatWHEn2o gDpv8hHPAJyQ9w66xwpoYv22R/yYXp89VIBYwHm3mhkxWC37SlYMXYeS02Fn521k M32N6IpHTFpH0P+t9ofAwAjpDlJfK2siAptTXBy7WFEZCrel1LSJEyzEJGRWrFoZ xW0s+Y+BIhdRyAG/C3yksW0cygZUm+x0uQZn96rQhM93CtxrkcRilwFv9L2lir2F wzf5oZUMVxinDgTJnlUdeivYLyaE4igwOC7EUiRgWmQhI9sc7es1OwBnyJqaH5UY /aT2kWTQHi5fosS/Z8b8mr/a/3eCQ==
X-ME-Sender: <xms:MVT6XHRtT7kpHAC67IB7_dfTunvHzwVfNFXwFdVMAqX73TObTQ9I5g>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeduuddrudegiedggeejucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmne cujfgurhepofgfggfkjghffffhvffutgfgsehtqhertderreejnecuhfhrohhmpedftehl vgigvgihucfovghlnhhikhhovhdfuceorggrmhgvlhhnihhkohhvsehfrghsthhmrghilh drfhhmqeenucfrrghrrghmpehmrghilhhfrhhomheprggrmhgvlhhnihhkohhvsehfrghs thhmrghilhdrfhhmnecuvehluhhsthgvrhfuihiivgeptd
X-ME-Proxy: <xmx:MVT6XI_ubH8UeBYXji-0uWHj3_DamkilPhrfMGkSV2UNJscL5IWG0w> <xmx:MVT6XPdxbWH0gnncrHOMYy-CqtmQWOVFcte-HCXAIHTPfq2q6UEfxQ> <xmx:MVT6XClnMWgxlFCjBfeqbkBlXXdjiPp1NHtDjaTawigWP3En0kpLlA> <xmx:M1T6XBC_ACS02RojV7bUn1r9XBv3fHmPsQ10mYo5YhDuJqtFDXcotg>
Received: by mailuser.nyi.internal (Postfix, from userid 501) id D9DFEC200A3; Fri, 7 Jun 2019 08:10:25 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
User-Agent: Cyrus-JMAP/3.1.6-663-gf46ad30-fmstable-20190607v1
Mime-Version: 1.0
Message-Id: <d1954b5e-f7bb-40e1-88dc-5565212b517d@www.fastmail.com>
In-Reply-To: <CALaySJJjwG26NLCJqFdo2yW_JhYCYbY+ADHENa490XqM539U2A@mail.gmail.com>
References: <155922060388.22145.12090008162284261785.idtracker@ietfa.amsl.com> <5b944fc8-3f97-55e6-2faf-45bfd11c5837@alum.mit.edu> <CALaySJJjwG26NLCJqFdo2yW_JhYCYbY+ADHENa490XqM539U2A@mail.gmail.com>
Date: Fri, 07 Jun 2019 13:09:57 +0100
From: Alexey Melnikov <aamelnikov@fastmail.fm>
To: Barry Leiba <barryleiba@computer.org>, Paul Kyzivat <pkyzivat@alum.mit.edu>
Cc: The IESG <iesg@ietf.org>, Flemming Andreasen <fandreas@cisco.com>, mmusic-chairs@ietf.org, draft-ietf-mmusic-rfc4566bis@ietf.org, mmusic@ietf.org
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Archived-At: <https://mailarchive.ietf.org/arch/msg/mmusic/xQEhXkyoSIBY1_pWnOExS_NsqPk>
Subject: Re: [MMUSIC] Alexey Melnikov's No Objection on draft-ietf-mmusic-rfc4566bis-35: (with COMMENT)
X-BeenThere: mmusic@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Multiparty Multimedia Session Control Working Group <mmusic.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/mmusic>, <mailto:mmusic-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/mmusic/>
List-Post: <mailto:mmusic@ietf.org>
List-Help: <mailto:mmusic-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/mmusic>, <mailto:mmusic-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 07 Jun 2019 12:10:32 -0000

Hi Barry/Paul,

On Mon, Jun 3, 2019, at 8:54 PM, Barry Leiba wrote:
> Hi, Paul.  Sticking my oar in with Alexey's here, just on a couple of items:
> 
> > > In Section 1:
> > >
> > > electronic mail using the MIME   extensions [RFC5322]
> > >
> > > This needs another reference for MIME. E.g. RFC 2045.
> >
> > I don't understand. This paragraph is referencing examples of protocols
> > that can be used to *transport* SDP. RFC5322 references the mail message
> > format that would be used to encapsulate SDP if it were transported via
> > email. (Though it doesn't actually mention the *transport* protocols
> > used for mail messages.)
> >
> > ISTM that it is the containing protocols that should reference rfc2045.
> > RFC5322 does so, and so says how to carry SDP in mail messages. SIP is
> > itself effectively an extension to RFC2045 though it doesn't say so.
> 
> Alexey's point is that you explicitly mention "MIME extensions" and
> don't provide a reference for it.  I'll go a bit farther to say that
> you're not just talking about message *format* here, but also SMTP as
> the transport (more correctly, application-layer) protocol, yes?  So
> this should say something more like, "electronic mail [RFC5321] using
> the MIME extensions [RFC2045]".  I don't think you need 5322, because
> 822 is cited by 2045, and that is obsoleted by 2822, and that by 5322.
> But I think you do need to cite SMTP and MIME.

Yes, exactly.

> > > In 6.10:
> > >
> > >     Note that a character set specified MUST still prohibit the use of
> > >     bytes 0x00 (Nul), 0x0A (LF), and 0x0d (CR).
> > >
> > > This doesn’t actually say what you intended. None of the common charsets
> > > prohibit these bytes. I think you meant that when using such charsets, these
> > > characters MUST NOT be used in values.
> 
> Adding to what Alexey says, and maybe clarifying a bit: Character set
> and encoding are different things.  The character set is the
> abstraction of the characters used, and the encoding is how they're
> represented.  The encoding is what creates the bytes on the wire.  One
> problem is that "ASCII" refers to both, so it's confusing.  But with
> Unicode, "Unicode" is the character set and "UTF-8" is (usually) the
> encoding.

Right. And the term "charset" is encoding of a particular character set. It might be worth using it below.

> But your point here is that the three byte values you list MUST NOT
> appear in the string, and that has nothing to do with the character
> set or the encoding.  Those three bytes are prohibited.
> 
> You say that quite well in Section 5:
> 
>    Text-containing fields such as the session-name-field and
>    information-field are octet strings that may contain any octet with
>    the exceptions of 0x00 (Nul), 0x0a (ASCII newline), and 0x0d (ASCII
>    carriage return).
> 
> ... and in 5.13:
> 
>    Attribute values are octet strings, and MAY use any octet value
>    except 0x00 (Nul), 0x0A (LF), and 0x0D (CR).
> 
> But in 6.10 I think you want something more like this:
> 
> OLD
>    Note that a character set specified MUST still prohibit the use of
>    bytes 0x00 (Nul), 0x0A (LF), and 0x0d (CR).  Character sets requiring
>    the use of these characters MUST define a quoting mechanism that
>    prevents these bytes from appearing within text fields.
> NEW
>    Note that the restriction specified in Section 5 applies: these strings
>    MUST NOT contain the bytes 0x00 (Nul), 0x0A (LF), and 0x0d (CR).
>    Character encodings that use these bytes MUST define a quoting
>    mechanism that prevents these bytes from appearing within the text
>    strings.
> END

I think this is much better, although "use these bytes" is still ambiguous. E.g. if these bytes are used to shift between encoding modes within a particular charset, then there is a problem. If they are just used to convey specific characters, it might not be.

However, see my comment below.

> > I don't recall what the state of character set definitions was in 1998
> > when this was first published. But it appears that they got carried away
> > and over-generalized. It is easy to understand how one might choose to
> > use ISO 8859-1 rather than UTF-8 since they are closely related and
> > byte-oriented. But it is unclear how one might use some other registered
> > charsets, such as EBCDIC, or other encodings of ISO 10646, such as UTF-16.
> >
> > The bottom line is that use of alternate charsets other than 8859-1 is
> > underspecified. We considered revamping the definition of charset, but
> > didn't want to open that can of worms, since in practice it isn't an issue.
> 
> I appreciate that, and I think this isn't the place to tackle that.
> So we just need to get the text here to accurately reflect what you're
> trying to say.

I would actually suggest that the document should tighten the definition of which charsets are allowed. For textual media types we now recommend use of UTF-8 (which should be the default) and possibly allowing a few others.

So I suggest that the new definition of a=charset be along the lines of "MUST support UTF-8 and US-ASCII. MAY support ISO-8859-1. SHOULD NOT use any other charsets".

> Hoping to be helpful,
> Barry
>