UTF-8 issues (Re: Open issues in DHCP FQDN, DHCID and DDNS-DHCP Related RFCs)

Harald Tveit Alvestrand <harald@alvestrand.no> Thu, 02 February 2006 23:13 UTC

From: Harald Tveit Alvestrand <harald@alvestrand.no>
Subject: UTF-8 issues (Re: Open issues in DHCP FQDN, DHCID and DDNS-DHCP Related RFCs)
Date: Thu, 02 Feb 2006 15:13:16 -0800
Lines: 90
References: <C007E603.D137%rdroms@cisco.com>
Mime-Version: 1.0
Content-Type: multipart/signed; micalg="pgp-sha1"; protocol="application/pgp-signature"; boundary="==========26E6B02998774739D698=========="
Cc: Stig Venaas <Stig.Venaas@uninett.no>, Olaf Kolkman <olaf@nlnetlabs.nl>, Ólafur Gu›mundsson /DNSEXT co-chair <ogud@ogud.com>
X-From: owner-namedroppers@ops.ietf.org Fri Feb 03 00:24:40 2006
Return-path: <owner-namedroppers@ops.ietf.org>
X-Spam-Checker-Version: SpamAssassin 3.1.0 (2005-09-13) on psg.com
X-Spam-Level:
X-Spam-Status: No, score=-2.4 required=5.0 tests=AWL,BAYES_00, DNS_FROM_RFC_ABUSE,FORGED_RCVD_HELO autolearn=no version=3.1.0
To: Ralph Droms <rdroms@cisco.com>, namedroppers@ops.ietf.org, dhcwg <dhcwg@ietf.org>
In-Reply-To: <C007E603.D137%rdroms@cisco.com>
X-Mailer: Mulberry/4.0.3 (Win32)
X-Virus-Scanned: by amavisd-new at alvestrand.no
Sender: owner-namedroppers@ops.ietf.org
Precedence: bulk
X-Message-ID:
Message-ID: <20140418072128.2560.64369.ARCHIVE@ietfa.amsl.com>


--On 2. februar 2006 16:38 -0500 Ralph Droms <rdroms@cisco.com> wrote:

> Included below is a summary list of the open issues in this package of
> documents:
>
> draft-ietf-dnsext-dhcid-rr-10.txt
> draft-ietf-dhc-ddns-resolution-10.txt
> draft-ietf-dhc-dhcpv6-fqdn-03.txt
> draft-ietf-dhc-fqdn-option-11.txt
> 11. UTF-8 character set usage (Harald Alvestrand, gen-art)
>
> Issue 11 needs some clarification; Harald, I hope you'll kick off a
> separate thread to discuss how to resolve this issue.
>
> - Ralph, for Olafur, Stig and Olaf

I'll try....
back in the days (November - how time flies), my comments on 
draft-ietf-dhc-fqdn-option-11.txt said:

>A smaller piece of missing material is this, from section 3.3.1 of the v4
>document:
>
>   Clients and servers SHOULD follow the character-set recommendations
>   of RFC 1034 [2] and RFC 1035 [3].  However, implementers SHOULD also
>   be aware that some client software could be using UTF-8 [10]
>   character encoding.  This specification does not require any support
>   for UTF-8.
>
>Given DNS experts' insistence that the DNS protocol does not place any
>limits on character sets, it would be good to have a clearer pointer to
>what recommendations are intended here.
>Given that it's in section 3.3.1 (Deprecated ASCII encoding), one can
>assume that this refers to a character set recommendation for characters 
in
>the ASCII encoding; a DNS encoded name should probably be used "as is", no
>matter what it contains (although it would be good if the doc said so).
>
>
>It's also unclear from the text whether or not there's a trailing dot on
>the name in the deprecated ASCII encoding (there's no need for one, since 
a
>non-FQDN name can only be one component, but other fields of use have done
>this in the past, so it's best to be clear that this field does not do so).

On the point of character set recommendations, I think what you should be 
pointing at is not 1034 and 1035 in general, but say that they SHOULD 
follow the recommendation of RFC 1035 section 2.3.1.
Since the rule was actually relaxed in RFC 1123 section 2.1, I think the 
sentence should say "SHOULD follow the character set recommendation of RFC 
1035 section 2.3.1, as modified by RFC 1123 section 2.1".

That's not an UTF-8 issue, but is a charset issue. The UTF-8 text is a 
cop-out (I think), but it's appropriate to cop-out here; you're basically 
saying "don't be surprised if people don't do what this spec recommends".

Does this make sense now?

                         Harald