Re: [weirds] feedback on draft-designteam-weirds-using-http

Francisco Obispo <fobispo@isc.org> Thu, 17 May 2012 20:53 UTC

Return-Path: <fobispo@isc.org>
X-Original-To: weirds@ietfa.amsl.com
Delivered-To: weirds@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 6115A21F8753 for <weirds@ietfa.amsl.com>; Thu, 17 May 2012 13:53:54 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.6
X-Spam-Level:
X-Spam-Status: No, score=-2.6 tagged_above=-999 required=5 tests=[BAYES_00=-2.599, NO_RELAYS=-0.001]
Received: from mail.ietf.org ([12.22.58.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id iF8Xy3qG5Wa7 for <weirds@ietfa.amsl.com>; Thu, 17 May 2012 13:53:53 -0700 (PDT)
Received: from mx.ams1.isc.org (mx.ams1.isc.org [IPv6:2001:500:60::65]) by ietfa.amsl.com (Postfix) with ESMTP id 95A7521F8746 for <weirds@ietf.org>; Thu, 17 May 2012 13:53:53 -0700 (PDT)
Received: from bikeshed.isc.org (bikeshed.isc.org [IPv6:2001:4f8:3:d::19]) (using TLSv1 with cipher DHE-RSA-CAMELLIA256-SHA (256/256 bits)) (Client CN "mail.isc.org", Issuer "RapidSSL CA" (not verified)) by mx.ams1.isc.org (Postfix) with ESMTPS id 3F91E5F99EB; Thu, 17 May 2012 20:53:38 +0000 (UTC) (envelope-from fobispo@isc.org)
Received: from [IPv6:2001:4f8:3:64:a559:51fe:97d2:20a3] (unknown [IPv6:2001:4f8:3:64:a559:51fe:97d2:20a3]) (using TLSv1 with cipher AES128-SHA (128/128 bits)) (Client did not present a certificate) by bikeshed.isc.org (Postfix) with ESMTPSA id BC188216C33; Thu, 17 May 2012 20:53:36 +0000 (UTC) (envelope-from fobispo@isc.org)
Mime-Version: 1.0 (Apple Message framework v1278)
Content-Type: text/plain; charset="iso-8859-1"
From: Francisco Obispo <fobispo@isc.org>
In-Reply-To: <4FB55E74.3050008@gmx.de>
Date: Thu, 17 May 2012 13:53:33 -0700
Content-Transfer-Encoding: quoted-printable
Message-Id: <E4D0B76B-7576-4531-B7F3-00D8ABE71926@isc.org>
References: <4FB26082.3070800@gmx.de> <9B94D739-E037-4539-82E8-0E79BBFEC543@hxr.us> <4FB520B4.1070803@gmx.de> <A58EF70A-BBAA-4DA5-A951-C1F16356F741@hxr.us> <4FB5537A.9060208@gmx.de> <D8BBBFB4-B14D-4CCC-A967-7252476C514E@isc.org> <4FB55E74.3050008@gmx.de>
To: Julian Reschke <julian.reschke@gmx.de>
X-Mailer: Apple Mail (2.1278)
Cc: weirds@ietf.org
Subject: Re: [weirds] feedback on draft-designteam-weirds-using-http
X-BeenThere: weirds@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: "WHOIS-based Extensible Internet Registration Data Service \(WEIRDS\)" <weirds.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/weirds>, <mailto:weirds-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/weirds>
List-Post: <mailto:weirds@ietf.org>
List-Help: <mailto:weirds-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/weirds>, <mailto:weirds-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 17 May 2012 20:53:54 -0000

Did a bit more research and http://www.w3.org/International/questions/qa-controls proposes what I had in mind, which is encoding the content using something like base64 and decoding it before presentation. (ugly)

I might be missing something, but why are we arguing about this? I don't think we're planning to pass control codes within the XML.. 

Francisco


On May 17, 2012, at 1:24 PM, Julian Reschke wrote:

> On 2012-05-17 22:13, Francisco Obispo wrote:
>> If you need to specify non-ascii characters in an XML document, you either have to:
>> 
>> a) Encode the characters in their XML entities ñ ->  &#241;  or
> 
> 1) You can use those "as-is", if you declare the document encoding properly (such as by using UTF-8).
> 
> 2) That doesn't change the repertoire of characters you can use.
> 
> 3) Also, it's ASCII control characters which are a problem.
> 
>> b) use a CDATA instruction: Unparsed Character Data [1]
>> 
>> [1] http://www.w3schools.com/xml/xml_cdata.asp
> 
> -> <http://w3fools.com/>
> 
>> The latter approach seems to be more efficient for some.
> 
> And it doesn't help at all. In CDATA, you don't need to escape "<" and "&", but in turn there are *other* sequences you can't use. See <http://www.w3.org/TR/REC-xml/#sec-cdata-sect> -- you can't have the string "]]>" inside CDATA because it ends the CDATA section.
> 
> My recommendation is never ever to use CDATA unless when hand-editing *and* when you absolute positively know what you do. If you generate with a program, don't ever use it.
> 
> Best regards, Julian

Francisco Obispo 
email: fobispo@isc.org
Phone: +1 650 423 1374 || INOC-DBA *3557* NOC
PGP KeyID = B38DB1BE