Re: [VCARDDAV] Default charset

Peter Saint-Andre <stpeter@stpeter.im> Mon, 22 February 2010 15:03 UTC

Return-Path: <stpeter@stpeter.im>
X-Original-To: vcarddav@core3.amsl.com
Delivered-To: vcarddav@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id F3FAD28C1E9 for <vcarddav@core3.amsl.com>; Mon, 22 Feb 2010 07:03:23 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.585
X-Spam-Level:
X-Spam-Status: No, score=-2.585 tagged_above=-999 required=5 tests=[AWL=0.014, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Njz7cSY5QeUI for <vcarddav@core3.amsl.com>; Mon, 22 Feb 2010 07:03:22 -0800 (PST)
Received: from stpeter.im (stpeter.im [207.210.219.233]) by core3.amsl.com (Postfix) with ESMTP id A374D3A83EB for <vcarddav@ietf.org>; Mon, 22 Feb 2010 07:03:22 -0800 (PST)
Received: from squire.local (dsl-140-211.dynamic-dsl.frii.net [216.17.140.211]) (Authenticated sender: stpeter) by stpeter.im (Postfix) with ESMTPSA id 8106F40126 for <vcarddav@ietf.org>; Mon, 22 Feb 2010 08:05:20 -0700 (MST)
Message-ID: <4B829D1F.6030908@stpeter.im>
Date: Mon, 22 Feb 2010 08:05:03 -0700
From: Peter Saint-Andre <stpeter@stpeter.im>
User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.1.7) Gecko/20100111 Thunderbird/3.0.1
MIME-Version: 1.0
To: vcarddav@ietf.org
References: <C1BC44B4FBF6BE3AB2FC712D@caldav.corp.apple.com>
In-Reply-To: <C1BC44B4FBF6BE3AB2FC712D@caldav.corp.apple.com>
X-Enigmail-Version: 1.0.1
OpenPGP: url=http://www.saint-andre.com/me/stpeter.asc
Content-Type: multipart/signed; protocol="application/pkcs7-signature"; micalg="sha1"; boundary="------------ms050100010902060206050808"
Subject: Re: [VCARDDAV] Default charset
X-BeenThere: vcarddav@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: IETF vcarddav wg mailing list <vcarddav.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/vcarddav>, <mailto:vcarddav-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/vcarddav>
List-Post: <mailto:vcarddav@ietf.org>
List-Help: <mailto:vcarddav-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/vcarddav>, <mailto:vcarddav-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 22 Feb 2010 15:03:24 -0000

On 2/22/10 7:58 AM, Cyrus Daboo wrote:
> Hi folks,
> I just came across the following link:
> <http://microformats.org/wiki/vcard-errata>. This claims that RFC2426
> was wrong to remove the CHARSET parameter on properties (which I
> personally do not believe - and no formal IETF errata has ever been
> posted for that). Apparently some clients require CHARSET parameters on
> properties for VERSION:3.0 too:
> <http://help.wugnet.com/office/Outlook-vCard-UTF-ftopict1163209.html>.
> 
> Clearly we need to clean up this mess. Now, the current 4.0 draft does
> state clearly that for a MIME object with a Content-Type header, the
> default charset is UTF-8 (Section 10.1) for the newly registered media
> type text/vcard. What it does not do is explain what charset to use for
> a "standalone" file (dealing with the microformats issue). iCalendar is
> explicit in stating that the default charset for an iCalendar object is
> UTF-8 (<http://tools.ietf.org/html/rfc5545> Section 3.1.4). I believe
> the 4.0 specification should be just as explicit about this.

+1

> Proposed changes:
> 
> - Add a new Section 3.4 with text matching that in RFC5545 Section 3.1.4:
> 
>    3.4.  Character Set
> 
>       There is not a property parameter to declare the charset used in a
>       property value.  The default charset for vCard data is UTF-8
>       as defined in [RFC3629].

The term "default" might be taken to imply that one can override the
default, but that seems not to be the case.

>       The "charset" Content-Type parameter MUST be used in MIME transports
>       to specify the charset being used.

Which is always UTF-8?

> - First paragraph of Section 4.1 needs to be adjusted to remove
> reference to charset given the addition of Section 3.4 as above.
> 
> - Add an "Internationalization Considerations" with text matching that
> in RFC5545 (note this may be stronger than we want in that it mandates
> use of utf-8 only):
> 
>    XX.  Internationalization Considerations
> 
>       Applications MUST generate vCard data in the UTF-8 charset and
>       MUST accept vCard data in the UTF-8 or US-ASCII charset.

Is this indeed stronger than we want? That is, do we ever want any
charset other than UTF-8? I don't think so.

Peter

-- 
Peter Saint-Andre
https://stpeter.im/