Re: [apps-discuss] Thoughts on text/* encoding defaults

Dzonatas Sol <dzonatas@gmail.com> Mon, 06 June 2011 18:57 UTC

Return-Path: <dzonatas@gmail.com>
X-Original-To: apps-discuss@ietfa.amsl.com
Delivered-To: apps-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A36D611E816B for <apps-discuss@ietfa.amsl.com>; Mon, 6 Jun 2011 11:57:46 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -7.099
X-Spam-Level:
X-Spam-Status: No, score=-7.099 tagged_above=-999 required=5 tests=[AWL=-3.500, BAYES_00=-2.599, RCVD_IN_DNSWL_LOW=-1]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id TxGLDtskvzXc for <apps-discuss@ietfa.amsl.com>; Mon, 6 Jun 2011 11:57:46 -0700 (PDT)
Received: from mail-pw0-f44.google.com (mail-pw0-f44.google.com [209.85.160.44]) by ietfa.amsl.com (Postfix) with ESMTP id 2D1E111E816A for <apps-discuss@ietf.org>; Mon, 6 Jun 2011 11:57:46 -0700 (PDT)
Received: by pwi5 with SMTP id 5so2506880pwi.31 for <apps-discuss@ietf.org>; Mon, 06 Jun 2011 11:57:45 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:message-id:date:from:user-agent:mime-version:to :subject:references:in-reply-to:content-type :content-transfer-encoding; bh=EDimmeQ4Ye1pXKZyRywH+u30b1uTZYoB6ePaWOQHQJ4=; b=UVjVPZVTogmIfO6+OVv4LHEz8nd6m+Ycd/uXWJw69jBGMKWNVxSTxLd4j4EV+0mcld Jnw+JW4JX9xxcnwD5+VyxbshGk2VEWKYMW6pI3l7CtgmhdPBHU0vVhOqXKV7NutzQfgA dX8xrEISj9OiYYwWK4GwbXzUhs8kpBbpcp3+M=
DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:content-transfer-encoding; b=OrojVyHl4C7G3kyNaj9yDtyFrAdXGTJfN/cdO5A5wjNiPO3lFeFLq+WvXAUy0CiC2S WBhJlAd1p9+jSLWpmV56kNtZROEBIw2j6r+Q4x0xKF4jqsFHyudQ7tTt+lSX7/cTw+Ly x1CySiRclVUaHCDS73OB5OMB0+JBd/w3tB8a4=
Received: by 10.68.52.199 with SMTP id v7mr1973653pbo.244.1307386665768; Mon, 06 Jun 2011 11:57:45 -0700 (PDT)
Received: from [192.168.0.50] ([70.133.70.225]) by mx.google.com with ESMTPS id w2sm3863735pbg.53.2011.06.06.11.57.44 (version=TLSv1/SSLv3 cipher=OTHER); Mon, 06 Jun 2011 11:57:44 -0700 (PDT)
Message-ID: <4DED22F1.6050809@gmail.com>
Date: Mon, 06 Jun 2011 11:56:49 -0700
From: Dzonatas Sol <dzonatas@gmail.com>
User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.16) Gecko/20110505 Icedove/3.0.11
MIME-Version: 1.0
To: apps-discuss@ietf.org
References: <4DECCB27.4030209@gmx.de>
In-Reply-To: <4DECCB27.4030209@gmx.de>
Content-Type: text/plain; charset="UTF-8"; format="flowed"
Content-Transfer-Encoding: 7bit
Subject: Re: [apps-discuss] Thoughts on text/* encoding defaults
X-BeenThere: apps-discuss@ietf.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: General discussion of application-layer protocols <apps-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/apps-discuss>
List-Post: <mailto:apps-discuss@ietf.org>
List-Help: <mailto:apps-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Mon, 06 Jun 2011 18:57:46 -0000

On 06/06/2011 05:42 AM, Julian Reschke wrote:
>
> Left to do:
>
> a) Revise RFC 2046; allow text/* types that carry encoding information 
> inline to do the expected thing (overriding the US-ASCII default); 
> warn against doing so in new registrations (recommend to only support 
> UTF-8, and require to always explicitly include the charset parameter, 
> such as text/vcard is going to do it?)
>
> b) Revise RFC 3023 to delegate text/xml charset defaults to revision 
> of 2046?
>

This has probably been said before by someone else, yet there is some 
reason about the above action which makes me concerned when we don't 
first assume use of "us-ascii" that there may be loss of compatibility 
with 7bit communications (and compression algorithms), especially in 
hardware that doesn't require updates based on known standards. I know 
UTF-8 includes the literals, yet if we strip the 8th bit on 
unicoded-to-ascii escape sequences then is the media content still 
compatible (even size wise before and after compression)? Or would this 
cause unknown error correction where UTF-8 is an explicit replacement of 
"us-ascii". I think that there is the significant point to 7bit being 
"must" by that 7bit mode being specifically implied by "ascii" because 
the 8th bit in UTF-8 is not "human readable" in regards to plain XML. To 
only support UTF-8 means one more bit is being added to the 7bit standard.

I do recognize you want to resolve the conflict for further support with 
ISO & I18N. I think what you want why some tried to progress "code 
pages" in context.

-- 
--- https://twitter.com/Dzonatas_Sol ---
Web Development, Software Engineering, Virtual Reality, Consultant