Re: #428 Accept-Language ordering for identical qvalues

Amos Jeffries <squid3@treenet.co.nz> Thu, 24 January 2013 10:19 UTC

Return-Path: <ietf-http-wg-request@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9FDD621F8552 for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Thu, 24 Jan 2013 02:19:26 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -9.845
X-Spam-Level:
X-Spam-Status: No, score=-9.845 tagged_above=-999 required=5 tests=[AWL=0.754, BAYES_00=-2.599, RCVD_IN_DNSWL_HI=-8]
Received: from mail.ietf.org ([64.170.98.30]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id w9ifZw96BJWR for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Thu, 24 Jan 2013 02:19:26 -0800 (PST)
Received: from frink.w3.org (frink.w3.org [128.30.52.56]) by ietfa.amsl.com (Postfix) with ESMTP id 0078421F8941 for <httpbisa-archive-bis2Juki@lists.ietf.org>; Thu, 24 Jan 2013 02:19:25 -0800 (PST)
Received: from lists by frink.w3.org with local (Exim 4.72) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1TyJt3-0006KR-AT for ietf-http-wg-dist@listhub.w3.org; Thu, 24 Jan 2013 10:18:25 +0000
Resent-Date: Thu, 24 Jan 2013 10:18:25 +0000
Resent-Message-Id: <E1TyJt3-0006KR-AT@frink.w3.org>
Received: from lisa.w3.org ([128.30.52.41]) by frink.w3.org with esmtp (Exim 4.72) (envelope-from <squid3@treenet.co.nz>) id 1TyJsy-0006Hy-OK for ietf-http-wg@listhub.w3.org; Thu, 24 Jan 2013 10:18:20 +0000
Received: from ip-58-28-153-233.static-xdsl.xnet.co.nz ([58.28.153.233] helo=treenet.co.nz) by lisa.w3.org with esmtp (Exim 4.72) (envelope-from <squid3@treenet.co.nz>) id 1TyJsx-0002el-58 for ietf-http-wg@w3.org; Thu, 24 Jan 2013 10:18:20 +0000
Received: from [192.168.1.103] (unknown [14.1.64.4]) by treenet.co.nz (Postfix) with ESMTP id EEB26E6ED4; Thu, 24 Jan 2013 23:17:53 +1300 (NZDT)
Message-ID: <51010A49.4010207@treenet.co.nz>
Date: Thu, 24 Jan 2013 23:17:45 +1300
From: Amos Jeffries <squid3@treenet.co.nz>
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:17.0) Gecko/20130107 Thunderbird/17.0.2
MIME-Version: 1.0
To: Julian Reschke <julian.reschke@gmx.de>
CC: ietf-http-wg@w3.org
References: <50F6CD98.8080802@gmx.de> <99A8B4D1-BE1B-4965-9B78-1EC90455E102@mnot.net> <F4C2A095-50C7-451B-9AFF-A200592CCB4D@gbiv.com> <98F554C9-4FCB-47E4-A018-FE02558FEA49@mnot.net> <E5B8C951-9C05-4CA4-8A17-2636FEF2A9E9@mnot.net> <5100F038.6050902@gmx.de> <5100F6CC.9000105@treenet.co.nz> <510103E9.7060502@gmx.de>
In-Reply-To: <510103E9.7060502@gmx.de>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
Received-SPF: pass client-ip=58.28.153.233; envelope-from=squid3@treenet.co.nz; helo=treenet.co.nz
X-W3C-Hub-Spam-Status: No, score=-3.0
X-W3C-Hub-Spam-Report: AWL=-3.043, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001
X-W3C-Scan-Sig: lisa.w3.org 1TyJsx-0002el-58 af5ccdcc0b715da419fa2980a80dc9e5
X-Original-To: ietf-http-wg@w3.org
Subject: Re: #428 Accept-Language ordering for identical qvalues
Archived-At: <http://www.w3.org/mid/51010A49.4010207@treenet.co.nz>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/16146
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <http://www.w3.org/Mail/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

On 24/01/2013 10:50 p.m., Julian Reschke wrote:
> On 2013-01-24 09:54, Amos Jeffries wrote:
>> On 24/01/2013 9:26 p.m., Julian Reschke wrote:
>>> On 2013-01-24 02:17, Mark Nottingham wrote:
>>>> So, does anyone have an issue with making ordering significant when
>>>> there's no qvalue for *all* headers that use qvalues?
>>>> ..
>>>
>>> I still do, and I'd prefer we go back to what the spec has been saying
>>> for well over a decade.
>>>
>>> What *real* problem are we solving with this change that justifies
>>> making current implementations broken?
>>
>> Problem 1) a lot of agents (~57% by unique U-A string) are using
>> q-values to specify ordering where the spec says "unordered".
>
> Not sure what you're trying to say here. If they send q-values there 
> is no doubt about the semantics, right?
>
> Also, counting unique UA strings generates a totally distorted statistic.

Somewhat yes. However IME it distorts the numbers to be more inline with 
total traffic load %'s each agent places on the network overall.

>
>> Problem 2) a majority of the remaining agents appear to be treating the
>> field-value as an ordered list of preferences even without q-values.
>
> Recipients, I assume? How is that a problem? They choose one plausible 
> interpretation where the spec doesn't define one.
>

Senders.

The pattern is clearly a country-specific code, a generic language code 
and possibly followed by a few similar languages in decreasing order of 
similarity, possibly followed by wildcards.
The pattern is strong and almost identical for both agents sending 
ordered q-values and agents sending no q-values at all.
The errors all appear to be at the interface between mechanisms with 
confusion evident when they are mixed in one request header.

Problem is that the original spec wording you are arguing to keep says 
it is a wrong interpretation. One which leads to all these being problems.

>> Problem 3)  ~1% of agents are incorectly implementing q-values. (see my
>> earlier post responding to your request for examples).
>
> Are these agents widely used? Can they be fixed? Did you report bugs 
> against them?
>
>> Problem 4) q-values being mandatory when preference order is wanted adds
>> complexity on both ends of the transaction, causing unnecessary CPU
>> burden on the recipient. Misunderstandings and a host of needless
>> mistakes by end-users and developers alike. (again see my earlier post
>> for examples).
>
> That is true, but we can't remove q values at this point. Note we are 
> discussing HTTP/1.1.

But we can make the primary need for them decrease by making the header 
ordered by default.

>
>> Problem 5) On the Accept-Language header the bandwidth required to
>> transmit q-values inflates the header size by at least 55% (ISO code:
>> 2-5 bytes, q-v component: 6 bytes).
>
> For that field value, yes.

That is the field-value we are discussing though.


Amos