Re: Extending news to EAI

Julien ÉLIE <julien@trigofacile.com> Wed, 03 February 2010 22:01 UTC

Return-Path: <owner-ietf-usefor@mail.imc.org>
X-Original-To: ietfarch-usefor-archive@core3.amsl.com
Delivered-To: ietfarch-usefor-archive@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id CA9F83A6B4F for <ietfarch-usefor-archive@core3.amsl.com>; Wed, 3 Feb 2010 14:01:32 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.598
X-Spam-Level:
X-Spam-Status: No, score=-2.598 tagged_above=-999 required=5 tests=[AWL=0.547, BAYES_50=0.001, HELO_MISMATCH_COM=0.553, MIME_8BIT_HEADER=0.3, RCVD_IN_DNSWL_MED=-4, STOX_REPLY_TYPE=0.001]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id DUBLy8qVwmLS for <ietfarch-usefor-archive@core3.amsl.com>; Wed, 3 Feb 2010 14:01:31 -0800 (PST)
Received: from balder-227.proper.com (Balder-227.Proper.COM [192.245.12.227]) by core3.amsl.com (Postfix) with ESMTP id 92B0F3A6B45 for <usefor-archive@ietf.org>; Wed, 3 Feb 2010 14:01:31 -0800 (PST)
Received: from balder-227.proper.com (localhost [127.0.0.1]) by balder-227.proper.com (8.14.2/8.14.2) with ESMTP id o13M0bMp067863 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 3 Feb 2010 15:00:37 -0700 (MST) (envelope-from owner-ietf-usefor@mail.imc.org)
Received: (from majordom@localhost) by balder-227.proper.com (8.14.2/8.13.5/Submit) id o13M0bsU067862; Wed, 3 Feb 2010 15:00:37 -0700 (MST) (envelope-from owner-ietf-usefor@mail.imc.org)
X-Authentication-Warning: balder-227.proper.com: majordom set sender to owner-ietf-usefor@mail.imc.org using -f
Received: from 30.mail-out.ovh.net (30.mail-out.ovh.net [213.186.62.213]) by balder-227.proper.com (8.14.2/8.14.2) with SMTP id o13M0ZZw067856 for <ietf-usefor@imc.org>; Wed, 3 Feb 2010 15:00:36 -0700 (MST) (envelope-from julien@trigofacile.com)
Received: (qmail 2983 invoked by uid 503); 3 Feb 2010 22:02:20 -0000
Received: from b6.ovh.net (HELO mail432.ha.ovh.net) (213.186.33.56) by 30.mail-out.ovh.net with SMTP; 3 Feb 2010 22:02:20 -0000
Received: from b0.ovh.net (HELO queueout) (213.186.33.50) by b0.ovh.net with SMTP; 3 Feb 2010 22:00:33 -0000
Received: from aaubervilliers-151-1-47-4.w83-114.abo.wanadoo.fr (HELO Iulius) (julien%trigofacile.com@83.114.133.4) by ns0.ovh.net with SMTP; 3 Feb 2010 22:00:32 -0000
Message-ID: <D599DB8FD691431D8E6C6E6B75B56400@Iulius>
From: Julien ÉLIE <julien@trigofacile.com>
To: Usefor WG <ietf-usefor@imc.org>
References: <Kx6CzM.12F@clerew.man.ac.uk>
In-Reply-To: <Kx6CzM.12F@clerew.man.ac.uk>
Subject: Re: Extending news to EAI
Date: Wed, 03 Feb 2010 23:00:37 +0100
Organization: TrigoFACILE -- http://www.trigofacile.com/
MIME-Version: 1.0
Content-Type: text/plain; format="flowed"; charset="iso-8859-1"; reply-type="original"
Content-Transfer-Encoding: 8bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Windows Mail 6.0.6002.18005
X-MimeOLE: Produced By Microsoft MimeOLE V6.0.6002.18005
X-Ovh-Tracer-Id: 17437937757502176697
X-Ovh-Remote: 83.114.133.4 (aaubervilliers-151-1-47-4.w83-114.abo.wanadoo.fr)
X-Ovh-Local: 213.186.33.20 (ns0.ovh.net)
X-Spam-Check: DONE|U 0.5/N
Sender: owner-ietf-usefor@mail.imc.org
Precedence: bulk
List-Archive: <http://www.imc.org/ietf-usefor/mail-archive/>
List-Unsubscribe: <mailto:ietf-usefor-request@imc.org?body=unsubscribe>
List-ID: <ietf-usefor.imc.org>

Hi Charles,

> So the experimental protocol would start off with the extensions allowed
> by RFC5535, and then add UTF-8 in the Newsgroups header.

I think it will also require to change the possible values for "argument"
here:

   control         =  "Control:" SP *WSP control-command *WSP CRLF
   control-command =  verb *( 1*WSP argument )
   verb            =  token
   argument        =  1*( %x21-7E )

We need to be able to use UTF-8 in "argument".


Could it also be possible to extend dist-list to allow UTF-8 distributions?

   distribution    =  "Distribution:" SP dist-list CRLF
   dist-list       =  *WSP dist-name
                      *( [FWS] "," [FWS] dist-name ) *WSP
   dist-name       =  ALPHA / DIGIT
                      *( ALPHA / DIGIT / "+" / "-" / "_" )


Incidentally, may it exist hostnames in UTF-8 (for the Path: header field)
or are they always canonized in ASCII?



> It has already been established that the existing transport mechanisms
> will move such articles around without problem.

Is there any record of that?  Which news servers (and versions) were tested?



> Some early USEFOR drafts show how the Newsgroups header was to be
> extended. In particular, it required some very strict normalization, so
> that a simple byte-by-byte comparison of newsgroup-names would always
> work.

The last draft seems to be -09, Section 5.5:
    http://www.eyrie.org/~eagle/usefor/drafts/draft-ietf-usefor-article-09.txt

-- 
Julien ÉLIE

« Avez-vous remarqué qu'à table les mets que l'on vous sert vous mettent
  les mots à la bouche ? » (Raymond Devos)