Re: Extending news to EAI

"Charles Lindsey" <chl@clerew.man.ac.uk> Fri, 05 February 2010 17:13 UTC

Return-Path: <owner-ietf-usefor@mail.imc.org>
X-Original-To: ietfarch-usefor-archive@core3.amsl.com
Delivered-To: ietfarch-usefor-archive@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 77FB73A6943 for <ietfarch-usefor-archive@core3.amsl.com>; Fri, 5 Feb 2010 09:13:26 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -5.475
X-Spam-Level:
X-Spam-Status: No, score=-5.475 tagged_above=-999 required=5 tests=[AWL=0.203, BAYES_00=-2.599, HELO_MISMATCH_COM=0.553, RCVD_IN_DNSWL_MED=-4, URI_HEX=0.368]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id P-H2iSiiD5yZ for <ietfarch-usefor-archive@core3.amsl.com>; Fri, 5 Feb 2010 09:13:24 -0800 (PST)
Received: from balder-227.proper.com (Balder-227.Proper.COM [192.245.12.227]) by core3.amsl.com (Postfix) with ESMTP id DC2003A6896 for <usefor-archive@ietf.org>; Fri, 5 Feb 2010 09:13:23 -0800 (PST)
Received: from balder-227.proper.com (localhost [127.0.0.1]) by balder-227.proper.com (8.14.2/8.14.2) with ESMTP id o15HC8DD031481 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 5 Feb 2010 10:12:08 -0700 (MST) (envelope-from owner-ietf-usefor@mail.imc.org)
Received: (from majordom@localhost) by balder-227.proper.com (8.14.2/8.13.5/Submit) id o15HC88D031480; Fri, 5 Feb 2010 10:12:08 -0700 (MST) (envelope-from owner-ietf-usefor@mail.imc.org)
X-Authentication-Warning: balder-227.proper.com: majordom set sender to owner-ietf-usefor@mail.imc.org using -f
Received: from v-smtp-auth-relay-3.gradwell.net (v-smtp-auth-relay-3.gradwell.net [79.135.125.42]) by balder-227.proper.com (8.14.2/8.14.2) with ESMTP id o15HC6Zg031472 for <ietf-usefor@imc.org>; Fri, 5 Feb 2010 10:12:07 -0700 (MST) (envelope-from news@clerew.man.ac.uk)
Received: from [80.175.135.89] ([80.175.135.89] helo=clerew.man.ac.uk country=GB ident=postmaster^pop3^clerew&man^ac*uk) by v-smtp-auth-relay-3.gradwell.net with esmtpa (Gradwell gwh-smtpd 1.290) id 4b6c5165.771.25e for ietf-usefor@imc.org; Fri, 5 Feb 2010 17:12:05 +0000 (envelope-sender <news@clerew.man.ac.uk>)
Received: from clerew.man.ac.uk (localhost [127.0.0.1]) by clerew.man.ac.uk (8.13.7/8.13.7) with ESMTP id o15HC1ji004053 for <ietf-usefor@imc.org>; Fri, 5 Feb 2010 17:12:01 GMT
Received: (from news@localhost) by clerew.man.ac.uk (8.13.7/8.13.7/Submit) id o15HC1NR004050 for ietf-usefor@imc.org; Fri, 5 Feb 2010 17:12:01 GMT
To: ietf-usefor@imc.org
Xref: clerew local.usefor:25239
Path: clerew!chl
From: Charles Lindsey <chl@clerew.man.ac.uk>
Subject: Re: Extending news to EAI
Content-Type: text/plain; charset="iso-8859-1"
Message-ID: <KxDIqE.Jnx@clerew.man.ac.uk>
Content-Transfer-Encoding: 8bit
X-Newsreader: NN version 6.5.2 (NOV)
References: <Kx6CzM.12F@clerew.man.ac.uk> <D599DB8FD691431D8E6C6E6B75B56400@Iulius>
Mime-Version: 1.0
Date: Fri, 05 Feb 2010 14:54:14 +0000
Lines: 77
Sender: owner-ietf-usefor@mail.imc.org
Precedence: bulk
List-Archive: <http://www.imc.org/ietf-usefor/mail-archive/>
List-Unsubscribe: <mailto:ietf-usefor-request@imc.org?body=unsubscribe>
List-ID: <ietf-usefor.imc.org>

In <D599DB8FD691431D8E6C6E6B75B56400@Iulius> Julien ÉLIE <julien@trigofacile.com> writes:

>Hi Charles,

>> So the experimental protocol would start off with the extensions allowed
>> by RFC5535, and then add UTF-8 in the Newsgroups header.

>I think it will also require to change the possible values for "argument"
>here:

>   control         =  "Control:" SP *WSP control-command *WSP CRLF
>   control-command =  verb *( 1*WSP argument )
>   verb            =  token
>   argument        =  1*( %x21-7E )

>We need to be able to use UTF-8 in "argument".


>Could it also be possible to extend dist-list to allow UTF-8 distributions?

>   distribution    =  "Distribution:" SP dist-list CRLF
>   dist-list       =  *WSP dist-name
>                      *( [FWS] "," [FWS] dist-name ) *WSP
>   dist-name       =  ALPHA / DIGIT
>                      *( ALPHA / DIGIT / "+" / "-" / "_" )

Yes, there are various minor things that would need to be extended apart
from the Newgroups header (including the newsgroups line and the
checkgroups message).

>Incidentally, may it exist hostnames in UTF-8 (for the Path: header field)
>or are they always canonized in ASCII?

That is an interesting question, which would need dicussion, but does not
look like a show stopper.

>> It has already been established that the existing transport mechanisms
>> will move such articles around without problem.

>Is there any record of that?  Which news servers (and versions) were tested?

It should be documented in the early archives of this list.

Here is a typical Path for an article that reached news.dotsrc.org:

Path:
  news.sunsite.dk!dotsrc.org!news.szn.dk!news.bunk.cc!pnx.dk!
  newsfeed101.telia.com!nf02.dk.telia.net!news.tele.dk!
  feed118.news.tele.dk!not-for-mail

It is really a question of whether a server has it in its active file. If
so, then there is no reason why it should not pass straight through it.
The flooding algorithm will, as usual, find a way around any servers that
don't participate, but is will simply be a configuration affair rather
than any failure of implementation.

The real difficulty will be in user agents. I can read the danish group in
Opera, but for some reason it doesn't like yours. Ensuring there is a
working Path between your server and news.dotsrc.org might be worth a try.

>> Some early USEFOR drafts show how the Newsgroups header was to be
>> extended. In particular, it required some very strict normalization, so
>> that a si...

>The last draft seems to be -09, Section 5.5:
>    http://www.eyrie.org/~eagle/usefor/drafts/draft-ietf-usefor-article-09.txt

-- 
Charles H. Lindsey ---------At Home, doing my own thing------------------------
Tel: +44 161 436 6131            Web: http://www.cs.man.ac.uk/~chl
Email: chl@clerew.man.ac.uk      Snail: 5 Clerewood Ave, CHEADLE, SK8 3JU, U.K.
PGP: 2C15F1A9      Fingerprint: 73 6D C2 51 93 A0 01 E7 65 E8 64 7E 14 A4 AB A5