Re: ietf-nntp Backfill

USENET news manager <newsmaster@ucs.cam.ac.uk> Tue, 17 December 1996 09:27 UTC

Received: from cnri by ietf.org id aa22048; 17 Dec 96 4:27 EST
Received: from ACADEM2.ACADEM.COM by CNRI.Reston.VA.US id aa05754; 17 Dec 96 4:27 EST
Received: (from majordomo@localhost) by academ2.academ.com (8.8.3/8.7.3) id DAA03772 for ietf-nntp-outgoing; Tue, 17 Dec 1996 03:25:28 -0600 (CST)
X-Authentication-Warning: academ2.academ.com: majordomo set sender to owner-ietf-nntp using -f
Received: from academ.com (root@ACADEM.COM [198.137.249.2]) by academ2.academ.com (8.8.3/8.7.3) with ESMTP id DAA03767 for <ietf-nntp@ACADEM2.ACADEM.COM>; Tue, 17 Dec 1996 03:25:25 -0600 (CST)
Received: from lyra.csx.cam.ac.uk (news@lyra.csx.cam.ac.uk [131.111.8.36]) by academ.com (8.8.3/8.7.1) with SMTP id DAA13296 for <ietf-nntp@academ.com>; Tue, 17 Dec 1996 03:25:23 -0600 (CST)
Received: by lyra.csx.cam.ac.uk (SMI-8.6/MDTG-V1.1.8@lyra.csx.cam.ac.uk) id JAA16350; Tue, 17 Dec 1996 09:25:16 GMT
Message-Id: <199612170925.JAA16350@lyra.csx.cam.ac.uk>
Subject: Re: ietf-nntp Backfill
To: clive@demon.net
Date: Tue, 17 Dec 1996 09:25:15 +0000 (GMT)
Cc: ietf-nntp@academ.com
In-Reply-To: <wriZDoApG0syEwVJ@on-the-train.demon.co.uk> from "Clive D.W. Feather" at Dec 15, 96 00:09:13 am
From: USENET news manager <newsmaster@ucs.cam.ac.uk>
X-Mailer: ELM [version 2.4 PL22]
Content-Type: text
Sender: owner-ietf-nntp@academ.com
Precedence: bulk

>Jon Ribbens <jon@oaktree.co.uk> writes
>>I am looking at INN 1.4 for this reply.
>
>>> [Q2] When will new articles become available and expired/cancelled ones
>>> become unavailable ? Is it:
>>New ones will become available and deleted ones will become unavailable
>>immediately. However 'NEXT' will not attempt to show deleted articles,
>>and it will not attempt to show newly-arrived articles. This seems
>>reasonable to me.
>
>So you are saying that NEXT from article 1200 might give article 1234,
>even though there is an article 1212 ?
>
>Hmm. I would never have thought of that as being a behaviour permitted
>by RFC 977. But I can see where it comes from.

Read it in the context of the following point! An article 1212 should not 
appear "from nowhere" if the high article number is already greater (e.g. 
1234).

>>> [Q5] Can an article appear with a number between the previous limits,
>>> such as 1250 ?
>>No.

Given that under normal circumstances (see below for the one exception I've 
encountered), article numbers within a group are monotonically increasing,
there should never be a newly-arrived article within the min-max bounds
determined earlier. NEXT should simply find successive articles that have
not expired or been cancelled (and may have large gaps to skip if there are
long-lived articles due to Expires: headers).

On a related point which I saw queried in an earlier message - if the highest 
numbered article is cancelled (or a number of articles at the end of the 
range), INN's implementation of the GROUP command will return the actual
low/high article numbers to the client *but* the high article number in the
active file (also available to the client, of course) will never decrease
and newly-arrived articles will be assigned numbers continuing on
from the highest used so far. The high numbers will not be reused simply
because articles have been cancelled, though they may not be mentioned as 
part of the currently-available range.

>Someone said at the BOF that this *does* happen with INN.

It could happen, but only in limited situations which would count as misuse
of the tools rather than working as intended.

I'm thinking of a situation such as an INN slave-mode feed (i.e. where the 
sending server specifies the article numbers to be used, to yield an 
identically-numbered spool on the slave) where for some reason batches of
articles were not sent across to the slave server in arrival order. 

I've tripped over that when using slave mode to copy over articles from
long-expiry groups while setting up a new server, with batch files
constructed by non-standard means (the old server was running C-news :-),
*but* the server didn't go live until later when it was running "normally"
with articles arriving in order (albeit still in slave mode, for a 
transitional period), and I considered it a trap to avoid during live
running, not something which client software ought to handle. Slave mode
would normally send articles in arrival order and hence, as usual, the
article numbers within a group would be monotonically increasing even though 
specified by the sending server.

[The problem that I actually encountered, and which drew my attention to the
issue, was that INN's overview data entries were written in article arrival
order even though that did not correspond to article number order within the
group because the sending server specified the numbering based on its spool.
In consequence, INN ignored the overview data and performance was poor
until I deleted and recreated the overview files.]

                                John Line
-- 
Cambridge University Computing Service - USENET news manager. Usually John Line
newsmaster@ucs.cam.ac.uk    (alias {newsmaster,news,usenet}@news.cam.ac.uk)