[Int-dir] Intdir last call review of draft-koster-rep-08

Ralf Weber via Datatracker <noreply@ietf.org> Sun, 29 May 2022 14:21 UTC

Return-Path: <noreply@ietf.org>
X-Original-To: int-dir@ietf.org
Delivered-To: int-dir@ietfa.amsl.com
Received: from ietfa.amsl.com (localhost [IPv6:::1]) by ietfa.amsl.com (Postfix) with ESMTP id E79E7C14F729; Sun, 29 May 2022 07:21:26 -0700 (PDT)
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: 7bit
From: Ralf Weber via Datatracker <noreply@ietf.org>
To: int-dir@ietf.org
Cc: draft-koster-rep.all@ietf.org, last-call@ietf.org
X-Test-IDTracker: no
X-IETF-IDTracker: 8.3.1
Auto-Submitted: auto-generated
Precedence: bulk
Message-ID: <165383408693.50938.9762995976740517128@ietfa.amsl.com>
Reply-To: Ralf Weber <rweber@akamai.com>
Date: Sun, 29 May 2022 07:21:26 -0700
Archived-At: <https://mailarchive.ietf.org/arch/msg/int-dir/_kfSG899eSFcfGA4imf8aLCzPdI>
Subject: [Int-dir] Intdir last call review of draft-koster-rep-08
X-BeenThere: int-dir@ietf.org
X-Mailman-Version: 2.1.34
List-Id: "This list is for discussion between the members of the Internet Area directorate." <int-dir.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/int-dir>, <mailto:int-dir-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/int-dir/>
List-Post: <mailto:int-dir@ietf.org>
List-Help: <mailto:int-dir-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/int-dir>, <mailto:int-dir-request@ietf.org?subject=subscribe>
X-List-Received-Date: Sun, 29 May 2022 14:21:27 -0000

Reviewer: Ralf Weber
Review result: Ready with Issues

Moin!

I am an assigned INT directorate reviewer for draft-koster-rep.
These comments were written primarily for the benefit of the Internet Area
Directors. Document editors and shepherd(s) should treat these comments just
like they would treat comments from any other IETF contributors and resolve
them along with any other Last Call comments that have been received. For more
details on the INT Directorate, see
https://datatracker.ietf.org/group/intdir/about/

While the document technically defines the content of the robots.txt files it
could do a better job in describing with examples the semantic of how robots
interpret them. Especially in 2.2.1 the notation of "Crawlers MUST find the
group that matches the product token exactly" should be better explained. I
assume it does not mean being fully equal but instead a substring match in the
User-Agent Header, so in the example would also match a http user agent of
ExampleBotnet/1.2. Is that understanding correct at least?

Also the examples in 5 seem a lot more arbirtary than what the ROBOTSTXT
website has and it should explain all the outcomes, e.g in 5.1  it would allow
access to all crawlers and all paths, but the foobot, barbot and bazbot
/example/disallowed.gif. An example with a * group would be better and more
realistic.

So long
-Ralf