Re: [Tools-discuss] Author detection (Fwd: I-D Action: draft-normington-jsonpath-00.txt)

Carsten Bormann <cabo@tzi.org> Wed, 09 December 2020 05:24 UTC

Return-Path: <cabo@tzi.org>
X-Original-To: tools-discuss@ietfa.amsl.com
Delivered-To: tools-discuss@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 8ED1C3A0C1D for <tools-discuss@ietfa.amsl.com>; Tue, 8 Dec 2020 21:24:27 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.919
X-Spam-Level:
X-Spam-Status: No, score=-1.919 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_MSPIKE_H4=-0.01, RCVD_IN_MSPIKE_WL=-0.01, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 6JQo23i4iErG for <tools-discuss@ietfa.amsl.com>; Tue, 8 Dec 2020 21:24:25 -0800 (PST)
Received: from gabriel-vm-2.zfn.uni-bremen.de (gabriel-vm-2.zfn.uni-bremen.de [134.102.50.17]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 6E1D63A0C8B for <tools-discuss@ietf.org>; Tue, 8 Dec 2020 21:24:17 -0800 (PST)
Received: from [192.168.217.118] (p548dca87.dip0.t-ipconnect.de [84.141.202.135]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by gabriel-vm-2.zfn.uni-bremen.de (Postfix) with ESMTPSA id 4CrQTC1dQYzyT0; Wed, 9 Dec 2020 06:24:15 +0100 (CET)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 13.4 \(3608.120.23.2.4\))
From: Carsten Bormann <cabo@tzi.org>
In-Reply-To: <a4b7605f-71e9-c22d-dd2a-9f2c3a08cf89@levkowetz.com>
Date: Wed, 09 Dec 2020 06:24:14 +0100
Cc: tools-discuss <tools-discuss@ietf.org>, Robert Sparks <rjsparks@nostrum.com>
X-Mao-Original-Outgoing-Id: 629184254.7794321-724bb2634e43848120431aa338613d91
Content-Transfer-Encoding: quoted-printable
Message-Id: <4DEC0D9B-DFEA-4D2E-A260-3D629545DA00@tzi.org>
References: <160735048347.21240.1003511101868024192@ietfa.amsl.com> <EE7A775C-07B9-440B-9589-EC1B46B54D94@tzi.org> <dd41a64b-6be3-9b39-80f1-f61093306201@nostrum.com> <41D0AF3B-7A92-4A20-8A89-A667230FB465@tzi.org> <86c081bd-f4fa-001a-9bc2-9e69445db630@levkowetz.com> <3A35451F-6B5C-447F-A014-902265D7B241@tzi.org> <a4b7605f-71e9-c22d-dd2a-9f2c3a08cf89@levkowetz.com>
To: Henrik Levkowetz <henrik@levkowetz.com>
X-Mailer: Apple Mail (2.3608.120.23.2.4)
Archived-At: <https://mailarchive.ietf.org/arch/msg/tools-discuss/Q4vTPVcp-EYdcOWi4To6EaHY_PQ>
Subject: Re: [Tools-discuss] Author detection (Fwd: I-D Action: draft-normington-jsonpath-00.txt)
X-BeenThere: tools-discuss@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: IETF Tools Discussion <tools-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/tools-discuss/>
List-Post: <mailto:tools-discuss@ietf.org>
List-Help: <mailto:tools-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/tools-discuss>, <mailto:tools-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 09 Dec 2020 05:24:35 -0000


> On 2020-12-08, at 15:52, Henrik Levkowetz <henrik@levkowetz.com> wrote:
> 
> Hi Carsten,
> 
> On 2020-12-08 15:18, Carsten Bormann wrote:
>> Hi Henrik,
>> 
>>> On 2020-12-08, at 14:21, Henrik Levkowetz <henrik@levkowetz.com> wrote:
>>> 
>>> In order to use \p{L} and \p{Lu} the code also has to be changed to use
>>> the external 'regex' module.
>> 
>> Thanks, I didn’t know that.
>> I thought I knew a bit about regexes and their support in various platforms;
>> I’m seriously stunned.
>> 
>> So, would adding a mechanical substitution of
>> 
>> regex.  for  re.
>> 
>> within extract_authors (plus adding the import) to the set I gave
>> solve the problem?
> 
> Probably, yes; but one would want to run a comprehensive test round to
> be sure that there are no side effects of the change.

Indeed.  (A feature that doesn’t have tests doesn’t exist.)
Doing that (and writing some additional tests for beyond-ASCII author/org names) sounds like a great little Codesprint project.

Grüße, Carsten