Re: [apps-discuss] Fun with URLs and regex

Larry Masinter <masinter@adobe.com> Thu, 29 January 2015 00:23 UTC

Return-Path: <masinter@adobe.com>
X-Original-To: apps-discuss@ietfa.amsl.com
Delivered-To: apps-discuss@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3E5F71A88D4 for <apps-discuss@ietfa.amsl.com>; Wed, 28 Jan 2015 16:23:27 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.902
X-Spam-Level:
X-Spam-Status: No, score=-1.902 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_PASS=-0.001, SPF_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id omqxGf2ocJzN for <apps-discuss@ietfa.amsl.com>; Wed, 28 Jan 2015 16:23:25 -0800 (PST)
Received: from na01-by2-obe.outbound.protection.outlook.com (mail-by2on0053.outbound.protection.outlook.com [207.46.100.53]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id E5A721A88B2 for <apps-discuss@ietf.org>; Wed, 28 Jan 2015 16:23:12 -0800 (PST)
Received: from DM2PR0201MB0960.namprd02.prod.outlook.com (25.160.216.28) by DM2PR0201MB0959.namprd02.prod.outlook.com (25.160.216.27) with Microsoft SMTP Server (TLS) id 15.1.65.19; Thu, 29 Jan 2015 00:23:11 +0000
Received: from DM2PR0201MB0960.namprd02.prod.outlook.com ([25.160.216.28]) by DM2PR0201MB0960.namprd02.prod.outlook.com ([25.160.216.28]) with mapi id 15.01.0065.013; Thu, 29 Jan 2015 00:23:11 +0000
From: Larry Masinter <masinter@adobe.com>
To: Matthew Kerwin <matthew@kerwin.net.au>, "julian.reschke@gmx.de" <julian.reschke@gmx.de>
Thread-Topic: [apps-discuss] Fun with URLs and regex
Thread-Index: AQHQKsH3fKm+oZQcQU+UGfBvMHQS95y1SJCAgAELpgCAAB3uAIAectsAgAAIrACAADWBAIABBhoAgAAHRYCAAARggIAAImqAgAACm+A=
Date: Thu, 29 Jan 2015 00:23:10 +0000
Message-ID: <DM2PR0201MB0960A331A366CFAD5346648AC3300@DM2PR0201MB0960.namprd02.prod.outlook.com>
References: <C5B10293-E6F6-4348-9782-C9C00A4476CE@mnot.net> <CACweHNBVOrVMesB7HOjPNHe5FtzL1k9XDGAHUXAx5DbOSYv5jA@mail.gmail.com> <A1E5B0EC-FAD5-4178-8C7B-540BEB61DC06@mnot.net> <54AEB660.1020701@intertwingly.net> <F122ADA8-4A96-4F88-BB9F-3C5C6A544067@mnot.net> <54C84872.5040902@intertwingly.net> <EF1E36FA-6A30-4A65-9520-5A31571EE445@mnot.net> <54C95132.2060402@gmx.de> <154ABFBB-AB8C-447A-89A3-D1746EFBF1C6@gbiv.com> <54C95AF7.6030703@gmx.de> <CACweHNBHiEGUwLB3z6YoTexF=b9ApwsUy6-DVCf9vnBSD+L5Rw@mail.gmail.com>
In-Reply-To: <CACweHNBHiEGUwLB3z6YoTexF=b9ApwsUy6-DVCf9vnBSD+L5Rw@mail.gmail.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [50.184.24.49]
authentication-results: kerwin.net.au; dkim=none (message not signed) header.d=none; kerwin.net.au; dmarc=none action=none header.from=adobe.com;
x-dmarcaction-test: None
x-microsoft-antispam: BCL:0;PCL:0;RULEID:(3005004);SRVR:DM2PR0201MB0959;
x-exchange-antispam-report-test: UriScan:;
x-exchange-antispam-report-cfa-test: BCL:0; PCL:0; RULEID:; SRVR:DM2PR0201MB0959;
x-forefront-prvs: 0471B73328
x-forefront-antispam-report: SFV:NSPM; SFS:(10009020)(6009001)(66066001)(33656002)(122556002)(2656002)(87936001)(54356999)(76576001)(2501002)(92566002)(46102003)(93886004)(77156002)(76176999)(50986999)(15975445007)(62966003)(106116001)(102836002)(1720100001)(74316001)(99286002)(54206007)(40100003)(86362001)(19580395003)(2950100001)(2900100001)(7059030); DIR:OUT; SFP:1101; SCL:1; SRVR:DM2PR0201MB0959; H:DM2PR0201MB0960.namprd02.prod.outlook.com; FPR:; SPF:None; MLV:sfv; LANG:en;
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: base64
MIME-Version: 1.0
X-OriginatorOrg: adobe.com
X-MS-Exchange-CrossTenant-originalarrivaltime: 29 Jan 2015 00:23:10.8742 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: fa7b1b5a-7b34-4387-94ae-d2c178decee1
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM2PR0201MB0959
Archived-At: <http://mailarchive.ietf.org/arch/msg/apps-discuss/_K-_76xH9IzBvo9Po4sQlJT5UoU>
Cc: "Roy T. Fielding" <fielding@gbiv.com>, Mark Nottingham <mnot@mnot.net>, IETF Apps Discuss <apps-discuss@ietf.org>
Subject: Re: [apps-discuss] Fun with URLs and regex
X-BeenThere: apps-discuss@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: General discussion of application-layer protocols <apps-discuss.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/apps-discuss/>
List-Post: <mailto:apps-discuss@ietf.org>
List-Help: <mailto:apps-discuss-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/apps-discuss>, <mailto:apps-discuss-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 29 Jan 2015 00:23:27 -0000

> The answer to this affects what I write in the 'file' scheme draft. I
> was advised early on to not mention fragments (which I took as
> "disallow by omission") because, while it's easy to define syntax, the
> scheme also has to define semantics, and fragment semantics are tied
> to content type, and dereferenced 'file' URIs don't have a
> well-defined content type.

This leads to the (currently abandoned) work on MIME sniffing
which attempts to well-define a content-type for 'file' URIs:

https://mimesniff.spec.whatwg.org/ 

(bug list 
https://www.w3.org/Bugs/Public/buglist.cgi?component=MIME&list_id=51221&product=WHATWG&resolution=---)

based on (the abandoned)
https://tools.ietf.org/html/draft-ietf-websec-mime-sniff-03


Note https://www.w3.org/Bugs/Public/show_bug.cgi?id=20003
"Clarify scope of MIME sniffing" from IETF tracker #15

Based on
http://trac.tools.ietf.org/wg/websec/trac/ticket/15