Re: Last Call: <draft-nottingham-safe-hint-05.txt> (The "safe" HTTP Preference) to Proposed Standard

Yoav Nir <ynir.ietf@gmail.com> Fri, 24 October 2014 19:49 UTC

Return-Path: <ynir.ietf@gmail.com>
X-Original-To: ietf@ietfa.amsl.com
Delivered-To: ietf@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id B89A71A1AF8 for <ietf@ietfa.amsl.com>; Fri, 24 Oct 2014 12:49:16 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: 0.7
X-Spam-Level:
X-Spam-Status: No, score=0.7 tagged_above=-999 required=5 tests=[BAYES_50=0.8, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, SPF_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id utB95dbbWNDO for <ietf@ietfa.amsl.com>; Fri, 24 Oct 2014 12:49:14 -0700 (PDT)
Received: from mail-wg0-x22d.google.com (mail-wg0-x22d.google.com [IPv6:2a00:1450:400c:c00::22d]) (using TLSv1 with cipher ECDHE-RSA-RC4-SHA (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id BF1CD1A03A2 for <ietf@ietf.org>; Fri, 24 Oct 2014 12:49:13 -0700 (PDT)
Received: by mail-wg0-f45.google.com with SMTP id l18so1789422wgh.4 for <ietf@ietf.org>; Fri, 24 Oct 2014 12:49:11 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=content-type:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=U59bgg0WdbztcawprFXQZktEJol+Qru+VH9W+r6MfVs=; b=w8X/UPpq7OiP6gNaoZUtVaJqaZR5X7Oe4caQ82Fbhaf02cgg8/G58OpmgI6fAOScBQ j4pEBZxhwVk10hhVBgkNpp3SUCOnVxuCJY1TrcZ4mNVBMN+yGA7T85alwC8Se7h35Yz0 tN0fijeV0GP1UCEcui0yfOKl9XfbcWU+kPN4Wa2L6vvc6qjprt3Ab8LfLy2Bcpz1ufSY CUI0IwhGes+BduzfvJ3izs224ShOYRFUNF34j+Co9xOlmXl8e2IgtJXEhc1JivRn/lMR 81LCzT+JplyV1FoS9Jju4DsRe8pFfPZnjwx7TryaE95pLmz6p89ePv9diyhIqNisI1TN JYIQ==
X-Received: by 10.180.12.195 with SMTP id a3mr6121355wic.73.1414180151421; Fri, 24 Oct 2014 12:49:11 -0700 (PDT)
Received: from [192.168.1.102] (IGLD-84-228-87-161.inter.net.il. [84.228.87.161]) by mx.google.com with ESMTPSA id mc4sm3047914wic.6.2014.10.24.12.49.10 for <ietf@ietf.org> (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 24 Oct 2014 12:49:10 -0700 (PDT)
Content-Type: text/plain; charset="utf-8"
Mime-Version: 1.0 (Mac OS X Mail 8.0 \(1990.1\))
Subject: Re: Last Call: <draft-nottingham-safe-hint-05.txt> (The "safe" HTTP Preference) to Proposed Standard
From: Yoav Nir <ynir.ietf@gmail.com>
In-Reply-To: <20141023140635.10188.qmail@ary.lan>
Date: Fri, 24 Oct 2014 22:49:08 +0300
Content-Transfer-Encoding: quoted-printable
Message-Id: <87A3AD2B-5747-46EB-A165-50D35A29DBA7@gmail.com>
References: <20141023140635.10188.qmail@ary.lan>
To: ietf <ietf@ietf.org>
X-Mailer: Apple Mail (2.1990.1)
Archived-At: http://mailarchive.ietf.org/arch/msg/ietf/oEp3ASBk3B6DZslg07sjAlIqXkc
X-BeenThere: ietf@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: IETF-Discussion <ietf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ietf>, <mailto:ietf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/ietf/>
List-Post: <mailto:ietf@ietf.org>
List-Help: <mailto:ietf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ietf>, <mailto:ietf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 24 Oct 2014 19:49:16 -0000

Hi

I have read the draft, and I do not support its publication. What worries me about it is not speculation about extensions that we’ll be asked to do or mis-use by content providers. It is that there is no way to use it properly. The drafts does not specify what “safe” content is and what “unsafe” content it, and some people treat this as an advantage. The result is that there’s no way for a content provider to know what a user means when their browser emits the “safe” hint, and no way for the user to know what kind of content they are going to get.

Stephen Farrell has made some interesting points about what “safe” might mean in other cultures. I think the failure is much closer to home, so let’s assume for the sake of argument that everyone affected is mainstream American (although neither Stephen nor I are Americans). So obviously anyone would consider porn to be “unsafe”, because we don’t want the children to see it and we don’t want it at work. A rabbit teaching the alphabet to kids OTOH is “safe” ([1]). But those are the easy types of content. What about political content? What about political content that is non-mainstream? Even inflammatory? Is it safe? For whom?  A signal is useless if there is no agreed-upon semantic to it. Yet the draft punts on attaching such a semantic. 

Section 3 mentions YouTube. That is actually a perfect example of what I mean. Sites like YouTube, deviantArt, Flickr, and Wattpad, even Wikipedia provide user-generated content.How are they to decide what is and isn’t “safe”? They have several choices:
 - They can ask the contributors to mark their content. Many of them do that. When you upload something, they require you to mark it as “mature”, “strictly mature” or not, and you even get to pick one of several categories of mature: "nudity", "sexual themes", "violence/gore", "strong language", and “ideologically sensitive”.([2])  They can use those indications in filtering content for people who prefer their content “safe”. But what happens when a user complains that the content they got was non-safe even thought it doesn’t fall into those categories?  (although “ideologically sensitive”?  Anything can fall into that!) 
 - They can assume everything is safe until someone complains. That makes sense. Everything that someone objects to is by definition objectionable. Pretty soon, only the alphabet-teaching rabbit is considered safe, and people have to turn off the hint to get anything done. 
 - They can intelligently (through a combination of heuristics by computers and human intervention) actually judge all content to figure out some rational definition of safe. There’s two problems with that. First, it’s hugely expensive. Wikipedia would die if it needed to do that, and I doubt even Google can afford to have people watch all of Youtube’s videos to rate them.. Second, whatever definition of “safe” they came up with, the users may not agree with them. 

IMO this does more harm than good, and I think we should not publish it.

Yoav

[1] yes, there are some who view talking animals as a violation of the second commandment. I did say “mainstream” Americans.
[2] that is also a great argument against the claim that content providers want just one bit. I think it is the browser vendors who aren’t willing to emit more than one bit.