Re: Declarative HTTP Spec Test Suite

David Benjamin <davidben@chromium.org> Tue, 28 May 2024 13:49 UTC

Return-Path: <ietf-http-wg-request+bounce-httpbisa-archive-bis2juki=ietf.org@listhub.w3.org>
X-Original-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Delivered-To: ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 7F137C14F60D for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Tue, 28 May 2024 06:49:09 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.848
X-Spam-Level:
X-Spam-Status: No, score=-2.848 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, HEADER_FROM_DIFFERENT_DOMAINS=0.249, HTML_MESSAGE=0.001, MAILING_LIST_MULTI=-1, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=w3.org header.b="auMM84Yo"; dkim=pass (2048-bit key) header.d=w3.org header.b="YRBqmpG3"; dkim=pass (1024-bit key) header.d=chromium.org header.b="IutOA4cS"
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id VrcWePDc7ulr for <ietfarch-httpbisa-archive-bis2Juki@ietfa.amsl.com>; Tue, 28 May 2024 06:49:05 -0700 (PDT)
Received: from mab.w3.org (mab.w3.org [IPv6:2600:1f18:7d7a:2700:d091:4b25:8566:8113]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id B48CEC14F699 for <httpbisa-archive-bis2Juki@ietf.org>; Tue, 28 May 2024 06:49:04 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=w3.org; s=s1; h=Subject:Content-Type:Cc:To:Message-ID:Date:From:In-Reply-To: References:MIME-Version:Reply-To; bh=fKadCisdbDioXrjemXiO3SeKb7DT+2A3Ada1Oji+ffw=; b=auMM84Yo6R+hI1EJK/QnFttEET w1qJdNt00fkGxBjSsJgqrn777ioSWA5V2DJaTGJrWInDGXyAXdp1hAr02IJw/kpyZJTtyfu0iv15Y MqyvzmhlSlt0GtpBuJ5EQ4BkYQSxeu8Vv0sZCUHL4C3YuNNqynAh0jRPN2PemqS5a1qKScjMq0u5+ qhC3NQw1XTcr/ygCXnAZZDFPLTZe33ztByUKI4meO6Pwh6udpGBXGch9CtGNhRCzanTCosCEdA1Cb Osjbfph/pan9SWTkmBk9qqhy8YaH+35kdvL5nxRc1BklQOMXQzLAKpAKwSfeKBh0grmiE+dUQqzno oSTeMsuw==;
Received: from lists by mab.w3.org with local (Exim 4.96) (envelope-from <ietf-http-wg-request@listhub.w3.org>) id 1sBxBP-00F2ol-1j for ietf-http-wg-dist@listhub.w3.org; Tue, 28 May 2024 13:48:07 +0000
Resent-Date: Tue, 28 May 2024 13:48:07 +0000
Resent-Message-Id: <E1sBxBP-00F2ol-1j@mab.w3.org>
Received: from ip-10-0-0-144.ec2.internal ([10.0.0.144] helo=pan.w3.org) by mab.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384 (Exim 4.96) (envelope-from <davidben@google.com>) id 1sBxBM-00F2Xn-0p for ietf-http-wg@listhub.w3.internal; Tue, 28 May 2024 13:48:04 +0000
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=w3.org; s=s1; h=Content-Type:Cc:To:Subject:Message-ID:Date:From:In-Reply-To: References:MIME-Version:Reply-To; bh=fKadCisdbDioXrjemXiO3SeKb7DT+2A3Ada1Oji+ffw=; t=1716904084; x=1717768084; b=YRBqmpG3qxZ5JYclfmRB3yNa0EUz0uDtumb1+qkx8p1d9JmZ9qft8Sf67EB+8ou3grvtYhG7w8j YbUVU3Y4DcR9AnopfSblCphoWFf5FkJzEkbkHeT+ULouSS/e1iHrShhRODFkrAFanyxwos7lQuvqS QHZfL7ibEPHroeS3xj94u8AxDeRBkO4EzqJySBOINQpHnQGDbfZvDsugjxjnD2hS89vz6lDX86TOG FoY0L4SXfCZDZSPofjt5bDjRa3V+AqaBFxzXrAvu46rErg3uR8rMCHcwO3uc3t532qgRxE6XYHPW4 gOpCHa4czLxSa6Ey6LhGO2XGrAh8xMMD3DVA==;
Received-SPF: pass (pan.w3.org: domain of google.com designates 2607:f8b0:4864:20::112b as permitted sender) client-ip=2607:f8b0:4864:20::112b; envelope-from=davidben@google.com; helo=mail-yw1-x112b.google.com;
Received: from mail-yw1-x112b.google.com ([2607:f8b0:4864:20::112b]) by pan.w3.org with esmtps (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.96) (envelope-from <davidben@google.com>) id 1sBxBL-008psw-0V for ietf-http-wg@w3.org; Tue, 28 May 2024 13:48:04 +0000
Received: by mail-yw1-x112b.google.com with SMTP id 00721157ae682-627eb38eb1bso6820107b3.1 for <ietf-http-wg@w3.org>; Tue, 28 May 2024 06:48:03 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=chromium.org; s=google; t=1716904079; x=1717508879; darn=w3.org; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:from:to:cc:subject:date:message-id:reply-to; bh=fKadCisdbDioXrjemXiO3SeKb7DT+2A3Ada1Oji+ffw=; b=IutOA4cS7pPnLCkLmCWIT9cep2vn1uHxj+p337mr373S9ntVbYzqEbbP4LXrNFIYe8 08qrdqcywdJUw/DY++aMYYORCwSNEeWr+xdse+DcEMzLpGeN+Xl9hmZQ8oFPCrHet/fw aRoIhs6V6lAIeu9sN93uPvREIiq0RgaeWwj00=
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1716904079; x=1717508879; h=cc:to:subject:message-id:date:from:in-reply-to:references :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=fKadCisdbDioXrjemXiO3SeKb7DT+2A3Ada1Oji+ffw=; b=pjiC8ZA6dGN1AMOpCAttrOEhZoMIbB31XyuFwlfG01cHvrCowLt6/LWOZdUHg7xPVj xTvHcU8BgLVFpjvxAiSMXeUmzs5Eu3Fa5qJxthPEuBoM2gtf+e5vnBcHubd92xDMPMAO rxAEVFyC1HGd3j6qOm0aGQXHIfnXQHTkXfG6bONarsEvfnp/f9YDoiZG7NO7rD8z454s h9n7DfiZfT/51QS7uLVjlDiGWXyHYaG5gWzrDQEs3ag3V1XShYmE4ru656CdtF/Fton3 hMSzp14S1oZHlYU6MUcyvIDBzalbbeYVUoKhbblHKG3LA3TMmq5rRpv09+WNnUhRKoCA +T8g==
X-Gm-Message-State: AOJu0YyxvKj3rz30hNBSRPOq60V2y+VW1A++GCepsXSX9tFr/0yaHXue RpCRydfV80NGUks+MAMwWy/HhwX8UY+3QIJJzTGLFVfDnqeRkPCxWnGXE7frceJ5t+ZjHohsHVn vUWzRGGvP136MQi7Yx5fg/U+VPvDnYksYvg0=
X-Google-Smtp-Source: AGHT+IFDCCr6Vnn4FaGdAt53FFQBAhpQw9eKnYFsD2NIO+qvgaBzdRv8kttlrtWJFnEC/KZcTAWcsHKNeZntInhrkCk=
X-Received: by 2002:a81:ac49:0:b0:627:e16f:ecf with SMTP id 00721157ae682-62a08dcf93amr123729587b3.30.1716904079275; Tue, 28 May 2024 06:47:59 -0700 (PDT)
MIME-Version: 1.0
References: <tPxvQGS7_bxKqrFish00m9LtTT_JQfE0RU1bthEcxYNTZR_uzbkEWKNOTg9c5azoZSYuDWt9pLCiS7Kd4nIWK8DPN1sOtcgbDGx4L7BeeU4=@caffeinatedwonders.com>
In-Reply-To: <tPxvQGS7_bxKqrFish00m9LtTT_JQfE0RU1bthEcxYNTZR_uzbkEWKNOTg9c5azoZSYuDWt9pLCiS7Kd4nIWK8DPN1sOtcgbDGx4L7BeeU4=@caffeinatedwonders.com>
From: David Benjamin <davidben@chromium.org>
Date: Tue, 28 May 2024 09:47:41 -0400
Message-ID: <CAF8qwaD7j=_9JgVS8jfDMJhJeo6cYq0QT4q-QB_TJ8W7uYCokw@mail.gmail.com>
To: Mohammed Al Sahaf <mohammed@caffeinatedwonders.com>
Cc: "ietf-http-wg@w3.org" <ietf-http-wg@w3.org>
Content-Type: multipart/alternative; boundary="000000000000c2d5dc061983e1b1"
X-W3C-Hub-DKIM-Status: validation passed: (address=davidben@google.com domain=chromium.org), signature is good
X-W3C-Hub-Spam-Status: No, score=-11.3
X-W3C-Hub-Spam-Report: BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.034, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DMARC_PASS=-0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.249, HTML_MESSAGE=0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, USER_IN_DEF_SPF_WL=-7.5, W3C_AA=-1, W3C_WL=-1
X-W3C-Scan-Sig: pan.w3.org 1sBxBL-008psw-0V 931c0fd4f3d132a170781fefde52bf4b
X-Original-To: ietf-http-wg@w3.org
Subject: Re: Declarative HTTP Spec Test Suite
Archived-At: <https://www.w3.org/mid/CAF8qwaD7j=_9JgVS8jfDMJhJeo6cYq0QT4q-QB_TJ8W7uYCokw@mail.gmail.com>
Resent-From: ietf-http-wg@w3.org
X-Mailing-List: <ietf-http-wg@w3.org> archive/latest/51977
X-Loop: ietf-http-wg@w3.org
Resent-Sender: ietf-http-wg-request@w3.org
Precedence: list
List-Id: <ietf-http-wg.w3.org>
List-Help: <https://www.w3.org/email/>
List-Post: <mailto:ietf-http-wg@w3.org>
List-Unsubscribe: <mailto:ietf-http-wg-request@w3.org?subject=unsubscribe>

The results in scenario 2 sound off. Chrome shouldn't treat
ERR_CONTENT_LENGTH_MISMATCH differently between HTTP and HTTPS. I'm not
familiar with their implementation, but the Firefox results similarly don't
make sense. Indeed it's quite important to enforce this over HTTPS, for
HTTP/1.1, because that is what defends against truncation attacks. (In
principle, TLS has the close_notify alert, but close_notify is, in
practice, a fiction for HTTPS. Instead we must rely on in-protocol
termination signals. For HTTP/1.1, one of those signals is Content-Length.)
Also it's generally on HTTPS that one can be more strict, not less, because
there are fewer intermediaries to worry about.

Given some of your errors mention HTTP/2, I suspect you are comparing
apples to oranges, and your HTTPS tests are testing HTTP/2. You mention the
Go standard library, but keep in mind that Go automatically enables HTTP/2.
The Content-Length header means very different things between HTTP/1.1 and
HTTP/2. In HTTP/1.1, it is a critical part of framing and needs to be
checked at that layer. (HTTP/1.1's framing is incredibly fragile. "Text"
protocols are wonderful.) In HTTP/2, it has no impact on framing and was
historically[0] considered advisory. The spec now considers it invalid,
otherwise an h2-to-h1 intermediary will have problems. [1] discusses this.
But that's where this mess with receivers enforcing dates to. Provided it
doesn't cause you to turn around and send mis-framed HTTP/1.1, it is
more-or-less safe, if sloppy, to accept it in HTTP/2.

David

[0]
https://datatracker.ietf.org/doc/html/draft-ietf-httpbis-http2-00#section-3.2.2
[1] https://www.rfc-editor.org/rfc/rfc9110#section-8.6-11

On Mon, May 27, 2024 at 7:30 AM Mohammed Al Sahaf <
mohammed@caffeinatedwonders.com> wrote:

> Hello,
>
> This is a proposal that is triggered by some of my involvement with the Caddy
> web server <https://caddyserver.com/> project. We (Caddy team) have been
> working towards developing a declarative test suite for the Caddy server.
> The discussions (particularly a comment
> <https://github.com/caddyserver/caddy/pull/6255#issuecomment-2088632219>
> by a user) led me to believe it's best to bring up the HTTP spec compliance
> parts with the HTTP WG for better insight and to have a common well-being
> check for all members of the HTTP community.
>
> There are numerous RFCs governing HTTP and the behavior of its citizens.
> Compliance to the RFCs is only validated through interoperability or manual
> eyeing of the RFCs against the implementation. The RFCs, for good reasons,
> are walls of texts and are akin to legalese when it comes to
> interpretation. Consequently, nuanced sections are possibly missed without
> visible failures due to being an edge-case. Having a specification defined
> as a test suite in a declarative language removes much of ambiguity and
> enables validation of conformance by the HTTP citizens. I hope to turn this
> into an official proposal, but I'd like to put the draft forward for
> discussion to solidify the approach and the scope first.
>
> *Motivation*:
>
> Conformance is an assurance of compatibility across the various components
> of the web and gives confidence of breakage if any of them were to change
> behavior or if the HTTP semantics were to change. Conformity can also
> assist in optimization efforts. If the behavior is known for sure in
> advance, certain optimizations can be applied.
>
> Secondly, it unifies the expectations of the community. Let's take for
> example the HTTP semantics of the Content-Length header as defined in RFC
> 9110 <https://www.rfc-editor.org/rfc/rfc9110.html#name-content-length>.
> The RFC states when servers and user-agents SHOULD, SHOULD NOT, MAY, MAY
> NOT, AND MUST NOT send the Content-Length header, but it does not specify
> how should either of them (server and user-agent) handle cases of mismatch
> between content-length header value and actual content length of the
> payload. I have run an unscientific poll on Twitter about the assumed ideal
> behavior of a client if the Content-Length value does not match the
> actual content-length of the body.
>
> First poll <https://twitter.com/MohammedSahaf/status/1792267681032253683>:
> What's the ideal HTTP client (e.g. curl, browser) behavior when the server
> includes more bytes in response body than stated in the content-length
> header? e.g. "Content-Length: 2", actual body length: 3.
>
> *Responses* (19 responses):
>
>    - Ignore header; read fully (4 votes; 21.1%)
>
>
>    - Read till content-length value (6 votes; 31.6%)
>    -
> *Abort/reject (9 votes; 47.4%) *
>
>
> *Reality*:
> When testing this scenario, I found the following:
>
>
>    - curl aborts the connection, reporting "(18) transfer closed with 1
>    bytes remaining to read" for *plaintext HTTP* connection, and "(92)
>    HTTP/2 stream 1 was not closed cleanly: PROTOCOL_ERROR (err 1)" for
>    *HTTPS* connections.
>
>
>
>    - Firefox fails the transfer on *plaintext HTTP* with
>    "NS_ERROR_NET_PARTIAL_TRANSFER"; but with *HTTPS* connection, it only
>    reads and displays payload per the number of bytes stated in the header
>    value.
>
>
>
>    - Chrome fails the transfer on *plaintext HTTP* with
>    "(failed)net::ERR_CONTENT_LENGTH_MISMATCH"; but with *HTTPS*
>    connection, it ignores the header value and displays the full payload.
>
>
> Second Poll <https://twitter.com/MohammedSahaf/status/1792267687411831063>:
> What's the ideal HTTP client (e.g. curl, browser) behavior when the server
> includes fewer bytes in response body than stated in the content-length
> header? e.g. "Content-Length: 5", actual body length: 3
>
> *Responses* (18 responses):
>
>    - Ignore header; read 3 (7 votes; 38.9%)
>
>
>    - Pad; with what? (0 votes; 0%)
>
>
>    - *Reject/abort (9 votes; 50%)*
>
>
>    - Other; comment (2 votes; 11.1%, none of them elaborated)
>
>
> *Reality*:
> When testing this scenario, I found the following:
>
>    - curl aborts the connection, reporting "(18) transfer closed with 1
>    bytes remaining to read" for *plaintext HTTP* connection; *for HTTPS*,
>    it prints the payload in full preceded by the message "(92) HTTP/2
>    stream 1 was not closed cleanly: PROTOCOL_ERROR (err 1)".
>
>
>
>    - Firefox displays the full payload *for HTTPS* connections without
>    reporting any errors. For *plaintext HTTP,* it displays the full
>    payload but reports an error "NS_ERROR_NET_PARTIAL_TRANSFER"
>
>
>
>    - Chrome displays the full payload *for HTTPS* connections without
>    reporting any errors. For *plaintext HTTP,* it fails to load the
>    content and reports the error
>    "(failed)net::ERR_CONTENT_LENGTH_MISMATCH".
>
>
> Third Poll <https://twitter.com/MohammedSahaf/status/1792267692700827955>:
> Assuming it's possible... What's the ideal HTTP client (e.g. curl,
> browser) behavior when the server includes negative value in the
> content-length header? e.g. "Content-Length: -2"
>
> *Responses* (21 responses):
>
>    - Ignore value (7 votes; 33.3%)
>
>
>    -
> *Reject/abort (14 votes; 66.7%) *
>
>
> *Reality*:
> I couldn't in effect test the scenario. I'm using Caddy for all the
> scenarios, and the Go standard library doesn't set the Content-Length
> header if it's set to a value of less than 0 (source
> <https://github.com/golang/go/blob/377646589d5fb0224014683e0d1f1db35e60c3ac/src/net/http/server.go#L1201>
> )
>
> Conclusion: The variation observed in the user agents and the, albeit
> unscientific, poll responses show a lack of consensus on the expected
> behavior. Each agent (human or machine) apply their own interpretations and
> assumptions to the protocol. The disagreement makes the evolution of the
> protocol difficult because of the varying expectations.
>
> *Method*:
>
> The test suite should be defined in declarative format that can be easily
> composed by humans and read by machines. The declarative,
> programming-language-agnostic format allows developers (RFC developers and
> software developers) of all backgrounds to contribute without a
> programming-language-based gate. In my research, I have found the
> open-source tool hurl <https://hurl.dev/> to be a suitable tool for
> defining HTTP server specification and to test it. It defines its own DSL
> for the request/response patterns and may be run with --test flag along
> with --report-{json,html,tap} to produce test results.
>
> The testing effort may be implemented in phases. The first phase is to
> author the test suite in a common, public repository. This makes it
> accessible for web server developers to clone it and run the test suite
> against their own software. The second phase is to provide an interface for
> automated testing and a UI to display the conformance summary of each web
> server submitted to the list.
>
> *Challenges*:
>
> *Agnostic Tooling*: The first challenge is to find an HTTP client that
> implements HTTP Semantics RFCs perfectly, otherwise its own idiosyncrasies
> will get in the way of the validation. One would be tempted to default to
> curl, especially that hurl uses curl under the hood. However, there is a
> chance curl may have its own set of HTTP idiosyncrasies that may affect
> the results of the test execution. Changes to curl are probably not
> desired unless the subject behavior is confirmed to be a defect.
> Involvement of curl is voluntary to curl, and the team may be looped and
> involved into this initiative for comments if desired.
>
> *Suitable DSL (Domain Specific Language)*: The hurl DSL is decent. In my
> experiment for a proof-of-concept, I found it lacking a few functions or
> operations to be perfectly suitable, e.g. indicating optionality. hurl
> has the advantage of its DSL  grammar being defined as a spec with
> deterministic parsing. Extensions and/or changes to hurl to accommodate
> this effort is up to the hurl developers.
>
> *Prior Art*:
>
> There's a wiki page title HTTP Testing Resources
> <https://github.com/httpwg/wiki/wiki/HTTP-Testing-Resources> under the
> github.com/httpwg/wiki repository. The page contains the following note:
>
>    -
>
>    *Note that there is no official conformance test suite. These tests
>    have not been vetted for correctness by the HTTP Working Group, and the
>    authority for conformance is always the relevant specification.*
>
> Indeed some of the listed projects have not been updated for a while.
> Worthy of note on the page is the cache-tests.fyi project, REDBot, and
> httplint (which REDBot uses).
>
> The REDBot project (by Mark Nottingham) was used by one of Caddy users to
> report a gap <https://github.com/caddyserver/caddy/issues/5849> in the
> conformance, which was subsequently fixed. Using REDBot requires pointing
> at a particular resource.
>
> The cache-tests.fyi is of keen interest for some inspiration of design.
> The test suite is close in essence to this proposal, which is a declarative
> suite that can be run by cache system developers to validate their
> conformance to the cache related RFCs. The Souin
> <https://github.com/darkweak/souin> caching system runs the
> cache-tests.fyi test suite on every pull request to note its conformance
> level and to watch for variations. The display of each system, the use
> case, and its conformance status allows the users and the developers to
> take appropriate actions. The website and the UI can be a phase-2
> (long-term goal) of this proposal, but the details of how to set up the
> system and run the tests can be postponed until more information is known
> about the test suite itself.
>
> *Humble Attempt*:
>
> To test the idea and develop a proof-of-concept, I have managed to convert
> 4 tests from REDBot (procedural, Python-based) to Hurl (declarative
> format). The test suite contains a collection of test sets in nested
> directory structure. The suite declares the URL it will call for each test
> case so web servers can be configured accordingly for the subject URL. The
> GitHub repository for the PoC is here: github.com/mohammed90/
> http-semantics-test-suite. The repository currently does not have a
> license applied as it's only for display of PoC, though I am inclined to
> apply Apache-2 or any other open-source-compatible license once the
> approach is agreed and finalized.
>
> All the best,
> Mohammed
> Blog <https://www.caffeinatedwonders.com/> | LinkedIn
> <https://www.linkedin.com/in/mohammedalsahaf/>
>