Re: [dmarc-ietf] DMARC aggregate reports XML Schema inconsistencies

Alessandro Vesely <vesely@tana.it> Tue, 06 August 2019 17:36 UTC

Return-Path: <vesely@tana.it>
X-Original-To: dmarc@ietfa.amsl.com
Delivered-To: dmarc@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E46401205DB for <dmarc@ietfa.amsl.com>; Tue, 6 Aug 2019 10:36:47 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.301
X-Spam-Level:
X-Spam-Status: No, score=-4.301 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_MED=-2.3, SPF_PASS=-0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1152-bit key) header.d=tana.it
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Gsp9hQ4DrLcu for <dmarc@ietfa.amsl.com>; Tue, 6 Aug 2019 10:36:45 -0700 (PDT)
Received: from wmail.tana.it (wmail.tana.it [62.94.243.226]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 9710812028B for <dmarc@ietf.org>; Tue, 6 Aug 2019 10:36:45 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=tana.it; s=delta; t=1565113003; bh=ncb9TIAOFZ88zpKWA9/a9KBO7M0rbfnTgUywIpNCkV4=; l=1571; h=To:References:From:Date:In-Reply-To; b=BpWBXYwFx+uzs/g0muhhLtqJzhA9kxYzCDIH7FLFhLBGHZXYeaJpqYho+LewQAZjd b9BDhCkUvo3xmNjudBWeHMoLY9Fm3tUEf1m1v1bIl9ei+7KJBwY1M+rVNjxfDNyW0O WQrZ9W19UKIjZQEvSJgTKaRgZD4ddYgT71Gy4BGbBDvQa+AqbMgJ24nH4YO9g
Authentication-Results: tana.it; auth=pass (details omitted)
Received: from [172.25.197.111] (pcale.tana [172.25.197.111]) (AUTH: CRAM-MD5 uXDGrn@SYT0/k) by wmail.tana.it with ESMTPA id 00000000005DC042.000000005D49BAAB.0000283F; Tue, 06 Aug 2019 19:36:43 +0200
To: dmarc@ietf.org
References: <008401d54784$f8300750$e89015f0$@leemankuiper.nl>
From: Alessandro Vesely <vesely@tana.it>
Openpgp: id=0A5B4BB141A53F7F55FC8CBCB6ACF44490D17C00
Message-ID: <e1fa3716-39b0-4de5-99df-10ed6fd91724@tana.it>
Date: Tue, 6 Aug 2019 19:36:43 +0200
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0
MIME-Version: 1.0
In-Reply-To: <008401d54784$f8300750$e89015f0$@leemankuiper.nl>
Content-Type: text/plain; charset=us-ascii
Content-Language: en-US
Content-Transfer-Encoding: 7bit
Archived-At: <https://mailarchive.ietf.org/arch/msg/dmarc/8kD52Ie8yWxxaaEsXPOG2as_caQ>
Subject: Re: [dmarc-ietf] DMARC aggregate reports XML Schema inconsistencies
X-BeenThere: dmarc@ietf.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: "Domain-based Message Authentication, Reporting, and Compliance \(DMARC\)" <dmarc.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/dmarc>, <mailto:dmarc-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/dmarc/>
List-Post: <mailto:dmarc@ietf.org>
List-Help: <mailto:dmarc-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/dmarc>, <mailto:dmarc-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 06 Aug 2019 17:36:48 -0000

On Wed 31/Jul/2019 11:47:29 +0200 Freddie Leeman wrote:
> [...]
> 
> DMARC reporting capabilities are a valuable aspect of the DMARC mechanism. It
> can help domain owners in setting up and hardening their DKIM/SPF/DMARC policy.
> But unless these reports follow strict guidelines they just pile up to a lot of
> inconsistent data open to interpretation and guesswork. Domain owners should be
> able to understand the data without the need for a spiritual voodoo DMARC guru
> (trademark pending) to make sense of it all.


I had tried and programmed carefully, but never formally checked what I was
sending.  Too bad.  Now that I did, I see my reports miss the <pct> and <fo>[*]
elements, and some other nuisance.

However, the most striking difference is that, after some tinkering, to be able
to formally validate a report, it has to be rewritten like so:

    <?xml version="1.0" encoding="UTF-8"?>
    <dmarc:feedback xmlns:xs="http://www.w3.org/2001/XMLSchema-instance"
        xmlns:dmarc="http://dmarc.org/dmarc-xml/0.1"
        xs:schemaLocation="http://dmarc.org/dmarc-xml/0.1 rua.xsd">
        <report_metadata>
            <org_name>example.com</org_name>
            <email>postmaster@example.com</email>;
            [...]

Is that correct?  Is that how reports should be written?  I ask because
checking some aggregate report I received, I found no mention of namespaces and
schema locations.  XSLT works well even without those.  Validation doesn't.

What do you reckon?


Best
Ale

-- 
[*] <fo> is present in Appendix C of the spec, but not in
https://dmarc.org/dmarc-xml/0.1/rua.xsd