Re: [rtcweb] Script to extract the SDP from draft-ietf-rtcweb-sdp-08

Harald Alvestrand <harald@alvestrand.no> Wed, 15 November 2017 23:36 UTC

Return-Path: <harald@alvestrand.no>
X-Original-To: rtcweb@ietfa.amsl.com
Delivered-To: rtcweb@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 9A869128D8B; Wed, 15 Nov 2017 15:36:19 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -4.199
X-Spam-Level:
X-Spam-Status: No, score=-4.199 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, RCVD_IN_DNSWL_MED=-2.3, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ug-JjelcpvCd; Wed, 15 Nov 2017 15:36:17 -0800 (PST)
Received: from mork.alvestrand.no (mork.alvestrand.no [158.38.152.117]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 64BC0126CD8; Wed, 15 Nov 2017 15:36:14 -0800 (PST)
Received: from localhost (localhost [127.0.0.1]) by mork.alvestrand.no (Postfix) with ESMTP id CB3057C0DE4; Thu, 16 Nov 2017 00:36:12 +0100 (CET)
X-Virus-Scanned: Debian amavisd-new at alvestrand.no
Received: from mork.alvestrand.no ([127.0.0.1]) by localhost (mork.alvestrand.no [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id yifeZ9f2QHvT; Thu, 16 Nov 2017 00:36:11 +0100 (CET)
Received: from [31.133.145.32] (dhcp-9120.meeting.ietf.org [31.133.145.32]) by mork.alvestrand.no (Postfix) with ESMTPSA id E81087C0440; Thu, 16 Nov 2017 00:36:09 +0100 (CET)
To: Nils Ohlmeier <nohlmeier@mozilla.com>, rtcweb@ietf.org
Cc: draft-ietf-rtcweb-sdp@ietf.org
References: <5F4CA14A-A894-4EA4-B402-AB1B655E26CA@mozilla.com>
From: Harald Alvestrand <harald@alvestrand.no>
Message-ID: <24a418cd-5f57-f560-f582-80c7af0500ad@alvestrand.no>
Date: Thu, 16 Nov 2017 00:35:41 +0100
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.4.0
MIME-Version: 1.0
In-Reply-To: <5F4CA14A-A894-4EA4-B402-AB1B655E26CA@mozilla.com>
Content-Type: multipart/signed; micalg="pgp-sha1"; protocol="application/pgp-signature"; boundary="pbTwbjvI21VHUijgH5g4jRR1hsGCWVSDM"
Archived-At: <https://mailarchive.ietf.org/arch/msg/rtcweb/q1agjzTGJzfStPPcDhVI0c8IE0Y>
Subject: Re: [rtcweb] Script to extract the SDP from draft-ietf-rtcweb-sdp-08
X-BeenThere: rtcweb@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: Real-Time Communication in WEB-browsers working group list <rtcweb.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/rtcweb/>
List-Post: <mailto:rtcweb@ietf.org>
List-Help: <mailto:rtcweb-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/rtcweb>, <mailto:rtcweb-request@ietf.org?subject=subscribe>
X-List-Received-Date: Wed, 15 Nov 2017 23:36:19 -0000

On 11/15/2017 11:26 AM, Nils Ohlmeier wrote:
> Hello,
>
> I quickly created a python script to extract the SDP from
> draft-ietf-rtcweb-sdp-08.
> You can find it here https://github.com/nils-ohlmeier/sdpextractor
>
> But I realized that the text version of the draft actually contains
> spaces in places where they don’t belong.
> I believe these are caused by lines breaks in the XML source of the draft.
>
> Best regards
>   Nils Ohlmeier
>
Would it be easier to extract from the XML than from the text?
At least we would only get spurious stuff that was present in the XML,
not spurious stuff that was added during the XML to text conversion.


-- 
Surveillance is pervasive. Go Dark.