Re: [sip-clf] Problem Statement - Source/Dest Fields & Contact

Hadriel Kaplan <HKaplan@acmepacket.com> Thu, 07 October 2010 21:19 UTC

Return-Path: <HKaplan@acmepacket.com>
X-Original-To: sip-clf@core3.amsl.com
Delivered-To: sip-clf@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 44EC63A715D for <sip-clf@core3.amsl.com>; Thu, 7 Oct 2010 14:19:50 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.483
X-Spam-Level:
X-Spam-Status: No, score=-2.483 tagged_above=-999 required=5 tests=[AWL=0.116, BAYES_00=-2.599]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 4wT-Z3Pontj9 for <sip-clf@core3.amsl.com>; Thu, 7 Oct 2010 14:19:48 -0700 (PDT)
Received: from etmail.acmepacket.com (etmail.acmepacket.com [216.41.24.6]) by core3.amsl.com (Postfix) with ESMTP id 8740A3A7139 for <sip-clf@ietf.org>; Thu, 7 Oct 2010 14:19:48 -0700 (PDT)
Received: from mail.acmepacket.com (216.41.24.7) by etmail.acmepacket.com (216.41.24.6) with Microsoft SMTP Server (TLS) id 8.2.254.0; Thu, 7 Oct 2010 17:20:51 -0400
Received: from mail.acmepacket.com ([127.0.0.1]) by mail ([127.0.0.1]) with mapi; Thu, 7 Oct 2010 17:20:48 -0400
From: Hadriel Kaplan <HKaplan@acmepacket.com>
To: "Vijay K. Gurbani" <vkg@bell-labs.com>
Date: Thu, 07 Oct 2010 17:20:31 -0400
Thread-Topic: [sip-clf] Problem Statement - Source/Dest Fields & Contact
Thread-Index: ActmZX9Zx0rC2JFJSQOAlKDUZUlFiQ==
Message-ID: <9E773F85-84C0-4347-8A12-55A6A2BD0747@acmepacket.com>
References: <AANLkTime-SfqMydvu8gKENvk=iBZsNq_q_4J-DpTh1aX@mail.gmail.com> <4CACA5BD.7080807@bell-labs.com> <DD2D25EE-FF9A-4358-A0F0-813D60F4922E@acmepacket.com> <4CACBF31.7070708@bell-labs.com> <6B351410-A5F8-4FA5-A7B5-497B2365A822@acmepacket.com> <4CACDC36.1060203@bell-labs.com> <E5B0520E-103F-4A23-A591-941D70384F73@magorcorp.com> <4CACE5D8.3010605@bell-labs.com> <72348B72-4112-406C-B792-0ECD3C1E4918@acmepacket.com> <4CACF433.90508@bell-labs.com> <6DBCDD04-7F34-4FB7-80F0-721F42C173D3@acmepacket.com> <40C4DA24-8A5E-4E73-B0FE-91770E3AB68F@acmepacket.com> <AANLkTinxxO+YEe5GZ-SmMhRLfdk3eRyCA2ch1=uP9Qcc@mail.gmail.com> <4CADC8A7.1090807@bell-labs.com> <60EC243B-B7E3-4301-A824-98F14A7A7630@acmepacket.com> <4CADE86E.6030303@bell-labs.com> <7ACD1E49-D529-470E-B80D-1A45A42A2EC7@acmepacket.com> <4CAE1A1A.60109@bell-labs.com>
In-Reply-To: <4CAE1A1A.60109@bell-labs.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
acceptlanguage: en-US
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
MIME-Version: 1.0
Cc: List SIP-CLF Mailing <sip-clf@ietf.org>
Subject: Re: [sip-clf] Problem Statement - Source/Dest Fields & Contact
X-BeenThere: sip-clf@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: SIP Common Log File format discussion list <sip-clf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/sip-clf>, <mailto:sip-clf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/sip-clf>
List-Post: <mailto:sip-clf@ietf.org>
List-Help: <mailto:sip-clf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/sip-clf>, <mailto:sip-clf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 07 Oct 2010 21:19:50 -0000

On Oct 7, 2010, at 3:06 PM, Vijay K. Gurbani wrote:
> 
> Unambiguous how?  In finding the exact host?  What happens if it
> was a DHCP address and re-assigned later on to someone else?  What
> does the unambiguity of an IP address get you then?  I don't think
> that just because we have a raw IP address, all other problems
> associated with that will simply vanish.

It's unambiguous in the sense that it identifies the server you sent the SIP Message to, at the time the record was generated.  Of course the IP could change at some later time (and hostnames can be reassigned), but there is no time-invariant identifier available.  It's not ambiguous to what it refers to, namely the layer-3 IP destination address - whereas if it's a hostname, it can have one of multiple meanings: the host-domain portion of the target URI, the target name of an SRV response entry, or a hostname from a local policy.

For example, Alice generates a "INVITE" to sip:bob@voip.ssp.com, and sends it to her local proxy.  The Proxy's target URI is "sip:bob@voip.ss.com".  It generates a NAPTR for "voip.ssp.com", which returns an entry for "_sip._udp.voip.ssp.com".  It does an SRV lookup of that and gets "sip-access.voip.ssp.com", with A records for 1.1.1.1 and 1.1.1.2.  Does it encode "voip.ss.com" or "sip-access.voip.ssp.com"?  
Suppose instead, voip.ssp.com did not have a NAPTR entry nor SRV, so it performs an A/AAAA lookup for "voip.ssp.com" (per 3263 section 4.2).  Does it encode "voip.ssp.com" as the destination this time?
Now suppose instead the proxy has a local policy to use an on-box hosts-file, which has an entry for "voip.ss.com" to IP Address 2.2.2.2, bypassing 3263.  Does it encode "voip.ss.com"?  

In all the cases above, none of the domain names are the "Destination" next-hop server.  They're domain names.  They're just names used to resolve to a destination.  They may be hostnames, or sub-domain names, or whatever.

Don't get me wrong - I'm not against recording a domain name - I just think it's of a different "type" than a destination IP Address, and not the same field.  I mean it doesn't represent the same ultimate property.  You're losing important details by replacing one with the other, because they're not synonymous.


> First off, I never claimed that an intermediate step of rfc3263 be
> used as a DNS name, it is the target name that is of interest here.

Umm... but you did. :)
You said: "I am making a case to log the final (resolved) DNS name ---
either cocoa.cc.columbia.edu or eclair.cc.columbia.edu,
NOT columbia.edu."

Those names are actually from a step in the middle of RFC 3263, afaict.  The input to RFC 3263 is the SIP Target URI ("sip:columbia.edu" in your example).  The "TARGET" name in 3263 is the host-domain portion of that, "columbia.edu" in your example.  The output of 3263 is the triple {transport protocol, port, IP address}.  Not a name received in an SRV.  "cocoa.cc.columbia.edu" was just from one of the possible middle steps inside 3263.  No?


> Second, if the name is taken from a hostfile instead of rfc3263/DNS,
> then what difference does it make?  The contents of the hostfile are
> pertinent to and of interest to the administrative domain
> interpreting the SIPCLF log file since we are not claiming that log
> files be sent from one domain to another.

Because a hosts-file, like DNS, is just a mapping from a name to one or more IP Addresses - the name it maps from is *not* actually guaranteed to be a "hostname" of a specific server/next-hop.  It's just a name.  And what's worse, hosts-files are famous for being wrong or maliciously subvert-able, so if the SIP-CLF only records the domain name, you can't tell the UAC is actually sending the Message to the wrong place.

-hadriel