Re: [sip-clf] Problem Statement - Source/Dest Fields & Contact

"Vijay K. Gurbani" <vkg@bell-labs.com> Thu, 07 October 2010 19:03 UTC

Return-Path: <vkg@bell-labs.com>
X-Original-To: sip-clf@core3.amsl.com
Delivered-To: sip-clf@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 90A273A701B for <sip-clf@core3.amsl.com>; Thu, 7 Oct 2010 12:03:19 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -102.495
X-Spam-Level:
X-Spam-Status: No, score=-102.495 tagged_above=-999 required=5 tests=[AWL=0.104, BAYES_00=-2.599, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id K7oBuu285rO4 for <sip-clf@core3.amsl.com>; Thu, 7 Oct 2010 12:03:18 -0700 (PDT)
Received: from ihemail4.lucent.com (ihemail4.lucent.com [135.245.0.39]) by core3.amsl.com (Postfix) with ESMTP id CA54C3A700A for <sip-clf@ietf.org>; Thu, 7 Oct 2010 12:03:17 -0700 (PDT)
Received: from umail.lucent.com (h135-3-40-63.lucent.com [135.3.40.63]) by ihemail4.lucent.com (8.13.8/IER-o) with ESMTP id o97J4Hai003686 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Thu, 7 Oct 2010 14:04:17 -0500 (CDT)
Received: from shoonya.ih.lucent.com (Knoppix-135185238233.ih.lucent.com [135.185.238.233]) by umail.lucent.com (8.13.8/TPES) with ESMTP id o97J4Gc3009410; Thu, 7 Oct 2010 14:04:16 -0500 (CDT)
Message-ID: <4CAE1A1A.60109@bell-labs.com>
Date: Thu, 07 Oct 2010 14:06:02 -0500
From: "Vijay K. Gurbani" <vkg@bell-labs.com>
Organization: Bell Laboratories, Alcatel-Lucent
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.12) Gecko/20100907 Fedora/3.0.7-1.fc12 Thunderbird/3.0.7
MIME-Version: 1.0
To: Hadriel Kaplan <HKaplan@acmepacket.com>
References: <AANLkTime-SfqMydvu8gKENvk=iBZsNq_q_4J-DpTh1aX@mail.gmail.com> <4CACA5BD.7080807@bell-labs.com> <DD2D25EE-FF9A-4358-A0F0-813D60F4922E@acmepacket.com> <4CACBF31.7070708@bell-labs.com> <6B351410-A5F8-4FA5-A7B5-497B2365A822@acmepacket.com> <4CACDC36.1060203@bell-labs.com> <E5B0520E-103F-4A23-A591-941D70384F73@magorcorp.com> <4CACE5D8.3010605@bell-labs.com> <72348B72-4112-406C-B792-0ECD3C1E4918@acmepacket.com> <4CACF433.90508@bell-labs.com> <6DBCDD04-7F34-4FB7-80F0-721F42C173D3@acmepacket.com> <40C4DA24-8A5E-4E73-B0FE-91770E3AB68F@acmepacket.com> <AANLkTinxxO+YEe5GZ-SmMhRLfdk3eRyCA2ch1=uP9Qcc@mail.gmail.com> <4CADC8A7.1090807@bell-labs.com> <60EC243B-B7E3-4301-A824-98F14A7A7630@acmepacket.com> <4CADE86E.6030303@bell-labs.com> <7ACD1E49-D529-470E-B80D-1A45A42A2EC7@acmepacket.com>
In-Reply-To: <7ACD1E49-D529-470E-B80D-1A45A42A2EC7@acmepacket.com>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Scanned-By: MIMEDefang 2.57 on 135.245.2.39
Cc: List SIP-CLF Mailing <sip-clf@ietf.org>
Subject: Re: [sip-clf] Problem Statement - Source/Dest Fields & Contact
X-BeenThere: sip-clf@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: SIP Common Log File format discussion list <sip-clf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/sip-clf>, <mailto:sip-clf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/sip-clf>
List-Post: <mailto:sip-clf@ietf.org>
List-Help: <mailto:sip-clf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/sip-clf>, <mailto:sip-clf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 07 Oct 2010 19:03:19 -0000

On 10/07/2010 12:27 PM, Hadriel Kaplan wrote:
> No, that's not really the issue.  Nobody's saying "you must not
> record the hostname".  What we're saying is "you must record the IP
> Address, and may optionally *also* record the hostname separately".

No surprise, I disagree on the word "separately" above ;-)

> It's that word "instead" you used above that's the issue.  That
> implies the same *field* carries two different types of information.
> And it's not just syntactically different fields - they're
> *Semantically* different.  A Layer-3 IP destination address is
> unambiguous when we say it's the layer-3 IP.

Unambiguous how?  In finding the exact host?  What happens if it
was a DHCP address and re-assigned later on to someone else?  What
does the unambiguity of an IP address get you then?  I don't think
that just because we have a raw IP address, all other problems
associated with that will simply vanish.

> But using a hostname is _ambiguous_: which hostname is it?  Is it
> the input into the 3263 process (ie, the domain portion of the
> target URI), or is it an intermediate step of 3263? or if 3263 isn't
> used, is it one from a local-policy of proxy-routing which might use
> a hostfile instead of 3263/DNS?  In other words, can the reader of
> the file know which hostname it is without knowing the internals of
> the recorder?

First off, I never claimed that an intermediate step of rfc3263 be
used as a DNS name, it is the target name that is of interest here.
Second, if the name is taken from a hostfile instead of rfc3263/DNS,
then what difference does it make?  The contents of the hostfile are
pertinent to and of interest to the administrative domain
interpreting the SIPCLF log file since we are not claiming that log
files be sent from one domain to another.

> No, not simple.  The semantic of the IP Address field is one of being
> a Layer-3 IP.  For example, getting the layer-3 IP in a SIP-CLF lets
> me go create an ACL or BPF for that... to block or capture packets
> from it in more detail, for example.

Yes, you can do that but if the IP address was re-assigned by DHCP
you will inadvertently block good traffic.

> You can't do that with a hostname.  Furthermore, it means in a
> database/table storing the individual fields I have to have a string
> or blob or complex type instead of a uint32[4] for ip.  And it also
> means my reader/tool/grep has to deal with determining which type it
> is (ip vs hostname) by its content instead of by it's position or
> type value.

The position is invariant irrespective of the contents, so I do not
see why the reader/tool/grep cannot simply interpret it.  If you
want to store the SIPCLF record contents in a database, yes, you'd
have to do a bit more work, but by no means is this extra work
a showstopper, is it?

Thanks,

- vijay
-- 
Vijay K. Gurbani, Bell Laboratories, Alcatel-Lucent
1960 Lucent Lane, Rm. 9C-533, Naperville, Illinois 60566 (USA)
Email: vkg@{alcatel-lucent.com,bell-labs.com,acm.org}
Web:   http://ect.bell-labs.com/who/vkg/