Re: [sip-clf] Problem Statement - Source/Dest Fields & Contact

"Vijay K. Gurbani" <vkg@bell-labs.com> Fri, 08 October 2010 16:14 UTC

Return-Path: <vkg@bell-labs.com>
X-Original-To: sip-clf@core3.amsl.com
Delivered-To: sip-clf@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id 97AB13A6899 for <sip-clf@core3.amsl.com>; Fri, 8 Oct 2010 09:14:56 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -102.496
X-Spam-Level:
X-Spam-Status: No, score=-102.496 tagged_above=-999 required=5 tests=[AWL=0.103, BAYES_00=-2.599, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id a75Ctyqa43w9 for <sip-clf@core3.amsl.com>; Fri, 8 Oct 2010 09:14:45 -0700 (PDT)
Received: from ihemail3.lucent.com (ihemail3.lucent.com [135.245.0.37]) by core3.amsl.com (Postfix) with ESMTP id 621E43A686B for <sip-clf@ietf.org>; Fri, 8 Oct 2010 09:14:41 -0700 (PDT)
Received: from umail.lucent.com (h135-3-40-63.lucent.com [135.3.40.63]) by ihemail3.lucent.com (8.13.8/IER-o) with ESMTP id o98GFiDo006825 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Fri, 8 Oct 2010 11:15:44 -0500 (CDT)
Received: from shoonya.ih.lucent.com (Knoppix-135185238233.ih.lucent.com [135.185.238.233]) by umail.lucent.com (8.13.8/TPES) with ESMTP id o98GFhTi003202; Fri, 8 Oct 2010 11:15:44 -0500 (CDT)
Message-ID: <4CAF4419.4040401@bell-labs.com>
Date: Fri, 08 Oct 2010 11:17:29 -0500
From: "Vijay K. Gurbani" <vkg@bell-labs.com>
Organization: Bell Laboratories, Alcatel-Lucent
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.9.1.12) Gecko/20100907 Fedora/3.0.7-1.fc12 Thunderbird/3.0.7
MIME-Version: 1.0
To: Hadriel Kaplan <HKaplan@acmepacket.com>
References: <AANLkTime-SfqMydvu8gKENvk=iBZsNq_q_4J-DpTh1aX@mail.gmail.com> <4CACA5BD.7080807@bell-labs.com> <DD2D25EE-FF9A-4358-A0F0-813D60F4922E@acmepacket.com> <4CACBF31.7070708@bell-labs.com> <6B351410-A5F8-4FA5-A7B5-497B2365A822@acmepacket.com> <4CACDC36.1060203@bell-labs.com> <E5B0520E-103F-4A23-A591-941D70384F73@magorcorp.com> <4CACE5D8.3010605@bell-labs.com> <72348B72-4112-406C-B792-0ECD3C1E4918@acmepacket.com> <4CACF433.90508@bell-labs.com> <6DBCDD04-7F34-4FB7-80F0-721F42C173D3@acmepacket.com> <40C4DA24-8A5E-4E73-B0FE-91770E3AB68F@acmepacket.com> <AANLkTinxxO+YEe5GZ-SmMhRLfdk3eRyCA2ch1=uP9Qcc@mail.gmail.com> <4CADC8A7.1090807@bell-labs.com> <60EC243B-B7E3-4301-A824-98F14A7A7630@acmepacket.com> <4CADE86E.6030303@bell-labs.com> <7ACD1E49-D529-470E-B80D-1A45A42A2EC7@acmepacket.com> <4CAE1A1A.60109@bell-labs.com> <9E773F85-84C0-4347-8A12-55A6A2BD0747@acmepacket.com>
In-Reply-To: <9E773F85-84C0-4347-8A12-55A6A2BD0747@acmepacket.com>
Content-Type: text/plain; charset="ISO-8859-1"; format="flowed"
Content-Transfer-Encoding: 7bit
X-Scanned-By: MIMEDefang 2.57 on 135.245.2.37
Cc: List SIP-CLF Mailing <sip-clf@ietf.org>
Subject: Re: [sip-clf] Problem Statement - Source/Dest Fields & Contact
X-BeenThere: sip-clf@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: SIP Common Log File format discussion list <sip-clf.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/sip-clf>, <mailto:sip-clf-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/mail-archive/web/sip-clf>
List-Post: <mailto:sip-clf@ietf.org>
List-Help: <mailto:sip-clf-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/sip-clf>, <mailto:sip-clf-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 08 Oct 2010 16:14:58 -0000

On 10/07/2010 04:20 PM, Hadriel Kaplan wrote:
> It's unambiguous in the sense that it identifies the server you sent
> the SIP Message to, at the time the record was generated.  Of course
> the IP could change at some later time (and hostnames can be
> reassigned), but there is no time-invariant identifier available.
> It's not ambiguous to what it refers to, namely the layer-3 IP
> destination address - whereas if it's a hostname, it can have one of
> multiple meanings: the host-domain portion of the target URI, the
> target name of an SRV response entry, or a hostname from a local
> policy.

Hadriel: Our disconnect is that you seem to think that I want
to log the host-domain portion of the target URI --- I do not.  I
am pushing for logging the Target field of the SRV lookup, which
will correspond to one IP address.  To be more specific, the last
field in the output below:

$ host -t SRV _sip._udp.sip.columbia.edu
_sip._udp.sip.columbia.edu has SRV record 10 10 5060 cocoa.cc.columbia.edu.
_sip._udp.sip.columbia.edu has SRV record 10 10 5060 eclair.cc.columbia.edu.

Now, I realize that you have made a case that the DNS lookup of
the Target field may result in multiple RRs.  However, my
contention is that the multiple RRs are of different types (e.g.,
A and AAAA) and not multiple RRs of A (a la round-robin DNS).

> For example, Alice generates a "INVITE" to sip:bob@voip.ssp.com, and
> sends it to her local proxy.  The Proxy's target URI is
> "sip:bob@voip.ss.com".  It generates a NAPTR for "voip.ssp.com",
> which returns an entry for "_sip._udp.voip.ssp.com".  It does an SRV
> lookup of that and gets "sip-access.voip.ssp.com", with A records
> for 1.1.1.1 and 1.1.1.2.

Why do you presume that a A lookup for a Target returned on a SRV
search will result in multiple A records.  If the service provider
wanted to do load balancing, multiple SRV records are a much better
option than to have multiple A records corresponding to a SRV target.

> Does it encode "voip.ss.com" or "sip-access.voip.ssp.com"?

sip-access.voip.ssp.com.

> Suppose instead, voip.ssp.com did not have a NAPTR entry nor SRV, so
> it performs an A/AAAA lookup for "voip.ssp.com" (per 3263 section
> 4.2).  Does it encode "voip.ssp.com" as the destination this time?

Yes.  Because anyone doing a DNS lookup on voip.ssp.com will get
the IP address (again, assuming round-robin DNS is not being used.
I don't think round-robin DNS for SIP servers is very prevalent,
and from what I understand there are other problems with round-robin
DNS as well.)

> Now suppose instead the proxy has a local policy to use an on-box
> hosts-file, which has an entry for "voip.ss.com" to IP Address
> 2.2.2.2, bypassing 3263.  Does it encode "voip.ss.com"?

Yes.  Whoever analyzes the logfile is not going to do so in isolation.
I presume that they will be aware that the DNS resolution was done in
the context of local policy.

> Don't get me wrong - I'm not against recording a domain name - I
> just think it's of a different "type" than a destination IP Address,
> and not the same field.  I mean it doesn't represent the same
> ultimate property.  You're losing important details by replacing one
> with the other, because they're not synonymous.

I am not replacing one by the other, I am making a case for logging
either format depending on what the implementer wants to do.

>> First off, I never claimed that an intermediate step of rfc3263 be
>>  used as a DNS name, it is the target name that is of interest
>> here.
>
> Umm... but you did. :) You said: "I am making a case to log the
> final (resolved) DNS name --- either cocoa.cc.columbia.edu or
> eclair.cc.columbia.edu, NOT columbia.edu." Those names are actually
> from a step in the middle of RFC 3263, afaict.  The input to RFC 3263
> is the SIP Target URI ("sip:columbia.edu" in your example).  The
> "TARGET" name in 3263 is the host-domain portion of that,
> "columbia.edu" in your example.  The output of 3263 is the triple
> {transport protocol, port, IP address}. Not a name received in an
> SRV.  "cocoa.cc.columbia.edu" was just from one of the possible
> middle steps inside 3263.  No?

No.  columbia.edu is a high-level identifier used as input to
rfc3263; _sip._udp.columbia.edu and _sip._tcp.columbia.edu are
intermediary names,  but cocoa.cc.columbia.edu and
eclair.cc.columbia.edu are the final destinations to which the IP
packet is sent to over a certain transport and on a certain port.

The disconnect is that you assume multiple A RRs for
cocoa.cc.columbia.edu and I do not.

> Because a hosts-file, like DNS, is just a mapping from a name to one
> or more IP Addresses - the name it maps from is *not* actually
> guaranteed to be a "hostname" of a specific server/next-hop.  It's
> just a name.  And what's worse, hosts-files are famous for being
> wrong or maliciously subvert-able, so if the SIP-CLF only records
> the domain name, you can't tell the UAC is actually sending the
> Message to the wrong place.

Whether or not an administrative domain decides to use host-files
is their prerogative.  If they do use them, they should understand
that any interpretation of the names in the SIPCLF records is a
function of their host-files.  Beyond that, I don't think there is
value in discussing host-files.

Thanks,

- vijay
-- 
Vijay K. Gurbani, Bell Laboratories, Alcatel-Lucent
1960 Lucent Lane, Rm. 9C-533, Naperville, Illinois 60566 (USA)
Email: vkg@{alcatel-lucent.com,bell-labs.com,acm.org}
Web:   http://ect.bell-labs.com/who/vkg/