Re: [SIP] draft-camarillo-sip-sdp-00.txt

Gonzalo Camarillo <Gonzalo.Camarillo@lmf.ericsson.se> Fri, 13 October 2000 08:30 UTC

Return-Path: <owner-confctrl>
Received: (from majordom@localhost) by zephyr.isi.edu (8.9.3/8.9.3) id BAA04371 for confctrl-outgoing; Fri, 13 Oct 2000 01:30:07 -0700 (PDT)
Received: from gamma.isi.edu (gamma.isi.edu [128.9.144.145]) by zephyr.isi.edu (8.9.3/8.9.3) with ESMTP id BAA04365 for <confctrl@zephyr.isi.edu>; Fri, 13 Oct 2000 01:30:05 -0700 (PDT)
Received: from penguin-ext.wise.edt.ericsson.se (penguin-ext.wise.edt.ericsson.se [194.237.142.110]) by gamma.isi.edu (8.9.3/8.9.3) with ESMTP id BAA08280 for <confctrl@ISI.EDU>; Fri, 13 Oct 2000 01:30:17 -0700 (PDT)
Received: from fogerty.lmf.ericsson.se (fogerty.lmf.ericsson.se [131.160.11.6]) by penguin.wise.edt.ericsson.se (8.11.0/8.10.1/WIREfire-1.3) with ESMTP id e9D8UBZ02592; Fri, 13 Oct 2000 10:30:12 +0200 (MEST)
Received: from lmf.ericsson.se (E005004B57CE1.lmf.ericsson.se [131.160.30.148]) by fogerty.lmf.ericsson.se (8.9.3+Sun/8.9.3) with ESMTP id LAA12740; Fri, 13 Oct 2000 11:30:07 +0300 (EET DST)
Message-ID: <39E6C80D.4E614098@lmf.ericsson.se>
Date: Fri, 13 Oct 2000 11:30:05 +0300
From: Gonzalo Camarillo <Gonzalo.Camarillo@lmf.ericsson.se>
Organization: Oy L M Ericsson Ab
X-Mailer: Mozilla 4.61 [en] (WinNT; I)
X-Accept-Language: en
MIME-Version: 1.0
To: "Culpepper, Bert" <bert.culpepper@intervoice-brite.com>
CC: Michael Thomas <mat@cisco.com>, Rohan Mahy <rohan@cisco.com>, Jonathan Rosenberg <jdrosen@dynamicsoft.com>, sip@lists.bell-labs.com, Henning Schulzrinne <hgs@cs.columbia.edu>, mmusic <confctrl@ISI.EDU>
Subject: Re: [SIP] draft-camarillo-sip-sdp-00.txt
References: <DBD1CC7CE357D211AECC009027158FD1034848A3@itmail-ict1-imc.wichi>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Sender: owner-confctrl@zephyr.isi.edu
Precedence: bulk

Hi,

I will try to summarize the discussions so far:

The issue I am looking at is: "What happens when I need to receive a
single media stream in different ports (or even different IP addresses)
depending on the codec used at any moment".

A media stream here is, for instance, the audio transmitted in a regular
telephone call.

With this issue in mind, it is good to know scenarios where this can
happen and, of course, possible solutions.

There are multiple scenarios where we can have this situation.

-The device receiving voice is not the same as the device that handles
DTMF tones. Thus, the media stream will have to be sent to the device
handling voice when the RTP packets contain PCM voice (for instance) and
to the device handling DTFM tones when they carry tones.

-I need to use a transcoder because I have a cheap client that does not
support all the audio codecs in the world. I want to receive PCM in my
IP address, but I would like to receive GSM in the IP address of the
transcoder. The transcoder will take care of sending the stream to my IP
address once it has been transcoded.

-Cellular terminals that have to configure radio bearers with specific
codec information because some optimizations are applied. They want to
receive media on different ports depending on the codec used.

Folks have pointed out more scenarios where this might happen.

The first solution I thought of was to send a single RTP stream to
different ports (or different IP addresses). As the AVT guys pointed
out, this would cause lots of problems. Mainly related to RTCP.
Everybody agreed that a much better solution was to establish different
RTP sessions.

If different RTP sessions are established, this issue is transparent to
RTP and RTCP. They are thus not affected.

SDP backwards compatibility was also an issue, so a solution that does
not break existing implementations has to be found.

The fid (flow identifier) attribute I was describing in my previous mail
would just be an identifier for the flow. That is, we might have several
RTP streams, but all of them are related, because we are dealing with
just one flow.

So, an application receiving this parameter would know that all the m
lines in the SDP with the same fid attribute (a=fid:12, for instance)
are related. I believe this is useful information for the application.

If an application does not understand the fid attribute, everything
works properly. The only issue is that the application is not aware of
this relation between m lines.


Regarding Mike's question, this was already presented as input for SDPng
in the last IETF. I am now after a solution for SDP. I agree that SDPng
will solve all these issues. The quesion is: when? Everybody working on
it is currently pretty busy.

Best regards,

Gonzalo



"Culpepper, Bert" wrote:
> 
> Hi:
> 
> I've only followed this thread on the SIP list and read the recent
> discussion on the AVT WG list,  I've probably missed some related discussion
> in other WGs.  It appears to me, and as Gonzalo pointed out, separate RTP
> streams are OK.  The session description below describes two RTP streams
> with only one active at any point in time (I believe the semantics of the
> fid attribute shown indicate that I can choose to send only one) vs. a
> single RTP stream with either of two encoding schemes.  However, as a SIP UA
> and having received that session description I know to send the two
> encodings separately if I want to make the other endpoint happy.  (And only
> one at a time if I support the fid attribute.)  It does seem reasonable that
> it might complicate QoS, but I expect the benefit of the separate media
> streams is justified when used.
> 
> Anyway, segregating media streams (where exact synchronization isn't needed
> and RTP is good enough) enables certain efficiencies when providing
> telecommunications and like services in an IP network.  This is only one
> advantage I suspect.
> 
> Regards,
> Bert
> 
> > -----Original Message-----
> > From: Michael Thomas [mailto:mat@cisco.com]
> > Sent: Thursday, October 12, 2000 10:22 AM
> > To: Gonzalo Camarillo
> > Cc: Culpepper, Bert; Rohan Mahy; Jonathan Rosenberg;
> > sip@lists.bell-labs.com; Henning Schulzrinne; mmusic
> > Subject: Re: [SIP] draft-camarillo-sip-sdp-00.txt
> >
> >
> > Gonzalo Camarillo writes:
> >  > I would like my session description to look like:
> >  >
> >  >          v=0
> >  >          o=John 289085535 289085535 IN IP4 first.example.com
> >  >          t=0 0
> >  >          c=IN IP4 111.111.111.111
> >  >          m=audio 20000 RTP/AVP 0
> >  >          a=fid:1
> >  >          m=audio 20002 RTP/AVP 8
> >  >          a=fid:1
> >  >
> >  > If an implementation does not understand the fid attribute it will try
> >  > to mix the audio streams. Since one is "empty" all the time, doing this
> >  > is not really a big deal. Thus, backwards compatibility is not a
> >  > concern.
> >  >
> >  > Do you have any concerns with this approach?
> >
> >    The main problem I see is with the interaction with
> >    QoS reservations. Namely, that you'd expect the receiver
> >    to book reservations for both flows. Doing:
> >
> >    m=audio 20000 RTP/AVP 0 8
> >
> >    on the other hand only implies that you book the ceil()
> >    of both flows.
> >
> >    Why is it so advantageous to segregate these into two
> >    streams? The implicit coupling you're after seems like
> >    it's not well handled by SDP right now, and I'd bet
> >    that this would be better taken up as a working item
> >    for SDPng (if not already there) because my understanding
> >    is that explicit audio synch with video is also problematic in
> >    SDP right now, which is sort of similar.
> >
> >               Mike
> >

-- 
Gonzalo Camarillo         Phone :  +358  9 299 33 71
Oy L M Ericsson Ab        Mobile:  +358 40 702 35 35
Telecom R&D               Fax   :  +358  9 299 30 52
FIN-02420 Jorvas          Email :  Gonzalo.Camarillo@ericsson.com
Finland                   http://www.hut.fi/~gonzalo