Re: [p2pi] Real life torrent statistics

Marshall Eubanks <tme@multicasttech.com> Tue, 19 August 2008 14:35 UTC

Return-Path: <p2pi-bounces@ietf.org>
X-Original-To: p2pi-archive@ietf.org
Delivered-To: ietfarch-p2pi-archive@core3.amsl.com
Received: from [127.0.0.1] (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id CD7A23A6ABA; Tue, 19 Aug 2008 07:35:47 -0700 (PDT)
X-Original-To: p2pi@core3.amsl.com
Delivered-To: p2pi@core3.amsl.com
Received: from localhost (localhost [127.0.0.1]) by core3.amsl.com (Postfix) with ESMTP id D4AB03A6B33 for <p2pi@core3.amsl.com>; Tue, 19 Aug 2008 07:35:46 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -101.564
X-Spam-Level:
X-Spam-Status: No, score=-101.564 tagged_above=-999 required=5 tests=[AWL=-1.504, BANG_GUAR=0.939, BAYES_50=0.001, RCVD_IN_DNSWL_LOW=-1, USER_IN_WHITELIST=-100]
Received: from mail.ietf.org ([64.170.98.32]) by localhost (core3.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id kKUXRfO8Wm+Q for <p2pi@core3.amsl.com>; Tue, 19 Aug 2008 07:35:46 -0700 (PDT)
Received: from multicasttech.com (lennon.multicasttech.com [63.105.122.7]) by core3.amsl.com (Postfix) with ESMTP id B0CED3A6A34 for <p2pi@ietf.org>; Tue, 19 Aug 2008 07:35:45 -0700 (PDT)
Received: from [63.105.122.7] (account marshall_eubanks HELO [IPv6:::1]) by multicasttech.com (CommuniGate Pro SMTP 3.4.8) with ESMTP-TLS id 12564687; Tue, 19 Aug 2008 10:35:53 -0400
Message-Id: <FD4F1F02-0CAF-4A34-A49F-3AB58C9A26A2@multicasttech.com>
From: Marshall Eubanks <tme@multicasttech.com>
To: Stas Khirman <stas@khirman.com>
In-Reply-To: <00d701c901c7$9f0340c0$140aa8c0@viceroy>
Mime-Version: 1.0 (Apple Message framework v926)
Date: Tue, 19 Aug 2008 10:35:52 -0400
References: <004601c90109$844b9890$6500a8c0@viceroy> <8B2A8C57-F3D9-48FB-B02A-E3F424B71F02@multicasttech.com> <00d701c901c7$9f0340c0$140aa8c0@viceroy>
X-Mailer: Apple Mail (2.926)
Cc: p2pi@ietf.org, p4pwg@yahoogroups.com
Subject: Re: [p2pi] Real life torrent statistics
X-BeenThere: p2pi@ietf.org
X-Mailman-Version: 2.1.9
Precedence: list
List-Id: P2P Infrastructure Discussion <p2pi.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/listinfo/p2pi>, <mailto:p2pi-request@ietf.org?subject=unsubscribe>
List-Archive: <http://www.ietf.org/pipermail/p2pi>
List-Post: <mailto:p2pi@ietf.org>
List-Help: <mailto:p2pi-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/p2pi>, <mailto:p2pi-request@ietf.org?subject=subscribe>
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain; charset="windows-1252"; Format="flowed"; DelSp="yes"
Sender: p2pi-bounces@ietf.org
Errors-To: p2pi-bounces@ietf.org

On Aug 19, 2008, at 2:48 AM, Stas Khirman wrote:

> > Here is a plot of the top 20 with a power law superimposed:
> >
> > http://www.americafree.tv/papers/torrent.top_20.png
> >
> > Regards
> > Marshall
> >
> [Stas Khirman]
>
> Marshall,
>
> Thank for your investigation, but unfortunately, when considering  
> tail of the distribution I'm not sure it power distribution can be  
> claimed - please check
> http://www.khirman.com/system/files/image/blog/torrent_distribution_blog_2.gif

Well, you generally can't tell anything from a linear-linear plots in  
these cases.

However, it is common in these cases for there to be a flattening with  
small rank
(thus the Pareto-Mandelbrot or Zipf-Mandelbrot distribution), and a  
lack of power as the numbers per bin get to
be of order unity. Both of these can be seen with the full data set

http://www.americafree.tv/papers/asn_groups.png
http://www.americafree.tv/papers/asn_groups.2.png

With only 3 degrees of freedom, the modified power law fits well for  
about 2 decades of rank.

This is not too surprising to me - similar Zipf type modified power  
laws can be seen in the selection
of video content (see http://www.imconf.net/imc-2007/papers/ 
imc78.pdf ) and I would
expect node selection to trace that.

Regards
Marshall

>>
> By popular demand, I just published major points and raw data on my  
> blog :
> http://www.khirman.com/blog/p2p_localisation
>
> Working notes available at http://www.khirman.com/system/files/image/blog/torrent_distribution.pdf
>
> Raw AS data http://www.khirman.com/system/files/image/blog/asn_groups.csv 
>  - please feel free to “twist” it – will appreciate any input.
>
>
> Regards
> Stas Khirman
> P.S. For those who in Silicon Valley – would be glad if you join our  
> Club event August 21 - http://www.khirman.com/ctc/20080821. Hot  
> controversial discussion and cold beer – guaranteed! (please sign-up  
> if you plan to attend)
>
>

_______________________________________________
p2pi mailing list
p2pi@ietf.org
https://www.ietf.org/mailman/listinfo/p2pi