Delivery-Date: Sat, 14 Jun 2014 00:41:38 -0400
Return-Path: <tor-talk-bounces@lists.torproject.org>
X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on moria.seul.org
X-Spam-Level: 
X-Spam-Status: No, score=-4.8 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED,
	RP_MATCHES_RCVD,UNPARSEABLE_RELAY autolearn=ham version=3.3.1
X-Original-To: archiver@seul.org
Delivered-To: archiver@seul.org
Received: from eugeni.torproject.org (eugeni.torproject.org [38.229.72.13])
	(using TLSv1.2 with cipher ADH-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by moria.seul.org (Postfix) with ESMTPS id 7C56A1E0B64
	for <archiver@seul.org>; Sat, 14 Jun 2014 00:41:36 -0400 (EDT)
Received: from eugeni.torproject.org (localhost [127.0.0.1])
	by eugeni.torproject.org (Postfix) with ESMTP id 1DEBB2FC5C;
	Sat, 14 Jun 2014 04:41:32 +0000 (UTC)
Received: from localhost (localhost [127.0.0.1])
 by eugeni.torproject.org (Postfix) with ESMTP id 4F34C2FC22
 for <tor-talk@lists.torproject.org>; Sat, 14 Jun 2014 04:39:24 +0000 (UTC)
X-Virus-Scanned: Debian amavisd-new at eugeni.torproject.org
Received: from eugeni.torproject.org ([127.0.0.1])
 by localhost (eugeni.torproject.org [127.0.0.1]) (amavisd-new, port 10024)
 with ESMTP id 3d3YTt5dAAcy for <tor-talk@lists.torproject.org>;
 Sat, 14 Jun 2014 04:39:24 +0000 (UTC)
Received: from mx1.riseup.net (mx1.riseup.net [198.252.153.129])
 (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits))
 (Client CN "*.riseup.net", Issuer "Gandi Standard SSL CA" (not verified))
 by eugeni.torproject.org (Postfix) with ESMTPS id 2E0352FC1E
 for <tor-talk@lists.torproject.org>; Sat, 14 Jun 2014 04:39:24 +0000 (UTC)
Received: from fulvetta.riseup.net (fulvetta-pn.riseup.net [10.0.1.75])
 (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits))
 (Client CN "*.riseup.net", Issuer "Gandi Standard SSL CA" (not verified))
 by mx1.riseup.net (Postfix) with ESMTPS id 1592A50A32
 for <tor-talk@lists.torproject.org>; Fri, 13 Jun 2014 21:39:20 -0700 (PDT)
Received: from [127.0.0.1] (localhost [127.0.0.1])
 (Authenticated sender: mirimir@fulvetta.riseup.net)
 with ESMTPSA id A891C1F1
Message-ID: <539BD1DE.3080008@riseup.net>
Date: Fri, 13 Jun 2014 22:38:54 -0600
From: Mirimir <mirimir@riseup.net>
User-Agent: Mozilla/5.0 (X11; Linux x86_64;
 rv:24.0) Gecko/20100101 Thunderbird/24.5.0
MIME-Version: 1.0
To: tor-talk@lists.torproject.org
References: <CAD2Ti29i-Rh=80edAs2KQaufj-+5HoKp+mzCxKNsWkY2fRo7KQ@mail.gmail.com>
 <53913B05.7000709@riseup.net>
 <CAD2Ti2-u-pKLOgvXhxrZL3P-0_Exxq5ahC4O5VQ+VnfzpBuqDQ@mail.gmail.com>
 <539A7E01.3090803@riseup.net> <539A83EF.4040602@riseup.net>
 <CAD2Ti2-LygGjWtJM01u0QgmNbQQGkp81YNPKdQrZu5poZ4XeCg@mail.gmail.com>
In-Reply-To: <CAD2Ti2-LygGjWtJM01u0QgmNbQQGkp81YNPKdQrZu5poZ4XeCg@mail.gmail.com>
X-Enigmail-Version: 1.6
X-Virus-Scanned: clamav-milter 0.98.1 at mx1
X-Virus-Status: Clean
Subject: Re: [tor-talk] Craigslist now giving Tor the slows, lol
X-BeenThere: tor-talk@lists.torproject.org
X-Mailman-Version: 2.1.15
Precedence: list
Reply-To: tor-talk@lists.torproject.org
List-Id: "all discussion about theory, design,
 and development of Onion Routing" <tor-talk.lists.torproject.org>
List-Unsubscribe: <https://lists.torproject.org/cgi-bin/mailman/options/tor-talk>, 
 <mailto:tor-talk-request@lists.torproject.org?subject=unsubscribe>
List-Archive: <http://lists.torproject.org/pipermail/tor-talk/>
List-Post: <mailto:tor-talk@lists.torproject.org>
List-Help: <mailto:tor-talk-request@lists.torproject.org?subject=help>
List-Subscribe: <https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-talk>, 
 <mailto:tor-talk-request@lists.torproject.org?subject=subscribe>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Errors-To: tor-talk-bounces@lists.torproject.org
Sender: "tor-talk" <tor-talk-bounces@lists.torproject.org>

On 06/13/2014 08:43 PM, grarpamp wrote:
> On Fri, Jun 13, 2014 at 12:54 AM, Mirimir <mirimir@riseup.net> wrote:
>> http://bayimg.com/lAoiMAAfL
>>
>>> For three of the ten exits used (default client choices) in the first
>>> test series, http://craigslist.org/ loaded in about eight seconds (30-50
>>> Kbps). For the other seven, it took several minutes (~1 Kbps).
>>>
>>> I don't see any obvious correlation with blacklist status.
> 
> Since you seem to be letting tor pick its exits, sampling for
> days might give a wider representative spread of exits. And
> could plot things for each exit over time.

Yes, I'm letting Tor pick. I want to get results that are directly
relevant for actual users. There will still be some artifacts due to
console-based snapshotting, however. And yes, I'll be running this for a
while, and perhaps multiple VMs in parallel. So far, I'm collecting data
for the top 50 websites. I will look at behavior over time.

> I see major prevalence of variation in your returned page lengths
> in bytes, almost every exit varied. Only about 1% of my single fetch
> across 1200+ exits varied from the exact normal byte count. It should
> be determined whether tor software somehow causes this when
> carrying 'slow' packet streams. By running in a loop the fetch from
> bound to the exit IP of an exit relay affected by both slowblocking,
> and showing byte variance.

As I understand it, you're just getting the HTML. I'm getting the entire
page, or at least whatever Midori grabs while pretending to be Firefox.
For example, I get http://xvideos.com/ with numerous (X-rated) images ;)

Also, I was hitting sites at 1-2 minute intervals, and successive hits
sometimes used the same exit (and perhaps the same circuit). Excluding
craigslist, the greatest loading time was about 500 seconds. So perhaps
I was overlapping too much at times. I've increased sleep between site
loads to 8-12 minutes, and decreased sleep between 50-site runs from
30-60 minutes to 20-40 minutes. That may reduce page-size variance.

> No more variance = tor issue.
> Still variance = IP <--> CL stack/path issue, or CL issue alone.
> 
> Will look at your Midori tool and maybe more of this type of project
> sometime later.

There's also wkhtmltopdf. Maybe it does a better job, being lighter even
than Midori. But I worry that it also may look less like a browser than
command-line Midori.

Once I work out kinks, and collect enough data, I'll write this up
somewhere with results for all 50 top sites.
-- 
tor-talk mailing list - tor-talk@lists.torproject.org
To unsubscribe or change other settings go to
https://lists.torproject.org/cgi-bin/mailman/listinfo/tor-talk

