Re: [Storagesync] Some preliminary investigations on ownCloud

Jakub Moscicki <Jakub.Moscicki@cern.ch> Thu, 26 November 2015 07:29 UTC

Return-Path: <Jakub.Moscicki@cern.ch>
X-Original-To: storagesync@ietfa.amsl.com
Delivered-To: storagesync@ietfa.amsl.com
Received: from localhost (ietfa.amsl.com [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 23EBB1A1BF8 for <storagesync@ietfa.amsl.com>; Wed, 25 Nov 2015 23:29:48 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.9
X-Spam-Level:
X-Spam-Status: No, score=-1.9 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, HTML_MESSAGE=0.001, SPF_HELO_PASS=-0.001] autolearn=ham
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id wLEgufBNUWr2 for <storagesync@ietfa.amsl.com>; Wed, 25 Nov 2015 23:29:42 -0800 (PST)
Received: from emea01-db3-obe.outbound.protection.outlook.com (mail-db3on0669.outbound.protection.outlook.com [IPv6:2a01:111:f400:fe04::669]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7E70F1B2A7D for <storagesync@ietf.org>; Wed, 25 Nov 2015 23:29:41 -0800 (PST)
Received: from AM3PR06CA051.eurprd06.prod.outlook.com (10.141.192.169) by DB5PR06MB1416.eurprd06.prod.outlook.com (10.163.103.22) with Microsoft SMTP Server (TLS) id 15.1.331.20; Thu, 26 Nov 2015 07:29:20 +0000
Received: from DB3FFO11FD056.protection.gbl (2a01:111:f400:7e04::195) by AM3PR06CA051.outlook.office365.com (2a01:111:e400:882b::41) with Microsoft SMTP Server (TLS) id 15.1.331.20 via Frontend Transport; Thu, 26 Nov 2015 07:29:20 +0000
Authentication-Results: spf=pass (sender IP is 188.184.36.50) smtp.mailfrom=cern.ch; ietf.org; dkim=none (message not signed) header.d=none;ietf.org; dmarc=bestguesspass action=none header.from=cern.ch;
Received-SPF: Pass (protection.outlook.com: domain of cern.ch designates 188.184.36.50 as permitted sender) receiver=protection.outlook.com; client-ip=188.184.36.50; helo=CERNMX11.cern.ch;
Received: from CERNMX11.cern.ch (188.184.36.50) by DB3FFO11FD056.mail.protection.outlook.com (10.47.217.128) with Microsoft SMTP Server (TLS) id 15.1.331.11 via Frontend Transport; Thu, 26 Nov 2015 07:29:20 +0000
Received: from cernfe01.cern.ch (188.184.36.42) by cernmxgwlb4.cern.ch (188.184.36.50) with Microsoft SMTP Server (TLS) id 14.3.158.1; Thu, 26 Nov 2015 08:29:18 +0100
Received: from CERNXCHG51.cern.ch ([fe80::20f7:8173:2da8:398a]) by CERNFE01.cern.ch ([fe80::885b:f831:a970:75e1%13]) with mapi id 14.03.0174.001; Thu, 26 Nov 2015 08:29:18 +0100
From: Jakub Moscicki <Jakub.Moscicki@cern.ch>
To: Linhui Sun <lh.sunlinh@gmail.com>
Thread-Topic: [Storagesync] Some preliminary investigations on ownCloud
Thread-Index: AQHRJ/vKifQcOZIR00eK6XsP2b71eZ6t148A
Date: Thu, 26 Nov 2015 07:29:17 +0000
Message-ID: <03A07C3E-9B3B-4886-9131-2CAF0A7B3F85@cern.ch>
References: <CAO_YprZTz+O-e82hsTgMBOLr645jJqbhtVKngubnLhimyfB2cg@mail.gmail.com> <71E522FC-C622-4DDE-B444-5CE902980823@cern.ch> <2015112611280303192311@bjtu.edu.cn> <CAO_YprbsHsDYewHh38uDJ6gOoSZKwO8MoCR+UHz21iarEqEQOA@mail.gmail.com>
In-Reply-To: <CAO_YprbsHsDYewHh38uDJ6gOoSZKwO8MoCR+UHz21iarEqEQOA@mail.gmail.com>
Accept-Language: en-GB, en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [191.242.168.97]
Content-Type: multipart/alternative; boundary="_000_03A07C3E9B3B488691312CAF0A7B3F85cernch_"
MIME-Version: 1.0
X-EOPAttributedMessage: 0
X-Microsoft-Exchange-Diagnostics: 1; DB3FFO11FD056; 1:RogARR9MqF4a2JuwMGhtx6Cno17HyFVwP5SQ52Ge4ZEQodGVGN1UC0HqtBeIsKUgLoZ8yDHzCQSuxJLNoMgtSrooVB9WLkLrZ7QtcscqjqFqEchaNN2yWEpjWaedHMCXttUh8f8jkQey5WGh7Q1P0RDM1tmLMM30F8rpuZTZwnlPnh7jtJj7aBUChe8ATxpo8QFV/3CdXuzfT/WkcV0POfNfdFvHQ30xxiwLDg2v1bNgrLe6/86lODZ2ETxw/Yko7HQiOlWW6+GbsUENMrHxHv++GbeP1aLkE5LGYUiSU51kH352SIcmBUMT2RiLjbIZaRWcpH9OaqXNQodc+DEbN51WkpXzAVIVg+mTF+lFDRs7KYC738oatArqRIZZbu7x0txuEWUiTFJ10EsQh1VoAJ9TcAUG7QCGdmB2xdqKvlY=
X-Forefront-Antispam-Report: CIP:188.184.36.50; CTRY:CH; IPV:CAL; IPV:NLI; EFV:NLI; SFV:NSPM; SFS:(10009020)(3000300001)(438002)(189002)(199003)(377424004)(24454002)(53754006)(1220700001)(16236675004)(19580405001)(11100500001)(53416004)(189998001)(84326002)(33656002)(106466001)(5001970100001)(93886004)(74482002)(4001150100001)(50986999)(5004730100002)(512874002)(1096002)(76176999)(66066001)(106116001)(92566002)(19580395003)(3846002)(16297215004)(6806005)(87936001)(5003600100002)(19617315012)(110136002)(102836003)(6116002)(586003)(83716003)(16796002)(2900100001)(2950100001)(86362001)(54356999)(36756003)(15975445007)(26826002)(82746002)(5250100002)(300700001)(5008740100001)(104396002); DIR:OUT; SFP:1101; SCL:1; SRVR:DB5PR06MB1416; H:CERNMX11.cern.ch; FPR:; SPF:Pass; PTR:cernmx11.cern.ch; MX:1; A:1; LANG:en;
X-Microsoft-Exchange-Diagnostics: 1; DB5PR06MB1416; 2:kZBxkuEOTqYkaf4qMrRUV/UzTxMNESUh+flY5+DsFqckmRdk1rHnZpoY+zz6A0qgT3T4ezI92LBeHLvDQV9OlDnnIoZnzcP9HEkT3m3//2V7ShvhJN9P18VeYkRmeK0LuX2iZKahwzYkDKG1iiGTHw==; 3:+5EqIz92jeg/6VTsS3BJndOjt5yZizQWy5NrXXoRSVczlhMKXKLxrJEDSQqAPbm9Xj04+UmQ/CVNFPUayC7ef0xpIL3DTIjvKl5GcEHS6pXVD9KaUQ5d8Vu8gfAT+nad7wSvC+2wBySqU0685y8OdFopTQGlK9l6CV0ngOrW5/5368Ho6Cd58sew4xS9TSShvZy43T2Vq83r18BjgolDdkdqNmsI1P9OHB9iAWibKJGXWf88sf44htNq0kX7hmTRHg4rjtZGK5cPlvFSoheTrg==; 25:TdJtWCTHwJSWdMDCzV2d/+HDfbc/k6YyOmFr8H47TuR1VnO3V3Tf0TifPqC9+LvSd0+WdR9xkMGsl97UMUDmMbm3YecFpdQTBpFvbsVBD/7PubkaK9qZy/qYGJ+85hKZnhJTidK6t4Ycl3tyKNSQ9SD9lBcA69N7ah8wD8BGE90bWJYeQ5psF234uGIq+iO+oODTUo1LV7q4LykUyzv0TjDHBQIbesVOK85peYd48WpfRVdUWRYZGkh4JKZw5f5/54g9/3KCYZct8fZVk/MOnw==
X-Microsoft-Antispam: UriScan:; BCL:0; PCL:0; RULEID:(8251501001); SRVR:DB5PR06MB1416;
X-Microsoft-Exchange-Diagnostics: 1; DB5PR06MB1416; 20:FCddSNzrzZdrzdrFgg6jgvWO82t5ajSCjMFBrB5HvO9tZWH9fICYi/WvdjgwHlpCIx+R6Mmi/qRhgbjueOe6riCxf5Bj7MkLHc/arR1To6EJQY2ZXyp8I7DK1O4k2Ec5SJx3Hw8XceLEPBSTdSOm9KeXGKkmb6esZEi5CVfYmUQ1Hup1gh7WsTr2vVS+wxH5wEol807pQpEddH7SXHYgQef3A7vPnsYa+l+giHGvBSjFUYbR0bAy64zx2Xi5v+xNoRnEJHQJFQ4+hE+6hroK5uAjAluZjZlRlgnM6FpT72UK/olgqYKFJag6qvhvw9PWbmvRb6i2NYWizsEQWFbo42e3aMJqHL8nvliacrrENy6wJIwGogvr1yz7mBbRFX+GNmBA7OZiFnLpvhctePXtZHRO0u/waDFfNsYtW/JhLu+ATgUIu0sRAnAxqAWEONVpHPl12fMzrAUzrF7f879lS2K/lyaZfrNU6ju+allY186zpR9pk11rqgO98/sUN0tR; 4:gpsIXT44M407zWe/Oc+69GNVexIwS8Zu3lQD0jJ4HVuKCBOtuy4wMkPa7lCbKUCQ6h+eZnzRlQjTkzgyq+bL+ICet7p0mjONUtrESkZUGS40jgJfHWWKKevEK+UTnRpr17u7pqgaKweII1PDa9cqvcnSL5zGbZPNTwgasJ/oVnxolC+uLqFNXtsd4N8q2qZovS1P6Yc4diUogn2AYvO2o3smBrFWkgNPqKZctCrM68elT/D38HT6Js/j9lBAA2slF5Ev3bh66TSvSU/W1RSK4oZ+VZGCy4AbOMCSs+8T60FoFveWcUjP4/5hebRTf/uAa6lP2+I3cHXTV9nhyMWrXckkxXFkqAzGYr+fg5GJpj3ncPyNx4HxIDmRWdnQWRysidOM9vrCk0pZ+52qruoAfSDFNna1ANt+/glJi2Z0Is6D4/i7Jr2dGXbJZZ7DcE3S
X-Microsoft-Antispam-PRVS: <DB5PR06MB141627F2E15EF41DB475E9A486040@DB5PR06MB1416.eurprd06.prod.outlook.com>
X-Exchange-Antispam-Report-Test: UriScan:(109460225580195);
X-Exchange-Antispam-Report-CFA-Test: BCL:0; PCL:0; RULEID:(601004)(2401047)(520078)(5005006)(8121501046)(3002001)(10201501046); SRVR:DB5PR06MB1416; BCL:0; PCL:0; RULEID:; SRVR:DB5PR06MB1416;
X-Forefront-PRVS: 0772E5DAD5
X-Microsoft-Exchange-Diagnostics: 1; DB5PR06MB1416; 23:/tZLRsQ/KtfMBuqSyUv/KhY2nH5jj1OrsmaiJouqJ01oFCtEj4E7cfnnpGO19E2njila/Zbj8d5FW0WQjYgtbzoPBTmw0zsDCf/OSl4xXYWcE/62/jt0b2oUzfGHYJ8iXlXMTwXYyAJ3HCYsXZ7goCz6T5TlWOwiIodRvuQYobsWJsHBVciJ7XHpcsgqSiHP60iz5UDm7XXHCM7rf8ypQhRCQcGp/qDPJbuyTyPuFCeGTvQWhZDcZ4QEA6iQdVhqenSABkKfUR+dPhReK/Z8BCLWJbjdRuNykya3fqe3/BeXCU6G3SIKmfOeAJ5AWzKtMAQ8rIyvnY9W1+rdP54rotNWSY0FRsGRtqwfxPBVzDYG++Yx6qlXkA1TZbSPTlT2CycYhQyrL/QcfDJXF6ZG4ww958izNm9QzmpROV/voDVUCoa4i0fGr5x2bNAmcvU75DMXjBtK9f6EZZvD5+c7/OaW2Ok70SKtOi/MYbIAjH3b2wwNp1RAw9WdXVrH1+vXLqQblyUgJlff5tiizS3UXr7t3JE3nEJwOdiyw3DywasvdXnrVJshUw4nmxqWLsSmTWhPMbuAAUse4Izvl6u7fJLfIysnI7E3N85oUPDAZt+u5BOBWSTS+WNHV48BFBWDLVgr3BFaDlVdb15fd5glzSHTBbhW6NU0bgJqBtoJig4qL/C5CVEQVCDatTaVW+Fd4/BmHimNM/sgXs4bgczXdN4iXlWhe6z3KXtWkszRyO4tTS9YOr4/qKie09S7qdAAhI9sYqSDZt3UwT8lXVnqJCA5YsicDBavHQ4fBjL8AP21nXs3o1IlEfWTYmoPyvyGrY0ThSmg566fsLZMxwk3PxJJJkrN6e9RGjMaf+G5TPrtnAgDSaJyp6vcIDim1DG3d4pnZQ75TY17lFlMZYLsoXfvo/LfHAC9/4xztN8Ziu3u7Jm/oUetww16YDqdljLBvtm0PkM+mfPK0UUCMkVvwcE61ZtlltO2C7Y4IrKFC5E4hEpF2M6Q8cN8hL+3ogeg2Zae9E4djy40GHIa1+ZrVsDmDDkx7HxarPC674e+pQ1MIolXkJ7prch3qPFqgS0cxqgMTZ9Cr9l4ZP9V93RGA1WQBCJjo6OigL40EYUOTUikeROWcxUurxL5GlxpSJ176YtNeI4EATwKSQLrGNoha7Xai5JzcGJu4xuss0ai3piaIp/QlfRYA26epnxX+cOwYQYK/4gu+V5dAUhiHLo0V0Kn3lhAOw+LuTIfYZVmDs73dR4kO6CqMFzSgTDK2k/FI4kD43MM9cqSrc+ghOqfvaxBq0CzVYzpOpEmXDV3B1PdaoIixcReDaFZlPPN3P0ShqjvE5MLAgIbUnKlmlYlMVeNjBLwhwL9zuvyPwsZSalbn8vcegOgvhJVZuNRthyu
X-Microsoft-Exchange-Diagnostics: 1; DB5PR06MB1416; 5:kG9XRfgUKRcvocnoSAyrpLpJbIEcsP/SF+QSrn6xvavDOMrEAEjNVwodZirqW1+u5OKfkkIc8+5OfXNgAFCgTgPtsqP+wH7O7ftBgDFDaHuG87m5Fb3YVCJBou6iZeLhtFMFQDDZ7rav1YoSPKbr1A==; 24:cds0gQ4U5dBRBvpvWQF0nqWp5zRpEuIJhbWGNR3RlStn6tS0XBz8/HdDMBvWQhQJIKQv0M7WW7aKc69vbrf+7QPoa/rKz5LdtgGvT9x72Pk=
SpamDiagnosticOutput: 1:23
SpamDiagnosticMetadata: NSPM
X-OriginatorOrg: cern.ch
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 26 Nov 2015 07:29:20.3051 (UTC)
X-MS-Exchange-CrossTenant-Id: c80d3499-4a40-4a8c-986e-abce017d6b19
X-MS-Exchange-CrossTenant-OriginalAttributedTenantConnectingIp: TenantId=c80d3499-4a40-4a8c-986e-abce017d6b19; Ip=[188.184.36.50]; Helo=[CERNMX11.cern.ch]
X-MS-Exchange-CrossTenant-FromEntityHeader: HybridOnPrem
X-MS-Exchange-Transport-CrossTenantHeadersStamped: DB5PR06MB1416
Archived-At: <http://mailarchive.ietf.org/arch/msg/storagesync/f0lI75kmtwa0L5JNkNXt9Y10q2U>
Cc: fsong <fsong@bjtu.edu.cn>, storagesync <storagesync@ietf.org>
Subject: Re: [Storagesync] Some preliminary investigations on ownCloud
X-BeenThere: storagesync@ietf.org
X-Mailman-Version: 2.1.15
Precedence: list
List-Id: Mechanisms to synchronize client file systems with Internet-based data storage services <storagesync.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/storagesync>, <mailto:storagesync-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/storagesync/>
List-Post: <mailto:storagesync@ietf.org>
List-Help: <mailto:storagesync-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/storagesync>, <mailto:storagesync-request@ietf.org?subject=subscribe>
X-List-Received-Date: Thu, 26 Nov 2015 07:29:48 -0000

2015-11-26 11:28 GMT+08:00 Fei Song <fsong@bjtu.edu.cn<mailto:fsong@bjtu.edu.cn>>:
BTW, Based on the last sentence of last email:"The intention of this message is to investigate the current state of using WebDAV for sync purposes to see what needs to be improved here and whether we need new protocols"

The outcome he/she wanted might be just the links like http://cs3.ethz.ch/program.html :)
In another word, I think what I want finally might be a discussion about "What is needed for the ISS: a sync protocol or a generalized API". Sorry for the poor expression : )

I think this is a relevant question indeed (well, actually also if there is any standard needed at all in the first place). By the generalized API do you mean a HTTP-style API (like REST)? In that case you may consider WebDAV quite close and the difference between the protocol and the API blurs for practical purposes.

While I agree that WebDAV may have disadvantages there is a good number of installations using this already in our community (research labs mainly in, but not limited to, Europe). I think the interesting point about WebDAV/HTTP is extensibility and maybe it is worth is that these extensions go by a “standard”. However, you should consider that for reasonably efficient sync scenario the server should also exhibit certain behaviour. That is, in case of owncloud for example, a server should be able to efficiently propagate the ETAG changes up the directory tree (like the Merkle Tree) so that the client may use propfind efficiently. This is not a hard spec requirement but otherwise propfinding the entire remote tree each time would be impractically inefficient. So this really goes a little bit beyond just an API.

You should also consider that the OpenCloudMesh (under GEANT umbrella) is an initiative with the intent is to make cross-service sharing very easy. These shares may also be synchronized automatically. I currently have no evidence, however, that other software providers with the exception of owncloud are interested in developing such standard. If there is no bottom up interest from the users then it won’t happen (in my opinion).

With the link I wanted to point you to the fact that what you discuss in this mailing list will be also discussed at the upcoming CS3 event I linked in. The intent is to do this discussing together between service providers, developers and researchers together — so that it does not only end up as an academic exercise but backed up by a critical mass if there is some potential in standardisation, at least in our community. I hope this could be of interest to IETF community, as mentioned on the program page.

Selected papers will be published in FGCS after the event, so please stand by, or attend the event if you want to be part of this discussion here. BTW. the abstract submission deadline is past but one exceptionally good contribution could still possibly be accommodated if submitted rapidly.

I would be nonetheless happy to continue contributing to the discussion in this mailing list.

Best regards,

Jakub Moscicki

--


Regards,
Linhui


--------------
Fei Song
>Hello,
>
>What kind of outcome are you looking for with this analysis? Some research in this area has already been done or is being done as we speak
>
>e.g. "A study of delta-sync and other optimisation in HTTP/Webdav synchonisation protocols"
>
>see "Technology and Research":
>
>http://cs3.ethz.ch/program.html
>
>It would be interesting to see if there is a potential for collaboration. Or maybe we already have some information you are looking for.
>
>Best regards,
>
>Jakub Moscicki
>
>—
>
>
>On 25 Nov 2015, at 11:45, Linhui Sun <lh.sunlinh@gmail.com<mailto:lh.sunlinh@gmail.com><mailto:lh.sunlinh@gmail.com<mailto:lh.sunlinh@gmail.com>>> wrote:
>
>Hi all,
>
>As I mentioned before, I think the developers could benefit from the IETF standards. The ownCloud (https://owncloud.org/) is just an example. It is developed for those who do not trust commercial storage services and want to build their own network-based storage services. The ownCloud is using WebDAV (RFC4918) to achieve the data sync. IMO, the WebDAV is designed for distributed work but not for the sync. Thus, I made some preliminary investigations on how the ownCloud uses WebDAV for sync purposes. A brief summary of what I've found is in the following, please correct me if I am wrong.
>
>I installed the ownCloud server (v8.2.1) on the CentOS7, and the client is a desktop client on Windows.
>
>1. To find whether there is a change to the synced directory, the client continuously sends PROPFIND to the server at regular intervals (around 34 seconds under my observation). The server will respond a 207 Multi-Status Response to tell whether the main directory has been changed. To perform this regular check, the client will open a new TCP connection to send the PROPFIND, the server will close the existing TCP connection after responding the 207 Multi-Status Response. For the next check, the client will open another new TCP connection.
>
>2. Every time adding (or creating) a new file to the local folder, the client will open a new TCP connection (if there is no connection existing) to send the file asap. The client will first send several PROPFINDs to find out which sub-directory has been changed. And then it sends the file using PUT. The server will respond a 201 Created Response and then terminate the connection. Currently, I haven't found any application layer chunking, all the segmentation are performed by TCP.
>
>3. Every time I delete (or rename) a file locally, the client will also open a new TCP connection to send several PROPFINDs to find out which file has been removed (or renamed). Then it will send DELETE (or MOVE). The server will respond a 204 No Content Response (or 201 Created Response) and then terminate the connection.
>
>4. I open a file and frequently edit and save it (actually this is what I usually do with the Dropbox). The client will send the whole file to the server every time I save the file.
>
>To summarize, it seems that the ownCloud makes heavily use of PROPFIND to achieve the sync process. Each sync operation (e.g. upload, modify and etc.) will start with sending one or more PROPFINDs. And currently, if I add a file to the server (directly from the server side via web interface), the client cannot find the change. I need to interrupt the sync and recover it to make the client be aware of the change and download the newly added file. I'm not sure whether this is caused by the sync mechanism or an improper server configuration. I need to investigate this further and also how the ownCloud works for multiple clients (or devices).
>
>For ISS, I think ownCloud has demonstrated to some extent that similar IETF protocols could be deployed and employed. The intention of this message is to investigate the current state of using WebDAV for sync purposes to see what needs to be improved here and whether we need new protocols.
>
>Comments are welcome : )
>
>Regards,
>Linhui
>
>
>_______________________________________________
>Storagesync mailing list
>Storagesync@ietf.org<mailto:Storagesync@ietf.org><mailto:Storagesync@ietf.org<mailto:Storagesync@ietf.org>>
>https://www.ietf.org/mailman/listinfo/storagesync
>
_______________________________________________
Storagesync mailing list
Storagesync@ietf.org<mailto:Storagesync@ietf.org>
https://www.ietf.org/mailman/listinfo/storagesync