Re: [nfsv4] RFC 7530: Filehandle of opened file after the REMOVE

Trond Myklebust <trondmy@gmail.com> Tue, 27 December 2016 20:24 UTC

Return-Path: <trondmy@gmail.com>
X-Original-To: nfsv4@ietfa.amsl.com
Delivered-To: nfsv4@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id A1D72129B86 for <nfsv4@ietfa.amsl.com>; Tue, 27 Dec 2016 12:24:43 -0800 (PST)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -1.699
X-Spam-Level:
X-Spam-Status: No, score=-1.699 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, FREEMAIL_FROM=0.001, FREEMAIL_REPLY=1, HTML_MESSAGE=0.001, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001] autolearn=no autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=gmail.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ZDT68gJBy7fU for <nfsv4@ietfa.amsl.com>; Tue, 27 Dec 2016 12:24:41 -0800 (PST)
Received: from mail-lf0-x241.google.com (mail-lf0-x241.google.com [IPv6:2a00:1450:4010:c07::241]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id D1B40129B81 for <nfsv4@ietf.org>; Tue, 27 Dec 2016 12:24:40 -0800 (PST)
Received: by mail-lf0-x241.google.com with SMTP id d16so16048690lfb.1 for <nfsv4@ietf.org>; Tue, 27 Dec 2016 12:24:40 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:message-id:mime-version:subject:date:in-reply-to:cc:to :references; bh=abdu7oAggEARrntW7/ZE4DFUxlc4LvpHruTFqm7m8Mg=; b=iuRuL/YaF7t+/qIwV9p2+ZezSYbIMZzBaqUr6Ne2pdLpPSrBb4HTLWcHh+80kgX2s1 EwjZyOzM8RPy14rxkhZ/jhm0RCVXv6JG40JDvjOON2/6SaY/VpbNfjH2ynua3JtbNKlF AGuyfy+kFSq5O77ZwxCyrYKu6QJkhnDUarNPTz+wg9oMZZrRK+gAOpl5IQsoAVCfdH8p j3JcdE+ucumDp4urm3G21LvyOtvB2t0CXEfPusP+9hwXAmho+AEhhGlMJ722r5QwiJYF ATtuJQ6yabuUnfRVndEl7rkNWiFtORPH1/cKppyA7p036rdnsWh/gaVQV2rXKIZhxgkw LrYg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:message-id:mime-version:subject:date :in-reply-to:cc:to:references; bh=abdu7oAggEARrntW7/ZE4DFUxlc4LvpHruTFqm7m8Mg=; b=Fb7sEjtwzXwKEg3B+gGkVx9svbb6o6yuGja0xbZmU4+ucbaW3OpuW0DFjHkRa7kgn8 oDwfW2xwXNfbAy8vXt8UUYt2mMJWKs0mxQqif/rBmV6BuKyk0yUjc4bzhp1dgyiu1QFK VX5UBUhfefVzZx23MkPwbXZkkp+TPdNOBVNWby5vgrDMHMF6Q9A/9Phk5EV4nXrk2ydn DsG6QYPoNkIDMKMvXFfdOdWVSGNakI7N1Jlcu6LBVSbrDcQQbTfaTAdmb14B6st2dEl9 JdLf0QbOoCAb7grHe07ljzSf2cq6AsQrQAf6NBSyYQ4hg/Y27GVoCBL7Qsd0Fsv9wQH0 QpfA==
X-Gm-Message-State: AIkVDXKlEh7JWIUdRvFY/Dzxi60luKN2hyzNw676OKCQ174dPxoL7W2wTJWA0YO0PyyG4A==
X-Received: by 10.25.76.194 with SMTP id z185mr12302431lfa.182.1482870278871; Tue, 27 Dec 2016 12:24:38 -0800 (PST)
Received: from [10.0.1.13] (9.42.202.84.customer.cdi.no. [84.202.42.9]) by smtp.gmail.com with ESMTPSA id c66sm11348987ljd.44.2016.12.27.12.24.37 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 27 Dec 2016 12:24:38 -0800 (PST)
From: Trond Myklebust <trondmy@gmail.com>
Message-Id: <C496AE44-0F27-4B66-A1F6-A76AEAFD7A90@gmail.com>
Content-Type: multipart/alternative; boundary="Apple-Mail=_D5AA62C2-FA6C-4413-9CC7-DF6FDE429C4E"
Mime-Version: 1.0 (Mac OS X Mail 10.2 \(3259\))
Date: Tue, 27 Dec 2016 21:24:36 +0100
In-Reply-To: <CADaq8jck14SKL6Ua9QxbqPyX1=1aaA7+76wv-__EWFvh7ZcEJA@mail.gmail.com>
To: Dave Noveck <davenoveck@gmail.com>
References: <20161213155825.Horde.vsqZuNSZ9hIXlcHQYxmgRC7@mail.telka.sk> <CADaq8jeiwGwgV=_HHjR2D4uNaKq9zY96hJOVXp4Q0H-3OgH2qA@mail.gmail.com> <20161213165639.Horde.t6BGVBJqifWKHucfa069yT8@mail.telka.sk> <CAABAsM579kGU4VzZfqWPUMPJ14QDBheJ8eMAk7DrYUSGscfVkQ@mail.gmail.com> <20161213171902.Horde.MkS1YMOM6VpxA0Z7rSMTe7P@mail.telka.sk> <CAABAsM5L0xdKodxk1dRSugLyROzn2JzgDkq6kdHE0LuGcfh++A@mail.gmail.com> <20161213181734.Horde.EqgB09El8rupnkesIQaBwJ3@mail.telka.sk> <CADaq8jcq2C0o8EWXoGjxDn58sV_J+-SP-=rj934Se-DV69b-pw@mail.gmail.com> <20161214112112.Horde.aPh8AjT6iWRl37CULwihyV7@mail.telka.sk> <CAABAsM7v6y0bsb0jKzfvobkUjniTLhM3uv8FYjo07HcLD2004w@mail.gmail.com> <20161227144414.GA32002@fieldses.org> <CADaq8jck14SKL6Ua9QxbqPyX1=1aaA7+76wv-__EWFvh7ZcEJA@mail.gmail.com>
X-Mailer: Apple Mail (2.3259)
Archived-At: <https://mailarchive.ietf.org/arch/msg/nfsv4/dt_b2y4UrQ08jPC3iWb5CxTKQmg>
Cc: Bruce James Fields <bfields@fieldses.org>, IETF NFSv4 WG Mailing List <nfsv4@ietf.org>
Subject: Re: [nfsv4] RFC 7530: Filehandle of opened file after the REMOVE
X-BeenThere: nfsv4@ietf.org
X-Mailman-Version: 2.1.17
Precedence: list
List-Id: NFSv4 Working Group <nfsv4.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/nfsv4>, <mailto:nfsv4-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/nfsv4/>
List-Post: <mailto:nfsv4@ietf.org>
List-Help: <mailto:nfsv4-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/nfsv4>, <mailto:nfsv4-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 27 Dec 2016 20:24:43 -0000

> On Dec 27, 2016, at 19:13, David Noveck <davenoveck@gmail.com> wrote:
> 
> Bruce wrote:
> 
> > I think this is what OPEN4_RESULT_PRESERVE_UNLINKED in 5661 was meant to
> > do.  
> 
> In fact it was.  See the sixth bullet in section 1.8 of RFC5661.
> 
> The problem I see is that it doesn't solve the problem that Marcel has pointed out.  If client A
> opens the file and client B removes it, there is no way it can decide whether or not to do a 
> silly rename.  Since he hasn't opened, the file he doesn't see the flag indicating it is not 
> needed.  Also, since the file is not open he has no indication that it is needed either.  So client
> B will do a REMOVE in all cases.  

As far as I can tell, Marcel is mainly interested in NFSv4.0. RFC5661 does not address any of the cases he was describing.

> 
> Server implementations may or may not preserve the file until last close.  For those that do, 
> everything will work out OK, while for those that don't, there isn't much that the client can 
> do.  If he did find out the server did not have the support, he has o way to defer the remove 
> until last close because he has no way of finding out when this happens,
> 
> Trond wrote:
> 
> > There are 4 main problems that are inadequately discussed in any of the existing RFCs, 
> > and that you need to address before we can consider replacing sillyrename 
> 
> It appears it has already been considered and spec'd but the spec may not be
> adequate.  Sigh!
> 
> > (which is well established today, and well understood by most users).
> 
> Most people understand why it works when it does but don't understand why it doesn't
> work in some cases.  So they are happy until they aren’t.

Umm…. If the solution works with unlink-on-close then you all do realize that it MUST always also work with sillyrename, right? Having a third party client remove a file named ‘.nfsXXXXX’ is just a special case of having it remove a file with a generic name.

IOW: I could argue that the problem here can be considered to be purely a server problem and irrelevant to the actual delete strategy chosen by the client.

> > 1) How does the client identify that the server supports this functionality?
> 
> The problem is that the client who doesn't open file, the troublesome case, is exactly the one in which
> he receives no information.  The client who does the open finds out silly rename is not needed, but, in
> this case, silly rename seems to work OK.
> 
> > 2) What functionality is needed on the underlying filesystems on the server?
> 
> Almost all filesystems have the ability to defer deletion until the last close.  If they don't, they
> are not very usable.  Servers which don't support this are not suffering from a lack of filesystem
> functionality.  Instead, the problem seems to be that some servers have an NFSv4 open which 
> either doesn't connect the to the underlying open functionality or there is no underlying open functionality.

Sure, but in order to be reboot safe, you need to go well beyond the functionality that you describe. As far as I can tell, you need to defer the garbage collection that would normally occur when the server comes back up again until after the state reclaim has occurred (as you describe below).

> 
> > 3) How does the server function in the case of a reboot? What can the client expect in terms of recoverability?
> 
> This is addressed b section 18.16.3 of rfc5661.  The third bullet on page  449 says:
> 
> Furthermore, the server promises to preserve the file through the grace period after server restart, thereby giving the client the opportunity to reclaim its open.

Again, this is specified for NFSv4.x (x>0) but not for x=0. It is not clear to me that older NFSv4 servers will have any of this functionality.

> 
> > 4) How does the client in practice perform recovery?
> >   a) In the case of server reboots.
> 
> I think he just recovers this opens within the grace period.
> 
> >   b) In the case of lease timeouts/network partitions
> 
> This case is not addressed RFC5661, so far as I can see.  The typical handling, in which
> the locks are all dropped would cause the file to go away.  Courtesy locks could address the
> problem, but the spec doesn't that have to be implemented.
> 
> > Note that the lack of open-by-filehandle in NFSv4.0 makes 4.b) more difficult than it should otherwise be.
> 
> True. In v4.1, a client can try an open-by-filehandle and take advantage of any courtesy locks that the server 
> has retained.
> 
> On Tue, Dec 27, 2016 at 9:44 AM, J. Bruce Fields <bfields@fieldses.org <mailto:bfields@fieldses.org>> wrote:
> On Wed, Dec 14, 2016 at 09:28:41AM -0500, Trond Myklebust wrote:
> > On Wed, Dec 14, 2016 at 6:21 AM, Marcel Telka <marcel@telka.sk <mailto:marcel@telka.sk>> wrote:
> >
> > > Citát David Noveck <davenoveck@gmail.com <mailto:davenoveck@gmail.com>>:
> > >
> > >> It appears that you want an informational document saying, more or less:
> > >>
> > >>    - If the server does not want clients to be discomfited by open files
> > >>    being removed, since such behavior is disallowed by typical OS
> > >> (e.g.Unix)
> > >>    semantics, the server can avoid this situation by delaying the actual
> > >>    removal of the file until last close, as allowed by RFC7530.
> > >>    - The use of rename by clients as a substitute for remove, normally
> > >>    known as "silly rename", has significant problems, since removes can
> > >> happen
> > >>    on nodes that do not have the file open.
> > >>
> > >> If this is what you want, then you can write an I-D and submit it.
> > >>
> > >
> > > Yes, this is exactly what I want to see.
> > >
> > >
> > There are 4 main problems that are inadequately discussed in any of the
> > existing RFCs, and that you need to address before we can consider
> > replacing sillyrename (which is well established today, and well understood
> > by most users).
> >
> > 1) How does the client identify that the server supports this functionality?
> 
> I think this is what OPEN4_RESULT_PRESERVE_UNLINKED in 5661 was meant to
> do.  It'd be interesting to try an implementation and see if your other
> points are addressed, but I haven't thought about it in a long time.
> 
> --b.
> 
> > 2) What functionality is needed on the underlying filesystems on the server?
> > 3) How does the server function in the case of a reboot? What can the
> > client expect in terms of recoverability?
> > 4) How does the client in practice perform recovery?
> >    a) In the case of server reboots.
> >    b) In the case of lease timeouts/network partitions
> >
> > Note that the lack of open-by-filehandle in NFSv4.0 makes 4.b) more
> > difficult than it should otherwise be.
> 
> > _______________________________________________
> > nfsv4 mailing list
> > nfsv4@ietf.org <mailto:nfsv4@ietf.org>
> > https://www.ietf.org/mailman/listinfo/nfsv4 <https://www.ietf.org/mailman/listinfo/nfsv4>
> 
> _______________________________________________
> nfsv4 mailing list
> nfsv4@ietf.org <mailto:nfsv4@ietf.org>
> https://www.ietf.org/mailman/listinfo/nfsv4 <https://www.ietf.org/mailman/listinfo/nfsv4>
>