Re: [nfsv4] Review comments for WGLC of draft-ietf-nfsv4-layout-types

Thomas Haynes <loghyr@primarydata.com> Tue, 15 August 2017 18:51 UTC

Return-Path: <loghyr@primarydata.com>
X-Original-To: nfsv4@ietfa.amsl.com
Delivered-To: nfsv4@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id E3BF7132356 for <nfsv4@ietfa.amsl.com>; Tue, 15 Aug 2017 11:51:32 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.7
X-Spam-Level:
X-Spam-Status: No, score=-2.7 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, RCVD_IN_DNSWL_LOW=-0.7, SPF_PASS=-0.001, URIBL_BLOCKED=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (1024-bit key) header.d=primarydata.com
Received: from mail.ietf.org ([4.31.198.44]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id OS0hgUXvn5Cy for <nfsv4@ietfa.amsl.com>; Tue, 15 Aug 2017 11:51:29 -0700 (PDT)
Received: from us-smtp-delivery-194.mimecast.com (us-smtp-delivery-194.mimecast.com [216.205.24.194]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-SHA384 (256/256 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 969B4132256 for <nfsv4@ietf.org>; Tue, 15 Aug 2017 11:51:29 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=primarydata.com; s=mimecast20170802; t=1502823088; h=from:subject:date:message-id:to:cc:mime-version:content-type:content-transfer-encoding:in-reply-to:references; bh=D2z6bFro6ZKaWtf1hjIMQPtXBJEPGzwK9IHdyeYXFtA=; b=LamscHnwi2W5z4TTTJpRxKqheL1rOObkpRoW6AG7zZYmk8fNJonssHDiNj6zm+7qTaUagifzDPbVeR2/ok3ucF9aqReOxpLC4koRJgT3g8povxxOt3dZGdeWptp2ggeP7f9LqQ8VMUjkt9w54VZ3GfRH+w4Ji7AfxBdHOClVc2s=
Received: from NAM03-CO1-obe.outbound.protection.outlook.com (mail-co1nam03lp0022.outbound.protection.outlook.com [216.32.181.22]) (Using TLS) by us-smtp-1.mimecast.com with ESMTP id us-mta-101-rWkeazBHPpmECmSVAT81xg-1; Tue, 15 Aug 2017 14:51:25 -0400
Received: from BY2PR1101MB1093.namprd11.prod.outlook.com (10.164.166.21) by BY2PR1101MB1095.namprd11.prod.outlook.com (10.164.166.23) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA384_P256) id 15.1.1341.21; Tue, 15 Aug 2017 18:51:22 +0000
Received: from BY2PR1101MB1093.namprd11.prod.outlook.com ([10.164.166.21]) by BY2PR1101MB1093.namprd11.prod.outlook.com ([10.164.166.21]) with mapi id 15.01.1341.020; Tue, 15 Aug 2017 18:51:22 +0000
From: Thomas Haynes <loghyr@primarydata.com>
To: Chuck Lever <chuck.lever@oracle.com>
CC: "nfsv4@ietf.org" <nfsv4@ietf.org>, hch <hch@lst.de>
Thread-Topic: [nfsv4] Review comments for WGLC of draft-ietf-nfsv4-layout-types
Thread-Index: AQHTDHrZiISsN7s3MU64UU3+W1eRPaKF1gEA
Date: Tue, 15 Aug 2017 18:51:22 +0000
Message-ID: <7572D8F6-D1FF-4723-AF89-1AD2B8C0F4F8@primarydata.com>
References: <418E308E-5057-499A-B403-38486BDEF772@oracle.com>
In-Reply-To: <418E308E-5057-499A-B403-38486BDEF772@oracle.com>
Accept-Language: en-US
Content-Language: en-US
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
x-originating-ip: [63.157.6.18]
x-ms-publictraffictype: Email
x-microsoft-exchange-diagnostics: 1; BY2PR1101MB1095; 20:/PqEF7E9EX1kjuEXRfWln6W74uN35Jf6IgORYtc1QaAKOfhU071KxbJfCK4wHZZuOBmGta3K4E0uFKBUWE+35pkHo+C+DtGtIkOr749sNFbM9OPWQHQhzhe1r6SEkk3E6fVcyKv08MOa+yN26HIrxr/eQqQhXh1rJgHdOesqFcc=
x-ms-exchange-antispam-srfa-diagnostics: SSOS;
x-ld-processed: 03193ed6-8726-4bb3-a832-18ab0d28adb7,ExtAddr
x-ms-office365-filtering-correlation-id: d850177a-f349-4c91-8a39-08d4e40e9f9a
x-microsoft-antispam: UriScan:; BCL:0; PCL:0; RULEID:(300000500095)(300135000095)(300000501095)(300135300095)(22001)(300000502095)(300135100095)(2017030254152)(2017082002075)(300000503095)(300135400095)(2017052603031)(201703131423075)(201702281549075)(300000504095)(300135200095)(300000505095)(300135600095)(300000506095)(300135500095); SRVR:BY2PR1101MB1095;
x-ms-traffictypediagnostic: BY2PR1101MB1095:
x-exchange-antispam-report-test: UriScan:(158342451672863)(120809045254105)(192374486261705)(146099531331640);
x-microsoft-antispam-prvs: <BY2PR1101MB10953A1528435A6C656012C4CE8D0@BY2PR1101MB1095.namprd11.prod.outlook.com>
x-exchange-antispam-report-cfa-test: BCL:0; PCL:0; RULEID:(100000700101)(100105000095)(100000701101)(100105300095)(100000702101)(100105100095)(6040450)(601004)(2401047)(8121501046)(5005006)(10201501046)(3002001)(100000703101)(100105400095)(93006095)(93001095)(6041248)(20161123562025)(20161123555025)(20161123564025)(2016111802025)(201703131423075)(201702281528075)(201703061421075)(201703061406153)(20161123560025)(20161123558100)(6072148)(6043046)(201708071742011)(100000704101)(100105200095)(100000705101)(100105500095); SRVR:BY2PR1101MB1095; BCL:0; PCL:0; RULEID:(100000800101)(100110000095)(100000801101)(100110300095)(100000802101)(100110100095)(100000803101)(100110400095)(100000804101)(100110200095)(100000805101)(100110500095); SRVR:BY2PR1101MB1095;
x-forefront-prvs: 04004D94E2
x-forefront-antispam-report: SFV:NSPM; SFS:(10019020)(6009001)(39830400002)(24454002)(377454003)(189002)(199003)(8936002)(33656002)(2906002)(230783001)(6512007)(54906002)(99286003)(6306002)(6246003)(6436002)(53936002)(110136004)(5660300001)(68736007)(189998001)(101416001)(50986999)(76176999)(54356999)(966005)(345774005)(86362001)(102836003)(6116002)(3846002)(53546010)(77096006)(105586002)(106356001)(3280700002)(14454004)(2950100002)(66066001)(3660700001)(6486002)(6506006)(6916009)(36756003)(478600001)(229853002)(25786009)(4326008)(8676002)(81166006)(305945005)(7736002)(97736004)(2900100001)(82746002)(83716003)(81156014)(42262002); DIR:OUT; SFP:1102; SCL:1; SRVR:BY2PR1101MB1095; H:BY2PR1101MB1093.namprd11.prod.outlook.com; FPR:; SPF:None; PTR:InfoNoRecords; MX:1; A:1; LANG:en;
spamdiagnosticoutput: 1:99
spamdiagnosticmetadata: NSPM
Content-ID: <756BA3B315F25F4A91302E9D0769D5E4@namprd11.prod.outlook.com>
MIME-Version: 1.0
X-OriginatorOrg: primarydata.com
X-MS-Exchange-CrossTenant-originalarrivaltime: 15 Aug 2017 18:51:22.2275 (UTC)
X-MS-Exchange-CrossTenant-fromentityheader: Hosted
X-MS-Exchange-CrossTenant-id: 03193ed6-8726-4bb3-a832-18ab0d28adb7
X-MS-Exchange-Transport-CrossTenantHeadersStamped: BY2PR1101MB1095
X-MC-Unique: rWkeazBHPpmECmSVAT81xg-1
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: base64
Archived-At: <https://mailarchive.ietf.org/arch/msg/nfsv4/utmA_JACJH0Lrpy5iwCG7ntIXAc>
Subject: Re: [nfsv4] Review comments for WGLC of draft-ietf-nfsv4-layout-types
X-BeenThere: nfsv4@ietf.org
X-Mailman-Version: 2.1.22
Precedence: list
List-Id: NFSv4 Working Group <nfsv4.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/nfsv4>, <mailto:nfsv4-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/nfsv4/>
List-Post: <mailto:nfsv4@ietf.org>
List-Help: <mailto:nfsv4-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/nfsv4>, <mailto:nfsv4-request@ietf.org?subject=subscribe>
X-List-Received-Date: Tue, 15 Aug 2017 18:51:33 -0000

> On Aug 3, 2017, at 10:06 AM, Chuck Lever <chuck.lever@oracle.com> wrote:
> 
> Thanks to Tom for writing this one.

No, thank you for the review!


> 
> I'm not qualified to address the technical issues covered by this
> document, thus my comments are limited to editorial and mechanical
> suggestions. 
> 
> It would be great if current authors of new layouts (other than
> the author of this document) could review this one. Christoph,
> can you take a look at this one? It's short.
> 
> The document is a valuable addition to the canon of pNFS
> specifications, but is still rough around the edges IMO. Nothing
> that is impossible to fix, but seems like section 4 needs some
> face lifting.
> 
> 
> Document status:
> 
> This is a "Standards Track" document. Section 4 appears to update
> three RFCs which are Standards Track. Only one of these is mentioned
> in the document header and Abstract.
> 
> Section 3 appears to be advice to authors of layout specifications,
> and advice to the WG on how to evaluate new layouts. Is the use of
> RFC 2119 terms appropriate for this section of this document?
> 

I think it is clear I am not a paragon of virtue when it comes to 
the correct use of RFC 2119 terms.


> 
> == Abstract ==
> 
> "as a whole and those those specifically directed"
> 
> Consider:
> 
> "as a whole and those specifically directed”

Ugh, thanks

> 
> 
> == 1. Introduction ==
> 
> "As a consequence, new internet drafts (see [FlexFiles] and [Lustre])
> may struggle to meet the requirements to be a pNFS layout type."
> 
> The Lustre document referenced here does not appear to be an RFC or an
> active I-D:
> 
> https://datatracker.ietf.org/doc/draft-faibish-nfsv4-pnfs-lustre-layout/
> 
> Perhaps the reference should be removed or replaced.


Which is why it is not normative. Note that the URL is stable.

This may come out during the last editorial phase. It lays the groundwork
for why we want the document.


> 
> "This document specifies the layout type independent requirements placed
> on all layout types, whether one of the original three or any new variant."
> 
> Consider this simplification:
> 
> "This document specifies layout type-independent requirements placed on
> all layout types.”

We have after Dave’s review:

   This document specifies the requirements placed on all layout types
   independent of the particular type chosen, whether one of the
   original three or any new additional one.

and I think you would argue for:

   This document specifies the requirements placed on all layout types
   independent of the particular type chosen.

And taking in mind your last point about the Intro, how about this as a
last paragraph:

   As a consequence, new internet drafts (see [FlexFiles] and [Lustre])
   may struggle to meet the requirements to be a pNFS layout type.  This
   document gathers the requirements from all of the original layout
   type standard documents and then specifies the requirements placed on
   all layout types independent of the particular type chosen.



> 
> The introduction should also announce what is happening in section 4,
> which provides updates to existing layout types.
> 
> 
> == 2. Definitions ==
> 
> In general the syntax of each definition should be consistent. You
> have "defines yada", "is yada", and sometimes complete sentences.
> Some examples follow:
> 
> "control communication requirements:  define for a layout type the
>    details regarding information on layouts, stateids, file metadata,
>    and file data which must be communicated between the metadata
>    server and the storage devices."
> 
> Consider:
> 
> "control communication requirements:  layout type requirements
>    regarding information on layouts, stateids, file metadata,
>    and file data which must be communicated between the metadata
>    server and storage devices."
> 
> 
> "(file) data:  is that part of the file system object which contains
>    the data to read or writen.  It is the contents of the object and
>    not the attributes of the object."
> 
> Consider:
> 
> "(file) data:  data content that is opaque to storage services.
>    File data that is written is expected to be unaltered when it is
>    subsequently read back. In particular, data content is not the
>    file's attributes."
> 
> "fencing:  is the process by which the metadata server prevents the
>    storage devices from processing I/O from a specific client to a
>    specific file.
> 
> Consider:
> 
> "fencing:  the mechanism by which a metadata server prevents
>    storage devices from processing I/O from a specific client to a
>    specific file."
> 
> And so on.

Went with <def>: [is | are] and complete sentences.


> 
> "layout iomode:  see Section 1."
> 
> Strikes me as needing expansion. If a single paragraph is not
> adequate, perhaps another subsection of 2 can be added.

This had already been fixed.


> 
> "loose coupling:  describes when the control protocol, between a
>    metadata server and storage device, is a storage protocol."
> 
> Loose and tight coupling might be better served in a separate
> subsection. This definition strikes me as vague for a first time
> reader. At least refer to section 3 here.
> 
> The terms "layout" and "layout type" are key to the entire
> discussion. They might benefit greatly from an independent
> subsection in section 2, or even some discussion in section 1.
> 


I think there is a difference of opinion on what the definitions
section means. To me, the section provides the smallest
possible definition that the reader can look at to
get the gist of the term. It is not a comprehensive
definition of the term.

When the reader comes across “layout type” later in
the document, they can flip back to the definitions
section to see the broad intent of the term.



> You might find it handy to have distinct definitions of "file
> access protocols" and "block access protocols" as separate classes
> of storage protocols.
> 
> Throughout the document, you might consider using a more abstract
> term than "I/O operation". "File data access" for example might
> be more independent of which storage protocol is in use, or
> whether the access is direct to a storage device or through the
> MDS.
> 

I’m fine with I/O.


> 
> == 2.1. Use of the terms ... ==
> 
> "Note that every data server is a storage device but
>   that storage devices which use protocols which are not file access
>   protocol are not data servers.”
> 

Ouch, in isolation this reads very badly.


> Consider:
> 
> ... "is a storage device, but storage devices which use protocols
> which are not file access protocols (such as NFS) are not data
> servers.”
> 


Taken.


> "while simultaneously acting as a non-data-server storage device for
> others."
> 
> I'm not quite sure what you are saying here. I don't see a definition
> for "non-data-server storage device". In context, perhaps you are
> referring to a storage device that does not simultaneously provide
> file access?


A storage device can be a data server for one storage protocol
and at the same time, not a data server for another storage
protocol.

Changed it to:

      Since a given storage device may support multiple layout types, a
      given device can potentially act as a data server for some set of
      storage protocols while simultaneously acting as a storage device
      for others. 


> 
> This discussion could benefit from adding definitions of "file
> data access" and "block data access", for instance.
> 
> 
> == 3. The Control Protocol ==
> 
> "In Section 12.2.6 of [RFC5661], the control protocol was introduced."
> 
> Consider:
> 
> "In Section 12.2.6 of [RFC5661], the concept of a layout type control
> protocol was introduced.”

I’m struggling with adding this differentiation and I think I have a fix:

   One of the key requirements of a layout type is the need for a
   mechanism to be used to meet the requirements that apply to the
   interaction between the metadata server and the storage device such
   that they present a consistent interface to the client
   (Section 12.2.6 of [RFC5661]).  Particular implementations may
   satisfy this requirement in any manner they choose and the mechanism
   chosen may not be described as a protocol.  Specifications defining
   layout types need to clearly show how implementations can meet the
   requirements discussed below, especially with respect to those that
   have security implications.  In addition, such specifications may
   find it necessary to impose requirements on implementations of the
   layout type to ensure appropriate interoperability.

Notice I also deleted a some sentences.


> 
> This text is repeated in front of the third and fourth paragraphs:
> 
> "In some cases, there may be no control protocol other than the
> storage”

Fixed elsewhere



> 
> "for metadata serverss and storage devices"
> 
> Consider:
> 
> "for metadata servers and storage devices”

Fixed elsewhere

> 
> "(2)  The metadata server MUST be able to restrict access to a file on
>      the storage devices when it revokes a layout."
> 
> This section should make clear that after layout revocation, a client
> is still allowed to access file data via the MDS.

Stating that is redundant to point 1.


> 
> "(4)  Locking MUST be respected."
> 
> This section feels like it needs expansion. Even "NFSv4.1 file locking
> semantics MUST be respected." Otherwise, this is not an enforceable
> MUST.


Already reworked.

> 
> "(5)  The metadata server and the storage devices MUST agree on
>        attributes like modify time, the change attribute, and the end-
>        of-file (EOF) position."
> 
> It would be better to attempt to provide a list of the attributes where
> this requirement is necessary instead of providing examples.
> 
> I think this section would be more clear if the requirement is
> constructed from solely the client's view.

I’m fine with the current POV.

> 
> This item and its discussion don't make sense for storage protocols that
> are not file access protocols. It needs to be rephrased to apply strictly
> to storage devices that are accessed via a file access protocol, which
> would expose file attribute information to clients.
> 

Oh, good catch!


> 
> == 3.2. Undocumented ... ==
> 
> "(4)  Clients MUST NOT perform I/O operations which would be
>      inconsistent with the iomode specified in the layout segments it
>      holds."
> 
> It's not immediately obvious to me what you are getting at in this
> item.
> 

If a client has a READ iomode, then it cannot WRITE using it.


> "(5)  The metadata server MUST be able to do allocation and
>      deallocation of storage.  I.e., creating and deleting files."
> 
> Perhaps the language here could be refined. "storage" can mean anything
> and hasn't been previously defined.

I think the rewording from Dave’s review satisfies this?

   (3)  The metadata server MUST be able to do storage allocation,
        whether that is to create, delete, extend, or truncate files.


> 
> I think you are talking about managing layout extents on storage
> devices?
> 
> 
> == 4.1 File Layout Type ==
> 
> Throughout this section, the discussion is not clear when it switches
> between describing behavior specified by RFC 5661, current
> implementation behavior, and fresh advice to new implementers.


I’m torn here, despite this statement:

   This section is not normative with regards to each of the presented
   types.  This document does not update the specification of either the
   block layout type (see [RFC5663]) or the object layout type (see
   [RFC5664]).  Nor does it update Section 13 of [RFC5661], but rather
   Section 12 of that document.  In other words, it is the pNFS
   requirements being updated rather than the specification of the file
   layout type.

Section 4 is not updating the pNFS requirements. That occurs in
Section 3.

I can see a rewhack of the draft to have in order:

Section 3.0
Section 4.0
Section 4.1
Section 4.2
Section 4.3
A new Section 5 to introduce the following
Section 3.1 as Section 5.1
Section 3.2 as Section 5.2
Section 3.3 as Section 5.3
Section 5 as Section 6
…

So the new Section 3 declares the main issue.
The new Section 4 uses the existing layout types to illustrate the issues.
The new Section 5 presents the resulting requirements.

Hmm, the old Section 5 could be the new Section 5.0.

Thoughts?


> 
> There are no reasons given about why these changes are being made (was
> the spec language inadequate, or are you addressing bugs in the protocol,
> for example)?
> 
> Please make it clear what the original spec says, what it should now
> say, and what is new advice to implementers of the file layout type.
> And separately discuss why the update is necessary.
> 
> See Noveck's migration update doc series for examples of how to
> clearly describe an update to specification language.
> 
> Similar comments apply to sections 4.2 and 4.3: it's not clear what
> is being updated.
> 
> I'm not at all arguing about the technical content of this section.
> These are strictly observations about how this material is presented.
> 
> 
> == 5. Summary ==
> 
> I'm not sure what purpose this section would play once this document
> becomes an RFC.
> 
> 
> == 6. Security Considerations ==
> 
> "It may do this directly, by providing that appropriate checks be
> performed at the time the access is performed."
> 
> Consider:
> 
> ... "at the time each access is performed.”

Ack

> 
> "The client the has the opportunity to re-acquire the layout"
> 
> Consider:
> 
> "The client has a subsequent opportunity to re-acquire the layout”
> 

Ack and thanks


> There is a strong discussion of client-side access checking in
> section 4.2. Would that be more appropriate to move into this
> section in a more generic form?
> 


I’m fine with it as is.


> 
> --
> Chuck Lever
> 
> 
> 
> _______________________________________________
> nfsv4 mailing list
> nfsv4@ietf.org
> https://www.ietf.org/mailman/listinfo/nfsv4
>