From bhalevy@panasas.com Mon Dec 15 23:01:00 2003
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 65678 invoked from network); 16 Dec 2003 07:01:00 -0000
Received: from unknown (66.218.66.216)
by m7.grp.scd.yahoo.com with QMQP; 16 Dec 2003 07:01:00 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta1.grp.scd.yahoo.com with SMTP; 16 Dec 2003 07:00:59 -0000
Received: from yang ([172.17.19.46]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY1CV6; Tue, 16 Dec 2003 02:00:52 -0500
To: <pnfs-reqs@yahoogroups.com>
Date: Tue, 16 Dec 2003 02:01:20 -0500
Message-ID: <LCEAJMHHKPKEPAIDBBEKGENBCAAA.bhalevy@panasas.com>
MIME-Version: 1.0
Content-Type: multipart/mixed;
boundary="----=_NextPart_000_0002_01C3C378.7FB8B890"
X-Priority: 3 (Normal)
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0)
Importance: Normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
X-eGroups-Remote-IP: 65.194.124.178
From: "Benny Halevy" <bhalevy@panasas.com>
Subject: FW: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Wednesday, December 10, 2003 22:27
To: Craig Everhart; John Muth; Brian Pawlowski; David Pease; Julian
Satran; Spencer Shepler; Gary Grider; Brent Welch; Benny Halevy; Jon
Haswell; Dean Hildebrand; Peter Honeyman; Jim Carlson; Garth Gibson;
Andy Adamson; Tyce McLarty; Peter Corbett; David Black
Cc: Garth Gibson
Subject: NEPS-REQS: getting started


So we are the requirements/problem statement subgroup of the NFS
extension for parallel storage effort.

Our job is to create the paper trail justification for adding something

to NFS and provide a conceptual framework by which to identify possible

solutions.

In the beginning this document is used to justify in the IETF process
that there are problems that people take seriously that cannot be
handled well in the scope of NFS today and that should be.

I asked around for examples to help us construct this document and I
was pointed at the problem statement used to start the RDMA over IP
effort (attached below). I was told that this was a particularly well
done problem statement, and that we should not necessarily work this
hard before giving the IETF something to look at.

ftp://ftp.rfc-editor.org/in-notes/internet-drafts/draft-ietf-rddp-
problem-statement-02.txt

RDDP Abstract: This draft addresses an IP-based solution to the problem

of high system costs due to network I/O copying in end-hosts at high
speeds. The problem is due to the high cost of memory bandwidth, and
it can be substantially improved using "copy avoidance." The high
overhead has limited the use of TCP/IP in interconnection networks
especially where high bandwidth, low latency and/or low overhead of
end-system data movement are required by the hosted application.

So I suppose we could start with

pNFS Abstract: This draft addresses an NFS-based solution to the
problem of high system costs due to store-and-forward copying of
storage data from storage devices through a file server mount point to
high-speed end-hosts that also have connectivity to source storage
devices. The problem is due to the high cost of funneling large
storage bandwidths through NFS on single IP addresses, and it can be
substantially improved using "out-of-band access." The high cost of
high-bandwidth NFS servers has limited the use of NFS in data centers
especially where high storage bandwidths are required and numerous
storage serving devices are already networked together.

A pNFS table of contents might be:

1. Introduction
2. The high cost of high bandwidth storage through NFS
2.1 Out-of-band access decreases bandwidth requirements in central file

servers
3. Application level routing of storage data packets is the root cause
of the problem
4. Storage bandwidth bottlenecks are problematic for many key file
system applications
5. Out-of-band access techniques
5.1 A conceptual framework: pNFS delegated maps for distributing files
over SBC, OSD and NFS storage subsystems
6. Security considerations
7. Acknowledgements
8. Informative references

Please have a look at the RDDP problem statement draft and comment on
my simplistic strategy of monkey-see-monkey-do :-)

garth



Begin forwarded message:

> From: Garth Gibson <garth@panasas.com>
> Date: Wed Dec 10, 2003 9:34:58 PM Canada/Eastern
> To: Andy Adamson <andros@citi.umich.edu>, David Black
> <Black_David@emc.com>, Don Cameron <don.cameron@intel.com>, Jim
> Carlson <jvc@us.ibm.com>, Peter Corbett <pcorbett@netapp.com>, Craig
> Everhart <craigev@us.ibm.com>, Steve Fridella
> <fridella_stephen@emc.com>, Garth Gibson <garth.gibson@panasas.com>,
> Gary Grider <ggrider@lanl.gov>, Benny Halevy <bhalevy@panasas.com>,
> Jon Haswell <haswell@us.ibm.com>, Dean Hildebrand
> <dhildebranz@eecs.umich.edu>, Peter Honeyman <honey@citi.umich.edu>,
> Xiaoye Jiang <xjiang@emc.com>, Mike Kazar <kazar@spinnakernet.com>,
> Tyce McLarty <mclarty3@llnl.gov>, John Muth <john.muth@veritas.com>,
> Dave Noveck <Dave.Noveck@netapp.com>, Brian Pawlowski
> <Brian.Pawlowski@netapp.com>, David Pease <pease@almaden.ibm.com>,
> Julian Satran <Julian_Satran@il.ibm.com>, Spencer Shepler
> <spencer.shepler@sun.com>, Brent Welch <bwelch@panasas.com>
> Subject: NFS Extensions for Parallel Storage, subgroup membership
>
> Folks,
>
> Thanks for a great workshop last Thursday!
>
> Materials presented that day are online:
> http://www.citi.umich.edu/NEPS/agenda.html
>
> Below are the workshop followup subgroup memberships as they are now.

> I think I heard Peter say that he would construct auto-managed email
> lists, which from the additions I've received this week, I have
> already decided would be great. Please Peter! Names like neps-all,
> neps-reqs, neps-ops, neps-sbc, neps-osd, neps-nfs would be great.
>
> Our goals, to reprise, are to sketch a set of requirements for NFS
> Extensions for Parallel Storage, or pNFS extensions, sketch a set of
> NFS operation extensions (possibly including alternatives), sketch a
> set of metadata definitions (possibly including alternatives) for
> out-of-band data access over fixed block (SBC) SCSI protocols, object

> (OSD) SCSI protocols and file (NFS) ONCRPC protocols.
>
> We want to do this quickly, over the next few months, and to take it
> into the IETF NFS process as a set of suggestions and strawman
> protocols. The current plan is that at that point those of us that
> follow through with this will to it in the IETF NFS working group. In

> order to convince the IETF and the NFS working group that we have
> important, useful and viable ideas, we are taking a little time to
> pull together starting material.
>
> The timelines discussed at the end of the workshop "heir of the dog"
> session were:
> - get workshop notes put together and out in December (Peter and
Garth)
> - 0th draft of a requirements/problem statement internet draft by mid

> January
> - IETF submission of an internet draft by first week of Feb, so it can

> be part of the March IETF meeting and used as evidence for inclusion
> of extensions for parallel storage into the NFS working group charter
> - one or more documents (not necessarily fully agreeing) from each
> subgroup into the IETF NFS email discussion for early to mid March
> - a face-to-face followup workshop, open to the IETF NFS group at the

> FAST 2004 conference, in San Francisco Mar 31 - Apr 2, at which all
> further plans are proposed, argued and ratified (e.g. shall we be
> absorbed into the IETF NFS group)
>
> To help move this along, we have asked one person in each subgroup to

> push, prod and pull ideas and words out of us. Please help these
> sacrificial volunteers with by contributing text, criticizing
> constructively with alternative text, and finding the time to read
> materials.
>
> These are volunteers in an unofficial process. We have no rules to be

> applied by arbitration, no membership to take votes from. If this
> consensus process, or these people, are not working out, then I
> suggest grass roots alternatives be suggested and explored as a group.

> Lets not get bogged down in process this early :-)
>
> But there are always going to be logistical and procedural issues that

> we need to deal with as a group. The suggestion at the workshop was
> that these multi-subgroup issues be taken into the requirements group.

> For example, I suggest that "scope" issues -- what we include and
> what we exclude from our agenda -- be dealt with in the requirements
> group, where we would need to add/delete requirements for each
> distinct aspect of our scope.
>
> I'm sure I'm way over the line giving this much direction :-) so I'll

> leave it to the subgroups to decide mechanisms for progress. For
> example, weekly conference calls, document exchange formats,
> editorship delegation and/or rotation, agreement achieving processes,

> ....
>
> And with that I'll go off and get to work on suggesting what our
> problem statement needs to say.
>
> garth
> 412-805-9878 (cell)
>
> -------------------------------------------------------
>
> pNFS requirements: Garth Gibson
> -----------------
> Andy Adamson <andros@citi.umich.edu>
> David Black <Black_David@emc.com>
> Jim Carlson <jvc@us.ibm.com>
> Peter Corbett <pcorbett@netapp.com>
> Craig Everhart <craigev@us.ibm.com>
> Garth Gibson <garth@panasas.com>
> Gary Grider <ggrider@lanl.gov>
> Benny Halevy <bhalevy@panasas.com>
> Jon Haswell <haswell@us.ibm.com>
> Dean Hildebrand <dhildebranz@eecs.umich.edu>
> Peter Honeyman <honey@citi.umich.edu>
> Tyce McLarty <mclarty3@llnl.gov>
> John Muth <john.muth@veritas.com>
> Brian Pawlowski <Brian.Pawlowski@netapp.com>
> David Pease <pease@almaden.ibm.com>
> Julian Satran <Julian_Satran@il.ibm.com>
> Spencer Shepler <spencer.shepler@sun.com>
> Brent Welch <bwelch@panasas.com>









Allyn Romanow (Cisco)
Internet-Draft Jeff Mogul (HP)
Expires: December 2003 Tom Talpey (NetApp)
Stephen Bailey (Sandburst)

RDMA over IP Problem Statement
draft-ietf-rddp-problem-statement-02


Status of this Memo

This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026.

Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.

Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other
documents at any time. It is inappropriate to use Internet-Drafts
as reference material or to cite them other than as "work in
progress."

The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt

The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.

Copyright Notice

Copyright (C) The Internet Society (2003). All Rights Reserved.

Abstract

This draft addresses an IP-based solution to the problem of high
system costs due to network I/O copying in end-hosts at high
speeds. The problem is due to the high cost of memory bandwidth,
and it can be substantially improved using "copy avoidance." The
high overhead has limited the use of TCP/IP in interconnection
networks especially where high bandwidth, low latency and/or low
overhead of end-system data movement are required by the hosted
application.






Romanow, et al Expires December 2003 [Page 1]

Internet-Draft RDMA Over IP Problem Statement June 2003


Table Of Contents

1. Introduction . . . . . . . . . . . . . . . . . . . . . . . 2
2. The high cost of data movement operations in network I/O . 3
2.1. Copy avoidance improves processing overhead . . . . . . . 5
3. Memory bandwidth is the root cause of the problem . . . . 6
4. High copy overhead is problematic for many key Internet
applications . . . . . . . . . . . . . . . . . . . . . . . 7
5. Copy Avoidance Techniques . . . . . . . . . . . . . . . . 9
5.1. A Conceptual Framework: DDP and RDMA . . . . . . . . . . . 11
6. Security Considerations . . . . . . . . . . . . . . . . . 11
7. Acknowledgements . . . . . . . . . . . . . . . . . . . . . 12
Informative References . . . . . . . . . . . . . . . . . . 12
Authors' Addresses . . . . . . . . . . . . . . . . . . . . 17
Full Copyright Statement . . . . . . . . . . . . . . . . . 18


1. Introduction

This draft considers the problem of high host processing overhead
associated with network I/O that occurs under high speed
conditions. This problem is often referred to as the "I/O
bottleneck" [CT90]. More specifically, the source of high overhead
that is of interest here is data movement operations - copying.
This issue is not be confused with TCP offload, which is not
addressed here. High speed refers to conditions where the network
link speed is high relative to the bandwidths of the host CPU and
memory. With today's computer systems, one Gbits/s and over is
considered high speed.

High costs associated with copying are an issue primarily for large
scale systems. Although smaller systems such as rack-mounted PCs
and small workstations would benefit from a reduction in copying
overhead, the benefit to smaller machines will be primarily in the
next few years as they scale in the amount of bandwidth they
handle. Today it is large system machines with high bandwidth
feeds, usually multiprocessors and clusters, that are adversely
affected by copying overhead. Examples of such machines include
all varieties of servers: database servers, storage servers,
application servers for transaction processing, for e-commerce, and
web serving, content distribution, video distribution, backups,
data mining and decision support, and scientific computing.

Note that such servers almost exclusively service many concurrent
sessions (transport connections), which, in aggregate, are
responsible for > 1 Gbits/s of communication. Nonetheless, the
cost of copying overhead for a particular load is the same whether
from few or many sessions.



Romanow, et al Expires December 2003 [Page 2]

Internet-Draft RDMA Over IP Problem Statement June 2003


The I/O bottleneck, and the role of data movement operations, have
been widely studied in research and industry over the last
approximately 14 years, and we draw freely on these results.
Historically, the I/O bottleneck has received attention whenever
new networking technology has substantially increased line rates -
100 Mbits/s FDDI and Fast Ethernet, 155 Mbits/s ATM, 1 Gbits/s
Ethernet. In earlier speed transitions, the availability of memory
bandwidth allowed the I/O bottleneck issue to be deferred. Now
however, this is no longer the case. While the I/O problem is
significant at 1 Gbits/s, it is the introduction of 10 Gbits/s
Ethernet which is motivating an upsurge of activity in industry and
research [DAFS, IB, VI, CGZ01, Ma02, MAF+02].

Because of high overhead of end-host processing in current
implementations, the TCP/IP protocol stack is not used for high
speed transfer. Instead, special purpose network fabrics, using a
technology generally known as remote direct memory access (RDMA),
have been developed and are widely used. RDMA is a set of
mechanisms that allow the network adapter, under control of the
application, to steer data directly into and out of application
buffers. Examples of such interconnection fabrics include Fibre
Channel [FIBRE] for block storage transfer, Virtual Interface
Architecture [VI] for database clusters, Infiniband [IB], Compaq
Servernet [SRVNET], Quadrics [QUAD] for System Area Networks.
These link level technologies limit application scaling in both
distance and size, meaning that the number of nodes cannot be
arbitrarily large.

This problem statement substantiates the claim that in network I/O
processing, high overhead results from data movement operations,
specifically copying; and that copy avoidance significantly
decreases the processing overhead. It describes when and why the
high processing overheads occur, explains why the overhead is
problematic, and points out which applications are most affected.

In addition, this document introduces an architectural approach to
solving the problem, which is developed in detail in [BT02]. It
also discusses how the proposed technology may introduce security
concerns and how they should be addressed.

2. The high cost of data movement operations in network I/O

A wealth of data from research and industry shows that copying is
responsible for substantial amounts of processing overhead. It
further shows that even in carefully implemented systems,
eliminating copies significantly reduces the overhead, as
referenced below.




Romanow, et al Expires December 2003 [Page 3]

Internet-Draft RDMA Over IP Problem Statement June 2003


Clark et al. [CJRS89] in 1989 shows that TCP [Po81] overhead
processing is attributable to both operating system costs such as
interrupts, context switches, process management, buffer
management, timer management, and to the costs associated with
processing individual bytes, specifically computing the checksum
and moving data in memory. They found moving data in memory is the
more important of the costs, and their experiments show that memory
bandwidth is the greatest source of limitation. In the data
presented [CJRS89], 64% of the measured microsecond overhead was
attributable to data touching operations, and 48% was accounted for
by copying. The system measured Berkeley TCP on a Sun-3/60 using
1460 Byte Ethernet packets.

In a well-implemented system, copying can occur between the network
interface and the kernel, and between the kernel and application
buffers - two copies, each of which are two memory bus crossings -
for read and write. Although in certain circumstances it is
possible to do better, usually two copies are required on receive.

Subsequent work has consistently shown the same phenomenon as the
earlier Clark study. A number of studies report results that data-
touching operations, checksumming and data movement, dominate the
processing costs for messages longer than 128 Bytes [BS96, CGY01,
Ch96, CJRS89, DAPP93, KP96]. For smaller sized messages, per-
packet overheads dominate [KP96, CGY01].

The percentage of overhead due to data-touching operations
increases with packet size, since time spent on per-byte operations
scales linearly with message size [KP96]. For example, Chu [Ch96]
reported substantial per-byte latency costs as a percentage of
total networking software costs for an MTU size packet on
SPARCstation/20 running memory-to-memory TCP tests over networks
with 3 different MTU sizes. The percentage of total software costs
attributable to per-byte operations were:

1500 Byte Ethernet 18-25%
4352 Byte FDDI 35-50%
9180 Byte ATM 55-65%


Although many studies report results for data-touching operations
including checksumming and data movement together, much work has
focused just on copying [BS96, B99, Ch96, TK95]. For example,
[KP96] reports results that separate processing times for checksum
from data movement operations. For the 1500 Byte Ethernet size,
20% of total processing overhead time is attributable to copying.
The study used 2 DECstations 5000/200 connected by an FDDI network.
(In this study checksum accounts for 30% of the processing time.)



Romanow, et al Expires December 2003 [Page 4]

Internet-Draft RDMA Over IP Problem Statement June 2003


2.1. Copy avoidance improves processing overhead

A number of studies show that eliminating copies substantially
reduces overhead. For example, results from copy-avoidance in the
IO-Lite system [PDZ99], which aimed at improving web server
performance, show a throughput increase of 43% over an optimized
web server, and 137% improvement over an Apache server. The system
was implemented in a 4.4BSD derived UNIX kernel, and the
experiments used a server system based on a 333MHz Pentium II PC
connected to a switched 100 Mbits/s Fast Ethernet.

There are many other examples where elimination of copying using a
variety of different approaches showed significant improvement in
system performance [CFF+94, DP93, EBBV95, KSZ95, TK95, Wa97]. We
will discuss the results of one of these studies in detail in order
to clarify the significant degree of improvement produced by copy
avoidance [Ch02].

Recent work by Chase et al. [CGY01], measuring CPU utilization,
shows that avoiding copies reduces CPU time spent on data access
from 24% to 15% at 370 Mbits/s for a 32 KBytes MTU using an
AlphaStation XP1000 and a Myrinet adapter [BCF+95]. This is an
absolute improvement of 9% due to copy avoidance.

The total CPU utilization was 35%, with data access accounting for
24%. Thus the relative importance of reducing copies is 26%. At
370 Mbits/s, the system is not very heavily loaded. The relative
improvement in achievable bandwidth is 34%. This is the
improvement we would see if copy avoidance were added when the
machine was saturated by network I/O.

Note that improvement from the optimization becomes more important
if the overhead it targets is a larger share of the total cost.
This is what happens if other sources of overhead, such as
checksumming, are eliminated. In [CGY01], after removing checksum
overhead, copy avoidance reduces CPU utilization from 26% to 10%.
This is a 16% absolute reduction, a 61% relative reduction, and a
160% relative improvement in achievable bandwidth.

In fact, today's network interface hardware commonly offloads the
checksum, which removes the other source of per-byte overhead.
They also coalesce interrupts to reduce per-packet costs. Thus,
today copying costs account for a relatively larger part of CPU
utilization than previously, and therefore relatively more benefit
is to be gained in reducing them. (Of course this argument would
be specious if the amount of overhead were insignificant, but it
has been shown to be substantial.)




Romanow, et al Expires December 2003 [Page 5]

Internet-Draft RDMA Over IP Problem Statement June 2003


3. Memory bandwidth is the root cause of the problem

Data movement operations are expensive because memory bandwidth is
scarce relative to network bandwidth and CPU bandwidth [PAC+97].
This trend existed in the past and is expected to continue into the
future [HP97, STREAM], especially in large multiprocessor systems.

With copies crossing the bus twice per copy, network processing
overhead is high whenever network bandwidth is large in comparison
to CPU and memory bandwidths. Generally with today's end-systems,
the effects are observable at network speeds over 1 Gbits/s.

A common question is whether increase in CPU processing power
alleviates the problem of high processing costs of network I/O.
The answer is no, it is the memory bandwidth that is the issue.
Faster CPUs do not help if the CPU spends most of its time waiting
for memory [CGY01].

The widening gap between microprocessor performance and memory
performance has long been a widely recognized and well-understood
problem [PAC+97]. Hennessy [HP97] shows microprocessor performance
grew from 1980-1998 at 60% per year, while the access time to DRAM
improved at 10% per year, giving rise to an increasing "processor-
memory performance gap".

Another source of relevant data is the STREAM Benchmark Reference
Information website which provides information on the STREAM
benchmark [STREAM]. The benchmark is a simple synthetic benchmark
program that measures sustainable memory bandwidth (in MBytes/s)
and the corresponding computation rate for simple vector kernels
measured in MFLOPS. The website tracks information on sustainable
memory bandwidth for hundreds of machines and all major vendors.

Results show measured system performance statistics. Processing
performance from 1985-2001 increased at 50% per year on average,
and sustainable memory bandwidth from 1975 to 2001 increased at 35%
per year on average over all the systems measured. A similar 15%
per year lead of processing bandwidth over memory bandwidth shows
up in another statistic, machine balance [Mc95], a measure of the
relative rate of CPU to memory bandwidth (FLOPS/cycle) / (sustained
memory ops/cycle) [STREAM].

Network bandwidth has been increasing about 10-fold roughly every 8
years, which is a 40% per year growth rate.

A typical example illustrates that the memory bandwidth compares
unfavorably with link speed. The STREAM benchmark shows that a
modern uniprocessor PC, for example the 1.2 GHz Athlon in 2001,



Romanow, et al Expires December 2003 [Page 6]

Internet-Draft RDMA Over IP Problem Statement June 2003


will move the data 3 times in doing a receive operation - 1 for the
network interface to deposit the data in memory, and 2 for the CPU
to copy the data. With 1 GBytes/s of memory bandwidth, meaning one
read or one write, the machine could handle approximately 2.67
Gbits/s of network bandwidth, one third the copy bandwidth. But
this assumes 100% utilization, which is not possible, and more
importantly the machine would be totally consumed! (A rule of
thumb for databases is that 20% of the machine should be required
to service I/O, leaving 80% for the database application. And, the
less the better.)

In 2001, 1 Gbits/s links were common. An application server may
typically have two 1 Gbits/s connections - one connection backend
to a storage server and one front-end, say for serving HTTP
[FGM+99]. Thus the communications could use 2 Gbits/s. In our
typical example, the machine could handle 2.7 Gbits/s at its
theoretical maximum while doing nothing else. This means that the
machine basically could not keep up with the communication demands
in 2001, with the relative growth trends the situation only gets
worse.

4. High copy overhead is problematic for many key Internet applications

If a significant portion of resources on an application machine is
consumed in network I/O rather than in application processing, it
makes it difficult for the application to scale - to handle more
clients, to offer more services.

Several years ago the most affected applications were streaming
multimedia, parallel file systems and supercomputing on clusters
[BS96]. In addition, today the applications that suffer from
copying overhead are more central in Internet computing - they
store, manage, and distribute the information of the Internet and
the enterprise. They include database applications doing
transaction processing, e-commerce, web serving, decision support,
content distribution, video distribution, and backups. Clusters
are typically used for this category of application, since they
have advantages of availability and scalability.

Today these applications, which provide and manage Internet and
corporate information, are typically run in data centers that are
organized into three logical tiers. One tier is typically a set of
web servers connecting to the WAN. The second tier is a set of
application servers that run the specific applications usually on
more powerful machines, and the third tier is backend databases.
Physically, the first two tiers - web server and application server
- are usually combined [Pi01]. For example an e-commerce server
communicates with a database server and with a customer site, or a



Romanow, et al Expires December 2003 [Page 7]

Internet-Draft RDMA Over IP Problem Statement June 2003


content distribution server connects to a server farm, or an OLTP
server connects to a database and a customer site.

When network I/O uses too much memory bandwidth, performance on
network paths between tiers can suffer. (There might also be
performance issues on SAN paths used either by the database tier or
the application tier.) The high overhead from network-related
memory copies diverts system resources from other application
processing. It also can create bottlenecks that limit total system
performance.

There are a large and growing number of these application servers
distributed throughout the Internet. In 1999 approximately 3.4
million server units were shipped, in 2000, 3.9 million units, and
the estimated annual growth rate for 2000-2004 was 17 percent
[Ne00, Pa01].

There is high motivation to maximize the processing capacity of
each CPU, as scaling by adding CPUs one way or another has
drawbacks. For example, adding CPUs to a multiprocessor will not
necessarily help, as a multiprocessor improves performance only
when the memory bus has additional bandwidth to spare. Clustering
can add additional complexity to handling the applications.

In order to scale a cluster or multiprocessor system, one must
proportionately scale the interconnect bandwidth. Interconnect
bandwidth governs the performance of communication-intensive
parallel applications; if this (often expressed in terms of
"bisection bandwidth") is too low, adding additional processors
cannot improve system throughput. Interconnect latency can also
limit the performance of applications that frequently share data
between processors.

So, excessive overheads on network paths in a "scalable" system
both can require the use of more processors than optimal, and can
reduce the marginal utility of those additional processors.

Copy avoidance scales a machine upwards by removing at least two-
thirds the bus bandwidth load from the "very best" 1-copy (on
receive) implementations, and removes at least 80% of the bandwidth
overhead from the 2-copy implementations.

An example showing poor performance with copies and improved
scaling with copy avoidance is illustrative. The IO-Lite work
[PDZ99] shows higher server throughput servicing more clients using
a zero-copy system. In an experiment designed to mimic real world
web conditions by simulating the effect of TCP WAN connections on
the server, the performance of 3 servers was compared. One server



Romanow, et al Expires December 2003 [Page 8]

Internet-Draft RDMA Over IP Problem Statement June 2003


was Apache, another an optimized server called Flash, and the third
the Flash server running IO-Lite, called Flash-Lite with zero copy.
The measurement was of throughput in requests/second as a function
of the number of slow background clients that could be served. As
the table shows, Flash-Lite has better throughput, especially as
the number of clients increases.

Apache Flash Flash-Lite
------ ----- ----------
#Clients Thruput reqs/s Thruput Thruput

0 520 610 890
16 390 490 890
32 360 490 850
64 360 490 890
128 310 450 880
256 310 440 820


Traditional Web servers (which mostly send data and can keep most
of their content in the file cache) are not the worst case for copy
overhead. Web proxies (which often receive as much data as they
send) and complex Web servers based on SANs or multi-tier systems
will suffer more from copy overheads than in the example above.

5. Copy Avoidance Techniques

There have been extensive research investigation and industry
experience with two main alternative approaches to eliminating data
movement overhead, often along with improving other Operating
System processing costs. In one approach, hardware and/or software
changes within a single host reduce processing costs. In another
approach, memory-to-memory networking [MAF+02], hosts communicate
via information that allows them to reduce processing costs.

The single host approaches range from new hardware and software
architectures [KSZ95, Wa97, DWB+93] to new or modified software
systems [BP96, Ch96, TK95, DP93, PDZ99]. In the approach based on
using a networking protocol to exchange information, the network
adapter, under control of the application, places data directly
into and out of application buffers, reducing the need for data
movement. Commonly this approach is called RDMA, Remote Direct
Memory Access.

As discussed below, research and industry experience has shown that
copy avoidance techniques within the receiver processing path alone
have proven to be problematic. The research special purpose host
adapter systems had good performance and can be seen as precursors



Romanow, et al Expires December 2003 [Page 9]

Internet-Draft RDMA Over IP Problem Statement June 2003


for the commercial RDMA-based NICs [KSZ95, DWB+93]. In software,
many implementations have successfully achieved zero-copy transmit,
but few have accomplished zero-copy receive. And those that have
done so make strict alignment and no-touch requirements on the
application, greatly reducing the portability and usefulness of the
implementation.

In contrast, experience has proven satisfactory with memory-to-
memory systems that permit RDMA - performance has been good and
there have not been system or networking difficulties. RDMA is a
single solution. Once implemented, it can be used with any OS and
machine architecture, and it does not need to be revised when
either of these changes.

In early work, one goal of the software approaches was to show that
TCP could go faster with appropriate OS support [CJR89, CFF+94].
While this goal was achieved, further investigation and experience
showed that, though possible to craft software solutions, specific
system optimizations have been complex, fragile, extremely
interdependent with other system parameters in complex ways, and
often of only marginal improvement [CFF+94, CGY01, Ch96, DAPP93,
KSZ95, PDZ99]. The network I/O system interacts with other aspects
of the Operating System such as machine architecture and file I/O,
and disk I/O [Br99, Ch96, DP93].

For example, the Solaris Zero-Copy TCP work [Ch96], which relies on
page remapping, shows that the results are highly interdependent
with other systems, such as the file system, and that the
particular optimizations are specific for particular architectures,
meaning for each variation in architecture optimizations must be
re-crafted [Ch96].

A number of research projects and industry products have been based
on the memory-to-memory approach to copy avoidance. These include
U-Net [EBBV95], SHRIMP [BLA+94], Hamlyn [BJM+96], Infiniband [IB],
Winsock Direct [Pi01]. Several memory-to-memory systems have been
widely used and have generally been found to be robust, to have
good performance, and to be relatively simple to implement. These
include VI [VI], Myrinet [BCF+95], Quadrics [QUAD], Compaq/Tandem
Servernet [SRVNET]. Networks based on these memory-to-memory
architectures have been used widely in scientific applications and
in data centers for block storage, file system access, and
transaction processing.

By exporting direct memory access "across the wire", applications
may direct the network stack to manage all data directly from
application buffers. A large and growing class of applications has
already emerged which takes advantage of such capabilities,



Romanow, et al Expires December 2003 [Page 10]

Internet-Draft RDMA Over IP Problem Statement June 2003


including all the major databases, as well as file systems such as
DAFS [DAFS] and network protocols such as Sockets Direct [SDP].

5.1. A Conceptual Framework: DDP and RDMA

An RDMA solution can be usefully viewed as being comprised of two
distinct components: "direct data placement (DDP)" and "remote
direct memory access (RDMA) semantics". They are distinct in
purpose and also in practice - they may be implemented as separate
protocols.

The more fundamental of the two is the direct data placement
facility. This is the means by which memory is exposed to the
remote peer in an appropriate fashion, and the means by which the
peer may access it, for instance reading and writing.

The RDMA control functions are semantically layered atop direct
data placement. Included are operations that provide "control"
features, such as connection and termination, and the ordering of
operations and signaling their completions. A "send" facility is
provided.

While the functions (and potentially protocols) are distinct,
historically both aspects taken together have been referred as
"RDMA". The facilities of direct data placement are useful in and
of themselves, and may be employed by other upper layer protocols
to facilitate data transfer. Therefore, it is often useful to
refer to DDP as the data placement functionality and RDMA as the
control aspect.

[BT02] develops an architecture for DDP and RDMA, and is a
companion draft to this problem statement.

6. Security Considerations

Solutions to the problem of reducing copying overhead in high
bandwidth transfers via one or more protocols may introduce new
security concerns. Any proposed solution must be analyzed for
security threats and any such threats addressed. Potential
security weaknesses due to resource issues that might lead to
denial-of-service attacks, overwrites and other concurrent
operations, the ordering of completions as required by the RDMA
protocol, the granularity of transfer, and any other identified
threats; need to be examined, described and an adequate solution to
them found.

Layered atop Internet transport protocols, the RDMA protocols will
gain leverage from and must permit integration with Internet



Romanow, et al Expires December 2003 [Page 11]

Internet-Draft RDMA Over IP Problem Statement June 2003


security standards, such as IPSec and TLS [IPSEC, TLS]. A thorough
analysis of the degree to which these protocols address potential
threats is required.

Security for an RDMA design requires more than just securing the
communication channel. While it is necessary to be able to
guarantee channel properties such as privacy, integrity, and
authentication, these properties cannot defend against all attacks
from properly authenticated peers, which might be malicious,
compromised, or buggy. For example, an RDMA peer should not be
able to read or write memory regions without prior consent.

Further, it must not be possible to evade consistency checks at the
recipient. The RDMA design must allow the recipient to rely on its
consistent memory contents by controlling peer access to memory
regions explicitly, and must disallow peer access to regions when
not authorized.

The RDMA protocols must ensure that regions addressable by RDMA
peers be under strict application control. Remote access to local
memory by a network peer introduces a number of potential security
concerns. This becomes particularly important in the Internet
context, where such access can be exported globally.

The RDMA protocols carry in part what is essentially user
information, explicitly including addressing information and
operation type (read or write), and implicitly including protection
and attributes. As such, the protocol requires checking of these
higher level aspects in addition to the basic formation of
messages. The semantics associated with each class of error must
be clearly defined, and the expected action to be taken on mismatch
be specified. In some cases, this will result in a catastrophic
error on the RDMA association, however in others a local or remote
error may be signalled. Certain of these errors may require
consideration of abstract local semantics, which must be carefully
specified so as to provide useful behavior while not constraining
the implementation.

7. Acknowledgements

Jeff Chase generously provided many useful insights and
information. Thanks to Jim Pinkerton for many helpful discussions.

8. Informative References

[BCF+95]
N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L.
Seitz, J. N. Seizovic, and W. Su. "Myrinet - A gigabit-per-



Romanow, et al Expires December 2003 [Page 12]

Internet-Draft RDMA Over IP Problem Statement June 2003


second local-area network", IEEE Micro, February 1995

[BJM+96]
G. Buzzard, D. Jacobson, M. Mackey, S. Marovich, J. Wilkes,
"An implementation of the Hamlyn send-managed interface
architecture", in Proceedings of the Second Symposium on
Operating Systems Design and Implementation, USENIX Assoc.,
October 1996

[BLA+94]
M. A. Blumrich, K. Li, R. Alpert, C. Dubnicki, E. W. Felten,
"A virtual memory mapped network interface for the SHRIMP
multicomputer", in Proceedings of the 21st Annual Symposium on
Computer Architecture, April 1994, pp. 142-153

[Br99]
J. C. Brustoloni, "Interoperation of copy avoidance in network
and file I/O", Proceedings of IEEE Infocom, 1999, pp. 534-542

[BS96]
J. C. Brustoloni, P. Steenkiste, "Effects of buffering
semantics on I/O performance", Proceedings OSDI'96, USENIX,
Seattle, WA October 1996, pp. 277-291

RFC Editor note:
Replace following architecture draft-ietf- name, status and date
with appropriate reference when assigned.

[BT02]
S. Bailey, T. Talpey, "The Architecture of Direct Data
Placement (DDP) And Remote Direct Memory Access (RDMA) On
Internet Protocols", Internet Draft Work in Progress, draft-
ietf-rddp-arch-02, June 2003

[CFF+94]
C-H Chang, D. Flower, J. Forecast, H. Gray, B. Hawe, A.
Nadkarni, K. K. Ramakrishnan, U. Shikarpur, K. Wilde, "High-
performance TCP/IP and UDP/IP networking in DEC OSF/1 for
Alpha AXP", Proceedings of the 3rd IEEE Symposium on High
Performance Distributed Computing, August 1994, pp. 36-42

[CGY01]
J. S. Chase, A. J. Gallatin, and K. G. Yocum, "End system
optimizations for high-speed TCP", IEEE Communications
Magazine, Volume: 39, Issue: 4 , April 2001, pp 68-74.
http://www.cs.duke.edu/ari/publications/end-system.{ps,pdf}





Romanow, et al Expires December 2003 [Page 13]

Internet-Draft RDMA Over IP Problem Statement June 2003


[Ch96]
H.K. Chu, "Zero-copy TCP in Solaris", Proc. of the USENIX 1996
Annual Technical Conference, San Diego, CA, January 1996

[Ch02]
Jeffrey Chase, Personal communication

[CJRS89]
D. D. Clark, V. Jacobson, J. Romkey, H. Salwen, "An analysis
of TCP processing overhead", IEEE Communications Magazine,
volume: 27, Issue: 6, June 1989, pp 23-29

[CT90]
D. D. Clark, D. Tennenhouse, "Architectural considerations for
a new generation of protocols", Proceedings of the ACM SIGCOMM
Conference, 1990

[DAFS]
DAFS Collaborative, "Direct Access File System Specification
v1.0", September 2001, available from
http://www.dafscollaborative.org

[DAPP93]
P. Druschel, M. B. Abbott, M. A. Pagels, L. L. Peterson,
"Network subsystem design", IEEE Network, July 1993, pp. 8-17

[DP93]
P. Druschel, L. L. Peterson, "Fbufs: a high-bandwidth cross-
domain transfer facility", Proceedings of the 14th ACM
Symposium of Operating Systems Principles, December 1993

[DWB+93]
C. Dalton, G. Watson, D. Banks, C. Calamvokis, A. Edwards, J.
Lumley, "Afterburner: architectural support for high-
performance protocols", Technical Report, HP Laboratories
Bristol, HPL-93-46, July 1993

[EBBV95]
T. von Eicken, A. Basu, V. Buch, and W. Vogels, "U-Net: A
user-level network interface for parallel and distributed
computing", Proc. of the 15th ACM Symposium on Operating
Systems Principles, Copper Mountain, Colorado, December 3-6,
1995

[FGM+99]
R. Fielding, J. Gettys, J. Mogul, F. Frystyk, L. Masinter, P.
Leach, T. Berners-Lee, "Hypertext Transfer Protocol -
HTTP/1.1", RFC 2616, June 1999



Romanow, et al Expires December 2003 [Page 14]

Internet-Draft RDMA Over IP Problem Statement June 2003


[FIBRE]
ANSI Technical Committee T10, "Fibre Channel Protocol (FCP)"
(and as revised and updated), ANSI X3.269:1996 [R2001],
committee draft available from
http://www.t10.org/drafts.htm#FibreChannel

[HP97]
J. L. Hennessy, D. A. Patterson, Computer Organization and
Design, 2nd Edition, San Francisco: Morgan Kaufmann
Publishers, 1997

[IB] InfiniBand Trade Association, "InfiniBand Architecture
Specification, Volumes 1 and 2", Release 1.1, November 2002,
available from http://www.infinibandta.org/specs

[KP96]
J. Kay, J. Pasquale, "Profiling and reducing processing
overheads in TCP/IP", IEEE/ACM Transactions on Networking, Vol
4, No. 6, pp.817-828, December 1996

[KSZ95]
K. Kleinpaste, P. Steenkiste, B. Zill, "Software support for
outboard buffering and checksumming", SIGCOMM'95

[Ma02]
K. Magoutis, "Design and Implementation of a Direct Access
File System (DAFS) Kernel Server for FreeBSD", in Proceedings
of USENIX BSDCon 2002 Conference, San Francisco, CA, February
11-14, 2002.

[MAF+02]
K. Magoutis, S. Addetia, A. Fedorova, M. I. Seltzer, J. S.
Chase, D. Gallatin, R. Kisley, R. Wickremesinghe, E. Gabber,
"Structure and Performance of the Direct Access File System
(DAFS)", accepted for publication at the 2002 USENIX Annual
Technical Conference, Monterey, CA, June 9-14, 2002.

[Mc95]
J. D. McCalpin, "A Survey of memory bandwidth and machine
balance in current high performance computers", IEEE TCCA
Newsletter, December 1995

[Ne00]
A. Newman, "IDC report paints conflicted picture of server
market circa 2004", ServerWatch, July 24, 2000
http://serverwatch.internet.com/news/2000_07_24_a.html





Romanow, et al Expires December 2003 [Page 15]

Internet-Draft RDMA Over IP Problem Statement June 2003


[Pa01]
M. Pastore, "Server shipments for 2000 surpass those in 1999",
ServerWatch, February 7, 2001
http://serverwatch.internet.com/news/2001_02_07_a.html

[PAC+97]
D. Patterson, T. Anderson, N. Cardwell, R. Fromm, K. Keeton,
C. Kozyrakis, R. Thomas, K. Yelick , "A case for intelligient
RAM: IRAM", IEEE Micro, April 1997

[PDZ99]
V. S. Pai, P. Druschel, W. Zwaenepoel, "IO-Lite: a unified I/O
buffering and caching system", Proc. of the 3rd Symposium on
Operating Systems Design and Implementation, New Orleans, LA,
February 1999

[Pi01]
J. Pinkerton, "Winsock Direct: The Value of System Area
Networks", May 2001, available from
http://www.microsoft.com/windows2000/techinfo/
howitworks/communications/winsock.asp

[Po81]
J. Postel, "Transmission Control Protocol - DARPA Internet
Program Protocol Specification", RFC 793, September 1981

[QUAD]
Quadrics Ltd., Quadrics QSNet product information, available
from http://www.quadrics.com/website/pages/02qsn.html

[SDP]
InfiniBand Trade Association, "Sockets Direct Protocol v1.0",
Annex A of InfiniBand Architecture Specification Volume 1,
Release 1.1, November 2002, available from
http://www.infinibandta.org/specs

[SRVNET]
R. Horst, "TNet: A reliable system area network", IEEE Micro,
pp. 37-45, February 1995

[STREAM]
J. D. McAlpin, The STREAM Benchmark Reference Information,
http://www.cs.virginia.edu/stream/

[TK95]
M. N. Thadani, Y. A. Khalidi, "An efficient zero-copy I/O
framework for UNIX", Technical Report, SMLI TR-95-39, May 1995




Romanow, et al Expires December 2003 [Page 16]

Internet-Draft RDMA Over IP Problem Statement June 2003


[VI] Compaq Computer Corp., Intel Corporation and Microsoft
Corporation, "Virtual Interface Architecture Specification
Version 1.0", December 1997, available from
http://www.vidf.org/info/04standards.html

[Wa97]
J. R. Walsh, "DART: Fast application-level networking via
data-copy avoidance", IEEE Network, July/August 1997, pp.
28-38

Authors' Addresses


Stephen Bailey
Sandburst Corporation
600 Federal Street
Andover, MA 01810 USA

Phone: +1 978 689 1614
Email: steph@sandburst.com


Jeffrey C. Mogul
Western Research Laboratory
Hewlett-Packard Company
1501 Page Mill Road, MS 1251
Palo Alto, CA 94304 USA

Phone: +1 650 857 2206 (email preferred)
Email: JeffMogul@acm.org


Allyn Romanow
Cisco Systems, Inc.
170 W. Tasman Drive
San Jose, CA 95134 USA

Phone: +1 408 525 8836
Email: allyn@cisco.com












Romanow, et al Expires December 2003 [Page 17]

Internet-Draft RDMA Over IP Problem Statement June 2003


Tom Talpey
Network Appliance
375 Totten Pond Road
Waltham, MA 02451 USA

Phone: +1 781 768 5329
Email: thomas.talpey@netapp.com


Full Copyright Statement

Copyright (C) The Internet Society (2003). All Rights Reserved.

This document and translations of it may be copied and furnished to
others, and derivative works that comment on or otherwise explain
it or assist in its implementation may be prepared, copied,
published and distributed, in whole or in part, without restriction
of any kind, provided that the above copyright notice and this
paragraph are included on all such copies and derivative works.
However, this document itself may not be modified in any way, such
as by removing the copyright notice or references to the Internet
Society or other Internet organizations, except as needed for the
purpose of developing Internet standards in which case the
procedures for copyrights defined in the Internet Standards process
must be followed, or as required to translate it into languages
other than English.

The limited permissions granted above are perpetual and will not be
revoked by the Internet Society or its successors or assigns.

This document and the information contained herein is provided on
an "AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET
ENGINEERING TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF
THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.















Romanow, et al Expires December 2003 [Page 18]

From bhalevy@panasas.com Mon Dec 15 23:02:52 2003
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 63766 invoked from network); 16 Dec 2003 07:02:51 -0000
Received: from unknown (66.218.66.166)
by m18.grp.scd.yahoo.com with QMQP; 16 Dec 2003 07:02:51 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 16 Dec 2003 07:02:49 -0000
Received: from yang ([172.17.19.46]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY1CWB; Tue, 16 Dec 2003 02:02:32 -0500
To: <pnfs-reqs@yahoogroups.com>
Date: Tue, 16 Dec 2003 02:03:00 -0500
Message-ID: <LCEAJMHHKPKEPAIDBBEKKENBCAAA.bhalevy@panasas.com>
MIME-Version: 1.0
Content-Type: multipart/mixed;
boundary="----=_NextPart_000_0007_01C3C378.BB3217E0"
X-Priority: 3 (Normal)
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0)
Importance: Normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
X-eGroups-Remote-IP: 65.194.124.178
From: "Benny Halevy" <bhalevy@panasas.com>
Subject: FW: Re: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

-----Original Message-----
From: Gary Grider [mailto:ggrider@lanl.gov]
Sent: Saturday, December 13, 2003 00:02
To: Garth Gibson; Craig Everhart; John Muth; Brian Pawlowski; David
Pease; Julian Satran; Spencer Shepler; Brent Welch; Benny Halevy; Jon
Haswell; Dean Hildebrand; Peter Honeyman; Jim Carlson; Garth Gibson;
Andy Adamson; Tyce McLarty; Peter Corbett; David Black
Cc: Garth Gibson
Subject: Re: NEPS-REQS: getting started



I decided to toss out a very quick and dirty draft with a lot of parts
missing.
Nothing sacred, just thoughts as they occurred to me partially organized.

I put it in Word so I could get formatting, TOC, etc.

I am attaching a Word and PDF.

I would be happy to put this on a web site for us if you want. I also
would be happy to
centralize the edits and re-post it on the web etc.

Thanks
Gary

At 10:26 PM 12/10/2003 -0500, Garth Gibson wrote:
>So we are the requirements/problem statement subgroup of the NFS
>extension for parallel storage effort.
>
>Our job is to create the paper trail justification for adding something
>to NFS and provide a conceptual framework by which to identify possible
>solutions.
>
>In the beginning this document is used to justify in the IETF process
>that there are problems that people take seriously that cannot be
>handled well in the scope of NFS today and that should be.
>
>I asked around for examples to help us construct this document and I
>was pointed at the problem statement used to start the RDMA over IP
>effort (attached below). I was told that this was a particularly well
>done problem statement, and that we should not necessarily work this
>hard before giving the IETF something to look at.
>
>ftp://ftp.rfc-editor.org/in-notes/internet-drafts/draft-ietf-rddp-
>problem-statement-02.txt
>
>RDDP Abstract: This draft addresses an IP-based solution to the problem
>of high system costs due to network I/O copying in end-hosts at high
>speeds. The problem is due to the high cost of memory bandwidth, and
>it can be substantially improved using "copy avoidance." The high
>overhead has limited the use of TCP/IP in interconnection networks
>especially where high bandwidth, low latency and/or low overhead of
>end-system data movement are required by the hosted application.
>
>So I suppose we could start with
>
>pNFS Abstract: This draft addresses an NFS-based solution to the
>problem of high system costs due to store-and-forward copying of
>storage data from storage devices through a file server mount point to
>high-speed end-hosts that also have connectivity to source storage
>devices. The problem is due to the high cost of funneling large
>storage bandwidths through NFS on single IP addresses, and it can be
>substantially improved using "out-of-band access." The high cost of
>high-bandwidth NFS servers has limited the use of NFS in data centers
>especially where high storage bandwidths are required and numerous
>storage serving devices are already networked together.
>
>A pNFS table of contents might be:
>
>1. Introduction
>2. The high cost of high bandwidth storage through NFS
>2.1 Out-of-band access decreases bandwidth requirements in central file
>servers
>3. Application level routing of storage data packets is the root cause
>of the problem
>4. Storage bandwidth bottlenecks are problematic for many key file
>system applications
>5. Out-of-band access techniques
>5.1 A conceptual framework: pNFS delegated maps for distributing files
>over SBC, OSD and NFS storage subsystems
>6. Security considerations
>7. Acknowledgements
>8. Informative references
>
>Please have a look at the RDDP problem statement draft and comment on
>my simplistic strategy of monkey-see-monkey-do :-)
>
>garth
>
>
>
>Begin forwarded message:
>
>>From: Garth Gibson <garth@panasas.com>
>>Date: Wed Dec 10, 2003 9:34:58 PM Canada/Eastern
>>To: Andy Adamson <andros@citi.umich.edu>, David Black
>><Black_David@emc.com>, Don Cameron <don.cameron@intel.com>, Jim
>>Carlson <jvc@us.ibm.com>, Peter Corbett <pcorbett@netapp.com>, Craig
>>Everhart <craigev@us.ibm.com>, Steve Fridella
>><fridella_stephen@emc.com>, Garth Gibson <garth.gibson@panasas.com>,
>>Gary Grider <ggrider@lanl.gov>, Benny Halevy <bhalevy@panasas.com>,
>>Jon Haswell <haswell@us.ibm.com>, Dean Hildebrand
>><dhildebranz@eecs.umich.edu>, Peter Honeyman <honey@citi.umich.edu>,
>>Xiaoye Jiang <xjiang@emc.com>, Mike Kazar <kazar@spinnakernet.com>,
>>Tyce McLarty <mclarty3@llnl.gov>, John Muth <john.muth@veritas.com>,
>>Dave Noveck <Dave.Noveck@netapp.com>, Brian Pawlowski
>><Brian.Pawlowski@netapp.com>, David Pease <pease@almaden.ibm.com>,
>>Julian Satran <Julian_Satran@il.ibm.com>, Spencer Shepler
>><spencer.shepler@sun.com>, Brent Welch <bwelch@panasas.com>
>>Subject: NFS Extensions for Parallel Storage, subgroup membership
>>
>>Folks,
>>
>>Thanks for a great workshop last Thursday!
>>
>>Materials presented that day are online:
>>http://www.citi.umich.edu/NEPS/agenda.html
>>
>>Below are the workshop followup subgroup memberships as they are now.
>>I think I heard Peter say that he would construct auto-managed email
>>lists, which from the additions I've received this week, I have
>>already decided would be great. Please Peter! Names like neps-all,
>>neps-reqs, neps-ops, neps-sbc, neps-osd, neps-nfs would be great.
>>
>>Our goals, to reprise, are to sketch a set of requirements for NFS
>>Extensions for Parallel Storage, or pNFS extensions, sketch a set of
>>NFS operation extensions (possibly including alternatives), sketch a
>>set of metadata definitions (possibly including alternatives) for
>>out-of-band data access over fixed block (SBC) SCSI protocols, object
>>(OSD) SCSI protocols and file (NFS) ONCRPC protocols.
>>
>>We want to do this quickly, over the next few months, and to take it
>>into the IETF NFS process as a set of suggestions and strawman
>>protocols. The current plan is that at that point those of us that
>>follow through with this will to it in the IETF NFS working group. In
>>order to convince the IETF and the NFS working group that we have
>>important, useful and viable ideas, we are taking a little time to
>>pull together starting material.
>>
>>The timelines discussed at the end of the workshop "heir of the dog"
>>session were:
>>- get workshop notes put together and out in December (Peter and Garth)
>>- 0th draft of a requirements/problem statement internet draft by mid
>>January
>>- IETF submission of an internet draft by first week of Feb, so it can
>>be part of the March IETF meeting and used as evidence for inclusion
>>of extensions for parallel storage into the NFS working group charter
>>- one or more documents (not necessarily fully agreeing) from each
>>subgroup into the IETF NFS email discussion for early to mid March
>>- a face-to-face followup workshop, open to the IETF NFS group at the
>>FAST 2004 conference, in San Francisco Mar 31 - Apr 2, at which all
>>further plans are proposed, argued and ratified (e.g. shall we be
>>absorbed into the IETF NFS group)
>>
>>To help move this along, we have asked one person in each subgroup to
>>push, prod and pull ideas and words out of us. Please help these
>>sacrificial volunteers with by contributing text, criticizing
>>constructively with alternative text, and finding the time to read
>>materials.
>>
>>These are volunteers in an unofficial process. We have no rules to be
>>applied by arbitration, no membership to take votes from. If this
>>consensus process, or these people, are not working out, then I
>>suggest grass roots alternatives be suggested and explored as a group.
>> Lets not get bogged down in process this early :-)
>>
>>But there are always going to be logistical and procedural issues that
>>we need to deal with as a group. The suggestion at the workshop was
>>that these multi-subgroup issues be taken into the requirements group.
>> For example, I suggest that "scope" issues -- what we include and
>>what we exclude from our agenda -- be dealt with in the requirements
>>group, where we would need to add/delete requirements for each
>>distinct aspect of our scope.
>>
>>I'm sure I'm way over the line giving this much direction :-) so I'll
>>leave it to the subgroups to decide mechanisms for progress. For
>>example, weekly conference calls, document exchange formats,
>>editorship delegation and/or rotation, agreement achieving processes,
>>....
>>
>>And with that I'll go off and get to work on suggesting what our
>>problem statement needs to say.
>>
>>garth
>>412-805-9878 (cell)
>>
>>-------------------------------------------------------
>>
>>pNFS requirements: Garth Gibson
>>-----------------
>>Andy Adamson <andros@citi.umich.edu>
>>David Black <Black_David@emc.com>
>>Jim Carlson <jvc@us.ibm.com>
>>Peter Corbett <pcorbett@netapp.com>
>>Craig Everhart <craigev@us.ibm.com>
>>Garth Gibson <garth@panasas.com>
>>Gary Grider <ggrider@lanl.gov>
>>Benny Halevy <bhalevy@panasas.com>
>>Jon Haswell <haswell@us.ibm.com>
>>Dean Hildebrand <dhildebranz@eecs.umich.edu>
>>Peter Honeyman <honey@citi.umich.edu>
>>Tyce McLarty <mclarty3@llnl.gov>
>>John Muth <john.muth@veritas.com>
>>Brian Pawlowski <Brian.Pawlowski@netapp.com>
>>David Pease <pease@almaden.ibm.com>
>>Julian Satran <Julian_Satran@il.ibm.com>
>>Spencer Shepler <spencer.shepler@sun.com>
>>Brent Welch <bwelch@panasas.com>
>
>



Attachment (not stored)
draft-ietf-pNFS-problem-statement.pdf
Type: application/pdf

Attachment (not stored)
draft-ietf-pNFS-problem-statement.doc
Type: application/msword

From bhalevy@panasas.com Mon Dec 15 23:06:28 2003
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 27808 invoked from network); 16 Dec 2003 07:06:28 -0000
Received: from unknown (66.218.66.216)
by m10.grp.scd.yahoo.com with QMQP; 16 Dec 2003 07:06:28 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta1.grp.scd.yahoo.com with SMTP; 16 Dec 2003 07:06:28 -0000
Received: from yang ([172.17.19.46]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY1CW2; Tue, 16 Dec 2003 02:05:59 -0500
To: <pnfs-reqs@yahoogroups.com>
Date: Tue, 16 Dec 2003 02:06:27 -0500
Message-ID: <LCEAJMHHKPKEPAIDBBEKOENBCAAA.bhalevy@panasas.com>
MIME-Version: 1.0
Content-Type: text/plain;
charset="US-ASCII"
Content-Transfer-Encoding: 7bit
X-Priority: 3 (Normal)
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0)
Importance: Normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
X-eGroups-Remote-IP: 65.194.124.178
From: "Benny Halevy" <bhalevy@panasas.com>
Subject: FW: Re: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

-----Original Message-----
From: Tyce McLarty [mailto:mclarty3@llnl.gov]
Sent: Monday, December 15, 2003 13:49
To: Gary Grider; Garth Gibson; Craig Everhart; John Muth; Brian
Pawlowski; David Pease; Julian Satran; Spencer Shepler; Brent Welch;
Benny Halevy; Jon Haswell; Dean Hildebrand; Peter Honeyman; Jim Carlson;
Garth Gibson; Andy Adamson; Peter Corbett; David Black
Cc: Garth Gibson
Subject: Re: NEPS-REQS: getting started


I've been wondering how important it is too cast the "problem" as one of
cost, rather than as the ability to do things that cannot be done today
with added benefits in cost reduction.

I liked the list that Garth put up at the workshop:

Scalable bandwidth
Scalable capacity
Load balancing
capacity balancing

plus the big winner - a standardized client.

So the Introduction would be basically two paragraphs with (in either
order):
1. proposal to extend NFSv4 to allow parallel out-of-band client access to
data separate from metadata operations.
2. why it's important to do using the reasons outlined above.

My question is - How close do we need to model the RDMA problem statement?
Is cost the best/only justification or can we use new & needed capability
plus value added?

I think Gary has slanted his additions this direction, but seems like we
should all agree on some basic principles before we get too deep in
word-smithing.

Thanks,
Tyce

At 10:02 PM 12/12/2003 -0700, Gary Grider wrote:

>I decided to toss out a very quick and dirty draft with a lot of parts
>missing.
>Nothing sacred, just thoughts as they occurred to me partially organized.
>
>I put it in Word so I could get formatting, TOC, etc.
>
>I am attaching a Word and PDF.
>
>I would be happy to put this on a web site for us if you want. I also
>would be happy to
>centralize the edits and re-post it on the web etc.
>
>Thanks
>Gary


From bhalevy@panasas.com Mon Dec 15 23:11:59 2003
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 45118 invoked from network); 16 Dec 2003 07:11:56 -0000
Received: from unknown (66.218.66.167)
by m9.grp.scd.yahoo.com with QMQP; 16 Dec 2003 07:11:56 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 16 Dec 2003 07:11:58 -0000
Received: from yang ([172.17.19.46]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY1CXS; Tue, 16 Dec 2003 02:11:56 -0500
To: <pnfs-reqs@yahoogroups.com>
Date: Tue, 16 Dec 2003 02:12:23 -0500
Message-ID: <LCEAJMHHKPKEPAIDBBEKKENECAAA.bhalevy@panasas.com>
MIME-Version: 1.0
Content-Type: text/plain;
charset="US-ASCII"
Content-Transfer-Encoding: 7bit
X-Priority: 3 (Normal)
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0)
Importance: Normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
X-eGroups-Remote-IP: 65.194.124.178
From: "Benny Halevy" <bhalevy@panasas.com>
Subject: FW: (Garth Gibson) NFS Extensions for Parallel Storage, subgroup membership
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

ADVERTISEMENT


-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Thursday, December 11, 2003 01:54
To: Andy Adamson; David Black; Don Cameron; Jim Carlson; Peter Corbett;
Craig Everhart; Steve Fridella; Garth Gibson; Gary Grider; Benny Halevy;
Jon Haswell; Dean Hildebranz; Peter Honeyman; Xiaoye Jiang; Mike Kazar;
Tyce McLarty; John Muth; Dave Noveck; Brian Pawlowski; David Pease;
Julian Satran; Spencer Shepler; Brent Welch
Cc: Garth Gibson
Subject: NFS Extensions for Parallel Storage, subgroup membership


Folks,

Thanks for a great workshop last Thursday!

Materials presented that day are online:
http://www.citi.umich.edu/NEPS/agenda.html

Below are the workshop followup subgroup memberships as they are now.
I think I heard Peter say that he would construct auto-managed email
lists, which from the additions I've received this week, I have already
decided would be great. Please Peter! Names like neps-all, neps-reqs,
neps-ops, neps-sbc, neps-osd, neps-nfs would be great.

Our goals, to reprise, are to sketch a set of requirements for NFS
Extensions for Parallel Storage, or pNFS extensions, sketch a set of
NFS operation extensions (possibly including alternatives), sketch a
set of metadata definitions (possibly including alternatives) for
out-of-band data access over fixed block (SBC) SCSI protocols, object
(OSD) SCSI protocols and file (NFS) ONCRPC protocols.

We want to do this quickly, over the next few months, and to take it
into the IETF NFS process as a set of suggestions and strawman
protocols. The current plan is that at that point those of us that
follow through with this will to it in the IETF NFS working group. In
order to convince the IETF and the NFS working group that we have
important, useful and viable ideas, we are taking a little time to pull
together starting material.

The timelines discussed at the end of the workshop "heir of the dog"
session were:
- get workshop notes put together and out in December (Peter and Garth)
- 0th draft of a requirements/problem statement internet draft by mid
January
- IETF submission of an internet draft by first week of Feb, so it can
be part of the March IETF meeting and used as evidence for inclusion of
extensions for parallel storage into the NFS working group charter
- one or more documents (not necessarily fully agreeing) from each
subgroup into the IETF NFS email discussion for early to mid March
- a face-to-face followup workshop, open to the IETF NFS group at the
FAST 2004 conference, in San Francisco Mar 31 - Apr 2, at which all
further plans are proposed, argued and ratified (e.g. shall we be
absorbed into the IETF NFS group)

To help move this along, we have asked one person in each subgroup to
push, prod and pull ideas and words out of us. Please help these
sacrificial volunteers with by contributing text, criticizing
constructively with alternative text, and finding the time to read
materials.

These are volunteers in an unofficial process. We have no rules to be
applied by arbitration, no membership to take votes from. If this
consensus process, or these people, are not working out, then I suggest
grass roots alternatives be suggested and explored as a group. Lets
not get bogged down in process this early :-)

But there are always going to be logistical and procedural issues that
we need to deal with as a group. The suggestion at the workshop was
that these multi-subgroup issues be taken into the requirements group.
For example, I suggest that "scope" issues -- what we include and what
we exclude from our agenda -- be dealt with in the requirements group,
where we would need to add/delete requirements for each distinct aspect
of our scope.

I'm sure I'm way over the line giving this much direction :-) so I'll
leave it to the subgroups to decide mechanisms for progress. For
example, weekly conference calls, document exchange formats, editorship
delegation and/or rotation, agreement achieving processes, ....

And with that I'll go off and get to work on suggesting what our
problem statement needs to say.

garth
412-805-9878 (cell)

-------------------------------------------------------

pNFS requirements: Garth Gibson
-----------------
Andy Adamson <andros@citi.umich.edu>
David Black <Black_David@emc.com>
Jim Carlson <jvc@us.ibm.com>
Peter Corbett <pcorbett@netapp.com>
Craig Everhart <craigev@us.ibm.com>
Garth Gibson <garth@panasas.com>
Gary Grider <ggrider@lanl.gov>
Benny Halevy <bhalevy@panasas.com>
Jon Haswell <haswell@us.ibm.com>
Dean Hildebranz <dhildebz@umich.edu>
Peter Honeyman <honey@citi.umich.edu>
Tyce McLarty <mclarty3@llnl.gov>
John Muth <john.muth@veritas.com>
Brian Pawlowski <Brian.Pawlowski@netapp.com>
David Pease <pease@almaden.ibm.com>
Julian Satran <Julian_Satran@il.ibm.com>
Spencer Shepler <spencer.shepler@sun.com>
Brent Welch <bwelch@panasas.com>

NFSv4 ops for pNFS: Peter Honeyman
------------------
Andy Adamson <andros@citi.umich.edu>
David Black <Black_David@emc.com>
Peter Corbett <pcorbett@netapp.com>
Craig Everhart <craigev@us.ibm.com>
Garth Gibson <garth@panasas.com>
Benny Halevy <bhalevy@panasas.com>
Jon Haswell <haswell@us.ibm.com>
Dean Hildebranz <dhildebz@umich.edu>
Peter Honeyman <honey@citi.umich.edu>
Xiaoye Jiang <xjiang@emc.com>
John Muth <john.muth@veritas.com>
Dave Noveck <Dave.Noveck@netapp.com>
Brian Pawlowski <Brian.Pawlowski@netapp.com>
Julian Satran <Julian_Satran@il.ibm.com>
Spencer Shepler <spencer.shepler@sun.com>
Brent Welch <bwelch@panasas.com>

SBC metadata for pNFS: David Black
---------------------
Andy Adamson <andros@citi.umich.edu>
David Black <Black_David@emc.com>
Jim Carlson <jvc@us.ibm.com>
Craig Everhart <craigev@us.ibm.com>
Steve Fridella <fridella_stephen@emc.com>
Garth Gibson <garth@panasas.com>
Xiaoye Jiang <xjiang@emc.com>
Mike Kazar <kazar@spinnakernet.com>
John Muth <john.muth@veritas.com>
David Pease <pease@almaden.ibm.com>
Julian Satran <Julian_Satran@il.ibm.com>
Spencer Shepler <spencer.shepler@sun.com>

OSD metadata for pNFS: Brent Welch
---------------------
Andy Adamson <andros@citi.umich.edu>
Don Cameron <don.cameron@intel.com>
Peter Corbett <pcorbett@netapp.com>
Garth Gibson <garth@panasas.com>
Benny Halevy <bhalevy@panasas.com>
John Muth <john.muth@veritas.com>
Julian Satran <Julian_Satran@il.ibm.com>
Spencer Shepler <spencer.shepler@sun.com>
Brent Welch <bwelch@panasas.com>

NFS metadata for pNFS: Peter Corbett
---------------------
Andy Adamson <andros@citi.umich.edu>
Peter Corbett <pcorbett@netapp.com>
Craig Everhart <craigev@us.ibm.com>
Garth Gibson <garth@panasas.com>
Jon Haswell <haswell@us.ibm.com>
Dean Hildebranz <dhildebz@umich.edu>
Peter Honeyman <honey@citi.umich.edu>
Xiaoye Jiang <xjiang@emc.com>
John Muth <john.muth@veritas.com>
Julian Satran <Julian_Satran@il.ibm.com>
Spencer Shepler <spencer.shepler@sun.com>

From pnfs-reqs@yahoogroups.com Mon Dec 15 23:51:51 2003
Return-Path: <notify@yahoogroups.com>
Received: (qmail 39098 invoked from network); 16 Dec 2003 07:51:50 -0000
Received: from unknown (66.218.66.216)
by m12.grp.scd.yahoo.com with QMQP; 16 Dec 2003 07:51:50 -0000
Received: from unknown (HELO n6.grp.scd.yahoo.com) (66.218.66.90)
by mta1.grp.scd.yahoo.com with SMTP; 16 Dec 2003 07:51:50 -0000
X-eGroups-Return: notify@yahoogroups.com
Received: from [66.218.67.252] by n6.grp.scd.yahoo.com with NNFMP; 16 Dec 2003 07:51:44 -0000
Date: 16 Dec 2003 07:51:43 -0000
Message-ID: <1071561103.2719.47454.w73@yahoogroups.com>
X-eGroups-Application: files
X-Yahoo-Group-Post: system
From: pnfs-reqs@yahoogroups.com
To: pnfs-reqs@yahoogroups.com
Subject: New file uploaded to pnfs-reqs
MIME-Version: 1.0
Content-Type: text/plain
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 66.218.66.90

Hello,

This email message is a notification to let you know that
a file has been uploaded to the Files area of the pnfs-reqs
group.

File : /draft-ietf-pNFS-problem-statement.doc
Uploaded by : benny_halevy <bhalevy@panasas.com>
Description : Gary Grider's draft 2003-12-13

You can access this file at the URL

http://groups.yahoo.com/group/pnfs-reqs/files/draft-ietf-pNFS-problem-statement.doc

To learn more about file sharing for your group, please visit

http://help.yahoo.com/help/us/groups/files

Regards,

benny_halevy <bhalevy@panasas.com>




From garth@panasas.com Wed Dec 17 21:34:01 2003
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 58554 invoked from network); 18 Dec 2003 05:34:01 -0000
Received: from unknown (66.218.66.218)
by m3.grp.scd.yahoo.com with QMQP; 18 Dec 2003 05:34:01 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 18 Dec 2003 05:34:00 -0000
Received: from panasas.com ([172.17.133.207]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY1NBZ; Thu, 18 Dec 2003 00:33:58 -0500
Date: Thu, 18 Dec 2003 00:34:04 -0500
Content-Type: text/plain; charset=US-ASCII; format=flowed
Mime-Version: 1.0 (Apple Message framework v553)
Cc: Garth Gibson <garth@panasas.com>
To: pnfs-reqs@yahoogroups.com
Content-Transfer-Encoding: 7bit
Message-Id: <CB2073A7-311B-11D8-BE66-000393754F12@panasas.com>
X-Mailer: Apple Mail (2.553)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Tyce,

[I've emailed this through the Yahoo group Benny set up,
http://groups.yahoo.com/group/pnfs-reqs. I will forward it to the
folks that have not yet joined this Yahoo group after I get it sent
back to me :-)]

The RDDP problem statement is similar and dissimilar to what we are
doing. It is similar in that it is about higher performance, which
always turns out to be cost-performance. It is dissimilar in that it
was fighting an uphill battle to get RDMA into the IETF, while we are
looking at no preconceived support or opposition in the IETF (that I am
aware of). And it is dissimilar in that what we are proposing helps in
the manageability of federated systems, which is not really a
performance issue.

I followed the RDDP example closely because it was easy -- our
arguments on strictly bandwidth are at least as strong, in my opinion.
And because I am not certain how to predict the IETF management's
reaction to a manageability argument. And the standardized client code
argument, although very import to some of us, seemed outside my notion
of the IETF scope.

Perhaps those with more experience selling ideas to the IETF could
educate us? Should we focus on a small number of the most easily
demonstrated problems or fill the problem statement out with all the
problems we can contribute to solving?

garth


On Monday, December 15, 2003, at 01:49 PM, Tyce McLarty wrote:
> I've been wondering how important it is too cast the "problem" as one
> of cost, rather than as the ability to do things that cannot be done
> today with added benefits in cost reduction.
>
> I liked the list that Garth put up at the workshop:
>
> Scalable bandwidth
> Scalable capacity
> Load balancing
> capacity balancing
>
> plus the big winner - a standardized client.
>
> So the Introduction would be basically two paragraphs with (in either
> order):
> 1. proposal to extend NFSv4 to allow parallel out-of-band client
> access to data separate from metadata operations.
> 2. why it's important to do using the reasons outlined above.
>
> My question is - How close do we need to model the RDMA problem
> statement? Is cost the best/only justification or can we use new &
> needed capability plus value added?
>
> I think Gary has slanted his additions this direction, but seems like
> we should all agree on some basic principles before we get too deep in
> word-smithing.
>
> Thanks,
> Tyce
>
> At 10:02 PM 12/12/2003 -0700, Gary Grider wrote:
>
>> I decided to toss out a very quick and dirty draft with a lot of
>> parts missing.
>> Nothing sacred, just thoughts as they occurred to me partially
>> organized.
>>
>> I put it in Word so I could get formatting, TOC, etc.
>>
>> I am attaching a Word and PDF.
>>
>> I would be happy to put this on a web site for us if you want. I
>> also would be happy to
>> centralize the edits and re-post it on the web etc.
>>
>> Thanks
>> Gary

From garth@panasas.com Wed Dec 17 21:42:23 2003
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 93406 invoked from network); 18 Dec 2003 05:42:23 -0000
Received: from unknown (66.218.66.167)
by m11.grp.scd.yahoo.com with QMQP; 18 Dec 2003 05:42:23 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 18 Dec 2003 05:42:22 -0000
Received: from panasas.com ([172.17.133.207]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY1NCX; Thu, 18 Dec 2003 00:42:19 -0500
Date: Thu, 18 Dec 2003 00:42:22 -0500
Content-Type: text/plain; charset=US-ASCII; format=flowed
Mime-Version: 1.0 (Apple Message framework v553)
Cc: Garth Gibson <garth@panasas.com>
To: pnfs-reqs@yahoogroups.com,
pnfs-ops@yahoogroups.com
Content-Transfer-Encoding: 7bit
Message-Id: <F3F660FC-311C-11D8-BE66-000393754F12@panasas.com>
X-Mailer: Apple Mail (2.553)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: pNFS Discussion Summary 1: 12/18/03
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Summary 1: 12/18/03

pNFS-ops and pNFS-reqs folks,

Following on the conversation that has been going on in the pNFS-ops
list since Brent put out his notes on the heir-of-the-dog meeting of
Fri Dec 5, I have tried below to summarize what I see as broad issues.
Your additions, corrections or directions are requested.

One theme I see evolving quickly is the differing opinions of the
driving requirements and how these drive differing opinions of
implementation issues in the NFSv4 operations discussion. I have tried
to identify which issues are more about requirements than about "how"
to achieve a requirement in NFSv4. This is not intended to be a power
play, by taking the topic out of the reach of anyone. It is more to
clarify which topics we need to resolve by defining our scope and share
with the folks that are only on the requirements email list. I imagine
the resolution to these requirements-related issues will be more
customer oriented and feature set driven.

Topics:

0.0 Defining Requirements
1.0 Minimalism
1.1 Proxying
1.2 Cache consistency
1.3 Delegation promotion & reacquisition
1.4 Layout delegations
1.5 Concurrent write
1.6 Map revocation
1.7 Separability
1.8 NTFS application semantics

----------------------------------------

[0.0 Defining Requirements]: What is the scope of requirements subgroup
doing and how is it related to the ops subgroup discussions?

I am beginning to see a significant difference between a "problem
statement" document and a "requirements" document. I believe that in a
problem statement we can make a strong case for a set of properties and
applications that are currently underserved in NFSv4, and a direction
that could in one or more steps resolve some or all of the problem.
Alternatively I am coming to see the detailed requirements as a
compendium of the most contentious and impactful issues, how they were
argued and what resolution was accepted. I can see the problem
statement getting done before we have sorted out all the hard problems,
or even run into all of them, so it is a good document for establishing
our interests in the IETF. But I suspect that the requirements
document stays open well into agreement on the specification issues.

For comparison, the first NFSv4 document was called "Design
Considerations" (rfc2624): This document is to cover the "limitations
and deficiencies of NFS version 3". This document will also be used as
a mechanism to focus discussion and avenues of investigation as the
definition of NFS version 4 progresses. Therefore, the contents of
this document cover the general functional/feature areas that are
anticipated for NFS version 4.

I propose that what we have started into in the requirements subgroup
is the problem statement, and that we should be careful to not let it
get bogged down in the longer term requirements resolutions.

----------------------------------------

[1.0 Minimalism]: How much additional functionality do we sacrifice to
limit the changes we seek in NFSv4?

On one hand, some have said that getting to one true file system, with
the high performance and the manageability of federated systems that
might come with out-of-band access, is worth not matching *every*
feature of all existing out-of-band file systems with this first set of
extensions to NFSv4. That we should bite off what we can do quickly,
correctly, with a clear incremental value to NFSv4, and roadmap more
aggressive changes that could bog us down, or introduce so much
complexity that interoperability becomes elusive. And that we should
be mindful of the reception we may get from the IETF NFS working group
if we *appear* to use out-of-band as an excuse to ask for a brace of
changes in other aspects of NFSv4.

On the other hand, the other out-of-band file systems that are
inspiring the evolution of NFSv4 have customers that may not accept any
backward sets in an evolution to NFSv4. This could create the need to
develop, carry and differentiate all the diverse one-off out-of-band
files systems plus a new out-of-band NFSv4. Some think it makes more
sense to go far enough with this first NFSv4 to simplify the
marketplace by making it reasonable for various vendors to
deprecate/end-of-life/begin to wean from their proprietary offering.

While it is certainly conceivable that we could be designing a roadmap
of solutions in detail from the start, communication among standards
bodies is hard enough without the challenge of designing specs for both
with and without a requirement.

This is a central issue in defining the requirements for out-of-band
NFSv4, or at least for defining the scope of the first set of
extensions.

----------------------------------------

[1.1 Proxying]: Operations/work that can only be done out-of-band vs
alternative access through the NFSv4 server for all operations/work

On one hand, some suggest that a set of out-of-band clients should not
have to also have a data path through the NFSv4 metadata server. One
reason is that customers may not tolerate the large variability in
performance between out-of-band (when the going is good) and in-band
(when the server chooses not to grant or to take away a delegation)
accesses. Another reason, and I paraphrase someone else here, is that
it is possible to construct out-of-band metadata servers that do not
have access to the data servers except through the clients -- I
encourage the source of this scenario to replace my paraphrasing with a
correct use case, because I find it odd to design for file servers that
do not have access to the data servers.

On the other hand, others have suggested that any access or work that a
client can do out-of-band should be possible with one or more commands
applied to the metadata server's data path. This has been proposed for
coping with recalled delegations, including concurrent writing by
multiple clients; retry after client access errors, provided adequate
idempotency of out-of-band operations; and many alternative
implementations of out-of-band clients, including legacy clients that
use out-of-band never or rarely.

I think this is a topic that should be argued one way or the other in
the requirements document. Use cases and examples in other systems
would be best.

----------------------------------------

[1.2 Cache consistency]: NFSv4 delegations are not about client cache
consistency; does out-of-band access require stronger cache consistency
than NFSv4 provides

NFSv4 cache consistency is a client function, based on testing file
attributes on open and close. While a client holds a delegation, its
users can close and reopen a file without recourse to the server, so
inside a delegation a client cache contents for that file must be valid
and up to date. However, a client cannot mandate getting a delegation
on open, it must immediately (approximately) give up a delegation if it
is recalled and a client has no way to reacquire a delegation on an
open file after that delegation has been recalled. So we must not
confuse delegations with strong cache consistency.

Many of the various proprietary out-of-band file systems have much
stronger client cache consistency, involving more different types and
interactions of cache callbacks. Some of these differences may have
been motivated by desire for differentiation, some by apps underserved
by NFS cache consistency semantics, and some by the long standing
designer belief that stronger semantics are theoretically better.

The question we must resolve, and argue in the requirements document,
is whether out-of-band access only within the NFSv4 cache consistency
and delegations is not sufficient, why and how much more must/should be
added before such a product is valuable.

I think that application use cases should be discussed. And I caution
us that most of us are the converted, coming to NFSv4 from one of these
proprietary file systems, so gaining agreement amongst ourselves easily
is not a good predictor of the challenge of gaining the agreement of
the NFS standards working group.

----------------------------------------

[1.3 Delegation promotion & reacquisition]: must/should NFSv4 offer
mechanisms for clients to possess a delegations more than once per open

Delegations in NFSv4 are new, and came with significant concern about
lots of complexity for not much performance, as they may do as little
as avoid the client waiting for one round trip to the server on open.
So, as described above with respect to cache consistency, the
limitations on delegations can mean great difficulties for clients
having performance requirements calling for out-of-band access mostly,
or exclusively.

So we have begun to propose mechanisms for clients to be more
aggressive about seeking, obtaining, reobtaining after a recall, and
even waiting for a signal that a denied delegation is now available.
This could lead to discussions of transitioning from a write delegation
to a read delegation, rather than no delegation, when a second
delegation is requested.

We all know, or can imagine, plenty of mechanism for this type of logic
-- after all, it is not far from what some systems do for cache
consistency. But all of this comes with complexity, that threat to
interoperability, and chips away at minimalism.

I would suggest that capture use cases to drive requirements for
controversial steps down this path.

----------------------------------------

[1.4 Layout delegations]: can/should layout metadata "ride" on NFSv4
delegations or are new "layout" delegations needed

If the delegations currently provided by NFSv4 are insufficient, for
reasons of cache consistency or the needed to be able to reacquire a
delegation in order to ensure that performance degradations can be
limited, then some are suggesting that rather than proposing to change
the semantics of the current delegations, we add new delegations
tailored to the purpose, so called layout delegations.

This is consistent with the advice we heard Dec 4 that it is much
easier, and more welcomed, to add new things to NFSv4 than to change
what is already there.

Assuming that in response to requirements arguments, we find the
existing NFSv4 delegations insufficient, then I think this topic is an
implementation issue for the NFSv4 operations subgroup. But I for one
would like to err on the side of fewer NFSv4 changes and slightly
weaker semantics, where possible.

----------------------------------------

[1.5 Concurrent write]: write delegations now are held by exactly one
client, if any; should/must NFS support multiple clients holding
concurrent layout delegations

One specifically excluded use case for out-of-band access is concurrent
write, actually concurrent read and write, or write and write, by
different clients. This is normally associated with expensive client
cache consistency algorithms, but for our purposes here, the issue is
managing the ordering, grouping/atomicity, and failure recovery of
changes on data servers, not updating/invalidating the contents of
client caches. It is certainly feasible to address out-of-band
concurrent writing to data servers without addressing client cache
consistency, if we so choose.

I believe three folks with experience with different existing file
systems referred to databases as the use case for needing concurrent
write.

I believe out-of-band concurrent write is an important use case to call
out carefully, because a ambitious implementation of it could lead to a
lot of state-maintaining messaging.

Some have said that, allowing multiple clients to hold the same lock is
a current need in NFSv4, and that a solution to this can provide the
infrastructure for concurrent delegation of layout maps for read and
overwrite (when growing the size of the file is not needed). This
seems like a good operations discussion topic.

----------------------------------------

[1.6 Map revocation]: can/must the NFS server be able to revoke a
client's use of a map, and enforce no future use (fence off the map)

NFSv4 delegations allow a broken or malicious client no additional
power to damage the stored file system because state changes must go
through the server. But a delegated layout map that is held and used
by a broken or malicious client after the delegation has been recalled
could damage the stored file system in a way that the server, by not
being on the data path, has no obvious way to protect against.

So there has been a call for the ability for the server to fence out a
client or enforce the revocation of a client's access to a specific
file or filesystem. At first glance all three data server
technologies, blocks, objects and files have some solution (blocks: lun
masking/acls or SAN zoning; objects: capability revocation, key
replacement; files: component file acls, volatile file handles). The
scope and cost of each of these mechanisms maybe dramatically different.

Some would say that this is going to end up being a differentiating
property of the choice of underlying data server. For example, many
would say that in systems that allow out-of-band block access, the
client machines must be trustworthy to respect the delegation recall
message (and lease timeouts). Others would object to this weakening of
the NFS server integrity.

I also see this as a requirements argument.

----------------------------------------

[1.7 Separability]: Independence vs co-dependence of layout metadata
access and NFSv4

On one hand, simple "an address per block/object/file" maps could be
represented as an array of NFSv4 attributes, manipulated using existing
NFSv4 attribute accessing commands, so to reduce the amount of change
to NFSv4.

On the other hand, particularly for block maps of large files composed
of extents, simple array indexing may be cumbersome and much bulkier
than necessary.

And also on the other hand, some suggest that it is desirable for the
metadata access protocol to be separate from NFSv4 attribute access, so
that the same metadata access protocol might be reusable under other
file services.

I think this topic would benefit from proposed metadata formats,
particularly the SBC (block) maps.

----------------------------------------

[1.8 NTFS application semantics]: applications coded to NTFS semantics
are different from those coded to POSIX and UNIX semantics

NFS originated as a exported file system, whose semantics were defined
by the underlying local filesystem on the file server. But since that
local filesystem has almost always been UNIX or UNIX like, customers
have come to think of NFS semantics as a well defined thing, not far
from UNIX semantics (but with a customary list of POSIX exceptions).
The semantics NTFS presents to applications using its storage is
different in significant ways.

Some of us see an evolution to better support for clients trying to
support NTFS well to be very desirable. Others see chasing this as
more than the NFS group as a whole is likely to bite off.

This, and any other issues about wire protocol support for important
semantics needed by different application file system interfaces
(middleware exploited API extensions in databases or parallel
programming systems such as MPI-IO) are also requirements topics.

End summary 1.

From bhalevy@panasas.com Wed Dec 17 22:13:53 2003
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 96391 invoked from network); 18 Dec 2003 06:13:52 -0000
Received: from unknown (66.218.66.217)
by m5.grp.scd.yahoo.com with QMQP; 18 Dec 2003 06:13:52 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta2.grp.scd.yahoo.com with SMTP; 18 Dec 2003 06:13:52 -0000
Received: from yang ([172.17.19.55]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY1NF3; Thu, 18 Dec 2003 01:13:50 -0500
To: <garth@panasas.com>
Cc: <pnfs-reqs@yahoogroups.com>
Date: Thu, 18 Dec 2003 01:13:41 -0500
Message-ID: <LCEAJMHHKPKEPAIDBBEKIENGCAAA.bhalevy@panasas.com>
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Priority: 3 (Normal)
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0)
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
Importance: Normal
In-reply-to: <CB2073A7-311B-11D8-BE66-000393754F12@panasas.com>
X-eGroups-Remote-IP: 65.194.124.178
From: "Benny Halevy" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] Re: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

Garth,

In case you guys want to broaden the problem statement...
There are a couple of arguments I believe may be appealing to
the IETF:

1. Interoperability.
Several of the existing non monolithic file systems mentioned
use proprietary protocols carried over Internet protocols.
Standardizing their access protocols within NFS will allow
interoperability between heterogeneous client hosts and
heterogeneous server systems.

The standardized client argument may fall into the interoperability
category from the IETF point of view.

2. Taking advantage of IP SANs
With the introduction of iSCSI, block and object based storage systems
become accessible over IP based networks. NEPS takes advantage of this
paradigm be allowing clients direct (yet moderated and secure) access
to networked storage and therefore it enhances the value proposition
of IP SANs.

Benny

> -----Original Message-----
> From: Garth Gibson [mailto:garth@Panasas.Com]
> Sent: Thursday, December 18, 2003 00:34
> To: pnfs-reqs@yahoogroups.com
> Cc: Garth Gibson
> Subject: [pnfs-reqs] Re: NEPS-REQS: getting started
>
>
> Tyce,
>
> [I've emailed this through the Yahoo group Benny set up,
> http://groups.yahoo.com/group/pnfs-reqs. I will forward it to the
> folks that have not yet joined this Yahoo group after I get it sent
> back to me :-)]
>
> The RDDP problem statement is similar and dissimilar to what we are
> doing. It is similar in that it is about higher performance, which
> always turns out to be cost-performance. It is dissimilar in that it
> was fighting an uphill battle to get RDMA into the IETF, while we are
> looking at no preconceived support or opposition in the IETF (that I am
> aware of). And it is dissimilar in that what we are proposing helps in
> the manageability of federated systems, which is not really a
> performance issue.
>
> I followed the RDDP example closely because it was easy -- our
> arguments on strictly bandwidth are at least as strong, in my opinion.
> And because I am not certain how to predict the IETF management's
> reaction to a manageability argument. And the standardized client code
> argument, although very import to some of us, seemed outside my notion
> of the IETF scope.
>
> Perhaps those with more experience selling ideas to the IETF could
> educate us? Should we focus on a small number of the most easily
> demonstrated problems or fill the problem statement out with all the
> problems we can contribute to solving?
>
> garth
>
>
> On Monday, December 15, 2003, at 01:49 PM, Tyce McLarty wrote:
> > I've been wondering how important it is too cast the "problem" as one
> > of cost, rather than as the ability to do things that cannot be done
> > today with added benefits in cost reduction.
> >
> > I liked the list that Garth put up at the workshop:
> >
> > Scalable bandwidth
> > Scalable capacity
> > Load balancing
> > capacity balancing
> >
> > plus the big winner - a standardized client.
> >
> > So the Introduction would be basically two paragraphs with (in either
> > order):
> > 1. proposal to extend NFSv4 to allow parallel out-of-band client
> > access to data separate from metadata operations.
> > 2. why it's important to do using the reasons outlined above.
> >
> > My question is - How close do we need to model the RDMA problem
> > statement? Is cost the best/only justification or can we use new &
> > needed capability plus value added?
> >
> > I think Gary has slanted his additions this direction, but seems like
> > we should all agree on some basic principles before we get too deep in
> > word-smithing.
> >
> > Thanks,
> > Tyce
> >
> > At 10:02 PM 12/12/2003 -0700, Gary Grider wrote:
> >
> >> I decided to toss out a very quick and dirty draft with a lot of
> >> parts missing.
> >> Nothing sacred, just thoughts as they occurred to me partially
> >> organized.
> >>
> >> I put it in Word so I could get formatting, TOC, etc.
> >>
> >> I am attaching a Word and PDF.
> >>
> >> I would be happy to put this on a web site for us if you want. I
> >> also would be happy to
> >> centralize the edits and re-post it on the web etc.
> >>
> >> Thanks
> >> Gary
>
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
> 

From garth@panasas.com Thu Dec 18 14:37:55 2003
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 89807 invoked from network); 18 Dec 2003 22:37:54 -0000
Received: from unknown (66.218.66.216)
by m13.grp.scd.yahoo.com with QMQP; 18 Dec 2003 22:37:54 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta1.grp.scd.yahoo.com with SMTP; 18 Dec 2003 22:37:54 -0000
Received: from panasas.com ([172.17.133.207]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY1RMV; Thu, 18 Dec 2003 17:37:52 -0500
Date: Thu, 18 Dec 2003 17:37:50 -0500
Content-Type: text/plain; charset=US-ASCII; format=flowed
Mime-Version: 1.0 (Apple Message framework v553)
To: pNFS Operations <pnfs-ops@yahoogroups.com>,
pNFS Requirements <pnfs-reqs@yahoogroups.com>
Content-Transfer-Encoding: 7bit
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A6D361F@silver.nane.netapp.com>
Message-Id: <CF94E7DF-31AA-11D8-996E-000393754F12@panasas.com>
X-Mailer: Apple Mail (2.553)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Thanks Dave. I agree. Lets refine the proxying issues: Legacy,
strict, functional and recovery proxying.

[1.1.0 Legacy proxying]: an NFS-v4.x server must be able to execute the
full NFS-v4.0 or NFS-v4.1 protocol.

I think Dave has given the case for this strongly. I do not see any
case against this.

-------------------------------------------

[1.1.1 Strict proxying]: does an NFS-v4.x server have to be able to
execute exactly the wire packet that an NFS-v4.x client might have sent
to a SBC/OSD/NFS data server?

This captures the notion that a metadata server must also be a
store-and-forward proxy for every data server it manages. It requires
NFS-v4.x servers implement SCSI SBC over FC, if their data servers
implement it; and the same for objects and files.

This only makes sense to me for NFS data servers. And it is not what I
intended in my prior summary, although it is a relevant question. I
would say that pNFS requirements not require Strict Proxying.

-------------------------------------------

[1.1.2 Functional proxying]: a file transformation achievable by an
NFS-v4.x client using a set of data server operations must be a
equivalently achievable using a (probably different) set of NFS-v4.x
server operations

This is the topic I intended to address in the last email. I believe
Dave is arguing that even with metadata servers that do not have access
to their data servers, the vendor of such a metadata server can
construct a proprietary protocol for the metadata server to (strict)
proxy data server accesses through clients that do have data server
access. I am not comfortable making up a counter to this, so I exhort
those that want a metadata server without data server access to speak
up if they disagree.

> On one hand, some suggest that a set of out-of-band clients should not
> have to also have a data path through the NFSv4 metadata server. One
> reason is that customers may not tolerate the large variability in
> performance between out-of-band (when the going is good) and in-band
> (when the server chooses not to grant or to take away a delegation)
> accesses. Another reason, and I paraphrase someone else here, is that
> it is possible to construct out-of-band metadata servers that do not
> have access to the data servers except through the clients -- I
> encourage the source of this scenario to replace my paraphrasing with
> a correct use case, because I find it odd to design for file servers
> that do not have access to the data servers.
>
> On the other hand, others have suggested that any access or work that
> a client can do out-of-band should be possible with one or more
> commands applied to the metadata server's data path. This has been
> proposed for coping with recalled delegations, including concurrent
> writing by multiple clients; retry after client access errors,
> provided adequate idempotency of out-of-band operations; and many
> alternative implementations of out-of-band clients, including legacy
> clients that use out-of-band never or rarely.
>
> I think this is a topic that should be argued one way or the other in
> the requirements document. Use cases and examples in other systems
> would be best.

-------------------------------------------

[1.1.3 Recovery proxying]: a file transformation begun by an NFS-v4.x
client using a set of data server operations, but interrupted before
completion, must be equivalently completable using a (probably
different) set of NFS-v4.x server operations

Some have suggested that having this property will greatly simplify the
amount of spec that is devoted to out-of-band error recovery. Others
have commented that a simple way to achieve this would be to require
that all operations on data servers should be idempotent.

-------------------------------------------

garth


On Thursday, December 18, 2003, at 12:21 PM, Noveck, Dave wrote:

> Good summary.
>
> I want to address the "proxying" issue.
>
>> [1.1 Proxying]: Operations/work that can only be done out-of-band vs
>> alternative access through the NFSv4 server for all operations/work
>
> If you are talking about operations in the extension (let's call it
> NFS-v4.x), that are not in the previous minor version (let's assume
> that is nfs-v4.1), then you have a choice of whether these are
> supported
> for access through the server, or only for access by the client with
> the
> data server. Let's call this the issue of proxying in the strict
> sense.
>
> There is another issue that people are calling "proxying" but is really
> logically distinct. That is the issue of access by the previous minor
> version, e.g. nfs-v4.0 or nfs-v4.1. Those versions have no concept of
> separate data servers and they need to be able to work. End of story.
> If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
> have a minor version without proxying. You don't have a minor version
> at all. I believe the working group is never going to accept that.
> Even if I'm wrong and you can get the working group to accept that,
> it is going to be very contentious and thus take up a lot of time.
> Anybody, who really wants to go down this path should seriously
> consider
> the trade-off between supporting something they find objectionable and
> getting a standard a lot later, if at all.
>
>> On one hand, some suggest that a set of out-of-band clients should not
>> have to also have a data path through the NFSv4 metadata server. One
>> reason is that customers may not tolerate the large variability in
>> performance between out-of-band (when the going is good) and in-band
>> (when the server chooses not to grant or to take away a delegation)
>> accesses.
>
> Then such customers will use clients that access things out-of-band
> whenever possible, and servers that never refuse to give out layout
> delegations. You have a number of quality-of-implementations issues
> for v4.x clients and servers. If a particular client only supports
> access via v4.0, then performance will suck, and the working group
> will understand that, but it won't accept not being able to use
> v4.0 at all. The customer is going to be motivated to upgrade his
> clients for those that need high-performance access, but he may be
> OK with some clients using v4.0 for a long time, depending on the
> particular performance those clients need. (And some will want v2/v3
> access but that is a matter that the working group has no say about).
>
>> Another reason, and I paraphrase someone else here, is that
>> it is possible to construct out-of-band metadata servers that do not
>> have access to the data servers except through the clients -- I
>> encourage the source of this scenario to replace my paraphrasing with
>> a
>> correct use case, because I find it odd to design for file servers
>> that
>> do not have access to the data servers.
>
> So let's grant that it is possible (and we'll pass over the issue of
> whether it is desirable, and in fact so desirable that one is willing
> to
> not get a standard and or get it much later).
>
> So we have a metadata server and it, for whatever reason, does not have
> access to the data servers. However, by hypothesis, there are machines
> (e.g. clients), that can communicate with both. So, if one has such an
> architecture, then one can take such a machine, give it a
> communication path
> to the meta-data server and the data server and have the meta-data
> server
> transfer v4.0 READ requests to it, let it read the data from the data
> server and send it back to the meta-data server who send it back to the
> original requestor. Is that a very good solution? No. Is it likely
> to be performant? No. Will it satisfy any particular customer? I
> don't
> know and that is the implementer's business decision. Will it satisfy
> the hypothetical customer who doesn't care about v4.0 access? Clearly.
> Will it satisfy the v4 working group? Yes, because they are not in the
> business of telling you how performant v4.0 access has got to be.
>
>> On the other hand, others have suggested that any access or work that
>> a
>> client can do out-of-band should be possible with one or more commands
>> applied to the metadata server's data path. This has been proposed
>> for
>> coping with recalled delegations, including concurrent writing by
>> multiple clients; retry after client access errors, provided adequate
>> idempotency of out-of-band operations; and many alternative
>> implementations of out-of-band clients, including legacy clients that
>> use out-of-band never or rarely.
>
> This effort is going to take a while, but if we manage it correctly, it
> is not going to take so long that v3 clients are going to be rare
> things,
> and they have to be supported. But v3 clients are not an issue for the
> working group. V4.0 clients are and they will be around and you will
> have to support them, and I believe the working group is not going to
> be disposed to cut you a lot of slack on this issue (and I don't see
> why it should).
>
>> I think this is a topic that should be argued one way or the other in
>> the requirements document. Use cases and examples in other systems
>> would be best.
>
> I think the requirement should be that this work should be done as a
> set of extensions to nfs-v4 delivered as a v4 minor version. If there
> is some feature/requirement that conflicts with that model (and it is a
> pretty flexible one), then you have to think long and hard before
> deciding
> that that requirement is more important than this basic deivery
> vehicle,
> because it seems to me that it is, in almost all respects, the ideal
> way
> to make this sort of technology available for widespread use.
>
>
>
>
>
>
> To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-ops/
>
> To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>



From julian_satran@il.ibm.com Mon Dec 22 02:26:02 2003
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 86299 invoked from network); 22 Dec 2003 10:26:01 -0000
Received: from unknown (66.218.66.218)
by m11.grp.scd.yahoo.com with QMQP; 22 Dec 2003 10:26:01 -0000
Received: from unknown (HELO mtagate3.de.ibm.com) (195.212.29.152)
by mta3.grp.scd.yahoo.com with SMTP; 22 Dec 2003 10:26:00 -0000
Received: from d12relay01.megacenter.de.ibm.com (d12relay01.megacenter.de.ibm.com [9.149.165.180] (may be forged))
by mtagate3.de.ibm.com (8.12.10/8.12.10) with ESMTP id hBMAPxn0031456;
Mon, 22 Dec 2003 10:25:59 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay01.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id hBMAPwG4256428;
Mon, 22 Dec 2003 11:25:58 +0100
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A6D361F@silver.nane.netapp.com>
To: pnfs-ops@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF41F5BC67.10F2AEA8-ONC2256E04.003885DC-C2256E04.00394C09@il.ibm.com>
Date: Mon, 22 Dec 2003 12:25:57 +0200
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
22/12/2003 12:25:58,
Serialize complete at 22/12/2003 12:25:58
Content-Type: multipart/alternative; boundary="=_alternative 00394B9DC2256E04_="
X-eGroups-Remote-IP: 195.212.29.152
From: Julian Satran <julian_satran@il.ibm.com>
Subject: RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03
X-Yahoo-Group-Post: member; u=64714603

Since I raised the issue of the metadata server not having access to all it's data servers (or at least not with adequate bandwidth) I feel compelled to say that Dave's arguments about supporting 4.0 are compelling enough to make it mandatory. The open issue is if it is legal for a "compliant server" to have serving data disabled by a local administrative function (the old "must implement but may use"). Otherwise an organization that wants to discourage use of data serving through the metadata server has very little it can do to enforce policy in a way that will not affect other clients (it may do serve poorly but this still affects other clients).

Julo


"Noveck, Dave" <dnoveck@netapp.com>

18/12/2003 19:21
Please respond to
pnfs-ops@yahoogroups.com

	
To
	<pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
cc
	
Subject
	RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03

	




Good summary.

I want to address the "proxying" issue.

> [1.1 Proxying]: Operations/work that can only be done out-of-band vs
> alternative access through the NFSv4 server for all operations/work

If you are talking about operations in the extension (let's call it
NFS-v4.x), that are not in the previous minor version (let's assume
that is nfs-v4.1), then you have a choice of whether these are supported
for access through the server, or only for access by the client with the
data server.  Let's call this the issue of proxying in the strict sense.

There is another issue that people are calling "proxying" but is really
logically distinct.  That is the issue of access by the previous minor
version, e.g. nfs-v4.0 or nfs-v4.1.  Those versions have no concept of
separate data servers and they need to be able to work.  End of story.
If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
have a minor version without proxying.  You don't have a minor version
at all.  I believe the working group is never going to accept that.
Even if I'm wrong and you can get the working group to accept that,
it is going to be very contentious and thus take up a lot of time.
Anybody, who really wants to go down this path should seriously consider
the trade-off between supporting something they find objectionable and
getting a standard a lot later, if at all.

> On one hand, some suggest that a set of out-of-band clients should not
> have to also have a data path through the NFSv4 metadata server.  One
> reason is that customers may not tolerate the large variability in
> performance between out-of-band (when the going is good) and in-band
> (when the server chooses not to grant or to take away a delegation)
> accesses.  

Then such customers will use clients that access things out-of-band
whenever possible, and servers that never refuse to give out layout
delegations.  You have a number of quality-of-implementations issues
for v4.x clients and servers.  If a particular client only supports
access via v4.0, then performance will suck, and the working group
will understand that, but it won't accept not being able to use
v4.0 at all.  The customer is going to be motivated to upgrade his
clients for those that need high-performance access, but he may be
OK with some clients using v4.0 for a long time, depending on the
particular performance those clients need.  (And some will want v2/v3
access but that is a matter that the working group has no say about).

> Another reason, and I paraphrase someone else here, is that
> it is possible to construct out-of-band metadata servers that do not
> have access to the data servers except through the clients -- I
> encourage the source of this scenario to replace my paraphrasing with a
> correct use case, because I find it odd to design for file servers that
> do not have access to the data servers.

So let's grant that it is possible (and we'll pass over the issue of
whether it is desirable, and in fact so desirable that one is willing to
not get a standard and or get it much later).

So we have a metadata server and it, for whatever reason, does not have
access to the data servers.  However, by hypothesis, there are machines
(e.g. clients), that can communicate with both.  So, if one has such an
architecture, then one can take such a machine, give it a communication path
to the meta-data server and the data server and have the meta-data server
transfer v4.0 READ requests to it, let it read the data from the data
server and send it back to the meta-data server who send it back to the
original requestor.  Is that a very good solution?  No.  Is it likely
to be performant?  No.  Will it satisfy any particular customer?  I don't
know and that is the implementer's business decision.  Will it satisfy
the hypothetical customer who doesn't care about v4.0 access?  Clearly.
Will it satisfy the v4 working group?  Yes, because they are not in the
business of telling you how performant v4.0 access has got to be.

> On the other hand, others have suggested that any access or work that a
> client can do out-of-band should be possible with one or more commands
> applied to the metadata server's data path.  This has been proposed for
> coping with recalled delegations, including concurrent writing by
> multiple clients; retry after client access errors, provided adequate
> idempotency of out-of-band operations; and many alternative
> implementations of out-of-band clients, including legacy clients that
> use out-of-band never or rarely.

This effort is going to take a while, but if we manage it correctly, it
is not going to take so long that v3 clients are going to be rare things,
and they have to be supported.  But v3 clients are not an issue for the
working group.  V4.0 clients are and they will be around and you will
have to support them, and I believe the working group is not going to
be disposed to cut you a lot of slack on this issue (and I don't see
why it should).

> I think this is a topic that should be argued one way or the other in
> the requirements document.  Use cases and examples in other systems
> would be best.

I think the requirement should be that this work should be done as a
set of extensions to nfs-v4 delivered as a v4 minor version.  If there
is some feature/requirement that conflicts with that model (and it is a
pretty flexible one), then you have to think long and hard before deciding
that that requirement is more important than this basic deivery vehicle,
because it seems to me that it is, in almost all respects, the ideal way
to make this sort of technology available for widespread use.






To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com



Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-ops/

To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/ 

From bhalevy@panasas.com Mon Dec 22 11:42:01 2003
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 94145 invoked from network); 22 Dec 2003 19:41:59 -0000
Received: from unknown (66.218.66.166)
by m6.grp.scd.yahoo.com with QMQP; 22 Dec 2003 19:41:59 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 22 Dec 2003 19:41:59 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSYFB9C>; Mon, 22 Dec 2003 14:41:57 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38733@PIKES.panasas.com>
To: "'julian_satran@il.ibm.com'" <julian_satran@il.ibm.com>,
"'pnfs-ops@yahoogroups.com'" <pnfs-ops@yahoogroups.com>
Cc: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Mon, 22 Dec 2003 14:41:53 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-ops] delegation arguments summary
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

> > * layout delegation revocation (and enforcement of)
> > This issue is orthogonal. We dicussed volatile file handles, OSD
> > capabilities, and SAN LUN mapping techniques.
> >
>
> Almost orthogonal. There is a subtle problem of sharing layout delegations if one of clientts is doing writes or appends.

This falls under CW (concurrent write) sharing since there is one or more writers.
By saying "this issue is orthogonal" I meant that the mechanism for revoking the
layout delegation is orthogonal to whether we need a complete new set of
delegations or extend the current model.

I agree that when the layout changes due to writes, appends, or for any other
reason the server has to recall layout delegations, at least from those clients
that requested layout for region that's about to be the changed. Hopefully,
all clients behave nicely and their delegations do not have to be revoked.
You want to revoke the layout delegation from unresponsive clients since allowing
them to use the stale layout may end up with data corruption.

Speaking of append, I always thought it'd be really nice to have an NFS APPEND
operation... This seems like something we can propose right away on nfsv4@ietf.org
How does people on this list feel about that?

A use case I encountered is a customer that use a shared file as a log and have
multiple nodes in the cluster appending to that file with some coordination
(right now, NFSv3 + NLM). They don't care about ordering of the appended records
and they even accept records written more than once to the file, but they do care
about the consistency of each record so writers can't just silently overwrite
each other.

> The issue is furthermore complicated by the "sparse" layout that we all want to support (do we?)

Can you please turn the details knob on "sparse" layout and maybe give a
concrete example where this layout make the proposed model fall short?

> > layout delegation:
> > - returned on READ_IND, WRITE_IND, LAYOUT_DELEG_ASK
> >
> > Covers only layout (aggregation header, map, handles/caps).
> > Optional, recallable, revocable.
> > Assures the client that the layout information it has will not change.
>
> But the layout information may change even in the most trivial single writer case and definitely in RW cases.

Correct, when the layout is about to be changed (a writer calls COMMIT_IND)
or when there is a write-write conflict (two clients call WRITE_IND for
overlapping regions) some or all layout delegations must be recalled.

> > WRITE yes client can safely cache read and write data,
> > serve opens, and locks locally and can perform
> > out-of-band or server reads and writes.
> At least this requires mapping updates for block storage.
> For those souls that want strict local-FS semantics (UNIX) cache and map invalidations can be a side-effect of the byte-range locking mechanism.

This sounds like something that falls into the distributed cache coherency
realm - meaning multiple clients have a CW data delegation and a layout delegation.
My assumption was that in this case the logical block map changes
rarely when the clients are writing in place, otherwise they should fall back to
writing through the server. Having an efficient distributed cache coherency
mechanism in NFS seems to me like a stretch but it's worth a discussion to see
if block based SAN filesystems can or can't live without it.

Benny

From ggrider@lanl.gov Mon Dec 22 11:53:58 2003
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 3731 invoked from network); 22 Dec 2003 19:53:57 -0000
Received: from unknown (66.218.66.166)
by m11.grp.scd.yahoo.com with QMQP; 22 Dec 2003 19:53:57 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta5.grp.scd.yahoo.com with SMTP; 22 Dec 2003 19:53:57 -0000
Received: from mailrelay3.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id hBMJrufK001673;
Mon, 22 Dec 2003 12:53:56 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay3.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id hBMJrtIt031106;
Mon, 22 Dec 2003 12:53:55 -0700
Received: from cthulu.lanl.gov (vpn-client-189.lanl.gov [128.165.253.189])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id hBMJrqFR016230;
Mon, 22 Dec 2003 12:53:53 -0700
Message-Id: <5.2.0.9.2.20031222125146.018b3cc0@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Mon, 22 Dec 2003 12:53:51 -0700
To: pnfs-reqs@yahoogroups.com,
"'julian_satran@il.ibm.com'" <julian_satran@il.ibm.com>,
"'pnfs-ops@yahoogroups.com'" <pnfs-ops@yahoogroups.com>
Cc: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
In-Reply-To: <30489F1321F5C343ACF6872B2CF7942A05D38733@PIKES.panasas.com
>
Mime-Version: 1.0
Content-Type: multipart/alternative;
boundary="=====================_15088946==.ALT"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] RE: [pnfs-ops] delegation arguments summary
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

At 02:41 PM 12/22/2003 -0500, Halevy, Benny wrote:

> > > * layout delegation revocation (and enforcement of)
> > >   This issue is orthogonal. We dicussed volatile file handles, OSD
> > >   capabilities, and SAN LUN mapping techniques.
> > >
> >
> > Almost orthogonal. There is a subtle problem of sharing layout delegations if one of clientts is doing writes or appends.
>
> This falls under CW (concurrent write) sharing since there is one or more writers.
> By saying "this issue is orthogonal" I meant that the mechanism for revoking the
> layout delegation is orthogonal to whether we need a complete new set of
> delegations or extend the current model.
>
> I agree that when the layout changes due to writes, appends, or for any other
> reason the server has to recall layout delegations, at least from those clients
> that requested layout for region that's about to be the changed.  Hopefully,
> all clients behave nicely and their delegations do not have to be revoked.
> You want to revoke the layout delegation from unresponsive clients since allowing
> them to use the stale layout may end up with data corruption.
>
> Speaking of append,  I always thought it'd be really nice to have an NFS APPEND
> operation... This seems like something we can propose right away on nfsv4@ietf.org
> How does people on this list feel about that?
>
> A use case I encountered is a customer that use a shared file as a log and have
> multiple nodes in the cluster appending to that file with some coordination
> (right now, NFSv3 + NLM).  They don't care about ordering of the appended records
> and they even accept records written more than once to the file, but they do care
> about the consistency of each record so writers can't just silently overwrite
> each other.
>
> > The issue is furthermore complicated by the "sparse" layout that we all want to support (do we?)
>
> Can you please turn the details knob on "sparse" layout and maybe give a
> concrete example where this layout make the proposed model fall short?
>
> > > layout delegation:
> > > - returned on READ_IND, WRITE_IND, LAYOUT_DELEG_ASK
> > >
> > > Covers only layout (aggregation header, map, handles/caps).
> > > Optional, recallable, revocable.
> > > Assures the client that the layout information it has will not change.
> >
> > But the layout information may change even in the most trivial single writer case and definitely in RW cases.
>
> Correct, when the layout is about to be changed (a writer calls COMMIT_IND)
> or when there is a write-write conflict (two clients call WRITE_IND for
> overlapping regions) some or all layout delegations must be recalled.
>
> > > WRITE yes         client can safely cache read and write data,
> > >                   serve opens, and locks locally and can perform
> > >                   out-of-band or server reads and writes.
> > At least this requires mapping updates for block storage.
> > For those souls that want strict local-FS semantics (UNIX)  cache and map invalidations can be a side-effect of the byte-range locking mechanism.
>
> This sounds like something that falls into the distributed cache coherency
> realm - meaning multiple clients have a CW data delegation and a layout delegation.
> My assumption was that in this case the logical block map changes
> rarely when the clients are writing in place, otherwise they should fall back to
> writing through the server.  


As long as there is a way to get concurrent write to scale with reasonable behavior,
like non overlapped regions and any other reasonable promises.  I suppose we need
to pin down what those reasonable promises are.

Gary


> Having an efficient distributed cache coherency
> mechanism in NFS seems to me like a stretch but it's worth a discussion to see
> if block based SAN filesystems can or can't live without it.
>
> Benny
>
>
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>
>
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From black_david@emc.com Tue Dec 23 12:33:03 2003
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 82796 invoked from network); 23 Dec 2003 20:33:03 -0000
Received: from unknown (66.218.66.216)
by m20.grp.scd.yahoo.com with QMQP; 23 Dec 2003 20:33:03 -0000
Received: from unknown (HELO mxic2.corp.emc.com) (128.221.12.9)
by mta1.grp.scd.yahoo.com with SMTP; 23 Dec 2003 20:33:02 -0000
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <ZHD48G6J>; Tue, 23 Dec 2003 15:33:02 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A53E4@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com
Date: Tue, 23 Dec 2003 15:32:59 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: multipart/mixed;
boundary="----_=_NextPart_000_01C3C993.F4B068F2"
X-eGroups-Remote-IP: 128.221.12.9
From: black_david@emc.com
Subject: RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

ADVERTISEMENT
Garth Gibson wrote:

> > The RDDP problem statement is similar and dissimilar to what we are
> > doing. It is similar in that it is about higher performance, which
> > always turns out to be cost-performance. It is dissimilar in that it
> > was fighting an uphill battle to get RDMA into the IETF, while we are
> > looking at no preconceived support or opposition in the IETF (that I
> > am aware of). And it is dissimilar in that what we are proposing
> > helps in the manageability of federated systems, which is not really a
> > performance issue.
> >
> > I followed the RDDP example closely because it was easy -- our
> > arguments on strictly bandwidth are at least as strong, in my opinion.
> > And because I am not certain how to predict the IETF management's
> > reaction to a manageability argument. And the standardized client
> > code argument, although very import to some of us, seemed outside my
> > notion of the IETF scope.
> >
> > Perhaps those with more experience selling ideas to the IETF could
> > educate us? Should we focus on a small number of the most easily
> > demonstrated problems or fill the problem statement out with all the
> > problems we can contribute to solving?

Having been heavily involved in getting both IPS and RDDP work underway
in the IETF, I have a few observations:

- A problem statement draft is a good thing to have, but the folks in
charge of the IETF are looking for a concise summary of what the
problem is, how to go about solving it, and **why** the IETF should
solve it. The latter is of particular importance, as I'll
explain shortly.
- I've attached a slide deck that I used for RDDP at the Spring 2002
IETF BOF on this topic. This sort of "elevator pitch" style
coverage of the topics is needed in addition to the more in-depth
academic approach that is in the RDDP problem statement.
- Goals and battles need to be chosen carefully. One of the things
that delayed RDDP work is that the RDDP proponents were
absolutely
convinced that they needed to change TCP, and hence decided to go
to battle with the IETF Transport community which was equally
convinced that TCP should not be changed. In 20/20 hindsight,
this was a mistake, as the IETF Transport community turned out
to be correct that TCP does not require normative changes for RDDP.
- Nonetheless, there is somewhat of an "uphill battle" to be engaged, as
Beepy and/or Spencer described in Ann Arbor - the IETF has grown to
a potentially unwieldy size, and as a consequence has developed a
healthy institutional bias against new work. As a result, it is
necessary to have good reasons not only for why work should be
done, but also why it should be done in the IETF. The fact that
we want to extend an existing IETF protocol (NFSv4) in a way that
can take advantage of another (iSCSI) provides at least two reasons.
Beyond this, there is value in drawing on the IETF's network
expertise
in areas such as security.
- A draft WG statement/scope of work is very important at an early stage,
including not only what we want to do, but what we do *not* want to
do. I tend to view the latter as more important, as a shared view
of what will not be worked on is a significant sign that a technical
community has coalesced around a common effort and goals. For
example,
there are fairly strong statements about work that is out of scope
in
both the IPS and RDDP charters, and as a WG chair, I've found those
statements useful from time to time ...

I hope this helps,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------



Attachment (not stored)
ROI-Problem-Scenario-0302.ppt
Type: application/vnd.ms-powerpoint

From black_david@emc.com Tue Dec 23 13:34:43 2003
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 93561 invoked from network); 23 Dec 2003 21:34:41 -0000
Received: from unknown (66.218.66.218)
by m2.grp.scd.yahoo.com with QMQP; 23 Dec 2003 21:34:41 -0000
Received: from unknown (HELO mxic2.corp.emc.com) (128.221.12.9)
by mta3.grp.scd.yahoo.com with SMTP; 23 Dec 2003 21:34:41 -0000
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <ZHD48JB9>; Tue, 23 Dec 2003 16:34:40 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A53E5@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com, pnfs-ops@yahoogroups.com
Date: Tue, 23 Dec 2003 16:34:32 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.12.9
From: black_david@emc.com
Subject: Re: pNFS Discussion Summary 1: Caching and Delegations
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

ADVERTISEMENT
I've split this commentary on Garth's issues into two categories.
This is about caching, delegations, and layout info.

> > [1.2 Cache consistency]: NFSv4 delegations are not about client cache
> > consistency; does out-of-band access require stronger cache
> > consistency than NFSv4 provides

With a little care in defining the protocol extensions, this issue
can be left to server implementers, unless one wants to take the
(silly, IMHO) position that the protocol should be incapable of
providing stronger cache consistency.

HighRoad uses the same FMP protocol to provide both NFS-style
close-to-open consistency for NFS clients and the stronger forms
of consistency required by CIFS - as long as the server knows what
clients have which access rights to what blocks, cache consistency
strength comes down to server implementation decisions about
what outstanding access rights conflict with a new request. We've
actually built server prototypes that provide stronger consistency
for NFS without change to either the FMP protocol or clients, but
the shipped product only provides NFS-style consistency for NFS.

> > [1.3 Delegation promotion & reacquisition]: must/should NFSv4 offer
> > mechanisms for clients to possess a delegations more than once per open
> >
> > Delegations in NFSv4 are new, and came with significant concern about
> > lots of complexity for not much performance, as they may do as little
> > as avoid the client waiting for one round trip to the server on open.
> > So, as described above with respect to cache consistency, the
> > limitations on delegations can mean great difficulties for clients
> > having performance requirements calling for out-of-band access mostly,
> > or exclusively.

Yes, and this is a strong reason for separating "layout" delegations from
the existing "data" delegations, IMHO. Consider a web or video server
that is caching file opens for performance reasons - if updating the
content underneath the server makes it impossible to get the direct
access ("layout") delegations back, the result is that one has to shut
down and restart all the servers after the content update in order to
restore performance. The sysadmin responsible for this annoying
work will want to tar-and-feather the system designers who made
it necessary (that would be us if we get this wrong ...).

> > [1.4 Layout delegations]: can/should layout metadata "ride" on NFSv4
> > delegations or are new "layout" delegations needed

New "layout" delegations are needed for clean separation of functionality,
and so that "layout" delegations can be designed for direct access
requirements. See [1.3] above.

> > [1.5 Concurrent write]: write delegations now are held by exactly one
> > client, if any; should/must NFS support multiple clients holding
> > concurrent layout delegations.

I understand the value of this to the self-coordinating HPC applications,
but would like to see this functionality specified (assuming it is
specified) as a cleanly separable option, as I think the desire to
self-coordinate a shared write delegation will be limited to a small
number of application spaces, like HPC. I also note Gary's comment
that it's sufficient for parallel write to work in the non-overlapping
case, which does not require any new concurrent write delegation as
long as each client can hold an exclusive write delegation for its range.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------


From black_david@emc.com Tue Dec 23 13:35:57 2003
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 44163 invoked from network); 23 Dec 2003 21:35:56 -0000
Received: from unknown (66.218.66.216)
by m17.grp.scd.yahoo.com with QMQP; 23 Dec 2003 21:35:56 -0000
Received: from unknown (HELO mxic2.corp.emc.com) (128.221.12.9)
by mta1.grp.scd.yahoo.com with SMTP; 23 Dec 2003 21:35:55 -0000
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <ZHD48JDT>; Tue, 23 Dec 2003 16:35:54 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A53E6@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com, pnfs-ops@yahoogroups.com
Date: Tue, 23 Dec 2003 16:35:52 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.12.9
From: black_david@emc.com
Subject: Re: pNFS Discussion Summary 1: Functionality
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

ADVERTISEMENT
I've split this commentary on Garth's issues into two categories.
This is about a couple of topics that I would classify as desirable,
but not mandatory functionality.

> > [1.6 Map revocation]: can/must the NFS server be able to revoke a
> > client's use of a map, and enforce no future use (fence off the map)

[... snip ...]

> > Some would say that this is going to end up being a differentiating
> > property of the choice of underlying data server. For example, many
> > would say that in systems that allow out-of-band block access, the
> > client machines must be trustworthy to respect the delegation recall
> > message (and lease timeouts). Others would object to this weakening
> > of the NFS server integrity.

I tend to take the former position, as if one cannot fence off client
access, not allowing access to untrustworthy clients becomes a fallback.
In the block world, while mechanisms exist to fence off access, standard
means of invoking them are somewhat immature.

> > [1.8 NTFS application semantics]: applications coded to NTFS semantics
> > are different from those coded to POSIX and UNIX semantics

IMHO, this is an orthogonal tarpit we should stay out of. I strongly
believe that trying to extend NFSv4 so it can be just as good as CIFS
for applications coded to Windows APIs should be someone else's problem.

Thanks,
--David

----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From black_david@emc.com Tue Dec 23 14:00:01 2003
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 67601 invoked from network); 23 Dec 2003 21:59:57 -0000
Received: from unknown (66.218.66.172)
by m9.grp.scd.yahoo.com with QMQP; 23 Dec 2003 21:59:57 -0000
Received: from unknown (HELO MAHO3MSX2.corp.emc.com) (128.221.11.32)
by mta4.grp.scd.yahoo.com with SMTP; 23 Dec 2003 22:00:00 -0000
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <ZHDM1SA4>; Tue, 23 Dec 2003 16:59:59 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A53E8@corpmx14.corp.emc.com>
To: pnfs-ops@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
Date: Tue, 23 Dec 2003 16:59:55 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.11.32
From: black_david@emc.com
Subject: Avoiding Delegation Recall
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

Dave Noveck writes:

> I've been wondering whether we could avoid the recall in many cases in
which the
> layout is changing. I know this sounds like I've lost my mind (so what
else is
> new?) but hear me out.
>
> The idea is the layout delegation gives you the ability to rely on mapped
areas
> but not holes or areas past the eof, and that correspondingly converting
an area
> from a hole to being mapped should not necessitate recall of the layout
delegation.
> This would force some complexity in the case in which were about to read
from one
> of the data servers and found something unmappeed but it would mean that
layout
> delegations would not need to be recalled in many common cases.

That depends on the consistency model. For NFS-level consistency, I believe
that returning zeroes for a hole that another client has filled is allowed
by the consistency model, but this "negative caching" behavior is not
exactly
common. If one wants to be able to support stronger consistency, one must
be able to recall the (non-)layout delegation after the hole fill in order
to
force the other client to see the newly written data. Writing data that
moves
EOF is similar, but there are some subtleties in that EOF changes are not
identical to cache consistency. I think (and hope in the case of EOF) that
all of this falls under my previous comment that with a little attention to
detail in specification of the protocol, we can make cache consistency
solely
a server implementation decision (implementer picks model, protocol can
support
all the interesting ones). I strongly prefer that approach because I
believe
consistency model debates to be an attractive tarpit ...

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From julian_satran@il.ibm.com Fri Dec 26 01:36:19 2003
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 67407 invoked from network); 26 Dec 2003 09:36:18 -0000
Received: from unknown (66.218.66.218)
by m17.grp.scd.yahoo.com with QMQP; 26 Dec 2003 09:36:18 -0000
Received: from unknown (HELO mtagate2.de.ibm.com) (195.212.29.151)
by mta3.grp.scd.yahoo.com with SMTP; 26 Dec 2003 09:36:17 -0000
Received: from d12relay01.megacenter.de.ibm.com (d12relay01.megacenter.de.ibm.com [9.149.165.180] (may be forged))
by mtagate2.de.ibm.com (8.12.10/8.12.10) with ESMTP id hBQ9aGHf096778;
Fri, 26 Dec 2003 09:36:16 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay01.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id hBQ9aF7Y254838;
Fri, 26 Dec 2003 10:36:15 +0100
In-Reply-To: <30489F1321F5C343ACF6872B2CF7942A05D38733@PIKES.panasas.com>
To: pnfs-reqs@yahoogroups.com
Cc: "'pnfs-ops@yahoogroups.com'" <pnfs-ops@yahoogroups.com>,
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OFADAAA89B.B9E88D3E-ONC2256E08.002B8E36-C2256E08.0034B9A6@il.ibm.com>
Date: Fri, 26 Dec 2003 11:36:12 +0200
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
26/12/2003 11:36:15,
Serialize complete at 26/12/2003 11:36:15
Content-Type: multipart/alternative; boundary="=_alternative 002D4385C2256E08_="
X-eGroups-Remote-IP: 195.212.29.151
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] RE: [pnfs-ops] delegation arguments summary
X-Yahoo-Group-Post: member; u=64714603

ADVERTISEMENT

Benny and all,

"Halevy, Benny" <bhalevy@panasas.com> wrote on 22/12/2003 21:41:53:

>
> > > * layout delegation revocation (and enforcement of)
> > >   This issue is orthogonal. We dicussed volatile file handles, OSD
> > >   capabilities, and SAN LUN mapping techniques.
> > >
> >
> > Almost orthogonal. There is a subtle problem of sharing layout
> delegations if one of clientts is doing writes or appends.
>
> This falls under CW (concurrent write) sharing since there is one or
> more writers.
> By saying "this issue is orthogonal" I meant that the mechanism for revoking the
> layout delegation is orthogonal to whether we need a complete new set of
> delegations or extend the current model.
>
> I agree that when the layout changes due to writes, appends, or for any other
> reason the server has to recall layout delegations, at least from those clients
> that requested layout for region that's about to be the changed.  Hopefully,
> all clients behave nicely and their delegations do not have to be revoked.
> You want to revoke the layout delegation from unresponsive clients
> since allowing
> them to use the stale layout may end up with data corruption.
>
> Speaking of append,  I always thought it'd be really nice to have an NFS APPEND
> operation... This seems like something we can propose right away on
> nfsv4@ietf.org
> How does people on this list feel about that?
>

I agree that supporting append is important data base and message queuing use frequently logs but so do many simple commercial applications.
 
> A use case I encountered is a customer that use a shared file as a log and have
> multiple nodes in the cluster appending to that file with some coordination
> (right now, NFSv3 + NLM).  They don't care about ordering of the
> appended records
> and they even accept records written more than once to the file, but
> they do care
> about the consistency of each record so writers can't just silently overwrite
> each other.
>
> > The issue is furthermore complicated by the "sparse" layout that we
> all want to support (do we?)
>
> Can you please turn the details knob on "sparse" layout and maybe give a
> concrete example where this layout make the proposed model fall short?
>

If you consider very large files very large files sparsely populated and being used by a well coordianted set of applications it makes more sense to have mapping information use and caching coordinated. The longer I think about it the more it looks that mapping and caching information are not distinct pieces of information and we better try to treat them as such.
 
> > > layout delegation:
> > > - returned on READ_IND, WRITE_IND, LAYOUT_DELEG_ASK
> > >
> > > Covers only layout (aggregation header, map, handles/caps).
> > > Optional, recallable, revocable.
> > > Assures the client that the layout information it has will not change.
> >
> > But the layout information may change even in the most trivial
> single writer case and definitely in RW cases.
>
> Correct, when the layout is about to be changed (a writer calls COMMIT_IND)
> or when there is a write-write conflict (two clients call WRITE_IND for
> overlapping regions) some or all layout delegations must be recalled.
>
> > > WRITE yes         client can safely cache read and write data,
> > >                   serve opens, and locks locally and can perform
> > >                   out-of-band or server reads and writes.
> > At least this requires mapping updates for block storage.
> > For those souls that want strict local-FS semantics (UNIX)  cache
> and map invalidations can be a side-effect of the byte-range locking mechanism.
>
> This sounds like something that falls into the distributed cache coherency
> realm - meaning multiple clients have a CW data delegation and a
> layout delegation.
> My assumption was that in this case the logical block map changes
> rarely when the clients are writing in place, otherwise they should fall back to
> writing through the server.  Having an efficient distributed cache coherency
> mechanism in NFS seems to me like a stretch but it's worth a discussion to see
> if block based SAN filesystems can or can't live without it.
>

I think that if we work towards common structures for mapping and caching we might end up letting the implementer or user decide about the consistency level he wants and support all. We certainly can't afford to ignore those that require consistency beyond the close-to-open level conventionally associated with NFS especially when there are distributed or cluster file-systems that got their customers use it today (GPFS, SAN-FS).

> Benny
>
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>  
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
>  http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
>  pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
>  http://docs.yahoo.com/info/terms/
>
> 

From julian_satran@il.ibm.com Fri Dec 26 01:36:27 2003
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 74476 invoked from network); 26 Dec 2003 09:36:24 -0000
Received: from unknown (66.218.66.166)
by m12.grp.scd.yahoo.com with QMQP; 26 Dec 2003 09:36:24 -0000
Received: from unknown (HELO mtagate1.de.ibm.com) (195.212.29.150)
by mta5.grp.scd.yahoo.com with SMTP; 26 Dec 2003 09:36:23 -0000
Received: from d12relay02.megacenter.de.ibm.com (d12relay02.megacenter.de.ibm.com [9.149.165.196] (may be forged))
by mtagate1.de.ibm.com (8.12.10/8.12.10) with ESMTP id hBQ9aJjB127026;
Fri, 26 Dec 2003 09:36:19 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay02.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id hBQ9aHO5258208;
Fri, 26 Dec 2003 10:36:18 +0100
In-Reply-To: <5.2.0.9.2.20031222125146.018b3cc0@cic-mail.lanl.gov>
To: Gary Grider <ggrider@lanl.gov>
Cc: "'pnfs-ops@yahoogroups.com'" <pnfs-ops@yahoogroups.com>,
pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF0448A63E.49581E6E-ONC2256E08.002D69E8-C2256E08.0034BA46@il.ibm.com>
Date: Fri, 26 Dec 2003 11:36:14 +0200
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
26/12/2003 11:36:18,
Serialize complete at 26/12/2003 11:36:18
Content-Type: multipart/alternative; boundary="=_alternative 002DC7EBC2256E08_="
X-eGroups-Remote-IP: 195.212.29.150
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] RE: [pnfs-ops] delegation arguments summary
X-Yahoo-Group-Post: member; u=64714603

I agree with Gary that handling efficiently the "good-path" (e.g., concurrent writers with non-overlapping regions, or single writer with readers needing only close-to-open consistency) is essential. To me it looks as all those could be better handled if we could approach mapping and caching concurrently.

Regards,
Julo


Gary Grider <ggrider@lanl.gov>

22/12/2003 21:53
	
To
	pnfs-reqs@yahoogroups.com, Julian Satran/Haifa/IBM@IBMIL, "'pnfs-ops@yahoogroups.com'" <pnfs-ops@yahoogroups.com>
cc
	"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Subject
	Re: [pnfs-reqs] RE: [pnfs-ops] delegation arguments summary

	




At 02:41 PM 12/22/2003 -0500, Halevy, Benny wrote:

> > * layout delegation revocation (and enforcement of)
> >   This issue is orthogonal. We dicussed volatile file handles, OSD
> >   capabilities, and SAN LUN mapping techniques.
> >
>
> Almost orthogonal. There is a subtle problem of sharing layout delegations if one of clientts is doing writes or appends.

This falls under CW (concurrent write) sharing since there is one or more writers.
By saying "this issue is orthogonal" I meant that the mechanism for revoking the
layout delegation is orthogonal to whether we need a complete new set of
delegations or extend the current model.

I agree that when the layout changes due to writes, appends, or for any other
reason the server has to recall layout delegations, at least from those clients
that requested layout for region that's about to be the changed.  Hopefully,
all clients behave nicely and their delegations do not have to be revoked.
You want to revoke the layout delegation from unresponsive clients since allowing
them to use the stale layout may end up with data corruption.

Speaking of append,  I always thought it'd be really nice to have an NFS APPEND
operation... This seems like something we can propose right away on nfsv4@ietf.org
How does people on this list feel about that?

A use case I encountered is a customer that use a shared file as a log and have
multiple nodes in the cluster appending to that file with some coordination
(right now, NFSv3 + NLM).  They don't care about ordering of the appended records
and they even accept records written more than once to the file, but they do care
about the consistency of each record so writers can't just silently overwrite
each other.

> The issue is furthermore complicated by the "sparse" layout that we all want to support (do we?)

Can you please turn the details knob on "sparse" layout and maybe give a
concrete example where this layout make the proposed model fall short?

> > layout delegation:
> > - returned on READ_IND, WRITE_IND, LAYOUT_DELEG_ASK
> >
> > Covers only layout (aggregation header, map, handles/caps).
> > Optional, recallable, revocable.
> > Assures the client that the layout information it has will not change.
>
> But the layout information may change even in the most trivial single writer case and definitely in RW cases.

Correct, when the layout is about to be changed (a writer calls COMMIT_IND)
or when there is a write-write conflict (two clients call WRITE_IND for
overlapping regions) some or all layout delegations must be recalled.

> > WRITE yes         client can safely cache read and write data,
> >                   serve opens, and locks locally and can perform
> >                   out-of-band or server reads and writes.
> At least this requires mapping updates for block storage.
> For those souls that want strict local-FS semantics (UNIX)  cache and map invalidations can be a side-effect of the byte-range locking mechanism.

This sounds like something that falls into the distributed cache coherency
realm - meaning multiple clients have a CW data delegation and a layout delegation.
My assumption was that in this case the logical block map changes
rarely when the clients are writing in place, otherwise they should fall back to
writing through the server.  

As long as there is a way to get concurrent write to scale with reasonable behavior,
like non overlapped regions and any other reasonable promises.  I suppose we need
to pin down what those reasonable promises are.

Gary


Having an efficient distributed cache coherency
mechanism in NFS seems to me like a stretch but it's worth a discussion to see
if block based SAN filesystems can or can't live without it.

Benny



To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com




Yahoo! Groups Links

    * To visit your group on the web, go to:
    * http://groups.yahoo.com/group/pnfs-reqs/
    *  
    * To unsubscribe from this group, send an email to:
    * pnfs-reqs-unsubscribe@yahoogroups.com
    *  
    * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From julian_satran@il.ibm.com Fri Dec 26 01:36:45 2003
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 68436 invoked from network); 26 Dec 2003 09:36:44 -0000
Received: from unknown (66.218.66.217)
by m17.grp.scd.yahoo.com with QMQP; 26 Dec 2003 09:36:44 -0000
Received: from unknown (HELO mtagate3.de.ibm.com) (195.212.29.152)
by mta2.grp.scd.yahoo.com with SMTP; 26 Dec 2003 09:36:43 -0000
Received: from d12relay02.megacenter.de.ibm.com (d12relay02.megacenter.de.ibm.com [9.149.165.196] (may be forged))
by mtagate3.de.ibm.com (8.12.10/8.12.10) with ESMTP id hBQ9agn0122204;
Fri, 26 Dec 2003 09:36:42 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay02.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id hBQ9afO5284158;
Fri, 26 Dec 2003 10:36:42 +0100
In-Reply-To: <B459CE1AFFC52D4688B2A5B842CA35EA7A53E5@corpmx14.corp.emc.com>
To: pnfs-ops@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF1E391D2A.8543882B-ONC2256E08.0032EBD9-C2256E08.0034C3ED@il.ibm.com>
Date: Fri, 26 Dec 2003 11:36:39 +0200
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
26/12/2003 11:36:41,
Serialize complete at 26/12/2003 11:36:41
Content-Type: multipart/alternative; boundary="=_alternative 0033FFCAC2256E08_="
X-eGroups-Remote-IP: 195.212.29.152
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-ops] Re: pNFS Discussion Summary 1: Caching and Delegations
X-Yahoo-Group-Post: member; u=64714603

David & all,

black_david@emc.com wrote on 23/12/2003 23:34:32:

> I've split this commentary on Garth's issues into two categories.
> This is about caching, delegations, and layout info.
>
> > > [1.2 Cache consistency]: NFSv4 delegations are not about client cache
> > > consistency; does out-of-band access require stronger cache
> > > consistency than NFSv4 provides
>
> With a little care in defining the protocol extensions, this issue
> can be left to server implementers, unless one wants to take the
> (silly, IMHO) position that the protocol should be incapable of
> providing stronger cache consistency.
>

I agree. As a statement of direction we should say that the protocol should be capable of providing all level of consistency close-to-open or UNIX and it should be a client/server - needed/implemented decision on what to use. The issue we may want to discuss is what must be provided as a minimum in a compliant client/server.

> HighRoad uses the same FMP protocol to provide both NFS-style
> close-to-open consistency for NFS clients and the stronger forms
> of consistency required by CIFS - as long as the server knows what
> clients have which access rights to what blocks, cache consistency
> strength comes down to server implementation decisions about
> what outstanding access rights conflict with a new request.  We've
> actually built server prototypes that provide stronger consistency
> for NFS without change to either the FMP protocol or clients, but
> the shipped product only provides NFS-style consistency for NFS.
>
> > > [1.3 Delegation promotion & reacquisition]: must/should NFSv4 offer
> > > mechanisms for clients to possess a delegations more than once per open
> > >
> > > Delegations in NFSv4 are new, and came with significant concern about
> > > lots of complexity for not much performance, as they may do as little
> > > as avoid the client waiting for one round trip to the server on open.  
> > > So, as described above with respect to cache consistency, the
> > > limitations on delegations can mean great difficulties for clients
> > > having performance requirements calling for out-of-band access mostly,
> > > or exclusively.
>
> Yes, and this is a strong reason for separating "layout" delegations from
> the existing "data" delegations, IMHO.  Consider a web or video server
> that is caching file opens for performance reasons - if updating the
> content underneath the server makes it impossible to get the direct
> access ("layout") delegations back, the result is that one has to shut
> down and restart all the servers after the content update in order to
> restore performance.  The sysadmin responsible for this annoying
> work will want to tar-and-feather the system designers who made
> it necessary (that would be us if we get this wrong ...).
>

I have my doubts that this makes sense as I could not find a case in which those are not strongly related and doing them separately will force us into considering a myriad of invalid combinations and failure modes. The only good argument for doing them separately is that they are easier to implement and understand separately but this might be misleading (it may increase substantially the exception handling). This is why I would refrain from suggesting this as a requirement now.

> > > [1.4 Layout delegations]: can/should layout metadata "ride" on NFSv4
> > > delegations or are new "layout" delegations needed
>
> New "layout" delegations are needed for clean separation of functionality,
> and so that "layout" delegations can be designed for direct access
> requirements.  See [1.3] above.
>
> > > [1.5 Concurrent write]: write delegations now are held by exactly one
> > > client, if any; should/must NFS support multiple clients holding
> > > concurrent layout delegations.
>
> I understand the value of this to the self-coordinating HPC applications,
> but would like to see this functionality specified (assuming it is
> specified) as a cleanly separable option, as I think the desire to
> self-coordinate a shared write delegation will be limited to a small
> number of application spaces, like HPC.  I also note Gary's comment
> that it's sufficient for parallel write to work in the non-overlapping
> case, which does not require any new concurrent write delegation as
> long as each client can hold an exclusive write delegation for its range.
>
> Thanks,
> --David
> ----------------------------------------------------
> David L. Black, Senior Technologist
> EMC Corporation, 176 South St., Hopkinton, MA  01748
> +1 (508) 293-7953             FAX: +1 (508) 293-7786
> black_david@emc.com        Mobile: +1 (978) 394-7754
> ----------------------------------------------------
>
> To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>  
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
>  http://groups.yahoo.com/group/pnfs-ops/
>
> To unsubscribe from this group, send an email to:
>  pnfs-ops-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
>  http://docs.yahoo.com/info/terms/
>
> 

From julian_satran@il.ibm.com Mon Dec 29 02:11:03 2003
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 82439 invoked from network); 29 Dec 2003 10:11:01 -0000
Received: from unknown (66.218.66.218)
by m14.grp.scd.yahoo.com with QMQP; 29 Dec 2003 10:11:01 -0000
Received: from unknown (HELO mtagate7.de.ibm.com) (195.212.29.156)
by mta3.grp.scd.yahoo.com with SMTP; 29 Dec 2003 10:11:00 -0000
Received: from d12relay01.megacenter.de.ibm.com (d12relay01.megacenter.de.ibm.com [9.149.165.180] (may be forged))
by mtagate7.de.ibm.com (8.12.10/8.12.10) with ESMTP id hBTAAqwj127908;
Mon, 29 Dec 2003 10:10:52 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay01.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id hBTAApcZ253454;
Mon, 29 Dec 2003 11:10:51 +0100
In-Reply-To: <F3F660FC-311C-11D8-BE66-000393754F12@panasas.com>
To: pnfs-ops@yahoogroups.com
Cc: Garth Gibson <garth@panasas.com>, pnfs-ops@yahoogroups.com,
pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF251283F9.854940E3-ONC2256E0B.00359657-C2256E0B.0037E963@il.ibm.com>
Date: Mon, 29 Dec 2003 12:10:49 +0200
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
29/12/2003 12:10:51,
Serialize complete at 29/12/2003 12:10:51
Content-Type: multipart/alternative; boundary="=_alternative 0037E92AC2256E0B_="
X-eGroups-Remote-IP: 195.212.29.156
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-ops] [minimalism] pNFS Discussion Summary 1: 12/18/03
X-Yahoo-Group-Post: member; u=64714603

Garth and All,



Garth Gibson <garth@panasas.com> wrote on 18/12/2003 07:42:22:




> ----------------------------------------
>
> [1.0 Minimalism]: How much additional functionality do we sacrifice to
> limit the changes we seek in NFSv4?
>
> On one hand, some have said that getting to one true file system, with
> the high performance and the manageability of federated systems that
> might come with out-of-band access, is worth not matching *every*
> feature of all existing out-of-band file systems with this first set of
> extensions to NFSv4.  That we should bite off what we can do quickly,
> correctly, with a clear incremental value to NFSv4, and roadmap more
> aggressive changes that could bog us down, or introduce so much
> complexity that interoperability becomes elusive.  And that we should
> be mindful of the reception we may get from the IETF NFS working group
> if we *appear* to use out-of-band as an excuse to ask for a brace of
> changes in other aspects of NFSv4.
>
> On the other hand, the other out-of-band file systems that are
> inspiring the evolution of NFSv4 have customers that may not accept any
> backward sets in an evolution to NFSv4.  This could create the need to
> develop, carry and differentiate all the diverse one-off out-of-band
> files systems plus a new out-of-band NFSv4.  Some think it makes more
> sense to go far enough with this first NFSv4 to simplify the
> marketplace by making it reasonable for various vendors to
> deprecate/end-of-life/begin to wean from their proprietary offering.
>
> While it is certainly conceivable that we could be designing a roadmap
> of solutions in detail from the start, communication among standards
> bodies is hard enough without the challenge of designing specs for both
> with and without a requirement.
>
> This is a central issue in defining the requirements for out-of-band
> NFSv4, or at least for defining the scope of the first set of
> extensions.
>
> ----------------------------------------
>

I am afraid that this text makes achieving compliance with existing out-of-band filesytems sound more complex than it might be.
I see several items that we should strive to keep even in a minimalist set of requirements:

    * attribute set rich enough to enable expressing the attributes of the major local-filesytems (Unix brands and Windows)
    * access control that accommodates the access control mechanisms of the major local-filesytems and some of the popular distributed file-systems (AFS?)
    * coherency mechanisms that enable vendors to optionally implement the two major flavor of coherent file access:
          o completely coherent
          o close-to-open coherent


      None of those seem to me as involving major departures from NFSv4.

      Julo 

From andros@citi.umich.edu Mon Dec 29 12:11:13 2003
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 99698 invoked from network); 29 Dec 2003 20:11:10 -0000
Received: from unknown (66.218.66.216)
by m5.grp.scd.yahoo.com with QMQP; 29 Dec 2003 20:11:10 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta1.grp.scd.yahoo.com with SMTP; 29 Dec 2003 20:11:10 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id A223D20806; Mon, 29 Dec 2003 15:11:09 -0500 (EST)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-ops@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com, andros@citi.umich.edu
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 29 Dec 2003 15:11:09 -0500
Message-Id: <20031229201109.A223D20806@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: why not use mandatory byte-range locking
X-Yahoo-Group-Post: member; u=169434965

the discussion of byte-range delegations and cache consistancy provoked this
thought: why not use existing mandatory byte-range locking?

the client opens a file, requests a (mandatory) lock on the region of the file
it's interested in. the resultant lock stateid is passed as an argument to the
READ/WRITE_IND request. we can require a mandatory lock stateid prior to
handing out layout maps for direct i/o. the layout map is 'good 'only for as
long as the byte-range lock.

the mandatory lock protects the layout, so no need for layout delegations.
mandatory locking also allows the client to cache and operate locally on the
locked data region with cache consistancy guarentees.

we already have the byte-range locking code written. so how far does this get
us? does it make sense to start with the locking code instead of the
delegation as far as extenstions?

-->Andy


From ggrider@lanl.gov Mon Dec 29 20:10:53 2003
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 30244 invoked from network); 30 Dec 2003 04:10:50 -0000
Received: from unknown (66.218.66.166)
by m11.grp.scd.yahoo.com with QMQP; 30 Dec 2003 04:10:50 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta5.grp.scd.yahoo.com with SMTP; 30 Dec 2003 04:10:49 -0000
Received: from mailrelay2.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id hBU4AmfK005317;
Mon, 29 Dec 2003 21:10:49 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay2.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id hBU4AmtS026425;
Mon, 29 Dec 2003 21:10:48 -0700
Received: from cthulu.lanl.gov (vpn-client-136.lanl.gov [128.165.253.136])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id hBU4AkFR002434;
Mon, 29 Dec 2003 21:10:46 -0700
Message-Id: <5.2.0.9.2.20031229210957.018956f0@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Mon, 29 Dec 2003 21:10:44 -0700
To: pnfs-reqs@yahoogroups.com, pnfs-ops@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com, andros@citi.umich.edu
In-Reply-To: <20031229201109.A223D20806@citi.umich.edu>
Mime-Version: 1.0
Content-Type: multipart/alternative;
boundary="=====================_3295098==.ALT"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] why not use mandatory byte-range locking
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

ADVERTISEMENT

As long as there is a way to ask for higher level coordination so byte range locks
are not mandatory, default is ok.

Thanks
Gary

At 03:11 PM 12/29/2003 -0500, William A.(Andy) Adamson wrote:

> the discussion of byte-range delegations and cache consistancy provoked this
> thought: why not use existing mandatory byte-range locking?
>
> the client opens a file, requests a (mandatory) lock on the region of the file
> it's interested in. the resultant lock stateid is passed as an argument to the
> READ/WRITE_IND request. we can require a mandatory lock stateid prior to
> handing out layout maps for direct i/o. the layout map is 'good 'only for as
> long as the byte-range lock.
>
> the mandatory lock protects the layout, so no need for layout delegations.
> mandatory locking also allows the client to cache and operate locally on the
> locked data region with cache consistancy guarentees.
>
> we already have the byte-range locking code written. so how far does this get
> us? does it make sense to start with the locking code instead of the
> delegation as far as extenstions?
>
> -->Andy
>
>
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>
>
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 



From mclarty3@llnl.gov Tue Dec 30 09:16:21 2003
Return-Path: <mclarty3@llnl.gov>
X-Sender: mclarty3@llnl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 28819 invoked from network); 30 Dec 2003 17:16:19 -0000
Received: from unknown (66.218.66.172)
by m9.grp.scd.yahoo.com with QMQP; 30 Dec 2003 17:16:19 -0000
Received: from unknown (HELO smtp-3.llnl.gov) (128.115.41.83)
by mta4.grp.scd.yahoo.com with SMTP; 30 Dec 2003 17:16:19 -0000
Received: from poptop.llnl.gov (localhost [127.0.0.1])
by smtp-3.llnl.gov (8.12.3p2-20030917/8.12.3/LLNL evision: 1.13 $) with ESMTP id hBUHGH9S025152;
Tue, 30 Dec 2003 09:16:17 -0800 (PST)
Received: from POLARBEAR.llnl.gov ([134.9.18.59] verified)
by poptop.llnl.gov (CommuniGate Pro SMTP 4.0.6)
with ESMTP id 33235490; Tue, 30 Dec 2003 09:16:17 -0800
Message-Id: <5.0.0.25.2.20031230083936.02fba428@poptop.llnl.gov>
X-Sender: e002801@poptop.llnl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Tue, 30 Dec 2003 09:16:16 -0800
To: pnfs-reqs@yahoogroups.com, pnfs-ops@yahoogroups.com
Cc: andros@citi.umich.edu
In-Reply-To: <5.2.0.9.2.20031229210957.018956f0@cic-mail.lanl.gov>
References: <20031229201109.A223D20806@citi.umich.edu>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-eGroups-Remote-IP: 128.115.41.83
From: Tyce McLarty <mclarty3@llnl.gov>
Subject: Re: [pnfs-reqs] why not use mandatory byte-range locking
X-Yahoo-Group-Post: member; u=169320772

ADVERTISEMENT
I'm sure I do not understand all the subtleties of byte-range delgations
vs. byte-range locking, but I think the essential ingredient we are after
is the ability to use some coordination across thousands of clients, like
Gary says.

A single process in an HPC application frequently needs to access many
discontiguous byte-ranges, but the coordinated group of clients will access
a large contiguous byte-range. I think it was this example that led us to
the idea of layout delegations to begin with.

The key is to keep thinking in terms of many parallel clients, not a single
one.

Thanks,
Tyce

At 09:10 PM 12/29/2003 -0700, Gary Grider wrote:

>As long as there is a way to ask for higher level coordination so byte
>range locks
>are not mandatory, default is ok.
>
>Thanks
>Gary
>
>At 03:11 PM 12/29/2003 -0500, William A.(Andy) Adamson wrote:
>>the discussion of byte-range delegations and cache consistancy provoked this
>>thought: why not use existing mandatory byte-range locking?
>>
>>the client opens a file, requests a (mandatory) lock on the region of the
>>file
>>it's interested in. the resultant lock stateid is passed as an argument
>>to the
>>READ/WRITE_IND request. we can require a mandatory lock stateid prior to
>>handing out layout maps for direct i/o. the layout map is 'good 'only for as
>>long as the byte-range lock.
>>
>>the mandatory lock protects the layout, so no need for layout delegations.
>>mandatory locking also allows the client to cache and operate locally on the
>>locked data region with cache consistancy guarentees.
>>
>>we already have the byte-range locking code written. so how far does this
>>get
>>us? does it make sense to start with the locking code instead of the
>>delegation as far as extenstions?
>>
>>-->Andy
>>
>>
>>
>>To unsubscribe from this group, send an email to:
>>pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>
>>
>>
>>
>>----------
>>Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> *
>> <http://groups.yahoo.com/group/pnfs-reqs/>http://groups.yahoo.com/group/pnfs-reqs/
>>
>> *
>> * To unsubscribe from this group, send an email to:
>> *
>> <mailto:pnfs-reqs-unsubscribe@yahoogroups.com?subject=Unsubscribe>pnfs-reqs-unsubscribe@yahoogroups.com
>>
>> *
>> * Your use of Yahoo! Groups is subject to the
>> <http://docs.yahoo.com/info/terms/>Yahoo! Terms of Service.
>
>Yahoo! Groups Sponsor
>ADVERTISEMENT
>
>
>----------
>Yahoo! Groups Links
> * To visit your group on the web, go to:
> *
> <http://groups.yahoo.com/group/pnfs-reqs/>http://groups.yahoo.com/group/pnfs-reqs/
>
> *
> * To unsubscribe from this group, send an email to:
> *
> <mailto:pnfs-reqs-unsubscribe@yahoogroups.com?subject=Unsubscribe>pnfs-reqs-unsubscribe@yahoogroups.com
>
> *
> * Your use of Yahoo! Groups is subject to the
> <http://docs.yahoo.com/info/terms/>Yahoo! Terms of Service.


From dnoveck@netapp.com Tue Dec 30 12:08:08 2003
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 26134 invoked from network); 30 Dec 2003 20:08:07 -0000
Received: from unknown (66.218.66.167)
by m12.grp.scd.yahoo.com with QMQP; 30 Dec 2003 20:08:07 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 30 Dec 2003 20:08:07 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id hBUK86Kw003014;
Tue, 30 Dec 2003 12:08:06 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id hBUK86pr015864;
Tue, 30 Dec 2003 12:08:06 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3CF10.A12BC55E"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Tue, 30 Dec 2003 12:08:02 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548AB80A98@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03
Thread-Index: AcPIdgEt2ftUYTH2RMiubXEU2ZfttQGc/84A
To: <pnfs-ops@yahoogroups.com>
Cc: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

It seems legal to me but I'm guessing that there are others that would think differently.
 
I tend to think that it is not a good idea, though.  There are going to be operations which, by their nature, are better done through the metadata server.  A two-byte write which spans multiple data servers is an example.  Another is append-writes, which have been mentioned (by whom I don't remember just now) as a desirable v4 extension, assuming the data to be written is of reasonable size.  In each case, we may create appropriate caching/locking primitives to allow the operation to be done without making any request of the metadata server that is officially denominated an "IO" request.
But can you really argue that this will be the best way for the client to do such operations?  And does it really make sense to force clients to invest the effort in terms of the code do such operations doing the IO with the data server only, when the performance benefit of that is going to be small, or zero, or negative?  You may wind up making as many requests of the meta-data server with the data-server-only approach.  It's just that they won't be IO operations (but instead locking and, in the case of append, getattr operations).

In complicated protocols (and v4 is a complicated protocol and is getting more complicated), there are going to be multiple ways of doing the same thing, which are going to differ in their performance characteristics.  An organization can be reasonably concerned about clients making the wrong choice, just as it is concerned about clients that are making excessive resource demands for other reasons.  There are two issues that I am worried about in taking such a drastic approach as simply refusing to support a valid piece of the protocol, even if that choice is made by the server administrator.  The first is that determining the better choice depends on a lot of variables and that a simple formula governing an option  (e.g. "IO through the metadata server is bad") is unlikely to completely match reality.  The second is that I-don't-like-your-IO-request-so-you-lose is kind of a blunt instrument to deal with the problem.
 
If you have identified some set of bad client practices, you can find the clients doing them, report the appropriate statistics, even, if the issue is critical, artificially give such clients (or specific requests) bad performance in a way that doesn't hurt other clients (unless they are waiting for the first set to do something.  Sigh!), by just delaying processing of their requests by millisecond or two.  That should be enough to preserve metadata-server bandwidth for more worthwhile purposes.  If that's insufficiently discouraging, you can raise the delay.  If you start rejecting requests because you would have done it differently, even if you are correct, you are on the road to creating your own sub-protocol, which is why this kind of thing is worrying, even if legal.
 
 

    -----Original Message-----
    From: Julian Satran [mailto:julian_satran@il.ibm.com]
    Sent: Monday, December 22, 2003 5:26 AM
    To: pnfs-ops@yahoogroups.com
    Cc: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
    Subject: RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03


    Since I raised the issue of the metadata server not having access to all it's data servers (or at least not with adequate bandwidth) I feel compelled to say that Dave's arguments about supporting 4.0 are compelling enough to make it mandatory. The open issue is if it is legal for a "compliant server" to have serving data disabled by a local administrative function (the old "must implement but may use"). Otherwise an organization that wants to discourage use of data serving through the metadata server has very little it can do to enforce policy in a way that will not affect other clients (it may do serve poorly but this still affects other clients).

    Julo


    "Noveck, Dave" <dnoveck@netapp.com>

    18/12/2003 19:21
    Please respond to
    pnfs-ops@yahoogroups.com

    	
    To
    	<pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
    cc
    	
    Subject
    	RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03

    	




    Good summary.

    I want to address the "proxying" issue.

    > [1.1 Proxying]: Operations/work that can only be done out-of-band vs
    > alternative access through the NFSv4 server for all operations/work

    If you are talking about operations in the extension (let's call it
    NFS-v4.x), that are not in the previous minor version (let's assume
    that is nfs-v4.1), then you have a choice of whether these are supported
    for access through the server, or only for access by the client with the
    data server.  Let's call this the issue of proxying in the strict sense.

    There is another issue that people are calling "proxying" but is really
    logically distinct.  That is the issue of access by the previous minor
    version, e.g. nfs-v4.0 or nfs-v4.1.  Those versions have no concept of
    separate data servers and they need to be able to work.  End of story.
    If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
    have a minor version without proxying.  You don't have a minor version
    at all.  I believe the working group is never going to accept that.
    Even if I'm wrong and you can get the working group to accept that,
    it is going to be very contentious and thus take up a lot of time.
    Anybody, who really wants to go down this path should seriously consider
    the trade-off between supporting something they find objectionable and
    getting a standard a lot later, if at all.

    > On one hand, some suggest that a set of out-of-band clients should not
    > have to also have a data path through the NFSv4 metadata server.  One
    > reason is that customers may not tolerate the large variability in
    > performance between out-of-band (when the going is good) and in-band
    > (when the server chooses not to grant or to take away a delegation)
    > accesses.  

    Then such customers will use clients that access things out-of-band
    whenever possible, and servers that never refuse to give out layout
    delegations.  You have a number of quality-of-implementations issues
    for v4.x clients and servers.  If a particular client only supports
    access via v4.0, then performance will suck, and the working group
    will understand that, but it won't accept not being able to use
    v4.0 at all.  The customer is going to be motivated to upgrade his
    clients for those that need high-performance access, but he may be
    OK with some clients using v4.0 for a long time, depending on the
    particular performance those clients need.  (And some will want v2/v3
    access but that is a matter that the working group has no say about).

    > Another reason, and I paraphrase someone else here, is that
    > it is possible to construct out-of-band metadata servers that do not
    > have access to the data servers except through the clients -- I
    > encourage the source of this scenario to replace my paraphrasing with a
    > correct use case, because I find it odd to design for file servers that
    > do not have access to the data servers.

    So let's grant that it is possible (and we'll pass over the issue of
    whether it is desirable, and in fact so desirable that one is willing to
    not get a standard and or get it much later).

    So we have a metadata server and it, for whatever reason, does not have
    access to the data servers.  However, by hypothesis, there are machines
    (e.g. clients), that can communicate with both.  So, if one has such an
    architecture, then one can take such a machine, give it a communication path
    to the meta-data server and the data server and have the meta-data server
    transfer v4.0 READ requests to it, let it read the data from the data
    server and send it back to the meta-data server who send it back to the
    original requestor.  Is that a very good solution?  No.  Is it likely
    to be performant?  No.  Will it satisfy any particular customer?  I don't
    know and that is the implementer's business decision.  Will it satisfy
    the hypothetical customer who doesn't care about v4.0 access?  Clearly.
    Will it satisfy the v4 working group?  Yes, because they are not in the
    business of telling you how performant v4.0 access has got to be.

    > On the other hand, others have suggested that any access or work that a
    > client can do out-of-band should be possible with one or more commands
    > applied to the metadata server's data path.  This has been proposed for
    > coping with recalled delegations, including concurrent writing by
    > multiple clients; retry after client access errors, provided adequate
    > idempotency of out-of-band operations; and many alternative
    > implementations of out-of-band clients, including legacy clients that
    > use out-of-band never or rarely.

    This effort is going to take a while, but if we manage it correctly, it
    is not going to take so long that v3 clients are going to be rare things,
    and they have to be supported.  But v3 clients are not an issue for the
    working group.  V4.0 clients are and they will be around and you will
    have to support them, and I believe the working group is not going to
    be disposed to cut you a lot of slack on this issue (and I don't see
    why it should).

    > I think this is a topic that should be argued one way or the other in
    > the requirements document.  Use cases and examples in other systems
    > would be best.

    I think the requirement should be that this work should be done as a
    set of extensions to nfs-v4 delivered as a v4 minor version.  If there
    is some feature/requirement that conflicts with that model (and it is a
    pretty flexible one), then you have to think long and hard before deciding
    that that requirement is more important than this basic deivery vehicle,
    because it seems to me that it is, in almost all respects, the ideal way
    to make this sort of technology available for widespread use.






    To unsubscribe from this group, send an email to:
    pnfs-ops-unsubscribe@yahoogroups.com



    Yahoo! Groups Links

    To visit your group on the web, go to:
    http://groups.yahoo.com/group/pnfs-ops/

    To unsubscribe from this group, send an email to:
    pnfs-ops-unsubscribe@yahoogroups.com

    Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/





    To unsubscribe from this group, send an email to:
    pnfs-ops-unsubscribe@yahoogroups.com





    Yahoo! Groups Links

        * To visit your group on the web, go to:
          http://groups.yahoo.com/group/pnfs-ops/
           
        * To unsubscribe from this group, send an email to:
          pnfs-ops-unsubscribe@yahoogroups.com
           
        * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From black_david@emc.com Fri Jan 02 08:45:29 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 73456 invoked from network); 2 Jan 2004 16:45:24 -0000
Received: from unknown (66.218.66.166)
by m18.grp.scd.yahoo.com with QMQP; 2 Jan 2004 16:45:24 -0000
Received: from unknown (HELO mxic2.corp.emc.com) (128.221.12.9)
by mta5.grp.scd.yahoo.com with SMTP; 2 Jan 2004 16:45:23 -0000
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <ZHDVCWAC>; Fri, 2 Jan 2004 11:45:23 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A53FB@corpmx14.corp.emc.com>
To: pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com,
pnfs-sbc@yahoogroups.com
Date: Fri, 2 Jan 2004 11:45:21 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.12.9
From: black_david@emc.com
Subject: Two Functionality issues
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

In starting to look at design issues for block metadata,
I've run across a couple of issues around functionality
to be supported that could use wider discussion. This
is based on an initial review of the EMC High Road FMP
protocol and the IBM StorageTank SAN.FS protocol. I've
tried to just describe the issues here without taking a
position.

[4] Functionality

SAN.FS extents come with both read and write
extent mappings and block usage bitmaps. The separate read
and write mappings allow for clients to participate in copy-on-
write functionality - IIRC, Craig has described this.

Issue [4.1]: Should protocol include support for client participation
in copy-on-write?

A motivation for the separate arrays of block usage bits" appears
to be allowing clients to turn file data into holes (e.g.,
AIX fclear system call).

Issue [4.2]: Is the ability to turn valid data into a file "hole"
(e.g., AIX fclear) at the client important to support?

FMP does not support separate read mappings or usage bitmaps,
and hence is not capable of involving clients in copy-on-write
or allowing a client to turn valid data into a file "hole".

Comments? Thanks,
--David

----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From dnoveck@netapp.com Mon Jan 05 08:00:09 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 73569 invoked from network); 5 Jan 2004 16:00:05 -0000
Received: from unknown (66.218.66.167)
by m3.grp.scd.yahoo.com with QMQP; 5 Jan 2004 16:00:05 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 5 Jan 2004 16:00:05 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i05G04Kw001147;
Mon, 5 Jan 2004 08:00:04 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i05G00SX005409;
Mon, 5 Jan 2004 08:00:04 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 5 Jan 2004 07:59:50 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548AB80AA8@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Two Functionality issues
Thread-Index: AcPRT9ZJdxsdz5loSIqJV3CopbvMEAB0UrSg
To: <pnfs-reqs@yahoogroups.com>, <pnfs-ops@yahoogroups.com>,
<pnfs-sbc@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Two Functionality issues
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

With regard to issue 4.2, the fclear operation, I don't
have a position on whether this is important to do but I
am pretty sure that if we do this, it should not be by
means of something limited to the block metadata. If
people need this, we should do this via an ordinary
v4.x operation, which I'll call FCLEAR for now.

The operation of turning a written area into a hole has
three visible consequences:
1) The written data within the targeted area vanishes
and is replaced by zeros as seen by ordinary v4.0
clients and also in pnfs environments where the
metadata format is file or object oriented.
2) Mod a whole bunch of server policy stuff (snapshots,
etc.) the disk space previously used is made available
(No real guarantees but clients may want to do this to
make space available and in many environments they will
be able reliably to get the results they desire).
3) The SAN metadata will show the targeted area as a hole.

So I would argue that, given that this has visible consequences
for all sorts of clients it should be done in a common way,
even though the most definitive manifestation of the function
is via the SAN metadata.

Consider a client implementing the fclear function. Even
though a test program might depend on 3), real applications
that want this functionality are going to be most interested
in 1) and 2). If this function were implemented only through
the SAN metadata, what is the client to do to give the application
the expected behavior? You can get 1) expensively by writing
lots of zeros, but for 2) you are stuck. The result is that
even applications that don't explicitly or implicitly depend
on 3) are burdened by the fact that fclear support in not
universally available.

We want to have a single protocol and not three protocols. So
I think this means that functionality should only be restricted
to a single form of metadata if the consequences of that
functionality can only be seen through that form of metadata,
which isn't the case here.



-----Original Message-----
From: black_david@emc.com [mailto:black_david@emc.com]
Sent: Friday, January 02, 2004 11:45 AM
To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com;
pnfs-sbc@yahoogroups.com
Subject: [pnfs-reqs] Two Functionality issues


In starting to look at design issues for block metadata,
I've run across a couple of issues around functionality
to be supported that could use wider discussion. This
is based on an initial review of the EMC High Road FMP
protocol and the IBM StorageTank SAN.FS protocol. I've
tried to just describe the issues here without taking a
position.

[4] Functionality

SAN.FS extents come with both read and write
extent mappings and block usage bitmaps. The separate read
and write mappings allow for clients to participate in copy-on-
write functionality - IIRC, Craig has described this.

Issue [4.1]: Should protocol include support for client participation
in copy-on-write?

A motivation for the separate arrays of block usage bits" appears
to be allowing clients to turn file data into holes (e.g.,
AIX fclear system call).

Issue [4.2]: Is the ability to turn valid data into a file "hole"
(e.g., AIX fclear) at the client important to support?

FMP does not support separate read mappings or usage bitmaps,
and hence is not capable of involving clients in copy-on-write
or allowing a client to turn valid data into a file "hole".

Comments? Thanks,
--David,
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------





Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/ 

From andros@citi.umich.edu Mon Jan 05 10:27:41 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 26235 invoked from network); 5 Jan 2004 18:27:40 -0000
Received: from unknown (66.218.66.217)
by m18.grp.scd.yahoo.com with QMQP; 5 Jan 2004 18:27:40 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta2.grp.scd.yahoo.com with SMTP; 5 Jan 2004 18:27:39 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 07168207D3; Mon, 5 Jan 2004 13:27:38 -0500 (EST)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-ops@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com, andros@citi.umich.edu
In-reply-to: Your message of "Mon, 29 Dec 2003 13:00:02 PST."
<C8CF60CFC4D8A74E9945E32CF096548A6D3632@silver.nane.netapp.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 05 Jan 2004 13:27:37 -0500
Message-Id: <20040105182738.07168207D3@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-ops] why not use mandatory byte-range locking
X-Yahoo-Group-Post: member; u=169434965

> Andy Adamson wrote:
> > the discussion of byte-range delegations and cache consistancy provoked this
> > thought: why not use existing mandatory byte-range locking?
>
> > the client opens a file, requests a (mandatory) lock on the region of the file
> > it's interested in. the resultant lock stateid is passed as an argument to the
> > READ/WRITE_IND request. we can require a mandatory lock stateid prior to
> > handing out layout maps for direct i/o. the layout map is 'good 'only for as
> > long as the byte-range lock.
>
> One problem is that there is no way for the client to specify that he wants
> a mandatory (as opposed to advisory) byte-range lock, he just asks for one
> and the server gives him the type of byte-range that server is giving out
> for that fs. So, if you did that, applications that relied on the semantics
> (or lack of semantics) of advisory byte-range locks would break.

 From 3530
5.11.5. Mode Attribute
......

Note that in UNIX, if a file has the MODE4_SGID bit set and no
MODE4_XGRP bit set, then READ and WRITE must use mandatory file
locking.


so for unix, there is a way to specify mandatory vrs advisory locking. since
this is also in 3530:
8. File Locking and Share Reservations
.....
These mechanisms can implement policy ranging from advisory only locking to
full mandatory locking.

adding a flag to a LOCK/T/U to indicate mandatory locking vrs advisory is
within reason.


> Another issue is that while you say "'good' only for as long as the byte-range
> lock", the results of doing this are that the layout map and the data will
> be fixed for at least as long as the byte-range lock exists, i.e. sometimes
> too long. If I'm going to be reading directly from the data server, then
> I want the layout to stay constant for a long time, or at least I don't want
> to be forced to repeatedly get locks for small areas of the layout. The
> obvious (and desirable) thing for me to do is to get a shared lock for the
> whole file so the layout cannot change, but if we combine changes of layout
> and changes of data under a single sort of lock, mandatory byte-range locks
> in this case, we have stopped anybody writing in the file for a very long
> time, i.e. essentially forever since my lease will normally be continually
> renewed.
>
> When you combine a guarantee that the layout will not change with a guarantee
> that the data will not change, in such a way that they can't be separated,
> you artificially increase the amount of conflicts, in many cases to an
> unacceptable level.

perhaps i'm missing something, but isn't it the case the layout of the data
and the abilty to access the data are totally bound together? the layout
changes due to writes and appends (other??). if the layout changes, stale
layout maps are not only no longer any good, they can lead to data corruption.
it seems to me that the guarentee that the layout won't change is bound to the
guarentee that the data won't change. i can't think of any conflicts such as
you mention - could you give some examples?


> When you have a delegation model, the problem is
> excessive recalls, while when you have a locking model the problem is that
> some applications will slow to a crawl/halt.
>
> > the mandatory lock protects the layout, so no need for layout delegations.
> > mandatory locking also allows the client to cache and operate locally on the
> > locked data region with cache consistancy guarentees.
>
> If you are going to be doing some local operation, then short-term mandatory
> byte-range locking can help you. If need to do a lock/fetch/update/write/unlock
> cycle on a record, this is the ticket (and in v4 lock/fetch and write/unlock
> can be COMPOUND's :-). The record you hold while updating can be considered
> cached for that brief period. If, however, you are caching data generally,
> i.e. for a period outside the range of a short operation sequence, you are
> going to need something that is delegation-like, in that if I have the
> cached data and want to keep it until there is some reason to get rid of
> it, i.e. it is LRU'd out or there is a conflict, then I have to have some
> way of finding out that there is a conflict. Delegations do that via a
> recall and one can imagine it being done other ways. But the mandatory
> lock model is that I have a lock because I need it and so there is no
> provision to tell me that someone else has a conflict. The logic is that
> he will wait until I give the lock up, and waiting for the cached data to
> be LRU'd is going to be too long in most cases.
>

i agree that the ability for the server to recall is a required feature. i'm
simply suggesting that mandatory locking may have more features in common with
what we need for pnfs than delgations, and that we could extend the existing
mandatory byte-range locking model with fewer changes than extending the
existing delegation model.

so, how about estending the mandatory locking model with a recall mechanism?

>
> > we already have the byte-range locking code written.
>
> I only have advisory byte-range locking code written. Who has v4 mandatory
> byte-range locking implemented?
>
> > so how far does this get
> > us? does it make sense to start with the locking code instead of the
> > delegation as far as extenstions?
>
> I think if we define some form of byte-range delegations (at least for data
> and maybe for layout as well), there is going to be lots of code sharing with
> an existing mandatory byte-range locking implementation. The data structures
> and many of the interfaces are going to be the same.

and this is really why i brought this up. a new lock type that has the
features we desire (e.g. a recall mechanism) makes sense to me.

> The difference is going
> to be what you do about conflicts. Instead of saying to the second claimant,
> "You snoozed so you lose", in some cases you have to be prepared to recall the
> delegation so that, for example, an otherwise unexceptionable write can proceed.

From dnoveck@netapp.com Mon Jan 05 12:11:32 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 69316 invoked from network); 5 Jan 2004 20:11:30 -0000
Received: from unknown (66.218.66.172)
by m5.grp.scd.yahoo.com with QMQP; 5 Jan 2004 20:11:30 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 5 Jan 2004 20:11:30 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i05KBUKw005747;
Mon, 5 Jan 2004 12:11:30 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i05KBUSR010661;
Mon, 5 Jan 2004 12:11:30 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 5 Jan 2004 12:11:23 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D3642@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: [pnfs-ops] why not use mandatory byte-range locking
Thread-Index: AcPTuZy/JW5xIH2uTcip4yVASdheYgACMxhQ
To: <pnfs-reqs@yahoogroups.com>, <pnfs-ops@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] why not use mandatory byte-range locking
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

Andy Adamson wrote:
[Dave Noveck wrote]:
> > When you combine a guarantee that the layout will not change with a guarantee
> > that the data will not change, in such a way that they can't be separated,
> > you artificially increase the amount of conflicts, in many cases to an
> > unacceptable level.

> perhaps i'm missing something, but isn't it the case the layout of the data
> and the abilty to access the data are totally bound together? the layout
> changes due to writes and appends (other??).

It is (almost) always the case that if the layout changes, it is because of
some data being written. The exceptions are so few that they can easily be
dealt with by considering that a fictitious write which just happened to
overwrite the same data has occurred (e.g. the data server scans its disks
and finds a bad spot making it advisable to move data stored there somewhere
else, thus changing the layout, even though there was no application-level
write).

However, the problem I see with your "totally bound together" formulation
is in the other direction. There are many many cases in which the data
changes and the layout does not change and they are important from a
performance point of view. In the SAN case, whenever a file is modified
by overwriting, the data changes but the layout des not. In the pnfs cases
in which the distribution is by files or objects, the layout changes
even less. The layout is normally established once ("this file is striped
among the following 64 data servers at 256K per stripe") and that hardly
ever changes. The protocol has to allow for the possibility that there
is a change (e.g. the administrator wants to add more data servers) but as
a practical matter the clients can go on their merry way using the layout
information they got when the file was first accessed.

> if the layout changes, stale
> layout maps are not only no longer any good, they can lead to data corruption.
> it seems to me that the guarentee that the layout won't change is bound to the
> guarentee that the data won't change. i can't think of any conflicts such as
> you mention - could you give some examples?

Thousands of Linux nodes in an application cluster are merrily reading and
writing, not changing the layout. The application is careful to not cache
inappropriately (and it knows how the file is used so it is reasonable that it
might do that), so callbacks will not be needed for cache invalidating. The
problem is, you want these nodes to get the layout information and use it
and not be bothered when the layout *isn't* changing (and when they are
the people doing the writes are bothered since they have to wait for the
delegation recalls from large numbers of clients). However, since it is
possible that the layout will change, the clients, since they have layout
info, will be notified when it changes. Since it is changing infrequently
(almost never) this is fine. But it isn't fine, if, whenever the data changes,
you act as if the layout is changing.

> i agree that the ability for the server to recall is a required feature. i'm
> simply suggesting that mandatory locking may have more features in common with
> what we need for pnfs than delgations, and that we could extend the existing
> mandatory byte-range locking model with fewer changes than extending the
> existing delegation model.

I think you are reading too much into my words. When I call such a thing a
delegation, I don't mean that it is very much like the delegations that
exist in v4.0 today. I mean simply that it is an optionally-granted recallable
lock. It makes sense in v4.x to do such a thing with a new OP (as I don't
think you can add parameters to existing ops) but GET_RANGE_DELEG is going
to look a whole lot more like the existing LOCK op than it does anything
related to current delegations in v4.0.

> so, how about estending the mandatory locking model with a recall mechanism?

I'd call the result a "range delegation".

The issue I have is the ability to lock (i.e. get a delegation for) the layout
for a given region without getting recalled when the data changes. I don't see
a need for the reverse (i.e. a lock on the data without getting recalled when the
layout changes). When we get to the detailed specification, we'll see if it
turns out better for these (the data lock/delegation and the layout lock/delegation)
to be conceptually independent or assembled into a hierarchy in which the
don't-change-the-data-or-layout lock/delegation is stronger than the don't-
change-the-layout lock/delegation.

-----Original Message-----
From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
Sent: Monday, January 05, 2004 1:28 PM
To: pnfs-ops@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com; andros@citi.umich.edu
Subject: [pnfs-reqs] Re: [pnfs-ops] why not use mandatory byte-range
locking


> Andy Adamson wrote:
> > the discussion of byte-range delegations and cache consistancy provoked this
> > thought: why not use existing mandatory byte-range locking?
>
> > the client opens a file, requests a (mandatory) lock on the region of the file
> > it's interested in. the resultant lock stateid is passed as an argument to the
> > READ/WRITE_IND request. we can require a mandatory lock stateid prior to
> > handing out layout maps for direct i/o. the layout map is 'good 'only for as
> > long as the byte-range lock.
>
> One problem is that there is no way for the client to specify that he wants
> a mandatory (as opposed to advisory) byte-range lock, he just asks for one
> and the server gives him the type of byte-range that server is giving out
> for that fs. So, if you did that, applications that relied on the semantics
> (or lack of semantics) of advisory byte-range locks would break.

>From 3530
5.11.5. Mode Attribute
......

Note that in UNIX, if a file has the MODE4_SGID bit set and no
MODE4_XGRP bit set, then READ and WRITE must use mandatory file
locking.


so for unix, there is a way to specify mandatory vrs advisory locking. since
this is also in 3530:
8. File Locking and Share Reservations
.....
These mechanisms can implement policy ranging from advisory only locking to
full mandatory locking.

adding a flag to a LOCK/T/U to indicate mandatory locking vrs advisory is
within reason.


> Another issue is that while you say "'good' only for as long as the byte-range
> lock", the results of doing this are that the layout map and the data will
> be fixed for at least as long as the byte-range lock exists, i.e. sometimes
> too long. If I'm going to be reading directly from the data server, then
> I want the layout to stay constant for a long time, or at least I don't want
> to be forced to repeatedly get locks for small areas of the layout. The
> obvious (and desirable) thing for me to do is to get a shared lock for the
> whole file so the layout cannot change, but if we combine changes of layout
> and changes of data under a single sort of lock, mandatory byte-range locks
> in this case, we have stopped anybody writing in the file for a very long
> time, i.e. essentially forever since my lease will normally be continually
> renewed.
>
> When you combine a guarantee that the layout will not change with a guarantee
> that the data will not change, in such a way that they can't be separated,
> you artificially increase the amount of conflicts, in many cases to an
> unacceptable level.

perhaps i'm missing something, but isn't it the case the layout of the data
and the abilty to access the data are totally bound together? the layout
changes due to writes and appends (other??). if the layout changes, stale
layout maps are not only no longer any good, they can lead to data corruption.
it seems to me that the guarentee that the layout won't change is bound to the
guarentee that the data won't change. i can't think of any conflicts such as
you mention - could you give some examples?


> When you have a delegation model, the problem is
> excessive recalls, while when you have a locking model the problem is that
> some applications will slow to a crawl/halt.
>
> > the mandatory lock protects the layout, so no need for layout delegations.
> > mandatory locking also allows the client to cache and operate locally on the
> > locked data region with cache consistancy guarentees.
>
> If you are going to be doing some local operation, then short-term mandatory
> byte-range locking can help you. If need to do a lock/fetch/update/write/unlock
> cycle on a record, this is the ticket (and in v4 lock/fetch and write/unlock
> can be COMPOUND's :-). The record you hold while updating can be considered
> cached for that brief period. If, however, you are caching data generally,
> i.e. for a period outside the range of a short operation sequence, you are
> going to need something that is delegation-like, in that if I have the
> cached data and want to keep it until there is some reason to get rid of
> it, i.e. it is LRU'd out or there is a conflict, then I have to have some
> way of finding out that there is a conflict. Delegations do that via a
> recall and one can imagine it being done other ways. But the mandatory
> lock model is that I have a lock because I need it and so there is no
> provision to tell me that someone else has a conflict. The logic is that
> he will wait until I give the lock up, and waiting for the cached data to
> be LRU'd is going to be too long in most cases.
>

i agree that the ability for the server to recall is a required feature. i'm
simply suggesting that mandatory locking may have more features in common with
what we need for pnfs than delgations, and that we could extend the existing
mandatory byte-range locking model with fewer changes than extending the
existing delegation model.

so, how about estending the mandatory locking model with a recall mechanism?

>
> > we already have the byte-range locking code written.
>
> I only have advisory byte-range locking code written. Who has v4 mandatory
> byte-range locking implemented?
>
> > so how far does this get
> > us? does it make sense to start with the locking code instead of the
> > delegation as far as extenstions?
>
> I think if we define some form of byte-range delegations (at least for data
> and maybe for layout as well), there is going to be lots of code sharing with
> an existing mandatory byte-range locking implementation. The data structures
> and many of the interfaces are going to be the same.

and this is really why i brought this up. a new lock type that has the
features we desire (e.g. a recall mechanism) makes sense to me.

> The difference is going
> to be what you do about conflicts. Instead of saying to the second claimant,
> "You snoozed so you lose", in some cases you have to be prepared to recall the
> delegation so that, for example, an otherwise unexceptionable write can proceed.







Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/ 

From black_david@emc.com Mon Jan 05 16:39:00 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 13162 invoked from network); 6 Jan 2004 00:38:58 -0000
Received: from unknown (66.218.66.216)
by m15.grp.scd.yahoo.com with QMQP; 6 Jan 2004 00:38:58 -0000
Received: from unknown (HELO MAHO3MSX2.corp.emc.com) (128.221.11.32)
by mta1.grp.scd.yahoo.com with SMTP; 6 Jan 2004 00:38:58 -0000
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <ZHDMNTRY>; Mon, 5 Jan 2004 19:38:57 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A5414@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com, pnfs-ops@yahoogroups.com
Date: Mon, 5 Jan 2004 19:38:57 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.11.32
From: black_david@emc.com
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] why not use mandatory byte-range l
ocking
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

> > Andy Adamson wrote:
> > > the discussion of byte-range delegations and cache consistancy
provoked this
> > > thought: why not use existing mandatory byte-range locking?

The "existing" locking cannot be reused - it has to be a new type of locking
that might share some operations with the existing locking, i.e.,

> and this is really why i brought this up. a new lock type that has the
> features we desire (e.g. a recall mechanism) makes sense to me.

Keep in mind that what's required is significantly more than locking.
For an example, take a look at FMP_Flush in the uploaded FMP spec to see the
things that may need to be done when releasing a write lock.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From garth@panasas.com Tue Jan 06 20:31:21 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 60427 invoked from network); 7 Jan 2004 04:31:19 -0000
Received: from unknown (66.218.66.218)
by m3.grp.scd.yahoo.com with QMQP; 7 Jan 2004 04:31:19 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 7 Jan 2004 04:31:19 -0000
Received: from [172.17.19.50] ([172.17.19.50]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYGWFA; Tue, 6 Jan 2004 23:31:18 -0500
Mime-Version: 1.0 (Apple Message framework v609)
Content-Transfer-Encoding: 7bit
Message-Id: <54AEC7B6-40CA-11D8-B7B5-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Tue, 6 Jan 2004 23:31:15 -0500
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Announcing a weekly pNFS requirements concall
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

While our mailing lists are seeing a good flow of good comments, the
timeline we have set for ourselves, to give the IETF something in early
Feb, is short. So I've set up a weekly conference call for an hour,
for all that can make it. Notes from these calls will go out to the
Yahoo group reflector for those that can't make it.

Beginning this Friday, Jan 9, 12-1pm EST, hosted by Panasas. Contact
garth gibson if you would like to participate and do not know the dial
in numbers.

Thanks
garth

From julian_satran@il.ibm.com Fri Jan 09 23:04:02 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 17499 invoked from network); 10 Jan 2004 07:04:01 -0000
Received: from unknown (66.218.66.167)
by m14.grp.scd.yahoo.com with QMQP; 10 Jan 2004 07:04:01 -0000
Received: from unknown (HELO mtagate3.de.ibm.com) (195.212.29.152)
by mta6.grp.scd.yahoo.com with SMTP; 10 Jan 2004 07:04:00 -0000
Received: from d12relay01.megacenter.de.ibm.com (d12relay01.megacenter.de.ibm.com [9.149.165.180])
by mtagate3.de.ibm.com (8.12.10/8.12.10) with ESMTP id i0A73vHI118250;
Sat, 10 Jan 2004 07:03:57 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay01.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i0A73tKG278370;
Sat, 10 Jan 2004 08:03:56 +0100
In-Reply-To: <20031229201109.A223D20806@citi.umich.edu>
To: pnfs-ops@yahoogroups.com
Cc: andros@citi.umich.edu, pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF180085D0.82D1232B-ONC2256E17.00235136-C2256E17.0026CF79@il.ibm.com>
Date: Sat, 10 Jan 2004 09:03:54 +0200
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
10/01/2004 09:03:56,
Serialize complete at 10/01/2004 09:03:56
Content-Type: multipart/alternative; boundary="=_alternative 0023E6B2C2256E17_="
X-eGroups-Remote-IP: 195.212.29.152
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-ops] why not use mandatory byte-range locking
X-Yahoo-Group-Post: member; u=64714603

It looks to me at least as a valid option. I think that the argument against it has to do with revocation and data access.
With NFS - data accesses go through the NFS server and a server that has revoked a lock will not let the client access data.

With the new scheme revocation has to be explicit  - as the client is accessing data by its own.

Delegation expresses this "new reality" better.  But perhaps for layout what is needed is combination of lock-stateID and delegation.

Julo


"William A.(Andy) Adamson" <andros@citi.umich.edu>

29/12/2003 22:11
Please respond to
pnfs-ops@yahoogroups.com

	
To
	pnfs-ops@yahoogroups.com
cc
	pnfs-reqs@yahoogroups.com, andros@citi.umich.edu
Subject
	[pnfs-ops] why not use mandatory byte-range locking

	




the discussion of byte-range delegations and cache consistancy provoked this
thought: why not use existing mandatory byte-range locking?

the client opens a file, requests a (mandatory) lock on the region of the file
it's interested in. the resultant lock stateid is passed as an argument to the
READ/WRITE_IND request. we can require a mandatory lock stateid prior to
handing out layout maps for direct i/o. the layout map is 'good 'only for as
long as the byte-range lock.

the mandatory lock protects the layout, so no need for layout delegations.
mandatory locking also allows the client to cache and operate locally on the
locked data region with cache consistancy guarentees.

we already have the byte-range locking code written. so how far does this get
us? does it make sense to start with the locking code instead of the
delegation as far as extenstions?

-->Andy


To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com



Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-ops/

To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/

From garth@panasas.com Wed Jan 14 16:16:12 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 5528 invoked from network); 15 Jan 2004 00:16:11 -0000
Received: from unknown (66.218.66.218)
by m4.grp.scd.yahoo.com with QMQP; 15 Jan 2004 00:16:11 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 15 Jan 2004 00:16:11 -0000
Received: from [172.17.133.59] ([172.17.133.59]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYHVN9; Wed, 14 Jan 2004 19:16:09 -0500
Mime-Version: 1.0 (Apple Message framework v609)
In-Reply-To: <OFBE7120C2.110DAF11-ONC2256E1B.004F757F-C2256E1B.00510874@il.ibm.com>
References: <OFBE7120C2.110DAF11-ONC2256E1B.004F757F-C2256E1B.00510874@il.ibm.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <037B3245-46F0-11D8-AC67-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Wed, 14 Jan 2004 16:16:07 -0800
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Announcing a weekly pNFS requirements concall
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Yes. As agreed in the last call, we moved the meeting time to Thursday
11-12 EST, 8-9 PST at the same number. Contact me if you do not have
the number.

Sorry for the late reminder. I'll get the notes from the last meeting
out this afternoon.

garth


On Jan 14, 2004, at 6:45 AM, Julian Satran wrote:
>
> Do we have a call this week? Julo


> From: Garth Gibson <garth@Panasas.Com>
> Date: January 6, 2004 8:31:15 PM PST
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] Announcing a weekly pNFS requirements concall
> Reply-To: pnfs-reqs@yahoogroups.com
>
> While our mailing lists are seeing a good flow of good comments, the
> timeline we have set for ourselves, to give the IETF something in early
> Feb, is short. So I've set up a weekly conference call for an hour,
> for all that can make it. Notes from these calls will go out to the
> Yahoo group reflector for those that can't make it.
>
> Beginning this Friday, Jan 9, 12-1pm EST, hosted by Panasas. Contact
> garth gibson if you would like to participate and do not know the dial
> in numbers.
>
> Thanks
> garth

From garth@panasas.com Wed Jan 14 23:19:17 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 35625 invoked from network); 15 Jan 2004 07:19:16 -0000
Received: from unknown (66.218.66.167)
by m5.grp.scd.yahoo.com with QMQP; 15 Jan 2004 07:19:16 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 15 Jan 2004 07:19:16 -0000
Received: from [172.17.133.59] ([172.17.133.59]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYHWJ2; Thu, 15 Jan 2004 02:19:11 -0500
Mime-Version: 1.0 (Apple Message framework v609)
Content-Transfer-Encoding: 7bit
Message-Id: <1B419D86-472B-11D8-AC67-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Wed, 14 Jan 2004 23:19:07 -0800
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: pNFS requirements concall 2004-01-09 12-1 EST notes
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
click here
pNFS requirements concall 2004-01-09 12-1 EST notes


Participants
----------------------------------------------------
David Black, EMC
Tyce McLarty, LLNL
Dave Noveck, Tom Talpey, Peter Corbett, NetApp
Julian Satran, IBM
Andy Adamson, CITI
Garth Gibson, Benny Halevy, Panasas

Garth chaired, Benny took notes.


Logistics
----------------------------------------------------

A) This meeting time, Fri 12-1 EST is during Julian's weekend in
Israel. We agreed to move to Thursday 11-12 EST beginning Thurs Jan
14.

B) The next face-to-face meeting is proposed to be Wed Mar 31 8:30-12
in the morning, immediately before the Usenix FAST conference
(www.usenix.org/events/fast04) in the same hotel (Grand Hyatt, 345
Stockton Street, San Francisco, CA 94108, 415.398.1234, 1.800.633.7313,
http://grandsanfrancisco.hyatt.com/property/index.jhtml). We are
seeking USENIX help for setting this up (Peter Honeyman is asking
USENIX for help). FAST starts at 2pm Wed. Its sister conference, NSDI
(Network System Design and Implementation) is being held in the same
hotel Mon morning until Wed noon. It is proposed to hold this meeting
as a BOF, open and advertised in one or both conferences.


Requirements group action items
----------------------------------------------------

Garth asks all contributors to strive to include use cases, application
areas and other enduser oriented justification in all requirements
deliverables.


1) Problem statement Informational Internet Draft as a vehicle to
communicate to IETF
- Timeline: we would like our topic to be considered for an agenda at
Seoul Feb 29 - March 5 59th IETF meeting
- Deadline: IETF deadline is approximately Feb 7, working backwards and
allowing time for communication and errors, we plan to set a
within-the-group deadline for last comments on the document end of day
Jan 30
- Purpose: to explain the problem we seek to fix, why it should be
fixed and why it should be done in the IETF
- Example: "RDMA over IP Problem Statement",
draft-ietf-rddp-problem-statement-02, by Allyn Romanow, Jeff Mogul, Tom
Talpey, Stephen Bailey with help from Jeff Chase and Jim Pinkerton
(ftp://ftp.rfc-editor.org/in-notes/internet-drafts/draft-ietf-rddp-
problem-statement-02.txt)
- Audience: skill set like us, background more varied, expect to be
persuaded, although the academic citation list of the example is much
more than we need, we should add about three-five pages of good content
to the boilerplate internet draft document structure

See discussion below.


2) Elevator pitch for general external communication
- Purpose: ensure that the members of our community attending the Seoul
IETF are equipped with the essentials (at least David Black, Tom Talpey
and Julian Satran are attending Seoul)
- Deadline: Feb 19, first day of Connectathon 2004 (Feb 19-26, 180 Park
Ave., San Jose, CA 95113 San Jose, CA, www.connectathon.org), which I
guess creates another opportunity for a (subgroup?) face-to-face

Garth proposed, subset from his NEPS position paper: "striping all the
way to the clients; providing scalable bandwidth, scalable capacity,
load balancing and capacity balancing for federated servers and
consolidated storage." Additionally, there are a number of products
for a good range of companies offering proprietary solutions in this
area, some of which employ/extend/supplant the IETF's NFS and iSCSI, so
an IETF effort building on or working within NFS and iSCSI seems very
natural and compelling.

Some comments:
Black: striping is not inherent, prefers "direct access"
Garth: Direct access is a loaded term for NFS and IETF: for us it is
moving data from multiple servers to one client without proxying it
through one server network endpoint, while direct access for DAFS and
RDMA has to do with eliminating copies in the memory system of any
client or server.
Satran: you can give up load balancing and capacity balancing since
these are means and not ends.
Black: don't want to focus on one sentence, multiple sentences may
prove better.
Julian: what about security?
Black: would not bring it here in the elevator, but in the problem
statement section on "why IETF"
Multiple folks: Scalable capacity and scalable bandwidth are the core
ideas.
With respect to "federated storage and consolidated storage", federated
and consolidated are loaded words
Black: propose something more general as scalable storage systems
Multiple: given the many uses of scalable, probably just "storage
systems"


3) Slide deck for face-to-face external communications
- Purpose: if members of our community attending the Seoul IETF have a
chance to present to the NFS (or other) working groups, we should equip
them with a presentation of the problem statement document; this will
also be useful at the FAST BOF
- Example: ROI-Problem-Scenario-0302.ppt, presented by David Black at
the second IETF BOF on RDMA as a synopsis of the corresponding problem
statement, particularly because the first BOF had not achieved its
goals
- Deadline: Feb 19, same as elevator pitch


4) Draft requirements document
- Purpose: a working document to capture and justify the group's
decisions what to do and what not to do
- Timeline: not clear yet

Most of the discussions going on in the pNFS-reqs mailing list are
addressing issues that belong in this document. This is great, but
should not be confused with the problem statement. The problem
statement is for external communication, justifying the effort to
standardize something, summarizing the commonality achieved at the NEPS
workshop; while the requirements document is for resolution of issues,
and may not be complete until the standard draft is pretty much fleshed
out.


Discussion on Problem Statement
----------------------------------------------------

Garth: Lets start with comments on the beginnings of a problem
statement that Garth (Dec 10) and Gary Grider (Dec 12) contributed to
this group.
Garth: Here is what I put out to start the conversation, based on
nearly copying the RDDP problem statement abstract and table of
contents.
> A possible pNFS problem statement abstract:
> This draft addresses an NFS-based solution to the problem of high
> system costs due to store-and-forward copying of storage data from
> storage devices through a file server mount point to high-speed
> end-hosts that also have connectivity to source storage devices. The
> problem is due to the high cost of funneling large storage bandwidths
> through NFS on single IP addresses, and it can be substantially
> improved using "out-of-band access." The high cost of high-bandwidth
> NFS servers has limited the use of NFS in data centers especially
> where high storage bandwidths are required and numerous storage
> serving devices are already networked together.
>
> A pNFS table of contents might be:
> 1. Introduction
> 2. The high cost of high bandwidth storage through NFS
> 2.1 Out-of-band access decreases bandwidth requirements in central
> file servers
> 3. Application level routing of storage data packets is the root cause
> of the problem
> 4. Storage bandwidth bottlenecks are problematic for many key file
> system applications
> 5. Out-of-band access techniques
> 5.1 A conceptual framework: pNFS delegated maps for distributing files
> over SBC, OSD and NFS storage subsystems
> 6. Security considerations
> 7. Acknowledgements
> 8. Informative references

Garth: I started with the RDDP problem statement. RDDP affected the
design of servers therefore affected their costs. So they pitched a
cost problem with the design of communication protocol going forward in
time.
Garth: "Store and forward copying through the ip address that you
mounted" problem is a cost problem.
Talpey: I wouldn't lead with cost. RDDP is about system overhead.
Garth: by analogy the system overhead we are avoiding is the forwarding
all the data packets through a single IP address, a single server
endpoint (NFS mount)
Tom: elevator pitch for RDDP: data copy costs cycle and bus bandwidth,
avoiding data copy scales servers. In our case we have a bottleneck
moving the data through one point (Tom recommends avoid the term
"single IP address")
Garth: RDDP references moore's law. For us client and server machines
both follow moore's law, but the rate of growth of the number of
clients making demands on the servers is causing demand to exceed
server bandwidth.
Tom: be careful with mentioning Moore's law
Corbett: missing one point of "the clients are focusing on a narrow
part of the dataset"
Garth: The cluster phenomena drives the demand way ahead of the server
supply. [people liked the term "cluster phenomena"]
Tom: the fundamental thing is scaling access to a single object.
Garth: the market for that may be too small.
Tom: it was said that "RDDP is good only for databases" so, I agree, be
careful for narrowing down the scope of applicability.
Talpey: there are existing solution like trunking the NFS protocol to
achieve scalable bandwidth to multiple files but you still have the
single server issue. the real problem is achieving scalable bandwidth
to a single file.
Garth: this seems to narrow, as Corbett's NEPS presentation argues, the
same problem exists for "close" collections of files like a single
directory even if no file is itself spread over multiple servers --
SNIA talks about virtualizing a file (spreading the parts of one file)
and virtualizing a file system (spreading the parts of one volume)
while preserving the manageability implied by one file server
Tom: "out of band" is a bad label because it already means certain
things.
Garth: I didn't use "direct access" because of prior and different
definition by DAFS -- We need a new word for what we're proposing.
Multiple: separation of data and control is good -- maybe some variant
of this gives us the words we need: separated data path, parallel data
path

From Brian.Pawlowski@netapp.com Thu Jan 15 03:55:52 2004
Return-Path: <beepy@netapp.com>
X-Sender: beepy@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 3528 invoked from network); 15 Jan 2004 11:55:50 -0000
Received: from unknown (66.218.66.172)
by m14.grp.scd.yahoo.com with QMQP; 15 Jan 2004 11:55:50 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 15 Jan 2004 11:55:50 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0FBtnKw023061
for <pnfs-reqs@yahoogroups.com>; Thu, 15 Jan 2004 03:55:49 -0800 (PST)
Received: from tooting-fe.eng.netapp.com (tooting-fe.eng.netapp.com [10.56.10.118])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0FBtnSR002121
for <pnfs-reqs@yahoogroups.com>; Thu, 15 Jan 2004 03:55:49 -0800 (PST)
Received: (from beepy@localhost)
by tooting-fe.eng.netapp.com (8.11.6+Sun/8.11.6) id i0FBtnl04249
for pnfs-reqs@yahoogroups.com; Thu, 15 Jan 2004 03:55:49 -0800 (PST)
Message-Id: <200401151155.i0FBtnl04249@tooting-fe.eng.netapp.com>
In-Reply-To: <1B419D86-472B-11D8-AC67-000A95A94F04@panasas.com> from Garth Gibson at "Jan 14, 4 11:19:07 pm"
To: pnfs-reqs@yahoogroups.com
Date: Thu, 15 Jan 2004 03:55:48 -0800 (PST)
X-Mailer: ELM [version 2.4ME++ PL40 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: Brian Pawlowski <beepy@netapp.com>
From: Brian Pawlowski <Brian.Pawlowski@netapp.com>
Subject: Re: [pnfs-reqs] pNFS requirements concall 2004-01-09 12-1 EST notes
X-Yahoo-Group-Post: member; u=169504717

ADVERTISEMENT
> A) This meeting time, Fri 12-1 EST is during Julian's weekend in
> Israel. We agreed to move to Thursday 11-12 EST beginning Thurs Jan
> 14.

You meant the other Thursday Jan 14.

From julian_satran@il.ibm.com Mon Jan 19 00:48:33 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 72665 invoked from network); 19 Jan 2004 08:48:31 -0000
Received: from unknown (66.218.66.172)
by m17.grp.scd.yahoo.com with QMQP; 19 Jan 2004 08:48:31 -0000
Received: from unknown (HELO mtagate5.de.ibm.com) (195.212.29.154)
by mta4.grp.scd.yahoo.com with SMTP; 19 Jan 2004 08:48:30 -0000
Received: from d12relay01.megacenter.de.ibm.com (d12relay01.megacenter.de.ibm.com [9.149.165.180])
by mtagate5.de.ibm.com (8.12.10/8.12.10) with ESMTP id i0J8mSe2117726;
Mon, 19 Jan 2004 08:48:28 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay01.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i0J8mRJY181632;
Mon, 19 Jan 2004 09:48:27 +0100
In-Reply-To: <CF94E7DF-31AA-11D8-996E-000393754F12@panasas.com>
To: pnfs-reqs@yahoogroups.com
Cc: pNFS Operations <pnfs-ops@yahoogroups.com>,
pNFS Requirements <pnfs-reqs@yahoogroups.com>
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OFC7A57BC7.593A949B-ONC2256E1F.0060FAE6-88256E20.00305ED2@il.ibm.com>
Date: Mon, 19 Jan 2004 00:48:22 -0800
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
19/01/2004 10:48:27,
Serialize complete at 19/01/2004 10:48:27
Content-Type: multipart/alternative; boundary="=_alternative 0061A2A0C2256E1F_="
X-eGroups-Remote-IP: 195.212.29.154
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

Garth Gibson <garth@panasas.com> wrote on 19/12/2003 00:37:50:

> Thanks Dave.  I agree.  Lets refine the proxying issues: Legacy,
> strict, functional and recovery proxying.
>
> [1.1.0 Legacy proxying]: an NFS-v4.x server must be able to execute the
> full NFS-v4.0 or NFS-v4.1 protocol.
>
> I think Dave has given the case for this strongly.  I do not see any
> case against this.
>
> -------------------------------------------
>
> [1.1.1 Strict proxying]: does an NFS-v4.x server have to be able to
> execute exactly the wire packet that an NFS-v4.x client might have sent
> to a SBC/OSD/NFS data server?
>
> This captures the notion that a metadata server must also be a
> store-and-forward proxy for every data server it manages.  It requires
> NFS-v4.x servers implement SCSI SBC over FC, if their data servers
> implement it; and the same for objects and files.
>
> This only makes sense to me for NFS data servers.  And it is not what I
> intended in my prior summary, although it is a relevant question.  I
> would say that pNFS requirements not require Strict Proxying.
>

Agree

> -------------------------------------------
>
> [1.1.2 Functional proxying]: a file transformation achievable by an
> NFS-v4.x client using a set of data server operations must be a
> equivalently achievable using a (probably different) set of NFS-v4.x
> server operations
>
> This is the topic I intended to address in the last email.  I believe
> Dave is arguing that even with metadata servers that do not have access
> to their data servers, the vendor of such a metadata server can
> construct a proprietary protocol for the metadata server to (strict)
> proxy data server accesses through clients that do have data server
> access.  I am not comfortable making up a counter to this, so I exhort
> those that want a metadata server without data server access to speak
> up if they disagree.
>
> > On one hand, some suggest that a set of out-of-band clients should not
> > have to also have a data path through the NFSv4 metadata server.  One
> > reason is that customers may not tolerate the large variability in
> > performance between out-of-band (when the going is good) and in-band
> > (when the server chooses not to grant or to take away a delegation)
> > accesses.  Another reason, and I paraphrase someone else here, is that
> > it is possible to construct out-of-band metadata servers that do not
> > have access to the data servers except through the clients -- I
> > encourage the source of this scenario to replace my paraphrasing with
> > a correct use case, because I find it odd to design for file servers
> > that do not have access to the data servers.
> >
> > On the other hand, others have suggested that any access or work that
> > a client can do out-of-band should be possible with one or more
> > commands applied to the metadata server's data path.  This has been
> > proposed for coping with recalled delegations, including concurrent
> > writing by multiple clients; retry after client access errors,
> > provided adequate idempotency of out-of-band operations; and many
> > alternative implementations of out-of-band clients, including legacy
> > clients that use out-of-band never or rarely.
> >
> > I think this is a topic that should be argued one way or the other in
> > the requirements document.  Use cases and examples in other systems
> > would be best.
>

I guess that proxying through a client should be recomended but not mandated.
We might the want to find how to do it while respecting restrictions removed the metadata server from the path.

> -------------------------------------------
>
> [1.1.3 Recovery proxying]: a file transformation begun by an NFS-v4.x
> client using a set of data server operations, but interrupted before
> completion, must be equivalently completable using a (probably
> different) set of NFS-v4.x server operations
>
> Some have suggested that having this property will greatly simplify the
> amount of spec that is devoted to out-of-band error recovery.  Others
> have commented that a simple way to achieve this would be to require
> that all operations on data servers should be idempotent.
>
> -------------------------------------------
>
> garth
>
>
> On Thursday, December 18, 2003, at 12:21  PM, Noveck, Dave wrote:
>
> > Good summary.
> >
> > I want to address the "proxying" issue.
> >
> >> [1.1 Proxying]: Operations/work that can only be done out-of-band vs
> >> alternative access through the NFSv4 server for all operations/work
> >
> > If you are talking about operations in the extension (let's call it
> > NFS-v4.x), that are not in the previous minor version (let's assume
> > that is nfs-v4.1), then you have a choice of whether these are
> > supported
> > for access through the server, or only for access by the client with
> > the
> > data server.  Let's call this the issue of proxying in the strict
> > sense.
> >
> > There is another issue that people are calling "proxying" but is really
> > logically distinct.  That is the issue of access by the previous minor
> > version, e.g. nfs-v4.0 or nfs-v4.1.  Those versions have no concept of
> > separate data servers and they need to be able to work.  End of story.
> > If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
> > have a minor version without proxying.  You don't have a minor version
> > at all.  I believe the working group is never going to accept that.
> > Even if I'm wrong and you can get the working group to accept that,
> > it is going to be very contentious and thus take up a lot of time.
> > Anybody, who really wants to go down this path should seriously
> > consider
> > the trade-off between supporting something they find objectionable and
> > getting a standard a lot later, if at all.
> >
> >> On one hand, some suggest that a set of out-of-band clients should not
> >> have to also have a data path through the NFSv4 metadata server.  One
> >> reason is that customers may not tolerate the large variability in
> >> performance between out-of-band (when the going is good) and in-band
> >> (when the server chooses not to grant or to take away a delegation)
> >> accesses.
> >
> > Then such customers will use clients that access things out-of-band
> > whenever possible, and servers that never refuse to give out layout
> > delegations.  You have a number of quality-of-implementations issues
> > for v4.x clients and servers.  If a particular client only supports
> > access via v4.0, then performance will suck, and the working group
> > will understand that, but it won't accept not being able to use
> > v4.0 at all.  The customer is going to be motivated to upgrade his
> > clients for those that need high-performance access, but he may be
> > OK with some clients using v4.0 for a long time, depending on the
> > particular performance those clients need.  (And some will want v2/v3
> > access but that is a matter that the working group has no say about).
> >
> >> Another reason, and I paraphrase someone else here, is that
> >> it is possible to construct out-of-band metadata servers that do not
> >> have access to the data servers except through the clients -- I
> >> encourage the source of this scenario to replace my paraphrasing with
> >> a
> >> correct use case, because I find it odd to design for file servers
> >> that
> >> do not have access to the data servers.
> >
> > So let's grant that it is possible (and we'll pass over the issue of
> > whether it is desirable, and in fact so desirable that one is willing
> > to
> > not get a standard and or get it much later).
> >
> > So we have a metadata server and it, for whatever reason, does not have
> > access to the data servers.  However, by hypothesis, there are machines
> > (e.g. clients), that can communicate with both.  So, if one has such an
> > architecture, then one can take such a machine, give it a
> > communication path
> > to the meta-data server and the data server and have the meta-data
> > server
> > transfer v4.0 READ requests to it, let it read the data from the data
> > server and send it back to the meta-data server who send it back to the
> > original requestor.  Is that a very good solution?  No.  Is it likely
> > to be performant?  No.  Will it satisfy any particular customer?  I
> > don't
> > know and that is the implementer's business decision.  Will it satisfy
> > the hypothetical customer who doesn't care about v4.0 access?  Clearly.
> > Will it satisfy the v4 working group?  Yes, because they are not in the
> > business of telling you how performant v4.0 access has got to be.
> >
> >> On the other hand, others have suggested that any access or work that
> >> a
> >> client can do out-of-band should be possible with one or more commands
> >> applied to the metadata server's data path.  This has been proposed
> >> for
> >> coping with recalled delegations, including concurrent writing by
> >> multiple clients; retry after client access errors, provided adequate
> >> idempotency of out-of-band operations; and many alternative
> >> implementations of out-of-band clients, including legacy clients that
> >> use out-of-band never or rarely.
> >
> > This effort is going to take a while, but if we manage it correctly, it
> > is not going to take so long that v3 clients are going to be rare
> > things,
> > and they have to be supported.  But v3 clients are not an issue for the
> > working group.  V4.0 clients are and they will be around and you will
> > have to support them, and I believe the working group is not going to
> > be disposed to cut you a lot of slack on this issue (and I don't see
> > why it should).
> >
> >> I think this is a topic that should be argued one way or the other in
> >> the requirements document.  Use cases and examples in other systems
> >> would be best.
> >
> > I think the requirement should be that this work should be done as a
> > set of extensions to nfs-v4 delivered as a v4 minor version.  If there
> > is some feature/requirement that conflicts with that model (and it is a
> > pretty flexible one), then you have to think long and hard before
> > deciding
> > that that requirement is more important than this basic deivery
> > vehicle,
> > because it seems to me that it is, in almost all respects, the ideal
> > way
> > to make this sort of technology available for widespread use.
> >
> >
> >
> >
> >
> >
> > To unsubscribe from this group, send an email to:
> > pnfs-ops-unsubscribe@yahoogroups.com
> >
> >
> >
> > Yahoo! Groups Links
> >
> > To visit your group on the web, go to:
> >  http://groups.yahoo.com/group/pnfs-ops/
> >
> > To unsubscribe from this group, send an email to:
> >  pnfs-ops-unsubscribe@yahoogroups.com
> >
> > Your use of Yahoo! Groups is subject to:
> >  http://docs.yahoo.com/info/terms/
> >
>
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>  
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
>  http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
>  pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
>  http://docs.yahoo.com/info/terms/
>
> 

From julian_satran@il.ibm.com Mon Jan 19 00:48:44 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 73823 invoked from network); 19 Jan 2004 08:48:44 -0000
Received: from unknown (66.218.66.217)
by m13.grp.scd.yahoo.com with QMQP; 19 Jan 2004 08:48:44 -0000
Received: from unknown (HELO mtagate7.de.ibm.com) (195.212.29.156)
by mta2.grp.scd.yahoo.com with SMTP; 19 Jan 2004 08:48:42 -0000
Received: from d12relay01.megacenter.de.ibm.com (d12relay01.megacenter.de.ibm.com [9.149.165.180])
by mtagate7.de.ibm.com (8.12.10/8.12.10) with ESMTP id i0J8mVRm094584;
Mon, 19 Jan 2004 08:48:31 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay01.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i0J8mTJY231014;
Mon, 19 Jan 2004 09:48:30 +0100
In-Reply-To: <20040105182738.07168207D3@citi.umich.edu>
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu, pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OFBB68BEED.8C13F0B4-ONC2256E1F.00620475-88256E20.0030601E@il.ibm.com>
Date: Mon, 19 Jan 2004 00:48:25 -0800
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
19/01/2004 10:48:29,
Serialize complete at 19/01/2004 10:48:29
Content-Type: multipart/alternative; boundary="=_alternative 006221DBC2256E1F_="
X-eGroups-Remote-IP: 195.212.29.156
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] Re: [pnfs-ops] why not use mandatory byte-range locking
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

I think that Andy has strong arguments. Julo


"William A.(Andy) Adamson" <andros@citi.umich.edu>

05/01/2004 20:27
Please respond to
pnfs-reqs

	
To
	pnfs-ops@yahoogroups.com
cc
	pnfs-reqs@yahoogroups.com, andros@citi.umich.edu
Subject
	[pnfs-reqs] Re: [pnfs-ops] why not use mandatory byte-range locking

	




> Andy Adamson wrote:
> > the discussion of byte-range delegations and cache consistancy provoked this
> > thought: why not use existing mandatory byte-range locking?
>
> > the client opens a file, requests a (mandatory) lock on the region of the file
> > it's interested in. the resultant lock stateid is passed as an argument to the
> > READ/WRITE_IND request. we can require a mandatory lock stateid prior to
> > handing out layout maps for direct i/o. the layout map is 'good 'only for as
> > long as the byte-range lock.
>
> One problem is that there is no way for the client to specify that he wants
> a mandatory (as opposed to advisory) byte-range lock, he just asks for one
> and the server gives him the type of byte-range that server is giving out
> for that fs.  So, if you did that, applications that relied on the semantics
> (or lack of semantics) of advisory byte-range locks would break.

 From 3530
5.11.5.  Mode Attribute
      ......

  Note that in UNIX, if a file has the MODE4_SGID bit set and no
  MODE4_XGRP bit set, then READ and WRITE must use mandatory file
  locking.


so for unix, there is a way to specify mandatory vrs advisory locking. since
this is also in 3530:
8.  File Locking and Share Reservations
 .....
  These mechanisms can implement policy ranging from advisory only locking to
full    mandatory locking.

adding a flag to a LOCK/T/U to indicate mandatory locking vrs advisory is
within reason.


> Another issue is that while you say "'good' only for as long as the byte-range
> lock", the results of doing this are that the layout map and the data will
> be fixed for at least as long as the byte-range lock exists, i.e. sometimes
> too long.  If I'm going to be reading directly from the data server, then
> I want the layout to stay constant for a long time, or at least I don't want
> to be forced to repeatedly get locks for small areas of the layout.  The
> obvious (and desirable) thing for me to do is to get a shared lock for the
> whole file so the layout cannot change, but if we combine changes of layout
> and changes of data under a single sort of lock, mandatory byte-range locks
> in this case, we have stopped anybody writing in the file for a very long
> time, i.e. essentially forever since my lease will normally be continually
> renewed.
>
> When you combine a guarantee that the layout will not change with a guarantee
> that the data will not change, in such a way that they can't be separated,
> you artificially increase the amount of conflicts, in many cases to an
> unacceptable level.

perhaps i'm missing something, but isn't it the case the layout of the data
and the abilty to access the data are totally bound together?  the layout
changes due to writes and appends (other??). if the layout changes, stale
layout maps are not only no longer any good, they can lead to data corruption.
it seems to me that the guarentee that the layout won't change is bound to the
guarentee that the data won't change. i can't think of any conflicts such as
you mention - could you give some examples?


> When you have a delegation model, the problem is
> excessive recalls, while when you have a locking model the problem is that
> some applications will slow to a crawl/halt.
>
> > the mandatory lock protects the layout, so no need for layout delegations.
> > mandatory locking also allows the client to cache and operate locally on the
> > locked data region with cache consistancy guarentees.
>
> If you are going to be doing some local operation, then short-term mandatory
> byte-range locking can help you.  If need to do a lock/fetch/update/write/unlock
> cycle on a record, this is the ticket (and in v4 lock/fetch and write/unlock
> can be COMPOUND's :-).  The record you hold while updating can be considered
> cached for that brief period.  If, however, you are caching data generally,
> i.e. for a period outside the range of a short operation sequence, you are
> going to need something that is delegation-like, in that if I have the
> cached data and want to keep it until there is some reason to get rid of
> it, i.e. it is LRU'd out or there is a conflict, then I have to have some
> way of finding out that there is a conflict.  Delegations do that via a
> recall and one can imagine it being done other ways.  But the mandatory
> lock model is that I have a lock because I need it and so there is no
> provision to tell me that someone else has a conflict.  The logic is that
> he will wait until I give the lock up, and waiting for the cached data to
> be LRU'd is going to be too long in most cases.
>

i agree that the ability for the server to recall is a required feature. i'm
simply suggesting that mandatory locking may have more features in common with
what we need for pnfs than delgations, and that we could extend the existing
mandatory byte-range locking model with fewer changes than extending the
existing delegation model.

so, how about estending the mandatory locking model with a recall mechanism?
                                   
>
> > we already have the byte-range locking code written.
>
> I only have advisory byte-range locking code written.  Who has v4 mandatory
> byte-range locking implemented?
>
> > so how far does this get
> > us? does it make sense to start with the locking code instead of the
> > delegation as far as extenstions?
>
> I think if we define some form of byte-range delegations (at least for data
> and maybe for layout as well), there is going to be lots of code sharing with
> an existing mandatory byte-range locking implementation.  The data structures
> and many of the interfaces are going to be the same.  

and this is really why i brought this up. a new lock type that has the
features we desire (e.g. a recall mechanism) makes sense to me.

> The difference is going
> to be what you do about conflicts.  Instead of saying to the second claimant,
> "You snoozed so you lose", in some cases you have to be prepared to recall the
> delegation so that, for example, an otherwise unexceptionable write can proceed.
 





------------------------ Yahoo! Groups Sponsor ---------------------~-->
Upgrade to 128-bit SSL Security!
http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
---------------------------------------------------------------------~->

Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/

From julian_satran@il.ibm.com Mon Jan 19 00:49:47 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 8402 invoked from network); 19 Jan 2004 08:49:44 -0000
Received: from unknown (66.218.66.172)
by m1.grp.scd.yahoo.com with QMQP; 19 Jan 2004 08:49:44 -0000
Received: from unknown (HELO mtagate3.de.ibm.com) (195.212.29.152)
by mta4.grp.scd.yahoo.com with SMTP; 19 Jan 2004 08:49:43 -0000
Received: from d12relay02.megacenter.de.ibm.com (d12relay02.megacenter.de.ibm.com [9.149.165.196])
by mtagate3.de.ibm.com (8.12.10/8.12.10) with ESMTP id i0J8mPHI114860;
Mon, 19 Jan 2004 08:48:25 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay02.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i0J8mNmx112370;
Mon, 19 Jan 2004 09:48:24 +0100
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548AB80A98@silver.nane.netapp.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF07A0F4D1.989233AA-ONC2256E1F.00601943-88256E20.00305D98@il.ibm.com>
Date: Mon, 19 Jan 2004 00:48:19 -0800
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
19/01/2004 10:48:23,
Serialize complete at 19/01/2004 10:48:23
Content-Type: multipart/alternative; boundary="=_alternative 0060D9BEC2256E1F_="
X-eGroups-Remote-IP: 195.212.29.152
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

"Noveck, Dave" <dnoveck@netapp.com> wrote on 30/12/2003 22:08:02:

> It seems legal to me but I'm guessing that there are others that would
> think differently.
>  
> I tend to think that it is not a good idea, though.  There are going
> to be operations which, by their nature, are better done through the
> metadata server.  A two-byte write which spans multiple data servers
> is an example.  Another is append-writes, which have been mentioned
> (by whom I don't remember just now) as a desirable v4 extension,
> assuming the data to be written is of reasonable size.  In each case,
> we may create appropriate caching/locking primitives to allow the
> operation to be done without making any request of the metadata server
> that is officially denominated an "IO" request.  But can you really
> argue that this will be the best way for the client to do such
> operations?  And does it really make sense to force clients to invest
> the effort in terms of the code do such operations doing the IO with
> the data server only, when the performance benefit of that is going to
> be small, or zero, or negative?  You may wind up making as many
> requests of the meta-data server with the data-server-only approach.  
> It's just that they won't be IO operations (but instead locking and,
> in the case of append, getattr operations).
>

This can be argued both ways. For applications that share little and build files by append (all transaction loggers) doing them on the client is a distinct advantage. And so iit is for object storage that supports append.

> In complicated protocols (and v4 is a complicated protocol and is
> getting more complicated), there are going to be multiple ways of
> doing the same thing, which are going to differ in their performance
> characteristics.  An organization can be reasonably concerned about
> clients making the wrong choice, just as it is concerned about clients
> that are making excessive resource demands for other reasons.  There
> are two issues that I am worried about in taking such a drastic
> approach as simply refusing to support a valid piece of the protocol,
> even if that choice is made by the server administrator.  The first is
> that determining the better choice depends on a lot of variables and
> that a simple formula governing an option  (e.g. "IO through the
> metadata server is bad") is unlikely to completely match reality.  The
> second is that I-don't-like-your-IO-request-so-you-lose is kind of a
> blunt instrument to deal with the problem.
>  

I don't think this is a big issue or that the scenario I describe will be widely used but with Object Storage you may not have (or need to very  often) a channel between the metadata server and the data servers. This partial access scheme may be maintained also in block environments or federated filers for various reasons (security may be one - you don't trust your administrator with all the data).

> If you have identified some set of bad client practices, you can find
> the clients doing them, report the appropriate statistics, even, if
> the issue is critical, artificially give such clients (or specific
> requests) bad performance in a way that doesn't hurt other clients
> (unless they are waiting for the first set to do something.  Sigh!),
> by just delaying processing of their requests by millisecond or two.  
> That should be enough to preserve metadata-server bandwidth for more
> worthwhile purposes.  If that's insufficiently discouraging, you can
> raise the delay.  If you start rejecting requests because you would
> have done it differently, even if you are correct, you are on the road
> to creating your own sub-protocol, which is why this kind of thing is
> worrying, even if legal.
>  
>  
> -----Original Message-----
> From: Julian Satran [mailto:julian_satran@il.ibm.com]
> Sent: Monday, December 22, 2003 5:26 AM
> To: pnfs-ops@yahoogroups.com
> Cc: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
> Subject: RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03

>
> Since I raised the issue of the metadata server not having access to
> all it's data servers (or at least not with adequate bandwidth) I feel
> compelled to say that Dave's arguments about supporting 4.0 are
> compelling enough to make it mandatory. The open issue is if it is
> legal for a "compliant server" to have serving data disabled by a
> local administrative function (the old "must implement but may use").
> Otherwise an organization that wants to discourage use of data serving
> through the metadata server has very little it can do to enforce
> policy in a way that will not affect other clients (it may do serve
> poorly but this still affects other clients).
>
> Julo
>

>
> "Noveck, Dave" <dnoveck@netapp.com>
> 18/12/2003 19:21
>
> Please respond to
> pnfs-ops@yahoogroups.com
>
> To
>
> <pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
>
> cc
>
> Subject
>
> RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03
>
>
>
>
> Good summary.
>
> I want to address the "proxying" issue.
>
> > [1.1 Proxying]: Operations/work that can only be done out-of-band vs
> > alternative access through the NFSv4 server for all operations/work
>
> If you are talking about operations in the extension (let's call it
> NFS-v4.x), that are not in the previous minor version (let's assume
> that is nfs-v4.1), then you have a choice of whether these are supported
> for access through the server, or only for access by the client with the
> data server.  Let's call this the issue of proxying in the strict sense.
>
> There is another issue that people are calling "proxying" but is really
> logically distinct.  That is the issue of access by the previous minor
> version, e.g. nfs-v4.0 or nfs-v4.1.  Those versions have no concept of
> separate data servers and they need to be able to work.  End of story.
> If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
> have a minor version without proxying.  You don't have a minor version
> at all.  I believe the working group is never going to accept that.
> Even if I'm wrong and you can get the working group to accept that,
> it is going to be very contentious and thus take up a lot of time.
> Anybody, who really wants to go down this path should seriously consider
> the trade-off between supporting something they find objectionable and
> getting a standard a lot later, if at all.
>
> > On one hand, some suggest that a set of out-of-band clients should not
> > have to also have a data path through the NFSv4 metadata server.  One
> > reason is that customers may not tolerate the large variability in
> > performance between out-of-band (when the going is good) and in-band
> > (when the server chooses not to grant or to take away a delegation)
> > accesses.  
>
> Then such customers will use clients that access things out-of-band
> whenever possible, and servers that never refuse to give out layout
> delegations.  You have a number of quality-of-implementations issues
> for v4.x clients and servers.  If a particular client only supports
> access via v4.0, then performance will suck, and the working group
> will understand that, but it won't accept not being able to use
> v4.0 at all.  The customer is going to be motivated to upgrade his
> clients for those that need high-performance access, but he may be
> OK with some clients using v4.0 for a long time, depending on the
> particular performance those clients need.  (And some will want v2/v3
> access but that is a matter that the working group has no say about).
>
> > Another reason, and I paraphrase someone else here, is that
> > it is possible to construct out-of-band metadata servers that do not
> > have access to the data servers except through the clients -- I
> > encourage the source of this scenario to replace my paraphrasing with a
> > correct use case, because I find it odd to design for file servers that
> > do not have access to the data servers.
>
> So let's grant that it is possible (and we'll pass over the issue of
> whether it is desirable, and in fact so desirable that one is willing to
> not get a standard and or get it much later).
>
> So we have a metadata server and it, for whatever reason, does not have
> access to the data servers.  However, by hypothesis, there are machines
> (e.g. clients), that can communicate with both.  So, if one has such an
> architecture, then one can take such a machine, give it a communication path
> to the meta-data server and the data server and have the meta-data server
> transfer v4.0 READ requests to it, let it read the data from the data
> server and send it back to the meta-data server who send it back to the
> original requestor.  Is that a very good solution?  No.  Is it likely
> to be performant?  No.  Will it satisfy any particular customer?  I don't
> know and that is the implementer's business decision.  Will it satisfy
> the hypothetical customer who doesn't care about v4.0 access?  Clearly.
> Will it satisfy the v4 working group?  Yes, because they are not in the
> business of telling you how performant v4.0 access has got to be.
>
> > On the other hand, others have suggested that any access or work that a
> > client can do out-of-band should be possible with one or more commands
> > applied to the metadata server's data path.  This has been proposed for
> > coping with recalled delegations, including concurrent writing by
> > multiple clients; retry after client access errors, provided adequate
> > idempotency of out-of-band operations; and many alternative
> > implementations of out-of-band clients, including legacy clients that
> > use out-of-band never or rarely.
>
> This effort is going to take a while, but if we manage it correctly, it
> is not going to take so long that v3 clients are going to be rare things,
> and they have to be supported.  But v3 clients are not an issue for the
> working group.  V4.0 clients are and they will be around and you will
> have to support them, and I believe the working group is not going to
> be disposed to cut you a lot of slack on this issue (and I don't see
> why it should).
>
> > I think this is a topic that should be argued one way or the other in
> > the requirements document.  Use cases and examples in other systems
> > would be best.
>
> I think the requirement should be that this work should be done as a
> set of extensions to nfs-v4 delivered as a v4 minor version.  If there
> is some feature/requirement that conflicts with that model (and it is a
> pretty flexible one), then you have to think long and hard before deciding
> that that requirement is more important than this basic deivery vehicle,
> because it seems to me that it is, in almost all respects, the ideal way
> to make this sort of technology available for widespread use.
>
>
>
>
>
>
> To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-ops/
>
> To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
> To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>
>
>
>
> Yahoo! Groups Links
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-ops/
>  
> To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>  
> Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>
> Yahoo! Groups Links
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>  
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>  
> Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.

From pcorbett@netapp.com Mon Jan 19 11:52:20 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 13972 invoked from network); 19 Jan 2004 19:52:19 -0000
Received: from unknown (66.218.66.218)
by m19.grp.scd.yahoo.com with QMQP; 19 Jan 2004 19:52:19 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 19 Jan 2004 19:52:19 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0JJprKw007257
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Jan 2004 11:51:53 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0JJprpr011490
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Jan 2004 11:51:53 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3DEC5.AEA79DEC"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 19 Jan 2004 11:51:51 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A015BF27D@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
Thread-Index: AcPeaQdFnqO5JJSFTaWLR7tAnrV/QgAXHQqQ
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

Julian,
For some reason, your messages always come to me in a microscopic font that I'm finding harder and harder to read.  I don't know if this is the case for all recipients, or it is peculiar to my client.   I am using Outlook.  I've never seen it on mail from anybody else.
Peter

    -----Original Message-----
    From: Julian Satran [mailto:julian_satran@il.ibm.com]
    Sent: Monday, January 19, 2004 3:48 AM
    To: pnfs-reqs@yahoogroups.com
    Cc: pNFS Operations; pNFS Requirements
    Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying



    Garth Gibson <garth@panasas.com> wrote on 19/12/2003 00:37:50:

    > Thanks Dave.  I agree.  Lets refine the proxying issues: Legacy,
    > strict, functional and recovery proxying.
    >
    > [1.1.0 Legacy proxying]: an NFS-v4.x server must be able to execute the
    > full NFS-v4.0 or NFS-v4.1 protocol.
    >
    > I think Dave has given the case for this strongly.  I do not see any
    > case against this.
    >
    > -------------------------------------------
    >
    > [1.1.1 Strict proxying]: does an NFS-v4.x server have to be able to
    > execute exactly the wire packet that an NFS-v4.x client might have sent
    > to a SBC/OSD/NFS data server?
    >
    > This captures the notion that a metadata server must also be a
    > store-and-forward proxy for every data server it manages.  It requires
    > NFS-v4.x servers implement SCSI SBC over FC, if their data servers
    > implement it; and the same for objects and files.
    >
    > This only makes sense to me for NFS data servers.  And it is not what I
    > intended in my prior summary, although it is a relevant question.  I
    > would say that pNFS requirements not require Strict Proxying.
    >

    Agree

    > -------------------------------------------
    >
    > [1.1.2 Functional proxying]: a file transformation achievable by an
    > NFS-v4.x client using a set of data server operations must be a
    > equivalently achievable using a (probably different) set of NFS-v4.x
    > server operations
    >
    > This is the topic I intended to address in the last email.  I believe
    > Dave is arguing that even with metadata servers that do not have access
    > to their data servers, the vendor of such a metadata server can
    > construct a proprietary protocol for the metadata server to (strict)
    > proxy data server accesses through clients that do have data server
    > access.  I am not comfortable making up a counter to this, so I exhort
    > those that want a metadata server without data server access to speak
    > up if they disagree.
    >
    > > On one hand, some suggest that a set of out-of-band clients should not
    > > have to also have a data path through the NFSv4 metadata server.  One
    > > reason is that customers may not tolerate the large variability in
    > > performance between out-of-band (when the going is good) and in-band
    > > (when the server chooses not to grant or to take away a delegation)
    > > accesses.  Another reason, and I paraphrase someone else here, is that
    > > it is possible to construct out-of-band metadata servers that do not
    > > have access to the data servers except through the clients -- I
    > > encourage the source of this scenario to replace my paraphrasing with
    > > a correct use case, because I find it odd to design for file servers
    > > that do not have access to the data servers.
    > >
    > > On the other hand, others have suggested that any access or work that
    > > a client can do out-of-band should be possible with one or more
    > > commands applied to the metadata server's data path.  This has been
    > > proposed for coping with recalled delegations, including concurrent
    > > writing by multiple clients; retry after client access errors,
    > > provided adequate idempotency of out-of-band operations; and many
    > > alternative implementations of out-of-band clients, including legacy
    > > clients that use out-of-band never or rarely.
    > >
    > > I think this is a topic that should be argued one way or the other in
    > > the requirements document.  Use cases and examples in other systems
    > > would be best.
    >

    I guess that proxying through a client should be recomended but not mandated.
    We might the want to find how to do it while respecting restrictions removed the metadata server from the path.

    > -------------------------------------------
    >
    > [1.1.3 Recovery proxying]: a file transformation begun by an NFS-v4.x
    > client using a set of data server operations, but interrupted before
    > completion, must be equivalently completable using a (probably
    > different) set of NFS-v4.x server operations
    >
    > Some have suggested that having this property will greatly simplify the
    > amount of spec that is devoted to out-of-band error recovery.  Others
    > have commented that a simple way to achieve this would be to require
    > that all operations on data servers should be idempotent.
    >
    > -------------------------------------------
    >
    > garth
    >
    >
    > On Thursday, December 18, 2003, at 12:21  PM, Noveck, Dave wrote:
    >
    > > Good summary.
    > >
    > > I want to address the "proxying" issue.
    > >
    > >> [1.1 Proxying]: Operations/work that can only be done out-of-band vs
    > >> alternative access through the NFSv4 server for all operations/work
    > >
    > > If you are talking about operations in the extension (let's call it
    > > NFS-v4.x), that are not in the previous minor version (let's assume
    > > that is nfs-v4.1), then you have a choice of whether these are
    > > supported
    > > for access through the server, or only for access by the client with
    > > the
    > > data server.  Let's call this the issue of proxying in the strict
    > > sense.
    > >
    > > There is another issue that people are calling "proxying" but is really
    > > logically distinct.  That is the issue of access by the previous minor
    > > version, e.g. nfs-v4.0 or nfs-v4.1.  Those versions have no concept of
    > > separate data servers and they need to be able to work.  End of story.
    > > If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
    > > have a minor version without proxying.  You don't have a minor version
    > > at all.  I believe the working group is never going to accept that.
    > > Even if I'm wrong and you can get the working group to accept that,
    > > it is going to be very contentious and thus take up a lot of time.
    > > Anybody, who really wants to go down this path should seriously
    > > consider
    > > the trade-off between supporting something they find objectionable and
    > > getting a standard a lot later, if at all.
    > >
    > >> On one hand, some suggest that a set of out-of-band clients should not
    > >> have to also have a data path through the NFSv4 metadata server.  One
    > >> reason is that customers may not tolerate the large variability in
    > >> performance between out-of-band (when the going is good) and in-band
    > >> (when the server chooses not to grant or to take away a delegation)
    > >> accesses.
    > >
    > > Then such customers will use clients that access things out-of-band
    > > whenever possible, and servers that never refuse to give out layout
    > > delegations.  You have a number of quality-of-implementations issues
    > > for v4.x clients and servers.  If a particular client only supports
    > > access via v4.0, then performance will suck, and the working group
    > > will understand that, but it won't accept not being able to use
    > > v4.0 at all.  The customer is going to be motivated to upgrade his
    > > clients for those that need high-performance access, but he may be
    > > OK with some clients using v4.0 for a long time, depending on the
    > > particular performance those clients need.  (And some will want v2/v3
    > > access but that is a matter that the working group has no say about).
    > >
    > >> Another reason, and I paraphrase someone else here, is that
    > >> it is possible to construct out-of-band metadata servers that do not
    > >> have access to the data servers except through the clients -- I
    > >> encourage the source of this scenario to replace my paraphrasing with
    > >> a
    > >> correct use case, because I find it odd to design for file servers
    > >> that
    > >> do not have access to the data servers.
    > >
    > > So let's grant that it is possible (and we'll pass over the issue of
    > > whether it is desirable, and in fact so desirable that one is willing
    > > to
    > > not get a standard and or get it much later).
    > >
    > > So we have a metadata server and it, for whatever reason, does not have
    > > access to the data servers.  However, by hypothesis, there are machines
    > > (e.g. clients), that can communicate with both.  So, if one has such an
    > > architecture, then one can take such a machine, give it a
    > > communication path
    > > to the meta-data server and the data server and have the meta-data
    > > server
    > > transfer v4.0 READ requests to it, let it read the data from the data
    > > server and send it back to the meta-data server who send it back to the
    > > original requestor.  Is that a very good solution?  No.  Is it likely
    > > to be performant?  No.  Will it satisfy any particular customer?  I
    > > don't
    > > know and that is the implementer's business decision.  Will it satisfy
    > > the hypothetical customer who doesn't care about v4.0 access?  Clearly.
    > > Will it satisfy the v4 working group?  Yes, because they are not in the
    > > business of telling you how performant v4.0 access has got to be.
    > >
    > >> On the other hand, others have suggested that any access or work that
    > >> a
    > >> client can do out-of-band should be possible with one or more commands
    > >> applied to the metadata server's data path.  This has been proposed
    > >> for
    > >> coping with recalled delegations, including concurrent writing by
    > >> multiple clients; retry after client access errors, provided adequate
    > >> idempotency of out-of-band operations; and many alternative
    > >> implementations of out-of-band clients, including legacy clients that
    > >> use out-of-band never or rarely.
    > >
    > > This effort is going to take a while, but if we manage it correctly, it
    > > is not going to take so long that v3 clients are going to be rare
    > > things,
    > > and they have to be supported.  But v3 clients are not an issue for the
    > > working group.  V4.0 clients are and they will be around and you will
    > > have to support them, and I believe the working group is not going to
    > > be disposed to cut you a lot of slack on this issue (and I don't see
    > > why it should).
    > >
    > >> I think this is a topic that should be argued one way or the other in
    > >> the requirements document.  Use cases and examples in other systems
    > >> would be best.
    > >
    > > I think the requirement should be that this work should be done as a
    > > set of extensions to nfs-v4 delivered as a v4 minor version.  If there
    > > is some feature/requirement that conflicts with that model (and it is a
    > > pretty flexible one), then you have to think long and hard before
    > > deciding
    > > that that requirement is more important than this basic deivery
    > > vehicle,
    > > because it seems to me that it is, in almost all respects, the ideal
    > > way
    > > to make this sort of technology available for widespread use.
    > >
    > >
    > >
    > >
    > >
    > >
    > > To unsubscribe from this group, send an email to:
    > > pnfs-ops-unsubscribe@yahoogroups.com
    > >
    > >
    > >
    > > Yahoo! Groups Links
    > >
    > > To visit your group on the web, go to:
    > >  http://groups.yahoo.com/group/pnfs-ops/
    > >
    > > To unsubscribe from this group, send an email to:
    > >  pnfs-ops-unsubscribe@yahoogroups.com
    > >
    > > Your use of Yahoo! Groups is subject to:
    > >  http://docs.yahoo.com/info/terms/
    > >
    >
    >
    > To unsubscribe from this group, send an email to:
    > pnfs-reqs-unsubscribe@yahoogroups.com
    >
    >  
    >
    > Yahoo! Groups Links
    >
    > To visit your group on the web, go to:
    >  http://groups.yahoo.com/group/pnfs-reqs/
    >
    > To unsubscribe from this group, send an email to:
    >  pnfs-reqs-unsubscribe@yahoogroups.com
    >
    > Your use of Yahoo! Groups is subject to:
    >  http://docs.yahoo.com/info/terms/
    >
    >


    Yahoo! Groups Links

        * To visit your group on the web, go to:
          http://groups.yahoo.com/group/pnfs-reqs/
           
        * To unsubscribe from this group, send an email to:
          pnfs-reqs-unsubscribe@yahoogroups.com
           
        * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From dnoveck@netapp.com Mon Jan 19 13:16:06 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 45217 invoked from network); 19 Jan 2004 21:16:04 -0000
Received: from unknown (66.218.66.216)
by m13.grp.scd.yahoo.com with QMQP; 19 Jan 2004 21:16:04 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 19 Jan 2004 21:16:04 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0JLFdKw019639
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Jan 2004 13:15:39 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0JLFcSR029819
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Jan 2004 13:15:38 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3DED1.62082C70"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 19 Jan 2004 13:15:37 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D3665@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
Thread-Index: AcPeaQdFnqO5JJSFTaWLR7tAnrV/QgAXHQqQAAKPEdA=
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

That font is quite annoying.  Interesting fact: when I was replying to one of Julian's recent messages, I copied and pasted some of this message, anticipating that I would have to chnge it to a reasonable-size font, but what happened when I pasted it was that the part where Julian had quoted me, which was in the microscopic font, when pasted it was in what looked like a nrmal sized courier font, while the stuff that Julian had written himself was still in the microscpic font.  The other thing was that Outlook when you put the cursor somewhere, normally changes the font indication up top so you can find out what font something is, but not here.  It always said Courier 10pt, even in Julian's text which was clearly smaller than that.  And now the wierdest part!!  If I put the cursor in a paragraph that I originally wrote (and Julian incorporated with >'s and looks like it is a reasonable size), as soon as I type a single character, the
whole paragraph instantly switches to Julian's microscopic font! and no it doesn't go back when I delete that character, but if cut and past that paragraph it does go back to a reasonable size.

    -----Original Message-----
    From: Corbett, Peter
    Sent: Monday, January 19, 2004 2:52 PM
    To: pnfs-reqs@yahoogroups.com
    Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying

    Julian,
    For some reason, your messages always come to me in a microscopic font that I'm finding harder and harder to read.  I don't know if this is the case for all recipients, or it is peculiar to my client.   I am using Outlook.  I've never seen it on mail from anybody else.
    Peter

        -----Original Message-----
        From: Julian Satran [mailto:julian_satran@il.ibm.com]
        Sent: Monday, January 19, 2004 3:48 AM
        To: pnfs-reqs@yahoogroups.com
        Cc: pNFS Operations; pNFS Requirements
        Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying



        Garth Gibson <garth@panasas.com> wrote on 19/12/2003 00:37:50:

        > Thanks Dave.  I agree.  Lets refine the proxying issues: Legacy,
        > strict, functional and recovery proxying.
        >
        > [1.1.0 Legacy proxying]: an NFS-v4.x server must be able to execute the
        > full NFS-v4.0 or NFS-v4.1 protocol.
        >
        > I think Dave has given the case for this strongly.  I do not see any
        > case against this.
        >
        > -------------------------------------------
        >
        > [1.1.1 Strict proxying]: does an NFS-v4.x server have to be able to
        > execute exactly the wire packet that an NFS-v4.x client might have sent
        > to a SBC/OSD/NFS data server?
        >
        > This captures the notion that a metadata server must also be a
        > store-and-forward proxy for every data server it manages.  It requires
        > NFS-v4.x servers implement SCSI SBC over FC, if their data servers
        > implement it; and the same for objects and files.
        >
        > This only makes sense to me for NFS data servers.  And it is not what I
        > intended in my prior summary, although it is a relevant question.  I
        > would say that pNFS requirements not require Strict Proxying.
        >

        Agree

        > -------------------------------------------
        >
        > [1.1.2 Functional proxying]: a file transformation achievable by an
        > NFS-v4.x client using a set of data server operations must be a
        > equivalently achievable using a (probably different) set of NFS-v4.x
        > server operations
        >
        > This is the topic I intended to address in the last email.  I believe
        > Dave is arguing that even with metadata servers that do not have access
        > to their data servers, the vendor of such a metadata server can
        > construct a proprietary protocol for the metadata server to (strict)
        > proxy data server accesses through clients that do have data server
        > access.  I am not comfortable making up a counter to this, so I exhort
        > those that want a metadata server without data server access to speak
        > up if they disagree.
        >
        > > On one hand, some suggest that a set of out-of-band clients should not
        > > have to also have a data path through the NFSv4 metadata server.  One
        > > reason is that customers may not tolerate the large variability in
        > > performance between out-of-band (when the going is good) and in-band
        > > (when the server chooses not to grant or to take away a delegation)
        > > accesses.  Another reason, and I paraphrase someone else here, is that
        > > it is possible to construct out-of-band metadata servers that do not
        > > have access to the data servers except through the clients -- I
        > > encourage the source of this scenario to replace my paraphrasing with
        > > a correct use case, because I find it odd to design for file servers
        > > that do not have access to the data servers.
        > >
        > > On the other hand, others have suggested that any access or work that
        > > a client can do out-of-band should be possible with one or more
        > > commands applied to the metadata server's data path.  This has been
        > > proposed for coping with recalled delegations, including concurrent
        > > writing by multiple clients; retry after client access errors,
        > > provided adequate idempotency of out-of-band operations; and many
        > > alternative implementations of out-of-band clients, including legacy
        > > clients that use out-of-band never or rarely.
        > >
        > > I think this is a topic that should be argued one way or the other in
        > > the requirements document.  Use cases and examples in other systems
        > > would be best.
        >

        I guess that proxying through a client should be recomended but not mandated.
        We might the want to find how to do it while respecting restrictions removed the metadata server from the path.

        > -------------------------------------------
        >
        > [1.1.3 Recovery proxying]: a file transformation begun by an NFS-v4.x
        > client using a set of data server operations, but interrupted before
        > completion, must be equivalently completable using a (probably
        > different) set of NFS-v4.x server operations
        >
        > Some have suggested that having this property will greatly simplify the
        > amount of spec that is devoted to out-of-band error recovery.  Others
        > have commented that a simple way to achieve this would be to require
        > that all operations on data servers should be idempotent.
        >
        > -------------------------------------------
        >
        > garth
        >
        >
        > On Thursday, December 18, 2003, at 12:21  PM, Noveck, Dave wrote:
        >
        > > Good summary.
        > >
        > > I want to address the "proxying" issue.
        > >
        > >> [1.1 Proxying]: Operations/work that can only be done out-of-band vs
        > >> alternative access through the NFSv4 server for all operations/work
        > >
        > > If you are talking about operations in the extension (let's call it
        > > NFS-v4.x), that are not in the previous minor version (let's assume
        > > that is nfs-v4.1), then you have a choice of whether these are
        > > supported
        > > for access through the server, or only for access by the client with
        > > the
        > > data server.  Let's call this the issue of proxying in the strict
        > > sense.
        > >
        > > There is another issue that people are calling "proxying" but is really
        > > logically distinct.  That is the issue of access by the previous minor
        > > version, e.g. nfs-v4.0 or nfs-v4.1.  Those versions have no concept of
        > > separate data servers and they need to be able to work.  End of story.
        > > If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
        > > have a minor version without proxying.  You don't have a minor version
        > > at all.  I believe the working group is never going to accept that.
        > > Even if I'm wrong and you can get the working group to accept that,
        > > it is going to be very contentious and thus take up a lot of time.
        > > Anybody, who really wants to go down this path should seriously
        > > consider
        > > the trade-off between supporting something they find objectionable and
        > > getting a standard a lot later, if at all.
        > >
        > >> On one hand, some suggest that a set of out-of-band clients should not
        > >> have to also have a data path through the NFSv4 metadata server.  One
        > >> reason is that customers may not tolerate the large variability in
        > >> performance between out-of-band (when the going is good) and in-band
        > >> (when the server chooses not to grant or to take away a delegation)
        > >> accesses.
        > >
        > > Then such customers will use clients that access things out-of-band
        > > whenever possible, and servers that never refuse to give out layout
        > > delegations.  You have a number of quality-of-implementations issues
        > > for v4.x clients and servers.  If a particular client only supports
        > > access via v4.0, then performance will suck, and the working group
        > > will understand that, but it won't accept not being able to use
        > > v4.0 at all.  The customer is going to be motivated to upgrade his
        > > clients for those that need high-performance access, but he may be
        > > OK with some clients using v4.0 for a long time, depending on the
        > > particular performance those clients need.  (And some will want v2/v3
        > > access but that is a matter that the working group has no say about).
        > >
        > >> Another reason, and I paraphrase someone else here, is that
        > >> it is possible to construct out-of-band metadata servers that do not
        > >> have access to the data servers except through the clients -- I
        > >> encourage the source of this scenario to replace my paraphrasing with
        > >> a
        > >> correct use case, because I find it odd to design for file servers
        > >> that
        > >> do not have access to the data servers.
        > >
        > > So let's grant that it is possible (and we'll pass over the issue of
        > > whether it is desirable, and in fact so desirable that one is willing
        > > to
        > > not get a standard and or get it much later).
        > >
        > > So we have a metadata server and it, for whatever reason, does not have
        > > access to the data servers.  However, by hypothesis, there are machines
        > > (e.g. clients), that can communicate with both.  So, if one has such an
        > > architecture, then one can take such a machine, give it a
        > > communication path
        > > to the meta-data server and the data server and have the meta-data
        > > server
        > > transfer v4.0 READ requests to it, let it read the data from the data
        > > server and send it back to the meta-data server who send it back to the
        > > original requestor.  Is that a very good solution?  No.  Is it likely
        > > to be performant?  No.  Will it satisfy any particular customer?  I
        > > don't
        > > know and that is the implementer's business decision.  Will it satisfy
        > > the hypothetical customer who doesn't care about v4.0 access?  Clearly.
        > > Will it satisfy the v4 working group?  Yes, because they are not in the
        > > business of telling you how performant v4.0 access has got to be.
        > >
        > >> On the other hand, others have suggested that any access or work that
        > >> a
        > >> client can do out-of-band should be possible with one or more commands
        > >> applied to the metadata server's data path.  This has been proposed
        > >> for
        > >> coping with recalled delegations, including concurrent writing by
        > >> multiple clients; retry after client access errors, provided adequate
        > >> idempotency of out-of-band operations; and many alternative
        > >> implementations of out-of-band clients, including legacy clients that
        > >> use out-of-band never or rarely.
        > >
        > > This effort is going to take a while, but if we manage it correctly, it
        > > is not going to take so long that v3 clients are going to be rare
        > > things,
        > > and they have to be supported.  But v3 clients are not an issue for the
        > > working group.  V4.0 clients are and they will be around and you will
        > > have to support them, and I believe the working group is not going to
        > > be disposed to cut you a lot of slack on this issue (and I don't see
        > > why it should).
        > >
        > >> I think this is a topic that should be argued one way or the other in
        > >> the requirements document.  Use cases and examples in other systems
        > >> would be best.
        > >
        > > I think the requirement should be that this work should be done as a
        > > set of extensions to nfs-v4 delivered as a v4 minor version.  If there
        > > is some feature/requirement that conflicts with that model (and it is a
        > > pretty flexible one), then you have to think long and hard before
        > > deciding
        > > that that requirement is more important than this basic deivery
        > > vehicle,
        > > because it seems to me that it is, in almost all respects, the ideal
        > > way
        > > to make this sort of technology available for widespread use.
        > >
        > >
        > >
        > >
        > >
        > >
        > > To unsubscribe from this group, send an email to:
        > > pnfs-ops-unsubscribe@yahoogroups.com
        > >
        > >
        > >
        > > Yahoo! Groups Links
        > >
        > > To visit your group on the web, go to:
        > >  http://groups.yahoo.com/group/pnfs-ops/
        > >
        > > To unsubscribe from this group, send an email to:
        > >  pnfs-ops-unsubscribe@yahoogroups.com
        > >
        > > Your use of Yahoo! Groups is subject to:
        > >  http://docs.yahoo.com/info/terms/
        > >
        >
        >
        > To unsubscribe from this group, send an email to:
        > pnfs-reqs-unsubscribe@yahoogroups.com
        >
        >  
        >
        > Yahoo! Groups Links
        >
        > To visit your group on the web, go to:
        >  http://groups.yahoo.com/group/pnfs-reqs/
        >
        > To unsubscribe from this group, send an email to:
        >  pnfs-reqs-unsubscribe@yahoogroups.com
        >
        > Your use of Yahoo! Groups is subject to:
        >  http://docs.yahoo.com/info/terms/
        >
        >


        Yahoo! Groups Links

            * To visit your group on the web, go to:
              http://groups.yahoo.com/group/pnfs-reqs/
               
            * To unsubscribe from this group, send an email to:
              pnfs-reqs-unsubscribe@yahoogroups.com
               
            * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 




    Yahoo! Groups Links

        * To visit your group on the web, go to:
          http://groups.yahoo.com/group/pnfs-reqs/
           
        * To unsubscribe from this group, send an email to:
          pnfs-reqs-unsubscribe@yahoogroups.com
           
        * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From julian_satran@il.ibm.com Mon Jan 19 14:11:59 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 80152 invoked from network); 19 Jan 2004 22:11:59 -0000
Received: from unknown (66.218.66.167)
by m5.grp.scd.yahoo.com with QMQP; 19 Jan 2004 22:11:59 -0000
Received: from unknown (HELO mtagate3.de.ibm.com) (195.212.29.152)
by mta6.grp.scd.yahoo.com with SMTP; 19 Jan 2004 22:11:58 -0000
Received: from d12relay02.megacenter.de.ibm.com (d12relay02.megacenter.de.ibm.com [9.149.165.196])
by mtagate3.de.ibm.com (8.12.10/8.12.10) with ESMTP id i0JMBuHI127844
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Jan 2004 22:11:56 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay02.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i0JMBtmx249174
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Jan 2004 23:11:56 +0100
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A6D3665@silver.nane.netapp.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OFFA3FA500.C4EBB703-ON88256E20.0078D701-88256E20.0079EEC0@il.ibm.com>
Date: Mon, 19 Jan 2004 14:11:52 -0800
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
20/01/2004 00:11:56,
Serialize complete at 20/01/2004 00:11:56
Content-Type: text/plain; charset="US-ASCII"
X-eGroups-Remote-IP: 195.212.29.152
From: Julian Satran <julian_satran@il.ibm.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

My humble appologies to all.
I've changed my internet mail options to text only (instead of HTML and
text). I hope it works better.
Interestingly enough I use mozilla (thunderbird) for my private mail and
never experienced this behavior.
It must be a Lotus-Note-vs.-Outlook war!

Julo



"Noveck, Dave" <dnoveck@netapp.com>
19/01/2004 13:15
Please respond to
pnfs-reqs


To
<pnfs-reqs@yahoogroups.com>
cc

Subject
RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03:
subtopic: proxying






That font is quite annoying. Interesting fact: when I was replying to one
of Julian's recent messages, I copied and pasted some of this message,
anticipating that I would have to chnge it to a reasonable-size font, but
what happened when I pasted it was that the part where Julian had quoted
me, which was in the microscopic font, when pasted it was in what looked
like a nrmal sized courier font, while the stuff that Julian had written
himself was still in the microscpic font. The other thing was that
Outlook when you put the cursor somewhere, normally changes the font
indication up top so you can find out what font something is, but not
here. It always said Courier 10pt, even in Julian's text which was
clearly smaller than that. And now the wierdest part!! If I put the
cursor in a paragraph that I originally wrote (and Julian incorporated
with >'s and looks like it is a reasonable size), as soon as I type a
single character, the
whole paragraph instantly switches to Julian's microscopic font! and no it
doesn't go back when I delete that character, but if cut and past that
paragraph it does go back to a reasonable size.
-----Original Message-----
From: Corbett, Peter
Sent: Monday, January 19, 2004 2:52 PM
To: pnfs-reqs@yahoogroups.com
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
12/18/03: subtopic: proxying

Julian,
For some reason, your messages always come to me in a microscopic font
that I'm finding harder and harder to read. I don't know if this is the
case for all recipients, or it is peculiar to my client. I am using
Outlook. I've never seen it on mail from anybody else.
Peter
-----Original Message-----
From: Julian Satran [mailto:julian_satran@il.ibm.com]
Sent: Monday, January 19, 2004 3:48 AM
To: pnfs-reqs@yahoogroups.com
Cc: pNFS Operations; pNFS Requirements
Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
12/18/03: subtopic: proxying



Garth Gibson <garth@panasas.com> wrote on 19/12/2003 00:37:50:

> Thanks Dave. I agree. Lets refine the proxying issues: Legacy,
> strict, functional and recovery proxying.
>
> [1.1.0 Legacy proxying]: an NFS-v4.x server must be able to execute the
> full NFS-v4.0 or NFS-v4.1 protocol.
>
> I think Dave has given the case for this strongly. I do not see any
> case against this.
>
> -------------------------------------------
>
> [1.1.1 Strict proxying]: does an NFS-v4.x server have to be able to
> execute exactly the wire packet that an NFS-v4.x client might have sent
> to a SBC/OSD/NFS data server?
>
> This captures the notion that a metadata server must also be a
> store-and-forward proxy for every data server it manages. It requires
> NFS-v4.x servers implement SCSI SBC over FC, if their data servers
> implement it; and the same for objects and files.
>
> This only makes sense to me for NFS data servers. And it is not what I
> intended in my prior summary, although it is a relevant question. I
> would say that pNFS requirements not require Strict Proxying.
>

Agree

> -------------------------------------------
>
> [1.1.2 Functional proxying]: a file transformation achievable by an
> NFS-v4.x client using a set of data server operations must be a
> equivalently achievable using a (probably different) set of NFS-v4.x
> server operations
>
> This is the topic I intended to address in the last email. I believe
> Dave is arguing that even with metadata servers that do not have access
> to their data servers, the vendor of such a metadata server can
> construct a proprietary protocol for the metadata server to (strict)
> proxy data server accesses through clients that do have data server
> access. I am not comfortable making up a counter to this, so I exhort
> those that want a metadata server without data server access to speak
> up if they disagree.
>
> > On one hand, some suggest that a set of out-of-band clients should not

> > have to also have a data path through the NFSv4 metadata server. One
> > reason is that customers may not tolerate the large variability in
> > performance between out-of-band (when the going is good) and in-band
> > (when the server chooses not to grant or to take away a delegation)
> > accesses. Another reason, and I paraphrase someone else here, is that

> > it is possible to construct out-of-band metadata servers that do not
> > have access to the data servers except through the clients -- I
> > encourage the source of this scenario to replace my paraphrasing with
> > a correct use case, because I find it odd to design for file servers
> > that do not have access to the data servers.
> >
> > On the other hand, others have suggested that any access or work that
> > a client can do out-of-band should be possible with one or more
> > commands applied to the metadata server's data path. This has been
> > proposed for coping with recalled delegations, including concurrent
> > writing by multiple clients; retry after client access errors,
> > provided adequate idempotency of out-of-band operations; and many
> > alternative implementations of out-of-band clients, including legacy
> > clients that use out-of-band never or rarely.
> >
> > I think this is a topic that should be argued one way or the other in
> > the requirements document. Use cases and examples in other systems
> > would be best.
>

I guess that proxying through a client should be recomended but not
mandated.
We might the want to find how to do it while respecting restrictions
removed the metadata server from the path.

> -------------------------------------------
>
> [1.1.3 Recovery proxying]: a file transformation begun by an NFS-v4.x
> client using a set of data server operations, but interrupted before
> completion, must be equivalently completable using a (probably
> different) set of NFS-v4.x server operations
>
> Some have suggested that having this property will greatly simplify the
> amount of spec that is devoted to out-of-band error recovery. Others
> have commented that a simple way to achieve this would be to require
> that all operations on data servers should be idempotent.
>
> -------------------------------------------
>
> garth
>
>
> On Thursday, December 18, 2003, at 12:21 PM, Noveck, Dave wrote:
>
> > Good summary.
> >
> > I want to address the "proxying" issue.
> >
> >> [1.1 Proxying]: Operations/work that can only be done out-of-band vs
> >> alternative access through the NFSv4 server for all operations/work
> >
> > If you are talking about operations in the extension (let's call it
> > NFS-v4.x), that are not in the previous minor version (let's assume
> > that is nfs-v4.1), then you have a choice of whether these are
> > supported
> > for access through the server, or only for access by the client with
> > the
> > data server. Let's call this the issue of proxying in the strict
> > sense.
> >
> > There is another issue that people are calling "proxying" but is
really
> > logically distinct. That is the issue of access by the previous minor
> > version, e.g. nfs-v4.0 or nfs-v4.1. Those versions have no concept of
> > separate data servers and they need to be able to work. End of story.
> > If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
> > have a minor version without proxying. You don't have a minor version
> > at all. I believe the working group is never going to accept that.
> > Even if I'm wrong and you can get the working group to accept that,
> > it is going to be very contentious and thus take up a lot of time.
> > Anybody, who really wants to go down this path should seriously
> > consider
> > the trade-off between supporting something they find objectionable and
> > getting a standard a lot later, if at all.
> >
> >> On one hand, some suggest that a set of out-of-band clients should
not
> >> have to also have a data path through the NFSv4 metadata server. One
> >> reason is that customers may not tolerate the large variability in
> >> performance between out-of-band (when the going is good) and in-band
> >> (when the server chooses not to grant or to take away a delegation)
> >> accesses.
> >
> > Then such customers will use clients that access things out-of-band
> > whenever possible, and servers that never refuse to give out layout
> > delegations. You have a number of quality-of-implementations issues
> > for v4.x clients and servers. If a particular client only supports
> > access via v4.0, then performance will suck, and the working group
> > will understand that, but it won't accept not being able to use
> > v4.0 at all. The customer is going to be motivated to upgrade his
> > clients for those that need high-performance access, but he may be
> > OK with some clients using v4.0 for a long time, depending on the
> > particular performance those clients need. (And some will want v2/v3
> > access but that is a matter that the working group has no say about).
> >
> >> Another reason, and I paraphrase someone else here, is that
> >> it is possible to construct out-of-band metadata servers that do not
> >> have access to the data servers except through the clients -- I
> >> encourage the source of this scenario to replace my paraphrasing with

> >> a
> >> correct use case, because I find it odd to design for file servers
> >> that
> >> do not have access to the data servers.
> >
> > So let's grant that it is possible (and we'll pass over the issue of
> > whether it is desirable, and in fact so desirable that one is willing
> > to
> > not get a standard and or get it much later).
> >
> > So we have a metadata server and it, for whatever reason, does not
have
> > access to the data servers. However, by hypothesis, there are
machines
> > (e.g. clients), that can communicate with both. So, if one has such
an
> > architecture, then one can take such a machine, give it a
> > communication path
> > to the meta-data server and the data server and have the meta-data
> > server
> > transfer v4.0 READ requests to it, let it read the data from the data
> > server and send it back to the meta-data server who send it back to
the
> > original requestor. Is that a very good solution? No. Is it likely
> > to be performant? No. Will it satisfy any particular customer? I
> > don't
> > know and that is the implementer's business decision. Will it satisfy
> > the hypothetical customer who doesn't care about v4.0 access? Clearly.
> > Will it satisfy the v4 working group? Yes, because they are not in
the
> > business of telling you how performant v4.0 access has got to be.
> >
> >> On the other hand, others have suggested that any access or work that

> >> a
> >> client can do out-of-band should be possible with one or more
commands
> >> applied to the metadata server's data path. This has been proposed
> >> for
> >> coping with recalled delegations, including concurrent writing by
> >> multiple clients; retry after client access errors, provided adequate
> >> idempotency of out-of-band operations; and many alternative
> >> implementations of out-of-band clients, including legacy clients that
> >> use out-of-band never or rarely.
> >
> > This effort is going to take a while, but if we manage it correctly,
it
> > is not going to take so long that v3 clients are going to be rare
> > things,
> > and they have to be supported. But v3 clients are not an issue for
the
> > working group. V4.0 clients are and they will be around and you will
> > have to support them, and I believe the working group is not going to
> > be disposed to cut you a lot of slack on this issue (and I don't see
> > why it should).
> >
> >> I think this is a topic that should be argued one way or the other in
> >> the requirements document. Use cases and examples in other systems
> >> would be best.
> >
> > I think the requirement should be that this work should be done as a
> > set of extensions to nfs-v4 delivered as a v4 minor version. If there
> > is some feature/requirement that conflicts with that model (and it is
a
> > pretty flexible one), then you have to think long and hard before
> > deciding
> > that that requirement is more important than this basic deivery
> > vehicle,
> > because it seems to me that it is, in almost all respects, the ideal
> > way
> > to make this sort of technology available for widespread use.
> >
> >
> >
> >
> >
> >
> > To unsubscribe from this group, send an email to:
> > pnfs-ops-unsubscribe@yahoogroups.com
> >
> >
> >
> > Yahoo! Groups Links
> >
> > To visit your group on the web, go to:
> > http://groups.yahoo.com/group/pnfs-ops/
> >
> > To unsubscribe from this group, send an email to:
> > pnfs-ops-unsubscribe@yahoogroups.com
> >
> > Your use of Yahoo! Groups is subject to:
> > http://docs.yahoo.com/info/terms/
> >
>
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>


Yahoo! Groups Links
To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.



Yahoo! Groups Links
To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.


Yahoo! Groups Links
To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From dhildebz@eecs.umich.edu Mon Jan 19 14:13:10 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 71211 invoked from network); 19 Jan 2004 22:13:10 -0000
Received: from unknown (66.218.66.216)
by m12.grp.scd.yahoo.com with QMQP; 19 Jan 2004 22:13:10 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta1.grp.scd.yahoo.com with SMTP; 19 Jan 2004 22:13:10 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.10/8.12.9) with ESMTP id i0JMD8vE011664
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO);
Mon, 19 Jan 2004 17:13:09 -0500
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.10/8.12.9/Submit) with ESMTP id i0JMD8wL011661;
Mon, 19 Jan 2004 17:13:08 -0500
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Mon, 19 Jan 2004 17:13:08 -0500 (EST)
To: pNFS Requirements <pnfs-reqs@yahoogroups.com>
Cc: pNFS Operations <pnfs-ops@yahoogroups.com>
In-Reply-To: <OFC7A57BC7.593A949B-ONC2256E1F.0060FAE6-88256E20.00305ED2@il.ibm.com>
Message-ID: <Pine.LNX.4.44.0401191701400.4110-100000@willow.eecs.umich.edu>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03:
subtopic: proxying
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

> > [1.1.2 Functional proxying]: a file transformation achievable by an
> > NFS-v4.x client using a set of data server operations must be a
> > equivalently achievable using a (probably different) set of NFS-v4.x
> > server operations
> >
> > This is the topic I intended to address in the last email. I believe
> > Dave is arguing that even with metadata servers that do not have access
> > to their data servers, the vendor of such a metadata server can
> > construct a proprietary protocol for the metadata server to (strict)
> > proxy data server accesses through clients that do have data server
> > access. I am not comfortable making up a counter to this, so I exhort
> > those that want a metadata server without data server access to speak
> > up if they disagree.
> >
> > > On one hand, some suggest that a set of out-of-band clients should not
>
> > > have to also have a data path through the NFSv4 metadata server. One
> > > reason is that customers may not tolerate the large variability in
> > > performance between out-of-band (when the going is good) and in-band
> > > (when the server chooses not to grant or to take away a delegation)
> > > accesses. Another reason, and I paraphrase someone else here, is that
>
> > > it is possible to construct out-of-band metadata servers that do not
> > > have access to the data servers except through the clients -- I
> > > encourage the source of this scenario to replace my paraphrasing with
> > > a correct use case, because I find it odd to design for file servers
> > > that do not have access to the data servers.
> > >
> > > On the other hand, others have suggested that any access or work that
> > > a client can do out-of-band should be possible with one or more
> > > commands applied to the metadata server's data path. This has been
> > > proposed for coping with recalled delegations, including concurrent
> > > writing by multiple clients; retry after client access errors,
> > > provided adequate idempotency of out-of-band operations; and many
> > > alternative implementations of out-of-band clients, including legacy
> > > clients that use out-of-band never or rarely.
> > >
> > > I think this is a topic that should be argued one way or the other in
> > > the requirements document. Use cases and examples in other systems
> > > would be best.
> >
>
> I guess that proxying through a client should be recomended but not
> mandated.
> We might the want to find how to do it while respecting restrictions
> removed the metadata server from the path.

I think relying on clients to do anything correctly is against the
inherent nature of NFS. Clients in NFS are transient and cannot be
trusted to do anything correctly. Therefore, the metadata server should
find its own way to write data to the data servers without relying on
clients. If proxying through a client is optional, it still seems
orthogonal to the behavior of existing installations and the spirit of
NFS. Maybe there is a valid use case someone could describe?

Dean

From dnoveck@netapp.com Mon Jan 19 15:23:15 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 7197 invoked from network); 19 Jan 2004 23:23:13 -0000
Received: from unknown (66.218.66.216)
by m20.grp.scd.yahoo.com with QMQP; 19 Jan 2004 23:23:13 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 19 Jan 2004 23:23:13 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0JNNDKw008007;
Mon, 19 Jan 2004 15:23:13 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0JNN8ST005106;
Mon, 19 Jan 2004 15:23:12 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 19 Jan 2004 15:23:08 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D3666@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
Thread-Index: AcPe2XB/4kBYFENqR3OWwRD2koQeGAAByR0A
To: <pnfs-reqs@yahoogroups.com>
Cc: "pNFS Operations" <pnfs-ops@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

Dean Hildebrand wrote:
> I think relying on clients to do anything correctly is against the
> inherent nature of NFS. Clients in NFS are transient and cannot be
> trusted to do anything correctly. Therefore, the metadata server should
> find its own way to write data to the data servers without relying on
> clients. If proxying through a client is optional, it still seems
> orthogonal to the behavior of existing installations and the spirit of
> NFS. Maybe there is a valid use case someone could describe?

I'm now totally confused. Before we talk about use cases for "proxying
through a client", I'd like to understand what it is.

My understanding is that when this discussion started, a number of people
were referring to a client writing data by sending a write to the meta-
data server (aka the NFS server) as "proxying", because, if your view is
that the proper/best/ideal way of doing data transfer operations is to
obtain mapping information and then do a write to the data server (i.e.
other NFS server or object data server or SAN-connected disk), then
the direct NFS write can be seen as the meta-data server acting as the
client's proxy. Is my understanding correct?

No matter how you come down on the quesion of the desirability of that,
I don't think there any way to argue that doing a write by sending an
NFS write request to an NFS server is against the inherent nature of
NFS. Nor does it ask the client do anything correctly that it hasn't
been doing all along.

At some point the phrase "proxying through the client" was used and I
realize I don't know what is meant by it. It doesn't seem to match
the "proxying" that was being discussed originally. How would the
client be a proxy for (presumably) the server? What am I missing?

-----Original Message-----
From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
Sent: Monday, January 19, 2004 5:13 PM
To: pNFS Requirements
Cc: pNFS Operations
Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
12/18/03: subtopic: proxying


> > [1.1.2 Functional proxying]: a file transformation achievable by an
> > NFS-v4.x client using a set of data server operations must be a
> > equivalently achievable using a (probably different) set of NFS-v4.x
> > server operations
> >
> > This is the topic I intended to address in the last email. I believe
> > Dave is arguing that even with metadata servers that do not have access
> > to their data servers, the vendor of such a metadata server can
> > construct a proprietary protocol for the metadata server to (strict)
> > proxy data server accesses through clients that do have data server
> > access. I am not comfortable making up a counter to this, so I exhort
> > those that want a metadata server without data server access to speak
> > up if they disagree.
> >
> > > On one hand, some suggest that a set of out-of-band clients should not
>
> > > have to also have a data path through the NFSv4 metadata server. One
> > > reason is that customers may not tolerate the large variability in
> > > performance between out-of-band (when the going is good) and in-band
> > > (when the server chooses not to grant or to take away a delegation)
> > > accesses. Another reason, and I paraphrase someone else here, is that
>
> > > it is possible to construct out-of-band metadata servers that do not
> > > have access to the data servers except through the clients -- I
> > > encourage the source of this scenario to replace my paraphrasing with
> > > a correct use case, because I find it odd to design for file servers
> > > that do not have access to the data servers.
> > >
> > > On the other hand, others have suggested that any access or work that
> > > a client can do out-of-band should be possible with one or more
> > > commands applied to the metadata server's data path. This has been
> > > proposed for coping with recalled delegations, including concurrent
> > > writing by multiple clients; retry after client access errors,
> > > provided adequate idempotency of out-of-band operations; and many
> > > alternative implementations of out-of-band clients, including legacy
> > > clients that use out-of-band never or rarely.
> > >
> > > I think this is a topic that should be argued one way or the other in
> > > the requirements document. Use cases and examples in other systems
> > > would be best.
> >
>
> I guess that proxying through a client should be recomended but not
> mandated.
> We might the want to find how to do it while respecting restrictions
> removed the metadata server from the path.

I think relying on clients to do anything correctly is against the
inherent nature of NFS. Clients in NFS are transient and cannot be
trusted to do anything correctly. Therefore, the metadata server should
find its own way to write data to the data servers without relying on
clients. If proxying through a client is optional, it still seems
orthogonal to the behavior of existing installations and the spirit of
NFS. Maybe there is a valid use case someone could describe?

Dean





Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/


From bhalevy@panasas.com Mon Jan 19 15:31:01 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 48321 invoked from network); 19 Jan 2004 23:31:01 -0000
Received: from unknown (66.218.66.166)
by m5.grp.scd.yahoo.com with QMQP; 19 Jan 2004 23:31:01 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 19 Jan 2004 23:31:00 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSYKLVJ>; Mon, 19 Jan 2004 18:30:58 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D3879D@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Cc: pNFS Operations <pnfs-ops@yahoogroups.com>
Date: Mon, 19 Jan 2004 18:30:56 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0
3: subtopic: proxying
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

Dave Noveck wrote:
>At some point the phrase "proxying through the client" was used and I
>realize I don't know what is meant by it. It doesn't seem to match
>the "proxying" that was being discussed originally. How would the
>client be a proxy for (presumably) the server? What am I missing?

I think it was you who suggested (maybe in a rhetorical way) that
when the metadata server is not capable of accessing the storage
it manages it should still be able to perform I/O using a client.
Maybe this created the "proxying through the client" idea...

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Monday, January 19, 2004 6:23 PM
>To: pnfs-reqs@yahoogroups.com
>Cc: pNFS Operations
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>Dean Hildebrand wrote:
>> I think relying on clients to do anything correctly is against the
>> inherent nature of NFS. Clients in NFS are transient and cannot be
>> trusted to do anything correctly. Therefore, the metadata
>server should
>> find its own way to write data to the data servers without relying on
>> clients. If proxying through a client is optional, it still seems
>> orthogonal to the behavior of existing installations and the
>spirit of
>> NFS. Maybe there is a valid use case someone could describe?
>
>I'm now totally confused. Before we talk about use cases for "proxying
>through a client", I'd like to understand what it is.
>
>My understanding is that when this discussion started, a
>number of people
>were referring to a client writing data by sending a write to the meta-
>data server (aka the NFS server) as "proxying", because, if
>your view is
>that the proper/best/ideal way of doing data transfer operations is to
>obtain mapping information and then do a write to the data server (i.e.
>other NFS server or object data server or SAN-connected disk), then
>the direct NFS write can be seen as the meta-data server acting as the
>client's proxy. Is my understanding correct?
>
>No matter how you come down on the quesion of the desirability of that,
>I don't think there any way to argue that doing a write by sending an
>NFS write request to an NFS server is against the inherent nature of
>NFS. Nor does it ask the client do anything correctly that it hasn't
>been doing all along.
>
>At some point the phrase "proxying through the client" was used and I
>realize I don't know what is meant by it. It doesn't seem to match
>the "proxying" that was being discussed originally. How would the
>client be a proxy for (presumably) the server? What am I missing?
>
>-----Original Message-----
>From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>Sent: Monday, January 19, 2004 5:13 PM
>To: pNFS Requirements
>Cc: pNFS Operations
>Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>> > [1.1.2 Functional proxying]: a file transformation
>achievable by an
>> > NFS-v4.x client using a set of data server operations must be a
>> > equivalently achievable using a (probably different) set
>of NFS-v4.x
>> > server operations
>> >
>> > This is the topic I intended to address in the last email.
> I believe
>> > Dave is arguing that even with metadata servers that do
>not have access
>> > to their data servers, the vendor of such a metadata server can
>> > construct a proprietary protocol for the metadata server
>to (strict)
>> > proxy data server accesses through clients that do have
>data server
>> > access. I am not comfortable making up a counter to this,
>so I exhort
>> > those that want a metadata server without data server
>access to speak
>> > up if they disagree.
>> >
>> > > On one hand, some suggest that a set of out-of-band
>clients should not
>>
>> > > have to also have a data path through the NFSv4 metadata
>server. One
>> > > reason is that customers may not tolerate the large
>variability in
>> > > performance between out-of-band (when the going is good)
>and in-band
>> > > (when the server chooses not to grant or to take away a
>delegation)
>> > > accesses. Another reason, and I paraphrase someone else
>here, is that
>>
>> > > it is possible to construct out-of-band metadata servers
>that do not
>> > > have access to the data servers except through the clients -- I
>> > > encourage the source of this scenario to replace my
>paraphrasing with
>> > > a correct use case, because I find it odd to design for
>file servers
>> > > that do not have access to the data servers.
>> > >
>> > > On the other hand, others have suggested that any access
>or work that
>> > > a client can do out-of-band should be possible with one or more
>> > > commands applied to the metadata server's data path.
>This has been
>> > > proposed for coping with recalled delegations, including
>concurrent
>> > > writing by multiple clients; retry after client access errors,
>> > > provided adequate idempotency of out-of-band operations;
>and many
>> > > alternative implementations of out-of-band clients,
>including legacy
>> > > clients that use out-of-band never or rarely.
>> > >
>> > > I think this is a topic that should be argued one way or
>the other in
>> > > the requirements document. Use cases and examples in
>other systems
>> > > would be best.
>> >
>>
>> I guess that proxying through a client should be recomended but not
>> mandated.
>> We might the want to find how to do it while respecting restrictions
>> removed the metadata server from the path.
>
>I think relying on clients to do anything correctly is against the
>inherent nature of NFS. Clients in NFS are transient and cannot be
>trusted to do anything correctly. Therefore, the metadata
>server should
>find its own way to write data to the data servers without relying on
>clients. If proxying through a client is optional, it still seems
>orthogonal to the behavior of existing installations and the spirit of
>NFS. Maybe there is a valid use case someone could describe?
>
>Dean
>
>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
>------------------------ Yahoo! Groups Sponsor
>---------------------~-->
>Upgrade to 128-bit SSL Security!
>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>---------------------------------------------------------------
>------~->
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>

From dnoveck@netapp.com Mon Jan 19 15:32:39 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 75759 invoked from network); 19 Jan 2004 23:32:37 -0000
Received: from unknown (66.218.66.167)
by m1.grp.scd.yahoo.com with QMQP; 19 Jan 2004 23:32:37 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 19 Jan 2004 23:32:37 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0JNWaKw009351;
Mon, 19 Jan 2004 15:32:37 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0JNWaSR007663;
Mon, 19 Jan 2004 15:32:36 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3DEE4.8419B3E0"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 19 Jan 2004 15:32:34 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548AB80B06@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03
Thread-Index: AcPeaTkQQwHlD+iITeOyPo50R1swWAAZbAXA
To: <pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

ADVERTISEMENT
Julian Satran wrote:
> "Noveck, Dave" <dnoveck@netapp.com> wrote on 30/12/2003 22:08:02:

> > It seems legal to me but I'm guessing that there are others that would
> > think differently.
>   
> > I tend to think that it is not a good idea, though.  There are going 
> > to be operations which, by their nature, are better done through the 
> > metadata server.  A two-byte write which span s multiple data servers 
> > is an example.  Another is append-writes, which have been mentioned 
> > (by whom I don't remember just now) as a desirable v4 extension, 
> > assuming the data to be written is of reasonable size.  In each case, 
> > we may create appropriate caching/locking primitives to allow the 
> > operation to be done without making any request of the metadata server
> > that is officially denominated an "IO" request.  But can you really 
> > argue that this will be the best way for the client to do such 
> > operations?  And does it really make sense to force clients to invest 
> > the effort in terms of the code do such operations doing the IO with 
> > the data server only, when the performance benefit of that is going to
> > be small, or zero, or negative?  You may wind up making as many 
> > requests of the meta-data server with the data-server-only approach.  
> > It's just that they won't be IO operations (but instead locking and, 
> > in the case of append, getattr operations).
 
> This can be argued both ways.  For applications that share little and
> build files by append (all transaction loggers) doing them on the client
> is a distinct advantage.
 
If you are talking about a situation in which there is little sharing
then the clients are going to have exclusive delegations for the files
that they are appending to, in which case the append write feature is
not really being used.  The client is best advised to simply gather up
its writes and write the whole file at once (or the part that it wrote 
until its delegation is recalled).  But in either of those cases, it
knows the eof of the file and can write to a specific offset and thus is
not depending on the fact that you have a write-append feature. 
 
Clients that are doing those writes in this situation may indeed wind up
being more efficient writing to the data server, but in other situations, 
where there is sharing, things will be different.  
 
This can be argued any number of ways depending on details of the implementation,
and applications.  The original concept here was that someone can decide in
advance that using the data server is better than using the metadata server to the
point that use of the metadata server to do IO can be *prohibited*.  The fact that
this is a complicated issue seems to me to argue strongly against that sort of
approach.  
 
> And so iit is for object storage that supports append.
 
This will depend on the details of the protocol as it evolves.  I had expected
that EOF would be something that is managed by the metadata server.  As you point
out, with object storage, it can be managed by the data server (and you could probably
do the same with the parallel-file option).  This gets to an important issue for
this effort that we will be coming back to again and again: how much advantage
to take of specific features of some data storage methods that are not shared by
all.  I don't see any general principle that is going to work for this all the time.
We are going to have to decide on a case-by-case basis.
 

    -----Original Message-----
    From: Julian Satran [mailto:julian_satran@il.ibm.com]
    Sent: Monday, January 19, 2004 3:48 AM
    To: pnfs-reqs@yahoogroups.com
    Cc: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
    Subject: Re: [pnfs-reqs] RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03



    "Noveck, Dave" <dnoveck@netapp.com> wrote on 30/12/2003 22:08:02:

    > It seems legal to me but I'm guessing that there are others that would
    > think differently.
    >  
    > I tend to think that it is not a good idea, though.  There are going
    > to be operations which, by their nature, are better done through the
    > metadata server.  A two-byte write which spans multiple data servers
    > is an example.  Another is append-writes, which have been mentioned
    > (by whom I don't remember just now) as a desirable v4 extension,
    > assuming the data to be written is of reasonable size.  In each case,
    > we may create appropriate caching/locking primitives to allow the
    > operation to be done without making any request of the metadata server
    > that is officially denominated an "IO" request.  But can you really
    > argue that this will be the best way for the client to do such
    > operations?  And does it really make sense to force clients to invest
    > the effort in terms of the code do such operations doing the IO with
    > the data server only, when the performance benefit of that is going to
    > be small, or zero, or negative?  You may wind up making as many
    > requests of the meta-data server with the data-server-only approach.  
    > It's just that they won't be IO operations (but instead locking and,
    > in the case of append, getattr operations).
    >

    This can be argued both ways. For applications that share little and build files by append (all transaction loggers) doing them on the client is a distinct advantage. And so iit is for object storage that supports append.

    > In complicated protocols (and v4 is a complicated protocol and is
    > getting more complicated), there are going to be multiple ways of
    > doing the same thing, which are going to differ in their performance
    > characteristics.  An organization can be reasonably concerned about
    > clients making the wrong choice, just as it is concerned about clients
    > that are making excessive resource demands for other reasons.  There
    > are two issues that I am worried about in taking such a drastic
    > approach as simply refusing to support a valid piece of the protocol,
    > even if that choice is made by the server administrator.  The first is
    > that determining the better choice depends on a lot of variables and
    > that a simple formula governing an option  (e.g. "IO through the
    > metadata server is bad") is unlikely to completely match reality.  The
    > second is that I-don't-like-your-IO-request-so-you-lose is kind of a
    > blunt instrument to deal with the problem.
    >  

    I don't think this is a big issue or that the scenario I describe will be widely used but with Object Storage you may not have (or need to very  often) a channel between the metadata server and the data servers. This partial access scheme may be maintained also in block environments or federated filers for various reasons (security may be one - you don't trust your administrator with all the data).

    > If you have identified some set of bad client practices, you can find
    > the clients doing them, report the appropriate statistics, even, if
    > the issue is critical, artificially give such clients (or specific
    > requests) bad performance in a way that doesn't hurt other clients
    > (unless they are waiting for the first set to do something.  Sigh!),
    > by just delaying processing of their requests by millisecond or two.  
    > That should be enough to preserve metadata-server bandwidth for more
    > worthwhile purposes.  If that's insufficiently discouraging, you can
    > raise the delay.  If you start rejecting requests because you would
    > have done it differently, even if you are correct, you are on the road
    > to creating your own sub-protocol, which is why this kind of thing is
    > worrying, even if legal.
    >  
    >  
    > -----Original Message-----
    > From: Julian Satran [mailto:julian_satran@il.ibm.com]
    > Sent: Monday, December 22, 2003 5:26 AM
    > To: pnfs-ops@yahoogroups.com
    > Cc: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
    > Subject: RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03

    >
    > Since I raised the issue of the metadata server not having access to
    > all it's data servers (or at least not with adequate bandwidth) I feel
    > compelled to say that Dave's arguments about supporting 4.0 are
    > compelling enough to make it mandatory. The open issue is if it is
    > legal for a "compliant server" to have serving data disabled by a
    > local administrative function (the old "must implement but may use").
    > Otherwise an organization that wants to discourage use of data serving
    > through the metadata server has very little it can do to enforce
    > policy in a way that will not affect other clients (it may do serve
    > poorly but this still affects other clients).
    >
    > Julo
    >

    >
    > "Noveck, Dave" <dnoveck@netapp.com>
    > 18/12/2003 19:21
    >
    > Please respond to
    > pnfs-ops@yahoogroups.com
    >
    > To
    >
    > <pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
    >
    > cc
    >
    > Subject
    >
    > RE: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03
    >
    >
    >
    >
    > Good summary.
    >
    > I want to address the "proxying" issue.
    >
    > > [1.1 Proxying]: Operations/work that can only be done out-of-band vs
    > > alternative access through the NFSv4 server for all operations/work
    >
    > If you are talking about operations in the extension (let's call it
    > NFS-v4.x), that are not in the previous minor version (let's assume
    > that is nfs-v4.1), then you have a choice of whether these are supported
    > for access through the server, or only for access by the client with the
    > data server.  Let's call this the issue of proxying in the strict sense.
    >
    > There is another issue that people are calling "proxying" but is really
    > logically distinct.  That is the issue of access by the previous minor
    > version, e.g. nfs-v4.0 or nfs-v4.1.  Those versions have no concept of
    > separate data servers and they need to be able to work.  End of story.
    > If you can't read files stored in nfs-v4.x with nfs-v4.0, you do not
    > have a minor version without proxying.  You don't have a minor version
    > at all.  I believe the working group is never going to accept that.
    > Even if I'm wrong and you can get the working group to accept that,
    > it is going to be very contentious and thus take up a lot of time.
    > Anybody, who really wants to go down this path should seriously consider
    > the trade-off between supporting something they find objectionable and
    > getting a standard a lot later, if at all.
    >
    > > On one hand, some suggest that a set of out-of-band clients should not
    > > have to also have a data path through the NFSv4 metadata server.  One
    > > reason is that customers may not tolerate the large variability in
    > > performance between out-of-band (when the going is good) and in-band
    > > (when the server chooses not to grant or to take away a delegation)
    > > accesses.  
    >
    > Then such customers will use clients that access things out-of-band
    > whenever possible, and servers that never refuse to give out layout
    > delegations.  You have a number of quality-of-implementations issues
    > for v4.x clients and servers.  If a particular client only supports
    > access via v4.0, then performance will suck, and the working group
    > will understand that, but it won't accept not being able to use
    > v4.0 at all.  The customer is going to be motivated to upgrade his
    > clients for those that need high-performance access, but he may be
    > OK with some clients using v4.0 for a long time, depending on the
    > particular performance those clients need.  (And some will want v2/v3
    > access but that is a matter that the working group has no say about).
    >
    > > Another reason, and I paraphrase someone else here, is that
    > > it is possible to construct out-of-band metadata servers that do not
    > > have access to the data servers except through the clients -- I
    > > encourage the source of this scenario to replace my paraphrasing with a
    > > correct use case, because I find it odd to design for file servers that
    > > do not have access to the data servers.
    >
    > So let's grant that it is possible (and we'll pass over the issue of
    > whether it is desirable, and in fact so desirable that one is willing to
    > not get a standard and or get it much later).
    >
    > So we have a metadata server and it, for whatever reason, does not have
    > access to the data servers.  However, by hypothesis, there are machines
    > (e.g. clients), that can communicate with both.  So, if one has such an
    > architecture, then one can take such a machine, give it a communication path
    > to the meta-data server and the data server and have the meta-data server
    > transfer v4.0 READ requests to it, let it read the data from the data
    > server and send it back to the meta-data server who send it back to the
    > original requestor.  Is that a very good solution?  No.  Is it likely
    > to be performant?  No.  Will it satisfy any particular customer?  I don't
    > know and that is the implementer's business decision.  Will it satisfy
    > the hypothetical customer who doesn't care about v4.0 access?  Clearly.
    > Will it satisfy the v4 working group?  Yes, because they are not in the
    > business of telling you how performant v4.0 access has got to be.
    >
    > > On the other hand, others have suggested that any access or work that a
    > > client can do out-of-band should be possible with one or more commands
    > > applied to the metadata server's data path.  This has been proposed for
    > > coping with recalled delegations, including concurrent writing by
    > > multiple clients; retry after client access errors, provided adequate
    > > idempotency of out-of-band operations; and many alternative
    > > implementations of out-of-band clients, including legacy clients that
    > > use out-of-band never or rarely.
    >
    > This effort is going to take a while, but if we manage it correctly, it
    > is not going to take so long that v3 clients are going to be rare things,
    > and they have to be supported.  But v3 clients are not an issue for the
    > working group.  V4.0 clients are and they will be around and you will
    > have to support them, and I believe the working group is not going to
    > be disposed to cut you a lot of slack on this issue (and I don't see
    > why it should).
    >
    > > I think this is a topic that should be argued one way or the other in
    > > the requirements document.  Use cases and examples in other systems
    > > would be best.
    >
    > I think the requirement should be that this work should be done as a
    > set of extensions to nfs-v4 delivered as a v4 minor version.  If there
    > is some feature/requirement that conflicts with that model (and it is a
    > pretty flexible one), then you have to think long and hard before deciding
    > that that requirement is more important than this basic deivery vehicle,
    > because it seems to me that it is, in almost all respects, the ideal way
    > to make this sort of technology available for widespread use.
    >
    >
    >
    >
    >
    >
    > To unsubscribe from this group, send an email to:
    > pnfs-ops-unsubscribe@yahoogroups.com
    >
    >
    >
    > Yahoo! Groups Links
    >
    > To visit your group on the web, go to:
    > http://groups.yahoo.com/group/pnfs-ops/
    >
    > To unsubscribe from this group, send an email to:
    > pnfs-ops-unsubscribe@yahoogroups.com
    >
    > Your use of Yahoo! Groups is subject to:
    > http://docs.yahoo.com/info/terms/
    >
    >
    >
    >
    >
    > To unsubscribe from this group, send an email to:
    > pnfs-ops-unsubscribe@yahoogroups.com
    >
    >
    >
    >
    >
    > Yahoo! Groups Links
    > To visit your group on the web, go to:
    > http://groups.yahoo.com/group/pnfs-ops/
    >  
    > To unsubscribe from this group, send an email to:
    > pnfs-ops-unsubscribe@yahoogroups.com
    >  
    > Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
    >
    >
    > Yahoo! Groups Links
    > To visit your group on the web, go to:
    > http://groups.yahoo.com/group/pnfs-reqs/
    >  
    > To unsubscribe from this group, send an email to:
    > pnfs-reqs-unsubscribe@yahoogroups.com
    >  
    > Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
    Yahoo! Groups Links

        * To visit your group on the web, go to:
          http://groups.yahoo.com/group/pnfs-ops/
           
        * To unsubscribe from this group, send an email to:
          pnfs-ops-unsubscribe@yahoogroups.com
           
        * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From garth@panasas.com Mon Jan 19 16:15:32 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 70221 invoked from network); 20 Jan 2004 00:15:28 -0000
Received: from unknown (66.218.66.217)
by m20.grp.scd.yahoo.com with QMQP; 20 Jan 2004 00:15:28 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta2.grp.scd.yahoo.com with SMTP; 20 Jan 2004 00:15:26 -0000
Received: from [172.17.2.81] ([172.17.2.81]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYKMAZ; Mon, 19 Jan 2004 19:15:24 -0500
In-Reply-To: <30489F1321F5C343ACF6872B2CF7942A05D3879D@PIKES.panasas.com>
References: <30489F1321F5C343ACF6872B2CF7942A05D3879D@PIKES.panasas.com>
Mime-Version: 1.0 (Apple Message framework v609)
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <BC9EEF80-4ADD-11D8-AC14-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Cc: pNFS Operations <pnfs-ops@yahoogroups.com>
Date: Mon, 19 Jan 2004 19:15:22 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

I think there are multiple issues being reviewed here.

First, what I called Functional Proxying is what Dave Noveck
understands :-) It is a safety valve for clients that don't want to,
can't, or think it is slower to directly access the leaf storage
server. In this case a client can always accomplish any legal file
system state transformation using NFSv4 operations on the metadata
server.

For the other scenario, I believe it was Julian Satran who mentioned
that IBM has considered situations where the metadata server does not
have good access, if any, to the storage devices directly. For
example, FC disks connected to all clients with FC NICs, and a metadata
server connected to clients via Ethernet and not connected to FC at
all. In this case, the metadata wants all, or almost all, accesses to
be direct from client to storage. It was in this context that it was
mentioned that a client could proxy a command from a metadata server to
storage.

> Since I raised the issue of the metadata server not having access to
> all it's data servers (or at least not with adequate bandwidth) I feel
> compelled to say that Dave's arguments about supporting 4.0 are
> compelling enough to make it mandatory. The open issue is if it is
> legal for a "compliant server" to have serving data disabled by a
> local administrative function (the old "must implement but may use").
> Otherwise an organization that wants to discourage use of data serving
> through the metadata server has very little it can do to enforce
> policy in a way that will not affect other clients (it may do serve
> poorly but this still affects other clients).
> Julo

> I guess that proxying through a client should be recomended but not
> mandated.
> We might the want to find how to do it while respecting restrictions
> removed the metadata server from the path.
> Julo

My guess is that Dean is worried that a metadata server needs tighter
control over storage than can be achieved by asking a client to do work
on its behalf, in a trust model where clients are not trusted at the
same level as servers. Man-in-the-middle security attacks come to mind
very easily.

Channeling for Julian, while this is a valid issue for clear text
commands sent to untrusted clients, object storage has done command
level digital signature things to ensure untrusted clients can't
tamper, but denial of service remains a threat.

My own take on this would be to say that the "client proxy" should be
separated from the untrusted clients and pulled into the server trust
domain, making it is logical node in the server's box. In this case
the client proxying is an implementation artifact and we need not
concern ourselves with it.

Dave, Dean, Julian, please correct me if I am not representing your
position correctly.

garth

==============================================================
On Jan 19, 2004, at 6:30 PM, Halevy, Benny wrote:

> Dave Noveck wrote:
>> At some point the phrase "proxying through the client" was used and I
>> realize I don't know what is meant by it. It doesn't seem to match
>> the "proxying" that was being discussed originally. How would the
>> client be a proxy for (presumably) the server? What am I missing?
>
> I think it was you who suggested (maybe in a rhetorical way) that
> when the metadata server is not capable of accessing the storage
> it manages it should still be able to perform I/O using a client.
> Maybe this created the "proxying through the client" idea...
>
> Benny
>
>> -----Original Message-----
>> From: Noveck, Dave [mailto:dnoveck@netapp.com]
>> Sent: Monday, January 19, 2004 6:23 PM
>> To: pnfs-reqs@yahoogroups.com
>> Cc: pNFS Operations
>> Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>> 12/18/03: subtopic: proxying
>>
>>
>> Dean Hildebrand wrote:
>>> I think relying on clients to do anything correctly is against the
>>> inherent nature of NFS. Clients in NFS are transient and cannot be
>>> trusted to do anything correctly. Therefore, the metadata server
>>> should
>>> find its own way to write data to the data servers without relying on
>>> clients. If proxying through a client is optional, it still seems
>>> orthogonal to the behavior of existing installations and the spirit
>>> of
>>> NFS. Maybe there is a valid use case someone could describe?
>>
>> I'm now totally confused. Before we talk about use cases for
>> "proxying
>> through a client", I'd like to understand what it is.
>>
>> My understanding is that when this discussion started, a number of
>> people
>> were referring to a client writing data by sending a write to the
>> meta-
>> data server (aka the NFS server) as "proxying", because, if your view
>> is
>> that the proper/best/ideal way of doing data transfer operations is to
>> obtain mapping information and then do a write to the data server
>> (i.e.
>> other NFS server or object data server or SAN-connected disk), then
>> the direct NFS write can be seen as the meta-data server acting as the
>> client's proxy. Is my understanding correct?
>>
>> No matter how you come down on the quesion of the desirability of
>> that,
>> I don't think there any way to argue that doing a write by sending an
>> NFS write request to an NFS server is against the inherent nature of
>> NFS. Nor does it ask the client do anything correctly that it hasn't
>> been doing all along.
>>
>> At some point the phrase "proxying through the client" was used and I
>> realize I don't know what is meant by it. It doesn't seem to match
>> the "proxying" that was being discussed originally. How would the
>> client be a proxy for (presumably) the server? What am I missing?
>>
>> -----Original Message-----
>> From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>> Sent: Monday, January 19, 2004 5:13 PM
>> To: pNFS Requirements
>> Cc: pNFS Operations
>> Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>> 12/18/03: subtopic: proxying
>>
>>
>>>> [1.1.2 Functional proxying]: a file transformation achievable by an
>>>> NFS-v4.x client using a set of data server operations must be a
>>>> equivalently achievable using a (probably different) set of NFS-v4.x
>>>> server operations
>>>>
>>>> This is the topic I intended to address in the last email. I believe
>>>> Dave is arguing that even with metadata servers that do not have
>>>> access
>>>> to their data servers, the vendor of such a metadata server can
>>>> construct a proprietary protocol for the metadata server to (strict)
>>>> proxy data server accesses through clients that do have data server
>>>> access. I am not comfortable making up a counter to this, so I
>>>> exhort
>>>> those that want a metadata server without data server access to
>>>> speak
>>>> up if they disagree.
>>>>
>>>>> On one hand, some suggest that a set of out-of-band clients should
>>>>> not
>>>
>>>>> have to also have a data path through the NFSv4 metadata server.
>>>>> One
>>>>> reason is that customers may not tolerate the large variability in
>>>>> performance between out-of-band (when the going is good) and
>>>>> in-band
>>>>> (when the server chooses not to grant or to take away a delegation)
>>>>> accesses. Another reason, and I paraphrase someone else here, is
>>>>> that
>>>
>>>>> it is possible to construct out-of-band metadata servers that do
>>>>> not
>>>>> have access to the data servers except through the clients -- I
>>>>> encourage the source of this scenario to replace my paraphrasing
>>>>> with
>>>>> a correct use case, because I find it odd to design for file
>>>>> servers
>>>>> that do not have access to the data servers.
>>>>>
>>>>> On the other hand, others have suggested that any access or work
>>>>> that
>>>>> a client can do out-of-band should be possible with one or more
>>>>> commands applied to the metadata server's data path. This has been
>>>>> proposed for coping with recalled delegations, including concurrent
>>>>> writing by multiple clients; retry after client access errors,
>>>>> provided adequate idempotency of out-of-band operations; and many
>>>>> alternative implementations of out-of-band clients, including
>>>>> legacy
>>>>> clients that use out-of-band never or rarely.
>>>>>
>>>>> I think this is a topic that should be argued one way or the other
>>>>> in
>>>>> the requirements document. Use cases and examples in other systems
>>>>> would be best.
>>>>
>>>
>>> I guess that proxying through a client should be recomended but not
>>> mandated.
>>> We might the want to find how to do it while respecting restrictions
>>> removed the metadata server from the path.
>>
>> I think relying on clients to do anything correctly is against the
>> inherent nature of NFS. Clients in NFS are transient and cannot be
>> trusted to do anything correctly. Therefore, the metadata
>> server should
>> find its own way to write data to the data servers without relying on
>> clients. If proxying through a client is optional, it still seems
>> orthogonal to the behavior of existing installations and the spirit of
>> NFS. Maybe there is a valid use case someone could describe?
>>
>> Dean
>>

From dnoveck@netapp.com Tue Jan 20 03:19:31 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 21140 invoked from network); 20 Jan 2004 11:19:30 -0000
Received: from unknown (66.218.66.218)
by m11.grp.scd.yahoo.com with QMQP; 20 Jan 2004 11:19:30 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 20 Jan 2004 11:19:30 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0KBJPKw001618;
Tue, 20 Jan 2004 03:19:25 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0KBJPSR013716;
Tue, 20 Jan 2004 03:19:25 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Tue, 20 Jan 2004 03:19:20 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D3667@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
Thread-Index: AcPe5E3+ne+ylrX4RNKlmGvjXmcMYwAXzstw
To: <pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/03: subtopic: proxying
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

You may be right about the origin of this but I did not suggest that
*NFS clients* take part when IO was done in this way. When
considering a situation in which there was no direct connection
between the meta-data server and the data server, I did note
that there was a large set of machines that had a connection to
both, making it possible/easy to provide a connection between
meta-data server and data server, albeit indirect.

While it is true that that large class of machines can have NFS
clients running on them (and many will), I don't think it is a
good idea to place the burden of effecting this communication
(to help servers without a direct communications path) on the
clients. This is as opposed to a server using the same hardware
that a client would use to effect an indirect communication
path, which seems quite reasonable to me, but does not affect
the client-server protocol.

In addition to the reasons that Dean cites for finding this
troublesome, let me add one more. Suppose we have IO from
a v4.0 client, necessitating access by the meta-data server
to the data server. If that function were imposed as a
requirement on v4.x clients, then how do you deal with the
case in which no v4.x clients are functioning? Previous V4
minor versions should just work and making them dependent
on v4.x clients is not going to fly. The server has to
support v4.0 and can use the same hardware as clients and
much of the same software, but effecting the necessary
communication is part of the server's responsibility.

-----Original Message-----
From: Halevy, Benny [mailto:bhalevy@panasas.com]
Sent: Monday, January 19, 2004 6:31 PM
To: 'pnfs-reqs@yahoogroups.com'
Cc: pNFS Operations
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
12/18/03: subtopic: proxying


Dave Noveck wrote:
>At some point the phrase "proxying through the client" was used and I
>realize I don't know what is meant by it. It doesn't seem to match
>the "proxying" that was being discussed originally. How would the
>client be a proxy for (presumably) the server? What am I missing?

I think it was you who suggested (maybe in a rhetorical way) that
when the metadata server is not capable of accessing the storage
it manages it should still be able to perform I/O using a client.
Maybe this created the "proxying through the client" idea...

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Monday, January 19, 2004 6:23 PM
>To: pnfs-reqs@yahoogroups.com
>Cc: pNFS Operations
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>Dean Hildebrand wrote:
>> I think relying on clients to do anything correctly is against the
>> inherent nature of NFS. Clients in NFS are transient and cannot be
>> trusted to do anything correctly. Therefore, the metadata
>server should
>> find its own way to write data to the data servers without relying on
>> clients. If proxying through a client is optional, it still seems
>> orthogonal to the behavior of existing installations and the
>spirit of
>> NFS. Maybe there is a valid use case someone could describe?
>
>I'm now totally confused. Before we talk about use cases for "proxying
>through a client", I'd like to understand what it is.
>
>My understanding is that when this discussion started, a
>number of people
>were referring to a client writing data by sending a write to the meta-
>data server (aka the NFS server) as "proxying", because, if
>your view is
>that the proper/best/ideal way of doing data transfer operations is to
>obtain mapping information and then do a write to the data server (i.e.
>other NFS server or object data server or SAN-connected disk), then
>the direct NFS write can be seen as the meta-data server acting as the
>client's proxy. Is my understanding correct?
>
>No matter how you come down on the quesion of the desirability of that,
>I don't think there any way to argue that doing a write by sending an
>NFS write request to an NFS server is against the inherent nature of
>NFS. Nor does it ask the client do anything correctly that it hasn't
>been doing all along.
>
>At some point the phrase "proxying through the client" was used and I
>realize I don't know what is meant by it. It doesn't seem to match
>the "proxying" that was being discussed originally. How would the
>client be a proxy for (presumably) the server? What am I missing?
>
>-----Original Message-----
>From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>Sent: Monday, January 19, 2004 5:13 PM
>To: pNFS Requirements
>Cc: pNFS Operations
>Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>> > [1.1.2 Functional proxying]: a file transformation
>achievable by an
>> > NFS-v4.x client using a set of data server operations must be a
>> > equivalently achievable using a (probably different) set
>of NFS-v4.x
>> > server operations
>> >
>> > This is the topic I intended to address in the last email.
> I believe
>> > Dave is arguing that even with metadata servers that do
>not have access
>> > to their data servers, the vendor of such a metadata server can
>> > construct a proprietary protocol for the metadata server
>to (strict)
>> > proxy data server accesses through clients that do have
>data server
>> > access. I am not comfortable making up a counter to this,
>so I exhort
>> > those that want a metadata server without data server
>access to speak
>> > up if they disagree.
>> >
>> > > On one hand, some suggest that a set of out-of-band
>clients should not
>>
>> > > have to also have a data path through the NFSv4 metadata
>server. One
>> > > reason is that customers may not tolerate the large
>variability in
>> > > performance between out-of-band (when the going is good)
>and in-band
>> > > (when the server chooses not to grant or to take away a
>delegation)
>> > > accesses. Another reason, and I paraphrase someone else
>here, is that
>>
>> > > it is possible to construct out-of-band metadata servers
>that do not
>> > > have access to the data servers except through the clients -- I
>> > > encourage the source of this scenario to replace my
>paraphrasing with
>> > > a correct use case, because I find it odd to design for
>file servers
>> > > that do not have access to the data servers.
>> > >
>> > > On the other hand, others have suggested that any access
>or work that
>> > > a client can do out-of-band should be possible with one or more
>> > > commands applied to the metadata server's data path.
>This has been
>> > > proposed for coping with recalled delegations, including
>concurrent
>> > > writing by multiple clients; retry after client access errors,
>> > > provided adequate idempotency of out-of-band operations;
>and many
>> > > alternative implementations of out-of-band clients,
>including legacy
>> > > clients that use out-of-band never or rarely.
>> > >
>> > > I think this is a topic that should be argued one way or
>the other in
>> > > the requirements document. Use cases and examples in
>other systems
>> > > would be best.
>> >
>>
>> I guess that proxying through a client should be recomended but not
>> mandated.
>> We might the want to find how to do it while respecting restrictions
>> removed the metadata server from the path.
>
>I think relying on clients to do anything correctly is against the
>inherent nature of NFS. Clients in NFS are transient and cannot be
>trusted to do anything correctly. Therefore, the metadata
>server should
>find its own way to write data to the data servers without relying on
>clients. If proxying through a client is optional, it still seems
>orthogonal to the behavior of existing installations and the spirit of
>NFS. Maybe there is a valid use case someone could describe?
>
>Dean
>
>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
>------------------------ Yahoo! Groups Sponsor
>---------------------~-->
>Upgrade to 128-bit SSL Security!
>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>---------------------------------------------------------------
>------~->
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>



Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-ops/

To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/

From bhalevy@panasas.com Tue Jan 20 08:00:59 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 44018 invoked from network); 20 Jan 2004 16:00:58 -0000
Received: from unknown (66.218.66.172)
by m4.grp.scd.yahoo.com with QMQP; 20 Jan 2004 16:00:58 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 20 Jan 2004 16:00:58 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSYKVPR>; Tue, 20 Jan 2004 11:00:56 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D3879F@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
pnfs-ops@yahoogroups.com
Date: Tue, 20 Jan 2004 11:00:55 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0
3: subtopic: proxying
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

Dave, I completely agree with your assertions below.
One more reason not to provide support in the NFS protocol
for such servers is to guarantee interoperability with simple
MFSv4.x clients that do not support out-of-band I/O or
some optional extensions, e.g. write sharing (if we spec.
it). Without the ability to read and write via the NFS
server, sharing a file that's being written by one or more
writers needs complete support for write sharing by all
clients as well as the server.

I suggest we mention this issue in the problem statement
document and explain why we want to leave it open for the
server implementation to solve and don't want to solve
it within the NFS protocol.

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Tuesday, January 20, 2004 6:19 AM
>To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>You may be right about the origin of this but I did not suggest that
>*NFS clients* take part when IO was done in this way. When
>considering a situation in which there was no direct connection
>between the meta-data server and the data server, I did note
>that there was a large set of machines that had a connection to
>both, making it possible/easy to provide a connection between
>meta-data server and data server, albeit indirect.
>
>While it is true that that large class of machines can have NFS
>clients running on them (and many will), I don't think it is a
>good idea to place the burden of effecting this communication
>(to help servers without a direct communications path) on the
>clients. This is as opposed to a server using the same hardware
>that a client would use to effect an indirect communication
>path, which seems quite reasonable to me, but does not affect
>the client-server protocol.
>
>In addition to the reasons that Dean cites for finding this
>troublesome, let me add one more. Suppose we have IO from
>a v4.0 client, necessitating access by the meta-data server
>to the data server. If that function were imposed as a
>requirement on v4.x clients, then how do you deal with the
>case in which no v4.x clients are functioning? Previous V4
>minor versions should just work and making them dependent
>on v4.x clients is not going to fly. The server has to
>support v4.0 and can use the same hardware as clients and
>much of the same software, but effecting the necessary
>communication is part of the server's responsibility.
>
>-----Original Message-----
>From: Halevy, Benny [mailto:bhalevy@panasas.com]
>Sent: Monday, January 19, 2004 6:31 PM
>To: 'pnfs-reqs@yahoogroups.com'
>Cc: pNFS Operations
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>Dave Noveck wrote:
>>At some point the phrase "proxying through the client" was used and I
>>realize I don't know what is meant by it. It doesn't seem to match
>>the "proxying" that was being discussed originally. How would the
>>client be a proxy for (presumably) the server? What am I missing?
>
>I think it was you who suggested (maybe in a rhetorical way) that
>when the metadata server is not capable of accessing the storage
>it manages it should still be able to perform I/O using a client.
>Maybe this created the "proxying through the client" idea...
>
>Benny
>
>>-----Original Message-----
>>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>>Sent: Monday, January 19, 2004 6:23 PM
>>To: pnfs-reqs@yahoogroups.com
>>Cc: pNFS Operations
>>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>Dean Hildebrand wrote:
>>> I think relying on clients to do anything correctly is against the
>>> inherent nature of NFS. Clients in NFS are transient and cannot be
>>> trusted to do anything correctly. Therefore, the metadata
>>server should
>>> find its own way to write data to the data servers without
>relying on
>>> clients. If proxying through a client is optional, it still seems
>>> orthogonal to the behavior of existing installations and the
>>spirit of
>>> NFS. Maybe there is a valid use case someone could describe?
>>
>>I'm now totally confused. Before we talk about use cases for
>"proxying
>>through a client", I'd like to understand what it is.
>>
>>My understanding is that when this discussion started, a
>>number of people
>>were referring to a client writing data by sending a write to
>the meta-
>>data server (aka the NFS server) as "proxying", because, if
>>your view is
>>that the proper/best/ideal way of doing data transfer operations is to
>>obtain mapping information and then do a write to the data
>server (i.e.
>>other NFS server or object data server or SAN-connected disk), then
>>the direct NFS write can be seen as the meta-data server acting as the
>>client's proxy. Is my understanding correct?
>>
>>No matter how you come down on the quesion of the
>desirability of that,
>>I don't think there any way to argue that doing a write by sending an
>>NFS write request to an NFS server is against the inherent nature of
>>NFS. Nor does it ask the client do anything correctly that it hasn't
>>been doing all along.
>>
>>At some point the phrase "proxying through the client" was used and I
>>realize I don't know what is meant by it. It doesn't seem to match
>>the "proxying" that was being discussed originally. How would the
>>client be a proxy for (presumably) the server? What am I missing?
>>
>>-----Original Message-----
>>From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>>Sent: Monday, January 19, 2004 5:13 PM
>>To: pNFS Requirements
>>Cc: pNFS Operations
>>Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>> > [1.1.2 Functional proxying]: a file transformation
>>achievable by an
>>> > NFS-v4.x client using a set of data server operations must be a
>>> > equivalently achievable using a (probably different) set
>>of NFS-v4.x
>>> > server operations
>>> >
>>> > This is the topic I intended to address in the last email.
>> I believe
>>> > Dave is arguing that even with metadata servers that do
>>not have access
>>> > to their data servers, the vendor of such a metadata server can
>>> > construct a proprietary protocol for the metadata server
>>to (strict)
>>> > proxy data server accesses through clients that do have
>>data server
>>> > access. I am not comfortable making up a counter to this,
>>so I exhort
>>> > those that want a metadata server without data server
>>access to speak
>>> > up if they disagree.
>>> >
>>> > > On one hand, some suggest that a set of out-of-band
>>clients should not
>>>
>>> > > have to also have a data path through the NFSv4 metadata
>>server. One
>>> > > reason is that customers may not tolerate the large
>>variability in
>>> > > performance between out-of-band (when the going is good)
>>and in-band
>>> > > (when the server chooses not to grant or to take away a
>>delegation)
>>> > > accesses. Another reason, and I paraphrase someone else
>>here, is that
>>>
>>> > > it is possible to construct out-of-band metadata servers
>>that do not
>>> > > have access to the data servers except through the clients -- I
>>> > > encourage the source of this scenario to replace my
>>paraphrasing with
>>> > > a correct use case, because I find it odd to design for
>>file servers
>>> > > that do not have access to the data servers.
>>> > >
>>> > > On the other hand, others have suggested that any access
>>or work that
>>> > > a client can do out-of-band should be possible with one or more
>>> > > commands applied to the metadata server's data path.
>>This has been
>>> > > proposed for coping with recalled delegations, including
>>concurrent
>>> > > writing by multiple clients; retry after client access errors,
>>> > > provided adequate idempotency of out-of-band operations;
>>and many
>>> > > alternative implementations of out-of-band clients,
>>including legacy
>>> > > clients that use out-of-band never or rarely.
>>> > >
>>> > > I think this is a topic that should be argued one way or
>>the other in
>>> > > the requirements document. Use cases and examples in
>>other systems
>>> > > would be best.
>>> >
>>>
>>> I guess that proxying through a client should be recomended but not
>>> mandated.
>>> We might the want to find how to do it while respecting
>restrictions
>>> removed the metadata server from the path.
>>
>>I think relying on clients to do anything correctly is against the
>>inherent nature of NFS. Clients in NFS are transient and cannot be
>>trusted to do anything correctly. Therefore, the metadata
>>server should
>>find its own way to write data to the data servers without relying on
>>clients. If proxying through a client is optional, it still seems
>>orthogonal to the behavior of existing installations and the spirit of
>>NFS. Maybe there is a valid use case someone could describe?
>>
>>Dean
>>
>>
>>
>>
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>>
>>
>>
>>------------------------ Yahoo! Groups Sponsor
>>---------------------~-->
>>Upgrade to 128-bit SSL Security!
>>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>>---------------------------------------------------------------
>>------~->
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-ops/
>
>To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
>------------------------ Yahoo! Groups Sponsor
>---------------------~-->
>Upgrade to 128-bit SSL Security!
>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>---------------------------------------------------------------
>------~->
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>

From Thomas.Talpey@netapp.com Tue Jan 20 09:08:11 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 83125 invoked from network); 20 Jan 2004 17:08:03 -0000
Received: from unknown (66.218.66.166)
by m20.grp.scd.yahoo.com with QMQP; 20 Jan 2004 17:08:02 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta5.grp.scd.yahoo.com with SMTP; 20 Jan 2004 17:08:00 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0KH7xKw019662;
Tue, 20 Jan 2004 09:07:59 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0KH7xpr015087;
Tue, 20 Jan 2004 09:07:59 -0800 (PST)
Received: from tmt.netapp.com ([10.97.1.30]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Tue, 20 Jan 2004 12:07:53 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3DF77.F0C43A80"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Tue, 20 Jan 2004 09:07:43 -0800
Message-ID: <5.2.1.1.2.20040120115557.01f84da8@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
Thread-Index: AcPfd/FIyXKQ2cP9TcmTvuQUbzySOg==
To: <pnfs-reqs@yahoogroups.com>
Cc: <pnfs-ops@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

At 07:15 PM 1/19/2004, Garth Gibson wrote:
>First, what I called Functional Proxying is what Dave Noveck
>understands :-)  It is a safety valve for clients that don't want to,
>can't, or think it is slower to directly access the leaf storage
>server.

I think we should avoid making up a new term for something that
NFS has been all about since day minus-one. And definitely let's
not call it a safety valve. It's just regular old NFS. Right?

>For the other scenario, I believe it was Julian Satran who mentioned
>that IBM has considered situations where the metadata server does not
>have good access, if any, to the storage devices directly.  For
>example, FC disks connected to all clients with FC NICs, and a metadata
>server connected to clients via Ethernet and not connected to FC at
>all.  In this case, the metadata wants all, or almost all, accesses to
>be direct from client to storage.

So, this is important and sets the tone for where complexity resides.
The issue is not so much whether the client chooses to perform a
direct transfer, but whether it is forced to. This is important, and isn't
a protocol issue, it's an implementation (or perhaps better, "deployment")
choice.

It's not going to be a popular proposal if we dwell on this. I view the
document as centering around what happens when clients negotiate
the advanced version, not how they fall back. Requirements, not
implementation.

Tom.


From julian_satran@il.ibm.com Tue Jan 20 16:54:29 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 75408 invoked from network); 21 Jan 2004 00:54:28 -0000
Received: from unknown (66.218.66.218)
by m14.grp.scd.yahoo.com with QMQP; 21 Jan 2004 00:54:28 -0000
Received: from unknown (HELO mtagate2.de.ibm.com) (195.212.29.151)
by mta3.grp.scd.yahoo.com with SMTP; 21 Jan 2004 00:54:27 -0000
Received: from d12relay01.megacenter.de.ibm.com (d12relay01.megacenter.de.ibm.com [9.149.165.180])
by mtagate2.de.ibm.com (8.12.10/8.12.10) with ESMTP id i0L0sPRT089196;
Wed, 21 Jan 2004 00:54:25 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay01.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i0L0sNI3159660;
Wed, 21 Jan 2004 01:54:25 +0100
In-Reply-To: <30489F1321F5C343ACF6872B2CF7942A05D3879F@PIKES.panasas.com>
To: pnfs-ops@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com,
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF65A1A887.14195B02-ON88256E21.005FAFF0-88256E22.0004F719@il.ibm.com>
Date: Tue, 20 Jan 2004 16:54:23 -0800
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
21/01/2004 02:54:25,
Serialize complete at 21/01/2004 02:54:25
Content-Type: text/plain; charset="US-ASCII"
X-eGroups-Remote-IP: 195.212.29.151
From: Julian Satran <julian_satran@il.ibm.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

ADVERTISEMENT
Benny,

Even "simple" NFSV4 clients support referrals (Moved). So a metadata
server may refer those requests to another server that has access to data.
The trouble I have is having to mandate this on all users or making an
"optional to use" feature and leave the "ERROR-DATA-ACCESS-NOT
supported-here" as a legal error (and that is the position I am taking).

Julo





"Halevy, Benny" <bhalevy@panasas.com>
20/01/2004 08:00
Please respond to
pnfs-ops


To
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
pnfs-ops@yahoogroups.com
cc

Subject
RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3:
subtopic: proxying






Dave, I completely agree with your assertions below.
One more reason not to provide support in the NFS protocol
for such servers is to guarantee interoperability with simple
MFSv4.x clients that do not support out-of-band I/O or
some optional extensions, e.g. write sharing (if we spec.
it). Without the ability to read and write via the NFS
server, sharing a file that's being written by one or more
writers needs complete support for write sharing by all
clients as well as the server.

I suggest we mention this issue in the problem statement
document and explain why we want to leave it open for the
server implementation to solve and don't want to solve
it within the NFS protocol.

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Tuesday, January 20, 2004 6:19 AM
>To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>You may be right about the origin of this but I did not suggest that
>*NFS clients* take part when IO was done in this way. When
>considering a situation in which there was no direct connection
>between the meta-data server and the data server, I did note
>that there was a large set of machines that had a connection to
>both, making it possible/easy to provide a connection between
>meta-data server and data server, albeit indirect.
>
>While it is true that that large class of machines can have NFS
>clients running on them (and many will), I don't think it is a
>good idea to place the burden of effecting this communication
>(to help servers without a direct communications path) on the
>clients. This is as opposed to a server using the same hardware
>that a client would use to effect an indirect communication
>path, which seems quite reasonable to me, but does not affect
>the client-server protocol.
>
>In addition to the reasons that Dean cites for finding this
>troublesome, let me add one more. Suppose we have IO from
>a v4.0 client, necessitating access by the meta-data server
>to the data server. If that function were imposed as a
>requirement on v4.x clients, then how do you deal with the
>case in which no v4.x clients are functioning? Previous V4
>minor versions should just work and making them dependent
>on v4.x clients is not going to fly. The server has to
>support v4.0 and can use the same hardware as clients and
>much of the same software, but effecting the necessary
>communication is part of the server's responsibility.
>
>-----Original Message-----
>From: Halevy, Benny [mailto:bhalevy@panasas.com]
>Sent: Monday, January 19, 2004 6:31 PM
>To: 'pnfs-reqs@yahoogroups.com'
>Cc: pNFS Operations
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>Dave Noveck wrote:
>>At some point the phrase "proxying through the client" was used and I
>>realize I don't know what is meant by it. It doesn't seem to match
>>the "proxying" that was being discussed originally. How would the
>>client be a proxy for (presumably) the server? What am I missing?
>
>I think it was you who suggested (maybe in a rhetorical way) that
>when the metadata server is not capable of accessing the storage
>it manages it should still be able to perform I/O using a client.
>Maybe this created the "proxying through the client" idea...
>
>Benny
>
>>-----Original Message-----
>>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>>Sent: Monday, January 19, 2004 6:23 PM
>>To: pnfs-reqs@yahoogroups.com
>>Cc: pNFS Operations
>>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>Dean Hildebrand wrote:
>>> I think relying on clients to do anything correctly is against the
>>> inherent nature of NFS. Clients in NFS are transient and cannot be
>>> trusted to do anything correctly. Therefore, the metadata
>>server should
>>> find its own way to write data to the data servers without
>relying on
>>> clients. If proxying through a client is optional, it still seems
>>> orthogonal to the behavior of existing installations and the
>>spirit of
>>> NFS. Maybe there is a valid use case someone could describe?
>>
>>I'm now totally confused. Before we talk about use cases for
>"proxying
>>through a client", I'd like to understand what it is.
>>
>>My understanding is that when this discussion started, a
>>number of people
>>were referring to a client writing data by sending a write to
>the meta-
>>data server (aka the NFS server) as "proxying", because, if
>>your view is
>>that the proper/best/ideal way of doing data transfer operations is to
>>obtain mapping information and then do a write to the data
>server (i.e.
>>other NFS server or object data server or SAN-connected disk), then
>>the direct NFS write can be seen as the meta-data server acting as the
>>client's proxy. Is my understanding correct?
>>
>>No matter how you come down on the quesion of the
>desirability of that,
>>I don't think there any way to argue that doing a write by sending an
>>NFS write request to an NFS server is against the inherent nature of
>>NFS. Nor does it ask the client do anything correctly that it hasn't
>>been doing all along.
>>
>>At some point the phrase "proxying through the client" was used and I
>>realize I don't know what is meant by it. It doesn't seem to match
>>the "proxying" that was being discussed originally. How would the
>>client be a proxy for (presumably) the server? What am I missing?
>>
>>-----Original Message-----
>>From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>>Sent: Monday, January 19, 2004 5:13 PM
>>To: pNFS Requirements
>>Cc: pNFS Operations
>>Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>> > [1.1.2 Functional proxying]: a file transformation
>>achievable by an
>>> > NFS-v4.x client using a set of data server operations must be a
>>> > equivalently achievable using a (probably different) set
>>of NFS-v4.x
>>> > server operations
>>> >
>>> > This is the topic I intended to address in the last email.
>> I believe
>>> > Dave is arguing that even with metadata servers that do
>>not have access
>>> > to their data servers, the vendor of such a metadata server can
>>> > construct a proprietary protocol for the metadata server
>>to (strict)
>>> > proxy data server accesses through clients that do have
>>data server
>>> > access. I am not comfortable making up a counter to this,
>>so I exhort
>>> > those that want a metadata server without data server
>>access to speak
>>> > up if they disagree.
>>> >
>>> > > On one hand, some suggest that a set of out-of-band
>>clients should not
>>>
>>> > > have to also have a data path through the NFSv4 metadata
>>server. One
>>> > > reason is that customers may not tolerate the large
>>variability in
>>> > > performance between out-of-band (when the going is good)
>>and in-band
>>> > > (when the server chooses not to grant or to take away a
>>delegation)
>>> > > accesses. Another reason, and I paraphrase someone else
>>here, is that
>>>
>>> > > it is possible to construct out-of-band metadata servers
>>that do not
>>> > > have access to the data servers except through the clients -- I
>>> > > encourage the source of this scenario to replace my
>>paraphrasing with
>>> > > a correct use case, because I find it odd to design for
>>file servers
>>> > > that do not have access to the data servers.
>>> > >
>>> > > On the other hand, others have suggested that any access
>>or work that
>>> > > a client can do out-of-band should be possible with one or more
>>> > > commands applied to the metadata server's data path.
>>This has been
>>> > > proposed for coping with recalled delegations, including
>>concurrent
>>> > > writing by multiple clients; retry after client access errors,
>>> > > provided adequate idempotency of out-of-band operations;
>>and many
>>> > > alternative implementations of out-of-band clients,
>>including legacy
>>> > > clients that use out-of-band never or rarely.
>>> > >
>>> > > I think this is a topic that should be argued one way or
>>the other in
>>> > > the requirements document. Use cases and examples in
>>other systems
>>> > > would be best.
>>> >
>>>
>>> I guess that proxying through a client should be recomended but not
>>> mandated.
>>> We might the want to find how to do it while respecting
>restrictions
>>> removed the metadata server from the path.
>>
>>I think relying on clients to do anything correctly is against the
>>inherent nature of NFS. Clients in NFS are transient and cannot be
>>trusted to do anything correctly. Therefore, the metadata
>>server should
>>find its own way to write data to the data servers without relying on
>>clients. If proxying through a client is optional, it still seems
>>orthogonal to the behavior of existing installations and the spirit of
>>NFS. Maybe there is a valid use case someone could describe?
>>
>>Dean
>>
>>
>>
>>
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>>
>>
>>
>>------------------------ Yahoo! Groups Sponsor
>>---------------------~-->
>>Upgrade to 128-bit SSL Security!
>>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>>---------------------------------------------------------------
>>------~->
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-ops/
>
>To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
>------------------------ Yahoo! Groups Sponsor
>---------------------~-->
>Upgrade to 128-bit SSL Security!
>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>---------------------------------------------------------------
>------~->
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>



Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-ops/

To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/


From dnoveck@netapp.com Wed Jan 21 05:32:00 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 16469 invoked from network); 21 Jan 2004 13:31:53 -0000
Received: from unknown (66.218.66.172)
by m14.grp.scd.yahoo.com with QMQP; 21 Jan 2004 13:31:53 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 21 Jan 2004 13:31:52 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0LDVpKw025463;
Wed, 21 Jan 2004 05:31:51 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0LDVppr022754;
Wed, 21 Jan 2004 05:31:51 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Wed, 21 Jan 2004 05:31:41 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D366E@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
Thread-Index: AcPfuSHl8H5HCtQBTWW1hkwg37TWRgAaJ0kw
To: <pnfs-reqs@yahoogroups.com>, <pnfs-ops@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

> Even "simple" NFSV4 clients support referrals (Moved).

Unfortunately, the "simple" clients that actually exist today
don't :-(, but I suppose we can assume that time will correct
that problem.

> So a metadata server may refer those requests to another server
> that has access to data.

But that's referral of an entire filesystem (everything sharing a
given fsid value) to another nfsv4 server (i.e. a metadata server).
You can't (in v4.0) refer requests for a single file or separately
refer data IO requests and those that involve metadata.

> The trouble I have is having to mandate this on all users or making an
> "optional to use" feature and leave the "ERROR-DATA-ACCESS-NOT
> supported-here" as a legal error (and that is the position I am taking).

It can't be a legal error in v4.0.


-----Original Message-----
From: Julian Satran [mailto:julian_satran@il.ibm.com]
Sent: Tuesday, January 20, 2004 7:54 PM
To: pnfs-ops@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com; 'pnfs-reqs@yahoogroups.com'
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
12/18/0 3: subtopic: proxying


Benny,

Even "simple" NFSV4 clients support referrals (Moved). So a metadata
server may refer those requests to another server that has access to data.
The trouble I have is having to mandate this on all users or making an
"optional to use" feature and leave the "ERROR-DATA-ACCESS-NOT
supported-here" as a legal error (and that is the position I am taking).

Julo





"Halevy, Benny" <bhalevy@panasas.com>
20/01/2004 08:00
Please respond to
pnfs-ops


To
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
pnfs-ops@yahoogroups.com
cc

Subject
RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3:
subtopic: proxying






Dave, I completely agree with your assertions below.
One more reason not to provide support in the NFS protocol
for such servers is to guarantee interoperability with simple
MFSv4.x clients that do not support out-of-band I/O or
some optional extensions, e.g. write sharing (if we spec.
it). Without the ability to read and write via the NFS
server, sharing a file that's being written by one or more
writers needs complete support for write sharing by all
clients as well as the server.

I suggest we mention this issue in the problem statement
document and explain why we want to leave it open for the
server implementation to solve and don't want to solve
it within the NFS protocol.

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Tuesday, January 20, 2004 6:19 AM
>To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>You may be right about the origin of this but I did not suggest that
>*NFS clients* take part when IO was done in this way. When
>considering a situation in which there was no direct connection
>between the meta-data server and the data server, I did note
>that there was a large set of machines that had a connection to
>both, making it possible/easy to provide a connection between
>meta-data server and data server, albeit indirect.
>
>While it is true that that large class of machines can have NFS
>clients running on them (and many will), I don't think it is a
>good idea to place the burden of effecting this communication
>(to help servers without a direct communications path) on the
>clients. This is as opposed to a server using the same hardware
>that a client would use to effect an indirect communication
>path, which seems quite reasonable to me, but does not affect
>the client-server protocol.
>
>In addition to the reasons that Dean cites for finding this
>troublesome, let me add one more. Suppose we have IO from
>a v4.0 client, necessitating access by the meta-data server
>to the data server. If that function were imposed as a
>requirement on v4.x clients, then how do you deal with the
>case in which no v4.x clients are functioning? Previous V4
>minor versions should just work and making them dependent
>on v4.x clients is not going to fly. The server has to
>support v4.0 and can use the same hardware as clients and
>much of the same software, but effecting the necessary
>communication is part of the server's responsibility.
>
>-----Original Message-----
>From: Halevy, Benny [mailto:bhalevy@panasas.com]
>Sent: Monday, January 19, 2004 6:31 PM
>To: 'pnfs-reqs@yahoogroups.com'
>Cc: pNFS Operations
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>Dave Noveck wrote:
>>At some point the phrase "proxying through the client" was used and I
>>realize I don't know what is meant by it. It doesn't seem to match
>>the "proxying" that was being discussed originally. How would the
>>client be a proxy for (presumably) the server? What am I missing?
>
>I think it was you who suggested (maybe in a rhetorical way) that
>when the metadata server is not capable of accessing the storage
>it manages it should still be able to perform I/O using a client.
>Maybe this created the "proxying through the client" idea...
>
>Benny
>
>>-----Original Message-----
>>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>>Sent: Monday, January 19, 2004 6:23 PM
>>To: pnfs-reqs@yahoogroups.com
>>Cc: pNFS Operations
>>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>Dean Hildebrand wrote:
>>> I think relying on clients to do anything correctly is against the
>>> inherent nature of NFS. Clients in NFS are transient and cannot be
>>> trusted to do anything correctly. Therefore, the metadata
>>server should
>>> find its own way to write data to the data servers without
>relying on
>>> clients. If proxying through a client is optional, it still seems
>>> orthogonal to the behavior of existing installations and the
>>spirit of
>>> NFS. Maybe there is a valid use case someone could describe?
>>
>>I'm now totally confused. Before we talk about use cases for
>"proxying
>>through a client", I'd like to understand what it is.
>>
>>My understanding is that when this discussion started, a
>>number of people
>>were referring to a client writing data by sending a write to
>the meta-
>>data server (aka the NFS server) as "proxying", because, if
>>your view is
>>that the proper/best/ideal way of doing data transfer operations is to
>>obtain mapping information and then do a write to the data
>server (i.e.
>>other NFS server or object data server or SAN-connected disk), then
>>the direct NFS write can be seen as the meta-data server acting as the
>>client's proxy. Is my understanding correct?
>>
>>No matter how you come down on the quesion of the
>desirability of that,
>>I don't think there any way to argue that doing a write by sending an
>>NFS write request to an NFS server is against the inherent nature of
>>NFS. Nor does it ask the client do anything correctly that it hasn't
>>been doing all along.
>>
>>At some point the phrase "proxying through the client" was used and I
>>realize I don't know what is meant by it. It doesn't seem to match
>>the "proxying" that was being discussed originally. How would the
>>client be a proxy for (presumably) the server? What am I missing?
>>
>>-----Original Message-----
>>From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>>Sent: Monday, January 19, 2004 5:13 PM
>>To: pNFS Requirements
>>Cc: pNFS Operations
>>Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>> > [1.1.2 Functional proxying]: a file transformation
>>achievable by an
>>> > NFS-v4.x client using a set of data server operations must be a
>>> > equivalently achievable using a (probably different) set
>>of NFS-v4.x
>>> > server operations
>>> >
>>> > This is the topic I intended to address in the last email.
>> I believe
>>> > Dave is arguing that even with metadata servers that do
>>not have access
>>> > to their data servers, the vendor of such a metadata server can
>>> > construct a proprietary protocol for the metadata server
>>to (strict)
>>> > proxy data server accesses through clients that do have
>>data server
>>> > access. I am not comfortable making up a counter to this,
>>so I exhort
>>> > those that want a metadata server without data server
>>access to speak
>>> > up if they disagree.
>>> >
>>> > > On one hand, some suggest that a set of out-of-band
>>clients should not
>>>
>>> > > have to also have a data path through the NFSv4 metadata
>>server. One
>>> > > reason is that customers may not tolerate the large
>>variability in
>>> > > performance between out-of-band (when the going is good)
>>and in-band
>>> > > (when the server chooses not to grant or to take away a
>>delegation)
>>> > > accesses. Another reason, and I paraphrase someone else
>>here, is that
>>>
>>> > > it is possible to construct out-of-band metadata servers
>>that do not
>>> > > have access to the data servers except through the clients -- I
>>> > > encourage the source of this scenario to replace my
>>paraphrasing with
>>> > > a correct use case, because I find it odd to design for
>>file servers
>>> > > that do not have access to the data servers.
>>> > >
>>> > > On the other hand, others have suggested that any access
>>or work that
>>> > > a client can do out-of-band should be possible with one or more
>>> > > commands applied to the metadata server's data path.
>>This has been
>>> > > proposed for coping with recalled delegations, including
>>concurrent
>>> > > writing by multiple clients; retry after client access errors,
>>> > > provided adequate idempotency of out-of-band operations;
>>and many
>>> > > alternative implementations of out-of-band clients,
>>including legacy
>>> > > clients that use out-of-band never or rarely.
>>> > >
>>> > > I think this is a topic that should be argued one way or
>>the other in
>>> > > the requirements document. Use cases and examples in
>>other systems
>>> > > would be best.
>>> >
>>>
>>> I guess that proxying through a client should be recomended but not
>>> mandated.
>>> We might the want to find how to do it while respecting
>restrictions
>>> removed the metadata server from the path.
>>
>>I think relying on clients to do anything correctly is against the
>>inherent nature of NFS. Clients in NFS are transient and cannot be
>>trusted to do anything correctly. Therefore, the metadata
>>server should
>>find its own way to write data to the data servers without relying on
>>clients. If proxying through a client is optional, it still seems
>>orthogonal to the behavior of existing installations and the spirit of
>>NFS. Maybe there is a valid use case someone could describe?
>>
>>Dean
>>
>>
>>
>>
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>>
>>
>>
>>------------------------ Yahoo! Groups Sponsor
>>---------------------~-->
>>Upgrade to 128-bit SSL Security!
>>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>>---------------------------------------------------------------
>>------~->
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-ops/
>
>To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
>------------------------ Yahoo! Groups Sponsor
>---------------------~-->
>Upgrade to 128-bit SSL Security!
>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>---------------------------------------------------------------
>------~->
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>



Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-ops/

To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/







Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/

From julian_satran@il.ibm.com Wed Jan 21 17:58:27 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 27962 invoked from network); 22 Jan 2004 01:58:20 -0000
Received: from unknown (66.218.66.218)
by m13.grp.scd.yahoo.com with QMQP; 22 Jan 2004 01:58:20 -0000
Received: from unknown (HELO mtagate7.de.ibm.com) (195.212.29.156)
by mta3.grp.scd.yahoo.com with SMTP; 22 Jan 2004 01:58:18 -0000
Received: from d12relay02.megacenter.de.ibm.com (d12relay02.megacenter.de.ibm.com [9.149.165.196])
by mtagate7.de.ibm.com (8.12.10/8.12.10) with ESMTP id i0M1uGRm080524;
Thu, 22 Jan 2004 01:56:16 GMT
Received: from d10ml001.telaviv.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay02.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i0M1uFYk277030;
Thu, 22 Jan 2004 02:56:15 +0100
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A6D366E@silver.nane.netapp.com>
To: pnfs-ops@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF68F7A059.552F39AC-ON88256E22.0068BA98-88256E23.000A9ECB@il.ibm.com>
Date: Wed, 21 Jan 2004 17:56:11 -0800
X-MIMETrack: Serialize by Router on D10ML001/10/M/IBM(Release 6.0.2CF2|July 23, 2003) at
22/01/2004 03:56:15,
Serialize complete at 22/01/2004 03:56:15
Content-Type: text/plain; charset="US-ASCII"
X-eGroups-Remote-IP: 195.212.29.156
From: Julian Satran <julian_satran@il.ibm.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

Dave,

I assume that minor version can incorporate new function and new errors.
I see the point related to your first objection (selective about moved)
but we can work through it patiently.
And it is nothing bad if for some FS data requests force that server to
appear as moved for any client that ever uses a data request from the
original metadata server.
We will have then a metadata server + boxes that serve old NFSv4 clients
only (and/or for specific Fss to which the the metadata server has no data
access.

Julo



"Noveck, Dave" <dnoveck@netapp.com>
21/01/2004 05:31
Please respond to
pnfs-ops


To
<pnfs-reqs@yahoogroups.com>, <pnfs-ops@yahoogroups.com>
cc

Subject
RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3:
subtopic: proxying






> Even "simple" NFSV4 clients support referrals (Moved).

Unfortunately, the "simple" clients that actually exist today
don't :-(, but I suppose we can assume that time will correct
that problem.

> So a metadata server may refer those requests to another server
> that has access to data.

But that's referral of an entire filesystem (everything sharing a
given fsid value) to another nfsv4 server (i.e. a metadata server).
You can't (in v4.0) refer requests for a single file or separately
refer data IO requests and those that involve metadata.

> The trouble I have is having to mandate this on all users or making an
> "optional to use" feature and leave the "ERROR-DATA-ACCESS-NOT
> supported-here" as a legal error (and that is the position I am taking).

It can't be a legal error in v4.0.


-----Original Message-----
From: Julian Satran [mailto:julian_satran@il.ibm.com]
Sent: Tuesday, January 20, 2004 7:54 PM
To: pnfs-ops@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com; 'pnfs-reqs@yahoogroups.com'
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
12/18/0 3: subtopic: proxying


Benny,

Even "simple" NFSV4 clients support referrals (Moved). So a metadata
server may refer those requests to another server that has access to data.
The trouble I have is having to mandate this on all users or making an
"optional to use" feature and leave the "ERROR-DATA-ACCESS-NOT
supported-here" as a legal error (and that is the position I am taking).

Julo





"Halevy, Benny" <bhalevy@panasas.com>
20/01/2004 08:00
Please respond to
pnfs-ops


To
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
pnfs-ops@yahoogroups.com
cc

Subject
RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3:

subtopic: proxying






Dave, I completely agree with your assertions below.
One more reason not to provide support in the NFS protocol
for such servers is to guarantee interoperability with simple
MFSv4.x clients that do not support out-of-band I/O or
some optional extensions, e.g. write sharing (if we spec.
it). Without the ability to read and write via the NFS
server, sharing a file that's being written by one or more
writers needs complete support for write sharing by all
clients as well as the server.

I suggest we mention this issue in the problem statement
document and explain why we want to leave it open for the
server implementation to solve and don't want to solve
it within the NFS protocol.

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Tuesday, January 20, 2004 6:19 AM
>To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>You may be right about the origin of this but I did not suggest that
>*NFS clients* take part when IO was done in this way. When
>considering a situation in which there was no direct connection
>between the meta-data server and the data server, I did note
>that there was a large set of machines that had a connection to
>both, making it possible/easy to provide a connection between
>meta-data server and data server, albeit indirect.
>
>While it is true that that large class of machines can have NFS
>clients running on them (and many will), I don't think it is a
>good idea to place the burden of effecting this communication
>(to help servers without a direct communications path) on the
>clients. This is as opposed to a server using the same hardware
>that a client would use to effect an indirect communication
>path, which seems quite reasonable to me, but does not affect
>the client-server protocol.
>
>In addition to the reasons that Dean cites for finding this
>troublesome, let me add one more. Suppose we have IO from
>a v4.0 client, necessitating access by the meta-data server
>to the data server. If that function were imposed as a
>requirement on v4.x clients, then how do you deal with the
>case in which no v4.x clients are functioning? Previous V4
>minor versions should just work and making them dependent
>on v4.x clients is not going to fly. The server has to
>support v4.0 and can use the same hardware as clients and
>much of the same software, but effecting the necessary
>communication is part of the server's responsibility.
>
>-----Original Message-----
>From: Halevy, Benny [mailto:bhalevy@panasas.com]
>Sent: Monday, January 19, 2004 6:31 PM
>To: 'pnfs-reqs@yahoogroups.com'
>Cc: pNFS Operations
>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>12/18/03: subtopic: proxying
>
>
>Dave Noveck wrote:
>>At some point the phrase "proxying through the client" was used and I
>>realize I don't know what is meant by it. It doesn't seem to match
>>the "proxying" that was being discussed originally. How would the
>>client be a proxy for (presumably) the server? What am I missing?
>
>I think it was you who suggested (maybe in a rhetorical way) that
>when the metadata server is not capable of accessing the storage
>it manages it should still be able to perform I/O using a client.
>Maybe this created the "proxying through the client" idea...
>
>Benny
>
>>-----Original Message-----
>>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>>Sent: Monday, January 19, 2004 6:23 PM
>>To: pnfs-reqs@yahoogroups.com
>>Cc: pNFS Operations
>>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>Dean Hildebrand wrote:
>>> I think relying on clients to do anything correctly is against the
>>> inherent nature of NFS. Clients in NFS are transient and cannot be
>>> trusted to do anything correctly. Therefore, the metadata
>>server should
>>> find its own way to write data to the data servers without
>relying on
>>> clients. If proxying through a client is optional, it still seems
>>> orthogonal to the behavior of existing installations and the
>>spirit of
>>> NFS. Maybe there is a valid use case someone could describe?
>>
>>I'm now totally confused. Before we talk about use cases for
>"proxying
>>through a client", I'd like to understand what it is.
>>
>>My understanding is that when this discussion started, a
>>number of people
>>were referring to a client writing data by sending a write to
>the meta-
>>data server (aka the NFS server) as "proxying", because, if
>>your view is
>>that the proper/best/ideal way of doing data transfer operations is to
>>obtain mapping information and then do a write to the data
>server (i.e.
>>other NFS server or object data server or SAN-connected disk), then
>>the direct NFS write can be seen as the meta-data server acting as the
>>client's proxy. Is my understanding correct?
>>
>>No matter how you come down on the quesion of the
>desirability of that,
>>I don't think there any way to argue that doing a write by sending an
>>NFS write request to an NFS server is against the inherent nature of
>>NFS. Nor does it ask the client do anything correctly that it hasn't
>>been doing all along.
>>
>>At some point the phrase "proxying through the client" was used and I
>>realize I don't know what is meant by it. It doesn't seem to match
>>the "proxying" that was being discussed originally. How would the
>>client be a proxy for (presumably) the server? What am I missing?
>>
>>-----Original Message-----
>>From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>>Sent: Monday, January 19, 2004 5:13 PM
>>To: pNFS Requirements
>>Cc: pNFS Operations
>>Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>> > [1.1.2 Functional proxying]: a file transformation
>>achievable by an
>>> > NFS-v4.x client using a set of data server operations must be a
>>> > equivalently achievable using a (probably different) set
>>of NFS-v4.x
>>> > server operations
>>> >
>>> > This is the topic I intended to address in the last email.
>> I believe
>>> > Dave is arguing that even with metadata servers that do
>>not have access
>>> > to their data servers, the vendor of such a metadata server can
>>> > construct a proprietary protocol for the metadata server
>>to (strict)
>>> > proxy data server accesses through clients that do have
>>data server
>>> > access. I am not comfortable making up a counter to this,
>>so I exhort
>>> > those that want a metadata server without data server
>>access to speak
>>> > up if they disagree.
>>> >
>>> > > On one hand, some suggest that a set of out-of-band
>>clients should not
>>>
>>> > > have to also have a data path through the NFSv4 metadata
>>server. One
>>> > > reason is that customers may not tolerate the large
>>variability in
>>> > > performance between out-of-band (when the going is good)
>>and in-band
>>> > > (when the server chooses not to grant or to take away a
>>delegation)
>>> > > accesses. Another reason, and I paraphrase someone else
>>here, is that
>>>
>>> > > it is possible to construct out-of-band metadata servers
>>that do not
>>> > > have access to the data servers except through the clients -- I
>>> > > encourage the source of this scenario to replace my
>>paraphrasing with
>>> > > a correct use case, because I find it odd to design for
>>file servers
>>> > > that do not have access to the data servers.
>>> > >
>>> > > On the other hand, others have suggested that any access
>>or work that
>>> > > a client can do out-of-band should be possible with one or more
>>> > > commands applied to the metadata server's data path.
>>This has been
>>> > > proposed for coping with recalled delegations, including
>>concurrent
>>> > > writing by multiple clients; retry after client access errors,
>>> > > provided adequate idempotency of out-of-band operations;
>>and many
>>> > > alternative implementations of out-of-band clients,
>>including legacy
>>> > > clients that use out-of-band never or rarely.
>>> > >
>>> > > I think this is a topic that should be argued one way or
>>the other in
>>> > > the requirements document. Use cases and examples in
>>other systems
>>> > > would be best.
>>> >
>>>
>>> I guess that proxying through a client should be recomended but not
>>> mandated.
>>> We might the want to find how to do it while respecting
>restrictions
>>> removed the metadata server from the path.
>>
>>I think relying on clients to do anything correctly is against the
>>inherent nature of NFS. Clients in NFS are transient and cannot be
>>trusted to do anything correctly. Therefore, the metadata
>>server should
>>find its own way to write data to the data servers without relying on
>>clients. If proxying through a client is optional, it still seems
>>orthogonal to the behavior of existing installations and the spirit of
>>NFS. Maybe there is a valid use case someone could describe?
>>
>>Dean
>>
>>
>>
>>
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>>
>>
>>
>>------------------------ Yahoo! Groups Sponsor
>>---------------------~-->
>>Upgrade to 128-bit SSL Security!
>>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>>---------------------------------------------------------------
>>------~->
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-ops/
>
>To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
>------------------------ Yahoo! Groups Sponsor
>---------------------~-->
>Upgrade to 128-bit SSL Security!
>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>---------------------------------------------------------------
>------~->
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>



Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-ops/

To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/







Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/





Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-ops/

To unsubscribe from this group, send an email to:
pnfs-ops-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/ 

From pnfs-reqs@yahoogroups.com Thu Jan 22 04:29:30 2004
Return-Path: <notify@yahoogroups.com>
Received: (qmail 30598 invoked from network); 22 Jan 2004 12:29:29 -0000
Received: from unknown (66.218.66.216)
by m15.grp.scd.yahoo.com with QMQP; 22 Jan 2004 12:29:29 -0000
Received: from unknown (HELO n17.grp.scd.yahoo.com) (66.218.66.72)
by mta1.grp.scd.yahoo.com with SMTP; 22 Jan 2004 12:29:29 -0000
X-eGroups-Return: notify@yahoogroups.com
Received: from [66.218.67.139] by n17.grp.scd.yahoo.com with NNFMP; 22 Jan 2004 12:29:26 -0000
Date: 22 Jan 2004 12:29:25 -0000
Message-ID: <1074774565.648.17240.w2@yahoogroups.com>
X-eGroups-Application: files
X-Yahoo-Group-Post: system
From: pnfs-reqs@yahoogroups.com
To: pnfs-reqs@yahoogroups.com
Subject: New file uploaded to pnfs-reqs
MIME-Version: 1.0
Content-Type: text/plain
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 66.218.66.72

Hello,

This email message is a notification to let you know that
a file has been uploaded to the Files area of the pnfs-reqs
group.

File : /draft-ietf-pNFS-problem-statement-v2.doc
Uploaded by : garth_a_gibson <garth@panasas.com>
Description : v0.2 of pNFS problem statement

You can access this file at the URL

http://groups.yahoo.com/group/pnfs-reqs/files/draft-ietf-pNFS-problem-statement-v2.doc

To learn more about file sharing for your group, please visit

http://help.yahoo.com/help/us/groups/files

Regards,

garth_a_gibson <garth@panasas.com>

From garth@panasas.com Thu Jan 22 04:38:24 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 19877 invoked from network); 22 Jan 2004 12:38:23 -0000
Received: from unknown (66.218.66.218)
by m17.grp.scd.yahoo.com with QMQP; 22 Jan 2004 12:38:23 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 22 Jan 2004 12:38:23 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYLFGA; Thu, 22 Jan 2004 07:38:16 -0500
Mime-Version: 1.0 (Apple Message framework v609)
Content-Transfer-Encoding: 7bit
Message-Id: <D8149F76-4CD7-11D8-B71A-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Thu, 22 Jan 2004 04:38:13 -0800
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: uploaded a draft of a pNFS problem statement
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Based on the feedback in the last two concalls, I have taken a shot at
a pNFS problem statement. The notion of a bottleneck, driven by the
dramatic increase in bandwidth demand coming from clusters, and the
desire to continue to allow filesystems and namespaces to be big and
not specialized to data distribution, are central. I didn't do any
work in the application space sections -- and I have not put in any
citations yet -- sorry. Tom was right -- even with this thin version
it comes out at 8 pages.

I am happy to take comments and produce revisions, or to turn over the
document to anyone who wants to make a pass through it.

Talk to you in a couple of hours at the 11-12 EST concall.

garth

From Thomas.Talpey@netapp.com Thu Jan 22 05:09:26 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 70873 invoked from network); 22 Jan 2004 13:09:25 -0000
Received: from unknown (66.218.66.172)
by m10.grp.scd.yahoo.com with QMQP; 22 Jan 2004 13:09:25 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 22 Jan 2004 13:09:25 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0MD9PKw022756;
Thu, 22 Jan 2004 05:09:25 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0MD9PSR024389;
Thu, 22 Jan 2004 05:09:25 -0800 (PST)
Received: from tmt.netapp.com ([10.97.1.30]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Thu, 22 Jan 2004 08:09:19 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3E0E8.F1C8A980"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Thu, 22 Jan 2004 05:09:07 -0800
Message-ID: <5.2.1.1.2.20040122080409.00c246d0@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
Thread-Index: AcPg6PIwOdtHwUfvTAO5uqZggOHrXg==
To: <pnfs-reqs@yahoogroups.com>
Cc: <pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0 3: subtopic: proxying
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

At 07:54 PM 1/20/2004, Julian Satran wrote:
>Benny,
>
>Even "simple" NFSV4 clients support referrals (Moved). So a metadata
>server may refer those requests to another server that has access to data.

Dave Noveck already pointed out that in fact most clients don't yet support
this, and that the referral is for a filesystem not a file.

But even apart form that, why wouldn't the referral happen the other way?
That is, why wouldn't a client connect to an NFSv4 server, the two would
determine that data bypass is desired and supported, and the server
would redirect the client to the metadata server?

This is a much cleaner and simpler upward migration story, and doesn't
break NFSv4. In any case, I still argue that this kind of interaction should
not be explored in the requirements document, except to point out that
it's a requirement to support "stock" NFSv4 without client modifications.

Tom.

>The trouble I have is having to mandate this on all users or making an
>"optional to use" feature and leave the "ERROR-DATA-ACCESS-NOT
>supported-here" as a legal error (and that is the position I am taking).
>
>Julo
>
>
>
>
>
>"Halevy, Benny" <bhalevy@panasas.com>
>20/01/2004 08:00
>Please respond to
>pnfs-ops
>
>
>To
>"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
>pnfs-ops@yahoogroups.com
>cc
>
>Subject
>RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1: 12/18/0       3:
>subtopic: proxying
>
>
>
>
>
>
>Dave, I completely agree with your assertions below.
>One more reason not to provide support in the NFS protocol
>for such servers is to guarantee interoperability with simple
>MFSv4.x clients that do not support out-of-band I/O or
>some optional extensions, e.g. write sharing (if we spec.
>it).  Without the ability to read and write via the NFS
>server, sharing a file that's being written by one or more
>writers needs complete support for write sharing by all
>clients as well as the server.
>
>I suggest we mention this issue in the problem statement
>document and explain why we want to leave it open for the
>server implementation to solve and don't want to solve
>it within the NFS protocol.
>
>Benny
>
>>-----Original Message-----
>>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>>Sent: Tuesday, January 20, 2004 6:19 AM
>>To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>You may be right about the origin of this but I did not suggest that
>>*NFS clients* take part when IO was done in this way.  When
>>considering a situation in which there was no direct connection
>>between the meta-data server and the data server, I did note
>>that there was a large set of machines that had a connection to
>>both, making it possible/easy to provide a connection between
>>meta-data server and data server, albeit indirect.
>>
>>While it is true that that large class of machines can have NFS
>>clients running on them (and many will), I don't think it is a
>>good idea to place the burden of effecting this communication
>>(to help servers without a direct communications path) on the
>>clients.  This is as opposed to a server using the same hardware
>>that a client would use to effect an indirect communication
>>path, which seems quite reasonable to me, but does not affect
>>the client-server protocol.
>>
>>In addition to the reasons that Dean cites for finding this
>>troublesome, let me add one more.  Suppose we have IO from
>>a v4.0 client, necessitating access by the meta-data server
>>to the data server.  If that function were imposed as a
>>requirement on v4.x clients, then how do you deal with the
>>case in which no v4.x clients are functioning?  Previous V4
>>minor versions should just work and making them dependent
>>on v4.x clients is not going to fly.  The server has to
>>support v4.0 and can use the same hardware as clients and
>>much of the same software, but effecting the necessary
>>communication is part of the server's responsibility.
>>
>>-----Original Message-----
>>From: Halevy, Benny [mailto:bhalevy@panasas.com]
>>Sent: Monday, January 19, 2004 6:31 PM
>>To: 'pnfs-reqs@yahoogroups.com'
>>Cc: pNFS Operations
>>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>12/18/03: subtopic: proxying
>>
>>
>>Dave Noveck wrote:
>>>At some point the phrase "proxying through the client" was used and I
>>>realize I don't know what is meant by it.  It doesn't seem to match
>>>the "proxying" that was being discussed originally.  How would the
>>>client be a proxy for (presumably) the server?  What am I missing?
>>
>>I think it was you who suggested (maybe in a rhetorical way) that
>>when the metadata server is not capable of accessing the storage
>>it manages it should still be able to perform I/O using a client.
>>Maybe this created the "proxying through the client" idea...
>>
>>Benny
>>
>>>-----Original Message-----
>>>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>>>Sent: Monday, January 19, 2004 6:23 PM
>>>To: pnfs-reqs@yahoogroups.com
>>>Cc: pNFS Operations
>>>Subject: RE: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>>12/18/03: subtopic: proxying
>>>
>>>
>>>Dean Hildebrand wrote:
>>>> I think relying on clients to do anything correctly is against the
>>>> inherent nature of NFS.  Clients in NFS are transient and cannot be
>>>> trusted to do anything correctly.  Therefore, the metadata
>>>server should
>>>> find its own way to write data to the data servers without
>>relying on
>>>> clients.  If proxying through a client is optional, it still seems
>>>> orthogonal to the behavior of existing installations and the
>>>spirit of
>>>> NFS.  Maybe there is a valid use case someone could describe?
>>>
>>>I'm now totally confused.  Before we talk about use cases for
>>"proxying
>>>through a client", I'd like to understand what it is.
>>>
>>>My understanding is that when this discussion started, a
>>>number of people
>>>were referring to a client writing data by sending a write to
>>the meta-
>>>data server (aka the NFS server) as "proxying", because, if
>>>your view is
>>>that the proper/best/ideal way of doing data transfer operations is to
>>>obtain mapping information and then do a write to the data
>>server (i.e.
>>>other NFS server or object data server or SAN-connected disk), then
>>>the direct NFS write can be seen as the meta-data server acting as the
>>>client's proxy.  Is my understanding correct?
>>>
>>>No matter how you come down on the quesion of the
>>desirability of that,
>>>I don't think there any way to argue that doing a write by sending an
>>>NFS write request to an NFS server is against the inherent nature of
>>>NFS.  Nor does it ask the client do anything correctly that it hasn't
>>>been doing all along.
>>>
>>>At some point the phrase "proxying through the client" was used and I
>>>realize I don't know what is meant by it.  It doesn't seem to match
>>>the "proxying" that was being discussed originally.  How would the
>>>client be a proxy for (presumably) the server?  What am I missing?
>>>
>>>-----Original Message-----
>>>From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>>>Sent: Monday, January 19, 2004 5:13 PM
>>>To: pNFS Requirements
>>>Cc: pNFS Operations
>>>Subject: Re: [pnfs-reqs] Re: [pnfs-ops] pNFS Discussion Summary 1:
>>>12/18/03: subtopic: proxying
>>>
>>>
>>>> > [1.1.2 Functional proxying]: a file transformation
>>>achievable by an
>>>> > NFS-v4.x client using a set of data server operations must be a
>>>> > equivalently achievable using a (probably different) set
>>>of NFS-v4.x
>>>> > server operations
>>>> >
>>>> > This is the topic I intended to address in the last email.
>>> I believe
>>>> > Dave is arguing that even with metadata servers that do
>>>not have access
>>>> > to their data servers, the vendor of such a metadata server can
>>>> > construct a proprietary protocol for the metadata server
>>>to (strict)
>>>> > proxy data server accesses through clients that do have
>>>data server
>>>> > access.  I am not comfortable making up a counter to this,
>>>so I exhort
>>>> > those that want a metadata server without data server
>>>access to speak
>>>> > up if they disagree.
>>>> >
>>>> > > On one hand, some suggest that a set of out-of-band
>>>clients should not
>>>>
>>>> > > have to also have a data path through the NFSv4 metadata
>>>server.  One
>>>> > > reason is that customers may not tolerate the large
>>>variability in
>>>> > > performance between out-of-band (when the going is good)
>>>and in-band
>>>> > > (when the server chooses not to grant or to take away a
>>>delegation)
>>>> > > accesses.  Another reason, and I paraphrase someone else
>>>here, is that
>>>>
>>>> > > it is possible to construct out-of-band metadata servers
>>>that do not
>>>> > > have access to the data servers except through the clients -- I
>>>> > > encourage the source of this scenario to replace my
>>>paraphrasing with
>>>> > > a correct use case, because I find it odd to design for
>>>file servers
>>>> > > that do not have access to the data servers.
>>>> > >
>>>> > > On the other hand, others have suggested that any access
>>>or work that
>>>> > > a client can do out-of-band should be possible with one or more
>>>> > > commands applied to the metadata server's data path.
>>>This has been
>>>> > > proposed for coping with recalled delegations, including
>>>concurrent
>>>> > > writing by multiple clients; retry after client access errors,
>>>> > > provided adequate idempotency of out-of-band operations;
>>>and many
>>>> > > alternative implementations of out-of-band clients,
>>>including legacy
>>>> > > clients that use out-of-band never or rarely.
>>>> > >
>>>> > > I think this is a topic that should be argued one way or
>>>the other in
>>>> > > the requirements document.  Use cases and examples in
>>>other systems
>>>> > > would be best.
>>>> >
>>>>
>>>> I guess that proxying through a client should be recomended but not
>>>> mandated.
>>>> We might the want to find how to do it while respecting
>>restrictions
>>>> removed the metadata server from the path.
>>>
>>>I think relying on clients to do anything correctly is against the
>>>inherent nature of NFS.  Clients in NFS are transient and cannot be
>>>trusted to do anything correctly.  Therefore, the metadata
>>>server should
>>>find its own way to write data to the data servers without relying on
>>>clients.  If proxying through a client is optional, it still seems
>>>orthogonal to the behavior of existing installations and the spirit of
>>>NFS.  Maybe there is a valid use case someone could describe?
>>>
>>>Dean
>>>
>>>
>>>
>>>
>>>
>>>Yahoo! Groups Links
>>>
>>>To visit your group on the web, go to:
>>> http://groups.yahoo.com/group/pnfs-reqs/
>>>
>>>To unsubscribe from this group, send an email to:
>>> pnfs-reqs-unsubscribe@yahoogroups.com
>>>
>>>Your use of Yahoo! Groups is subject to:
>>> http://docs.yahoo.com/info/terms/
>>>
>>>
>>>
>>>
>>>
>>>------------------------ Yahoo! Groups Sponsor
>>>---------------------~-->
>>>Upgrade to 128-bit SSL Security!
>>>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>>>---------------------------------------------------------------
>>>------~->
>>>
>>>Yahoo! Groups Links
>>>
>>>To visit your group on the web, go to:
>>> http://groups.yahoo.com/group/pnfs-reqs/
>>>
>>>To unsubscribe from this group, send an email to:
>>> pnfs-reqs-unsubscribe@yahoogroups.com
>>>
>>>Your use of Yahoo! Groups is subject to:
>>> http://docs.yahoo.com/info/terms/
>>>
>>>
>>
>>
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-ops/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-ops-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>>
>>
>>
>>------------------------ Yahoo! Groups Sponsor
>>---------------------~-->
>>Upgrade to 128-bit SSL Security!
>>http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/W6uqlB/TM
>>---------------------------------------------------------------
>>------~->
>>
>>Yahoo! Groups Links
>>
>>To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>>To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>Your use of Yahoo! Groups is subject to:
>> http://docs.yahoo.com/info/terms/
>>
>>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-ops/
>
>To unsubscribe from this group, send an email to:
> pnfs-ops-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>
>
>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/ 

From ggrider@lanl.gov Thu Jan 22 07:41:17 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 43525 invoked from network); 22 Jan 2004 15:41:10 -0000
Received: from unknown (66.218.66.172)
by m9.grp.scd.yahoo.com with QMQP; 22 Jan 2004 15:41:10 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta4.grp.scd.yahoo.com with SMTP; 22 Jan 2004 15:41:12 -0000
Received: from mailrelay1.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i0MFfBdj019460
for <pnfs-reqs@yahoogroups.com>; Thu, 22 Jan 2004 08:41:12 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay1.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i0MFfB82003006
for <pnfs-reqs@yahoogroups.com>; Thu, 22 Jan 2004 08:41:11 -0700
Received: from cthulu.lanl.gov (cthulu.lanl.gov [128.165.115.129])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i0MFfAYi001296;
Thu, 22 Jan 2004 08:41:10 -0700
Message-Id: <5.2.0.9.2.20040122084042.015476c8@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Thu, 22 Jan 2004 08:41:36 -0700
To: pnfs-reqs@yahoogroups.com, pnfs-reqs@yahoogroups.com
In-Reply-To: <D8149F76-4CD7-11D8-B71A-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="=====================_349192==.REL"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] uploaded a draft of a pNFS problem statement
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

I get
This page is currently unavailable
Unfortunately, we are unable to process your request at this time. We apologize for the inconvenience. Please try again later.
when I tried
http://groups.yahoo.com/group/pnfs-reqs/files/draft-ietf-pNFS-problem-statement-v2.doc

Thanks
Gary

At 04:38 AM 1/22/2004 -0800, Garth Gibson wrote:

> Based on the feedback in the last two concalls, I have taken a shot at
> a pNFS problem statement.  The notion of a bottleneck, driven by the
> dramatic increase in bandwidth demand coming from clusters, and the
> desire to continue to allow filesystems and namespaces to be big and
> not specialized to data distribution, are central.  I didn't do any
> work in the application space sections -- and I have not put in any
> citations yet -- sorry.  Tom was right -- even with this thin version
> it comes out at 8 pages.
>
> I am happy to take comments and produce revisions, or to turn over the
> document to anyone who wants to make a pass through it.
>
> Talk to you in a couple of hours at the 11-12 EST concall.
>
> garth
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> 54ef2.jpg
> 54f7e.jpg
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From garth@panasas.com Thu Jan 22 07:51:59 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 37100 invoked from network); 22 Jan 2004 15:51:56 -0000
Received: from unknown (66.218.66.217)
by m1.grp.scd.yahoo.com with QMQP; 22 Jan 2004 15:51:56 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta2.grp.scd.yahoo.com with SMTP; 22 Jan 2004 15:51:55 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYLF8S; Thu, 22 Jan 2004 10:51:51 -0500
Mime-Version: 1.0 (Apple Message framework v609)
In-Reply-To: <1074774565.645.17240.w2@yahoogroups.com>
References: <1074774565.645.17240.w2@yahoogroups.com>
Content-Type: multipart/mixed; boundary=Apple-Mail-9--157255413
Message-Id: <E350C9F6-4CF2-11D8-B71A-000A95A94F04@panasas.com>
Date: Thu, 22 Jan 2004 07:51:48 -0800
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: New file uploaded to pnfs-reqs
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Clicking on the URL in this message worked for me. Clicking on the
file in in the web browser view of the file list did not.

So I'll send the file directly :-)

On Jan 22, 2004, at 4:29 AM, Yahoo! Groups Notification wrote:

>
> Hello,
>
> This email message is a notification to let you know that
> a file has been uploaded to the Files area of your pnfs-reqs
> group.
>
> File : /draft-ietf-pNFS-problem-statement-v2.doc
> Uploaded by : garth_a_gibson <garth@panasas.com>
> Description : v0.2 of pNFS problem statement
>
> You can access the file at the URL
>
> http://groups.yahoo.com/group/pnfs-reqs/files/draft-ietf-pNFS-problem-
> statement-v2.doc
>
> Your group is currently configured to send you email
> notification whenever a member uploads a file. To turn off
> notification, visit
>
> http://groups.yahoo.com/group/pnfs-reqs/join
>
> Thank you for choosing Yahoo! Groups as your email group
> service for the pnfs-reqs group.
>
> Regards,
>
> Yahoo! Groups Customer Care
>
> Your use of Yahoo! Groups is subject to
> http://docs.yahoo.com/info/terms/



Attachment (not stored)
draft-ietf-pNFS-problem-statement-v2.doc
Type: application/applefile

Attachment (not stored)
draft-ietf-pNFS-problem-statement-v2.doc
Type: application/msword

From garth@panasas.com Thu Jan 22 08:31:29 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 64677 invoked from network); 22 Jan 2004 16:31:28 -0000
Received: from unknown (66.218.66.167)
by m5.grp.scd.yahoo.com with QMQP; 22 Jan 2004 16:31:28 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 22 Jan 2004 16:31:28 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYLGFS; Thu, 22 Jan 2004 11:31:27 -0500
Mime-Version: 1.0 (Apple Message framework v609)
Content-Transfer-Encoding: 7bit
Message-Id: <6B29A655-4CF8-11D8-B71A-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Thu, 22 Jan 2004 08:31:24 -0800
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: problem statement feedback concall: Mon Jan 26 8am PST, 11am EST
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Peter Corbett is making a pass through the problem statement tomorrow
and we are meeting again for further discussions Monday Jan 26 8am PST,
11am EST at the same conference call dialin numbers we have been using.

I believe this time may be inconvenient for some and would be will to
schedule other concalls next week, as needed.

garth

From craigev@us.ibm.com Thu Jan 22 09:48:03 2004
Return-Path: <craigev@us.ibm.com>
X-Sender: craigev@us.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 10650 invoked from network); 22 Jan 2004 17:48:00 -0000
Received: from unknown (66.218.66.216)
by m11.grp.scd.yahoo.com with QMQP; 22 Jan 2004 17:48:00 -0000
Received: from unknown (HELO e34.co.us.ibm.com) (32.97.110.132)
by mta1.grp.scd.yahoo.com with SMTP; 22 Jan 2004 17:48:00 -0000
Received: from westrelay02.boulder.ibm.com (westrelay02.boulder.ibm.com [9.17.195.11])
by e34.co.us.ibm.com (8.12.10/8.12.2) with ESMTP id i0MHlc6t361972;
Thu, 22 Jan 2004 12:47:48 -0500
Received: from d03nm130.boulder.ibm.com (d03av02.boulder.ibm.com [9.17.193.82])
by westrelay02.boulder.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i0MHlREO092548;
Thu, 22 Jan 2004 10:47:27 -0700
To: pnfs-sbc@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com,
pnfs-sbc@yahoogroups.com
X-Mailer: Lotus Notes Release 5.0.11 July 24, 2002
Message-ID: <OF7F584003.58EEA118-ON85256E23.00612D42-85256E23.0061B907@us.ibm.com>
Date: Thu, 22 Jan 2004 12:47:23 -0500
X-MIMETrack: Serialize by Router on D03NM130/03/M/IBM(Release 6.0.2CF2|July 23, 2003) at
01/22/2004 10:47:26
MIME-Version: 1.0
Content-type: multipart/related;
Boundary="0__=0ABBE4B0DFF2ABD28f9e8a93df938690918c0ABBE4B0DFF2ABD2"
X-eGroups-Remote-IP: 32.97.110.132
From: Craig Everhart <craigev@us.ibm.com>
Subject: Re: [pnfs-sbc] Two Functionality issues
X-Yahoo-Group-Post: member; u=67958684

[Catching up slowly, slowly.]

Issue 4.1. Yes, I presented separate read and write mappings at CITI to allow clients to participate in a style of copy-on-write processing. (The concept is straight out of the Tank protocol spec.) There are some simple compression techniques that could be used, since for most virtual offsets, at most one block address is defined. It's only while a block is in the middle of an (uncommitted) copy-on-write operation that both would be defined. But I feel that the ability to make the distinction between read and write mappings, as well as the ability sometimes to offer both a read and a write mapping for a block, offers important functionality.

Issue 4.2: I agree with Dave Noveck that the functionality is likely useful more broadly than in SBC-mode out-of-band access.

Craig

Craig Everhart
+1 919 543 2169 (tie 441 2169)

Inactive hide details for black_david@emc.comblack_david@emc.com



	

                        black_david@emc.com

                        01/02/2004 11:45 AM
                        Please respond to pnfs-sbc

	

To: pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com, pnfs-sbc@yahoogroups.com
cc:
Subject: [pnfs-sbc] Two Functionality issues


In starting to look at design issues for block metadata,
I've run across a couple of issues around functionality
to be supported that could use wider discussion. This
is based on an initial review of the EMC High Road FMP
protocol and the IBM StorageTank SAN.FS protocol. I've
tried to just describe the issues here without taking a
position.

[4] Functionality

SAN.FS extents come with both read and write
extent mappings and block usage bitmaps. The separate read
and write mappings allow for clients to participate in copy-on-
write functionality - IIRC, Craig has described this.

Issue [4.1]: Should protocol include support for client participation
in copy-on-write?

A motivation for the separate arrays of block usage bits" appears
to be allowing clients to turn file data into holes (e.g.,
AIX fclear system call).

Issue [4.2]: Is the ability to turn valid data into a file "hole"
(e.g., AIX fclear) at the client important to support?

FMP does not support separate read mappings or usage bitmaps,
and hence is not capable of involving clients in copy-on-write
or allowing a client to turn valid data into a file "hole".

Comments? Thanks,
--David

----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------




------------------------ Yahoo! Groups Sponsor ---------------------~-->
Upgrade to 128-bit SSL Security!
http://us.click.yahoo.com/qZ0LdD/yjVHAA/TtwFAA/26EolB/TM
---------------------------------------------------------------------~->

Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-sbc/

To unsubscribe from this group, send an email to:
pnfs-sbc-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/






Attachment (not stored)
pic25436.gif
Type: image/gif

From pcorbett@netapp.com Sun Jan 25 15:08:58 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 61063 invoked from network); 25 Jan 2004 23:08:58 -0000
Received: from unknown (66.218.66.218)
by m4.grp.scd.yahoo.com with QMQP; 25 Jan 2004 23:08:58 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 25 Jan 2004 23:08:58 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0PN8vKw022469
for <pnfs-reqs@yahoogroups.com>; Sun, 25 Jan 2004 15:08:57 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0PN8vDi021318
for <pnfs-reqs@yahoogroups.com>; Sun, 25 Jan 2004 15:08:57 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/mixed;
boundary="----_=_NextPart_001_01C3E398.33E2214A"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Sun, 25 Jan 2004 15:08:54 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A015BF2DD@silver.nane.netapp.com>
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: New file uploaded to pnfs-reqs
Thread-Index: AcPg/94CRT+QaWyhRKGEv16f3S+5fACl+54Q
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: [pnfs-reqs] Re: New file uploaded to pnfs-reqs
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

Here is my set of revisions. I did not have quite as much time to work
on this as I had hoped to, and it still needs quite a bit of work.
Please critique it agressively. I'm not sure I'll be able to make the
call tomorrow, but I'll try to dial in for at least the first part of
it.
Thanks,
Peter


-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Thursday, January 22, 2004 10:52 AM
To: pnfs-reqs@yahoogroups.com
Subject: [pnfs-reqs] Re: New file uploaded to pnfs-reqs


Clicking on the URL in this message worked for me. Clicking on the
file in in the web browser view of the file list did not.

So I'll send the file directly :-)

On Jan 22, 2004, at 4:29 AM, Yahoo! Groups Notification wrote:

>
> Hello,
>
> This email message is a notification to let you know that
> a file has been uploaded to the Files area of your pnfs-reqs group.
>
> File : /draft-ietf-pNFS-problem-statement-v2.doc
> Uploaded by : garth_a_gibson <garth@panasas.com>
> Description : v0.2 of pNFS problem statement
>
> You can access the file at the URL
>
> http://groups.yahoo.com/group/pnfs-reqs/files/draft-ietf-pNFS-problem-
> statement-v2.doc
>
> Your group is currently configured to send you email notification
> whenever a member uploads a file. To turn off notification, visit
>
> http://groups.yahoo.com/group/pnfs-reqs/join
>
> Thank you for choosing Yahoo! Groups as your email group service for
> the pnfs-reqs group.
>
> Regards,
>
> Yahoo! Groups Customer Care
>
> Your use of Yahoo! Groups is subject to
> http://docs.yahoo.com/info/terms/




Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/



Attachment (not stored)
draft-ietf-pNFS-problem-statement-v3.doc
Type: application/msword

From Thomas.Talpey@netapp.com Mon Jan 26 04:12:43 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 73709 invoked from network); 26 Jan 2004 12:12:43 -0000
Received: from unknown (66.218.66.218)
by m12.grp.scd.yahoo.com with QMQP; 26 Jan 2004 12:12:43 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 26 Jan 2004 12:12:43 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0QCCgKw009103
for <pnfs-reqs@yahoogroups.com>; Mon, 26 Jan 2004 04:12:42 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0QCCfDi011215
for <pnfs-reqs@yahoogroups.com>; Mon, 26 Jan 2004 04:12:41 -0800 (PST)
Received: from tmt.netapp.com ([10.97.1.33]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Mon, 26 Jan 2004 07:12:39 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3E405.B0E0D580"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Mon, 26 Jan 2004 04:12:20 -0800
Message-ID: <5.2.1.1.2.20040126070910.00bf4328@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: New file uploaded to pnfs-reqs
Thread-Index: AcPkBbF5FIUKYsaPTPuUjgeSaM2FaQ==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: RE: [pnfs-reqs] Re: New file uploaded to pnfs-reqs
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

I did notice something important in the filename - it cannot be called
"draft-ietf-"something at this point. Only official workgroup documents
may be titled that way. The norm for an individual (or group) submission
is to name it with the principal author, subject and revision, such as
        draft-someone-pnfs-problem-statement-00.txt
The document submission would be rejected otherwise.

Tom.

At 06:08 PM 1/25/2004, Corbett, Peter wrote:
>Here is my set of revisions.  I did not have quite as much time to work
>on this as I had hoped to, and it still needs quite a bit of work.
>Please critique it agressively.  I'm not sure I'll be able to make the
>call tomorrow, but I'll try to dial in for at least the first part of
>it.
>Thanks,
>Peter
>
>
>-----Original Message-----
>From: Garth Gibson [mailto:garth@panasas.com]
>Sent: Thursday, January 22, 2004 10:52 AM
>To: pnfs-reqs@yahoogroups.com
>Subject: [pnfs-reqs] Re: New file uploaded to pnfs-reqs
>
>
>Clicking on the URL in this message worked for me.  Clicking on the 
>file in in the web browser view of the file list did not.
>
>So I'll send the file directly :-)
>
>On Jan 22, 2004, at 4:29 AM, Yahoo! Groups Notification wrote:
>
>>
>> Hello,
>>
>> This email message is a notification to let you know that
>> a file has been uploaded to the Files area of your pnfs-reqs group.
>>
>>   File        : /draft-ietf-pNFS-problem-statement-v2.doc
>>   Uploaded by : garth_a_gibson <garth@panasas.com>
>>   Description : v0.2 of pNFS problem statement
>>
>> You can access the file at the URL
>>
>> http://groups.yahoo.com/group/pnfs-reqs/files/draft-ietf-pNFS-problem-
>> statement-v2.doc
>>
>> Your group is currently configured to send you email notification
>> whenever a member uploads a file.  To turn off notification, visit
>>
>> http://groups.yahoo.com/group/pnfs-reqs/join
>>
>> Thank you for choosing Yahoo! Groups as your email group service for
>> the pnfs-reqs group.
>>
>> Regards,
>>
>> Yahoo! Groups Customer Care
>>
>> Your use of Yahoo! Groups is subject to
>> http://docs.yahoo.com/info/terms/
>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
>http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
>pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
>http://docs.yahoo.com/info/terms/
>
>
>
>
>Yahoo! Groups Links
>
>To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
>To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
> 

From dnoveck@netapp.com Mon Jan 26 12:24:49 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 53563 invoked from network); 26 Jan 2004 20:24:49 -0000
Received: from unknown (66.218.66.172)
by m8.grp.scd.yahoo.com with QMQP; 26 Jan 2004 20:24:49 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 26 Jan 2004 20:24:49 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0QKOmKw019872
for <pnfs-reqs@yahoogroups.com>; Mon, 26 Jan 2004 12:24:48 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0QKOmRh008134
for <pnfs-reqs@yahoogroups.com>; Mon, 26 Jan 2004 12:24:48 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 26 Jan 2004 12:24:42 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D3683@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: New file uploaded to pnfs-reqs
Thread-Index: AcPg/94CRT+QaWyhRKGEv16f3S+5fACl+54QACeO4/A=
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Re: New file uploaded to pnfs-reqs
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

as we discussed at the call today, here is my suggestion for text
to replace the last two paragraphs of the introduction in the
current document. I basically added a bridge paragraph to
introduce the concept of separating data and control, swapped
the order of the two existing paragraphs and made some minor
adjustments in wording.

One way of increasing the bandwidth provided for access through a single file system is to enable access to be provided, in a coherent fashion, through multiple endpoints. Separation of control and data flows provides a straightforward framework to accomplish this, by allowing data transfer to proceed in parallel from many clients to many data storage endpoints. Control and file management operations, inherently difficult to parallelize, remain the province of the single NFS server, while the offloading of data transfer operations serves to provide the requisite bandwidth scalability. Data transfer may proceed using NFS or other protocols suitable for the purpose such as iSCSI.

Today the file system marketplace offers a number of proprietary alternatives to NFS servers that provide separated control and data flow. Examples include EMC High Road and IBM TotalStorage SAN FS. The lack of interoperability between these proprietary approaches hinders their adoption. An approach that solves the bandwidth problem using NFS is most desirable. By standardizing the key architectural features of separated control and data flows, a range of competitive and interoperable implementations can be provided. Moreover the industry's large investment in NFS would be protected. Without such an NFS-based solution to the bandwidth bottleneck, other file access approaches will compete with NFS (and probably with each other), causing a range of interoperability difficulties, compromising the benefits provided by a standard file access protocol.

An approach that separates control and data flow and provides for data access through other protocols has additional benefits. Even though NFS is widely used as a network file system protocol, most of the world's data resides in data stores that are not accessible through NFS. Much of this data is stored in Storage Area Networks, accessible by Fibre Channel Protocol, or increasingly, by iSCSI. Storage Area Networks do not have the simple management capability that comes from a file system, that associates data with named objects in a hierarchical namespace. Such capabilities can be provided by NFS, while leveraging the existing SAN data access infrastructure, all within a common architectural framework.


-----Original Message-----
From: Corbett, Peter
Sent: Sunday, January 25, 2004 6:09 PM
To: pnfs-reqs@yahoogroups.com
Subject: RE: [pnfs-reqs] Re: New file uploaded to pnfs-reqs


Here is my set of revisions. I did not have quite as much time to work
on this as I had hoped to, and it still needs quite a bit of work.
Please critique it agressively. I'm not sure I'll be able to make the
call tomorrow, but I'll try to dial in for at least the first part of
it.
Thanks,
Peter


-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Thursday, January 22, 2004 10:52 AM
To: pnfs-reqs@yahoogroups.com
Subject: [pnfs-reqs] Re: New file uploaded to pnfs-reqs


Clicking on the URL in this message worked for me. Clicking on the
file in in the web browser view of the file list did not.

So I'll send the file directly :-)

On Jan 22, 2004, at 4:29 AM, Yahoo! Groups Notification wrote:

>
> Hello,
>
> This email message is a notification to let you know that
> a file has been uploaded to the Files area of your pnfs-reqs group.
>
> File : /draft-ietf-pNFS-problem-statement-v2.doc
> Uploaded by : garth_a_gibson <garth@panasas.com>
> Description : v0.2 of pNFS problem statement
>
> You can access the file at the URL
>
> http://groups.yahoo.com/group/pnfs-reqs/files/draft-ietf-pNFS-problem-
> statement-v2.doc
>
> Your group is currently configured to send you email notification
> whenever a member uploads a file. To turn off notification, visit
>
> http://groups.yahoo.com/group/pnfs-reqs/join
>
> Thank you for choosing Yahoo! Groups as your email group service for
> the pnfs-reqs group.
>
> Regards,
>
> Yahoo! Groups Customer Care
>
> Your use of Yahoo! Groups is subject to
> http://docs.yahoo.com/info/terms/




Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/




Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to:
http://docs.yahoo.com/info/terms/

From garth@panasas.com Mon Jan 26 16:56:01 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 69141 invoked from network); 27 Jan 2004 00:56:00 -0000
Received: from unknown (66.218.66.166)
by m1.grp.scd.yahoo.com with QMQP; 27 Jan 2004 00:56:00 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 27 Jan 2004 00:56:00 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYLZL4; Mon, 26 Jan 2004 19:55:58 -0500
Mime-Version: 1.0 (Apple Message framework v609)
Content-Transfer-Encoding: quoted-printable
Message-Id: <8EE09C50-5063-11D8-A540-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=WINDOWS-1252; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Mon, 26 Jan 2004 16:55:53 -0800
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: here are some citations that may work for the problem statement
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Requirements for Bandwidth ========

SGS File System RFP, DOE NNCA and DOD NSA, April 25, 2001.

Knott, T., "Computing colossus," BP Frontiers magazine, Issue 6, April
2003, http://www.bp.com/frontiers.

striping over file servers =========

John H. Hartman and John K. Ousterhout, "The Zebra Striped Network File
System," ACM Transactions on Computer Systems 13, 3, August 1995,
279-310.

CMU NASD ===================

Gibson, G. A., et. al., �A Cost-Effective, High-Bandwidth Storage
Architecture,� International Conference on Architectural Support for
Programming Languages and Operating Systems (ASPLOS), October 1998.

Amiri, K., Gibson, G.A., Golding, R., "Highly Concurrent Shared
Storage," Int. Conf. on Distributed Computing Systems (ICDCS00), April
2000.


Panasas ======================

Garth A. Gibson, Brent B. Welch, David F. Nagle, Bruce C. Moxon,
"Object Storage: Scalable Bandwidth for HPC Clusters," Proc. of the
ClusterWorld Conference and Expo June 23-26, 2003, in San Jose, CA,
www.clusterworld.com.


IBM Objects ===================

Azagury, A., Dreizin, V., Factor, M., Henis, E., Naor, D., Rinetzky,
N., Satran, J., Tavory, A., Yerushalmi, L, �Towards an Object Store,�
IBM Storage Systems Technology Workshop, November 2002.

Rodeh, O., Schonfeld, U., Teperman, A., �zFS - A Scalable distributed
File System using Object Disks,� IBM Storage Systems Technology
Workshop, November 2002.

Miller, E. L., Freeman, W. E., Long, D. E., Reed, B. C., "Strong
Security for Network Attached Storage," USENIX Conference on File and
Storage Technologies (FAST), 2002.


Other object-like solutions =========

Lee, E., Thekkath, C. Petal, �Distributed virtual disks,� ACM 7th
International Conference on Architectural Support for Programming
Languages and Operating Systems (ASPLOS) October, 1996.

�Lustre: A Scalable, High Performance File System,� Cluster File
System, Inc., 2003, http://www.lustre.org/docs.html.


Other products =================

SAN FS

Sanergy

GPFS

High Road

Sistina

CXFS

QFS -- Harriet Coverston, "Enabling Advanced Data Management with Sun 
StorEdge(TM) QFS/SAM-FS 4.0", Twentieth IEEE / Eleventh NASA Goddard
Conference on Mass Storage Systems and Technologies, April 2003.

DAFS

====== MSST coming up

File System Workload Analysis For Large Scientific Computing
Applications, Feng Wang, Qin Xin, Bo Hong, Ethan L. Miller, Darrell D.
E. Long, Scott A. Brandt, University of California, Santa Cruz, Tyce T.
McLarty, Lawrence Livermore National Laboratory 

From pcorbett@netapp.com Tue Jan 27 06:24:33 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 62706 invoked from network); 27 Jan 2004 14:16:28 -0000
Received: from unknown (66.218.66.172)
by m11.grp.scd.yahoo.com with QMQP; 27 Jan 2004 14:16:28 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 27 Jan 2004 14:16:28 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0REG3Kw014476
for <pnfs-reqs@yahoogroups.com>; Tue, 27 Jan 2004 06:16:03 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0REG3Rh021660
for <pnfs-reqs@yahoogroups.com>; Tue, 27 Jan 2004 06:16:03 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/mixed;
boundary="----_=_NextPart_001_01C3E4E0.160A787C"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Tue, 27 Jan 2004 06:15:59 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A015BF2E8@silver.nane.netapp.com>
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator:
Thread-Topic: new version of problem statement
Thread-Index: AcPk4BMGlQX8HwJSRrO/v8ehVEFedQ==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: new version of problem statement
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

Here is a new version of the problem statement.  There are still some gaps, especially in the application section.  I incorporated the paragraph Dave wrote in the introduction.  I made a large number of local changes, and a few broader changes, moving a few paragraphs around, and deleting some repetitive content.   I think it is getting better.  It is still making the same point over and over again.  And it is repeating the same point, making it several times.

Garth, I didn't add your references.  Can you do that?  Also, I couldn't track down the spelling of Benny's last name for the Ack section.  Garth, you will also need to add your address info.

I am going to pass the token now, as I don't think I'll have any more time to work on this before I leave on vacation Friday.  Please forward comments to the group.

Thanks,
Peter

<<draft-ietf-pNFS-problem-statement-v3.doc>>


Attachment (not stored)
draft-ietf-pNFS-problem-statement-v3.doc
Type: application/msword

From dnoveck@netapp.com Tue Jan 27 08:01:01 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 19614 invoked from network); 27 Jan 2004 16:00:57 -0000
Received: from unknown (66.218.66.216)
by m15.grp.scd.yahoo.com with QMQP; 27 Jan 2004 16:00:57 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 27 Jan 2004 16:00:57 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0RFjlKw000958
for <pnfs-reqs@yahoogroups.com>; Tue, 27 Jan 2004 07:45:47 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0RFjkDi001953
for <pnfs-reqs@yahoogroups.com>; Tue, 27 Jan 2004 07:45:46 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3E4EC.9F2B86AF"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Tue, 27 Jan 2004 07:45:43 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D3685@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: new version of problem statement
Thread-Index: AcPk4BMGlQX8HwJSRrO/v8ehVEFedQAC3a3g
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

ADVERTISEMENT
I just noticed that although Peter incorporated my paragraphs in the Introduction, the two paragraphs that they were intended to replace are still there as well.
 
Garth, when you make the next pass, could you delete the fourth and fifth paragraphs of the introduction.
 
One other issue is that I still think that the last and penultimate paragraphs of the introduction are better swapped.  What do other people think about this?

    -----Original Message-----
    From: Corbett, Peter
    Sent: Tuesday, January 27, 2004 9:16 AM
    To: pnfs-reqs@yahoogroups.com
    Subject: [pnfs-reqs] new version of problem statement

    Here is a new version of the problem statement.  There are still some gaps, especially in the application section.  I incorporated the paragraph Dave wrote in the introduction.  I made a large number of local changes, and a few broader changes, moving a few paragraphs around, and deleting some repetitive content.   I think it is getting better.  It is still making the same point over and over again.  And it is repeating the same point, making it several times.

    Garth, I didn't add your references.  Can you do that?  Also, I couldn't track down the spelling of Benny's last name for the Ack section.  Garth, you will also need to add your address info.

    I am going to pass the token now, as I don't think I'll have any more time to work on this before I leave on vacation Friday.  Please forward comments to the group.

    Thanks,
    Peter

    <<draft-ietf-pNFS-problem-statement-v3.doc>>


    Yahoo! Groups Links

        * To visit your group on the web, go to:
          http://groups.yahoo.com/group/pnfs-reqs/
           
        * To unsubscribe from this group, send an email to:
          pnfs-reqs-unsubscribe@yahoogroups.com
           
        * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From bhalevy@panasas.com Tue Jan 27 14:18:06 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 35112 invoked from network); 27 Jan 2004 17:50:20 -0000
Received: from unknown (66.218.66.218)
by m4.grp.scd.yahoo.com with QMQP; 27 Jan 2004 17:50:20 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 27 Jan 2004 17:50:19 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSYL7WN>; Tue, 27 Jan 2004 12:50:05 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D387D5@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Tue, 27 Jan 2004 12:50:05 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3E4FD.FEDE34A0"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

What you suggest makes sense.  I'd move the last sentence of the "proprietary systems"
paragraph to the grand finale since it is pretty much repeated in the other paragraph.
 
That will make these two paragraphs look like this:
 

Today the file system marketplace offers a number of proprietary systems that provide separated control and data flow.  Examples include EMC High Road and IBM TotalStorage SAN FS.  The lack of interoperability between these proprietary systems hinders their adoption.  An approach that solves the bandwidth problem using NFS is desirable.  By standardizing the key architectural features of separated control and data flows, a range of competitive and interoperable implementations can be provided.

 

Such an approach has additional benefits.  Even though NFS is widely used as a network file system protocol, most of the world's data resides in data stores that are not accessible through NFS.  Much of this data is stored in Storage Area Networks, accessible by Fibre Channel Protocol, or increasingly, by iSCSI.  Storage Area Networks do not have the simple management capability that comes from a file system, which associates data with named objects in a hierarchical namespace.  Such capabilities can be provided by NFS, while leveraging the existing SAN data access infrastructure, all within a common architectural framework protecting
the industry's large investment both in NFS and in SAN storage infrastructure.

    -----Original Message-----
    From: Noveck, Dave [mailto:dnoveck@netapp.com]
    Sent: Tuesday, January 27, 2004 10:46 AM
    To: pnfs-reqs@yahoogroups.com
    Subject: RE: [pnfs-reqs] new version of problem statement

    I just noticed that although Peter incorporated my paragraphs in the Introduction, the two paragraphs that they were intended to replace are still there as well.
     
    Garth, when you make the next pass, could you delete the fourth and fifth paragraphs of the introduction.
     
    One other issue is that I still think that the last and penultimate paragraphs of the introduction are better swapped.  What do other people think about this?

        -----Original Message-----
        From: Corbett, Peter
        Sent: Tuesday, January 27, 2004 9:16 AM
        To: pnfs-reqs@yahoogroups.com
        Subject: [pnfs-reqs] new version of problem statement

        Here is a new version of the problem statement.  There are still some gaps, especially in the application section.  I incorporated the paragraph Dave wrote in the introduction.  I made a large number of local changes, and a few broader changes, moving a few paragraphs around, and deleting some repetitive content.   I think it is getting better.  It is still making the same point over and over again.  And it is repeating the same point, making it several times.

        Garth, I didn't add your references.  Can you do that?  Also, I couldn't track down the spelling of Benny's last name for the Ack section.  Garth, you will also need to add your address info.

        I am going to pass the token now, as I don't think I'll have any more time to work on this before I leave on vacation Friday.  Please forward comments to the group.

        Thanks,
        Peter

        <<draft-ietf-pNFS-problem-statement-v3.doc>>


        Yahoo! Groups Links

            * To visit your group on the web, go to:
              http://groups.yahoo.com/group/pnfs-reqs/
               
            * To unsubscribe from this group, send an email to:
              pnfs-reqs-unsubscribe@yahoogroups.com
               
            * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 



    Yahoo! Groups Links

        * To visit your group on the web, go to:
          http://groups.yahoo.com/group/pnfs-reqs/
           
        * To unsubscribe from this group, send an email to:
          pnfs-reqs-unsubscribe@yahoogroups.com
           
        * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From ggrider@lanl.gov Tue Jan 27 17:23:09 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 71483 invoked from network); 28 Jan 2004 01:21:50 -0000
Received: from unknown (66.218.66.217)
by m9.grp.scd.yahoo.com with QMQP; 28 Jan 2004 01:21:50 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta2.grp.scd.yahoo.com with SMTP; 28 Jan 2004 01:21:53 -0000
Received: from mailrelay2.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i0S1K6hE030602
for <pnfs-reqs@yahoogroups.com>; Tue, 27 Jan 2004 18:20:06 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay2.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i0S1K5sI012576;
Tue, 27 Jan 2004 18:20:06 -0700
Received: from cthulu.lanl.gov (vpn-client-131.lanl.gov [128.165.253.131])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i0S1JtYi024530;
Tue, 27 Jan 2004 18:19:59 -0700
Message-Id: <5.2.0.9.2.20040127181650.01609868@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Tue, 27 Jan 2004 18:19:52 -0700
To: pnfs-reqs@yahoogroups.com,
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Cc: garth Gibson <garth@panasas.com>
In-Reply-To: <30489F1321F5C343ACF6872B2CF7942A05D387D5@PIKES.panasas.com
>
Mime-Version: 1.0
Content-Type: multipart/alternative;
boundary="=====================_16215676==.ALT"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: RE: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

I worked on the cluster applications section a bit.

Here is where I am at:
---------------------------------------------------------------------------------------------------------------------------------------
Clustered Applications
There is a large number of clustered applications in many industry verticals that require bandwidth scaling of a single file system beyond what is possible with a single NFS server/network endpoint. Industries that have these applications include; industries that use high performance computation in science and engineering like universities and government laboratories, automotive, and aerospace; industries that do large scale data analysis like seismic, genomics, government intelligence, and business intelligence; and industries that create and use data for viewing and interacting with such as rendering, video production, video distribution, gaming, web serving, archiving etc.  Many different I/O models are used in these industries, but all require bandwidths that extend to tens of gigabytes/sec, sometimes to/from a single file, sometimes to multiple files in the same directory, and sometimes from multiple files in different directories.
With clustered computing becoming the prevalent way to address these applications needs, it will always be relatively easy to have scaled bandwidth needs that go well beyond a single NFS server/network endpoint, so the problem this proposal is addressing will not go away with time, in fact as scaling clusters to larger processor counts gets easier, the problem will get worse. 

In addition to the above data intensive cluster oriented applications, there has been increasing use of NFS file servers as the storage subsystem for databases.  The traditional alternative has been to store databases in raw storage partitions, either on locally attached disks, or more commonly, in a Fibre Channel attached Storage Area Network.  An advantage of the file-based approach is that it allows easier management of the data, especially in environments where there are very large numbers of database tables.  However, the bandwidth achievable by the database servers to the file server is limited. In a SAN-based environment, individual tables stored on individual devices can become a hotspot.
Tables can be distributed across a number of SAN devices connected to a number of Fibre Channels, to increase bandwidth.  However, this introduces a significant degree of complexity to determine the best layout.  This proposal addresses the issue of limited bandwidth from an NFS server by parallelizing data access to a single file system across a number of data servers.  This allows increased bandwidth, comparable to that achieved from SAN storage.  At the same time, it provides the benefits that accrue from using file-based storage.  The parallelization of the file data can be done in such a way that the bandwidth achievable is robust across a wide variety of workloads.  This can be accomplished without a large administrative burden.

There is no shortage of applications that stretch standard single end point NFS file servers.  NFS is well poised to assist in providing a standards based solution to help these applications that users and sites can deploy confidently.
---------------------------------------------------------------------------------------------------------------------------------
Hope this helps a bit.  If I missed an area you would like some words on, please let me know,
I could probably get to it tonight, I hope.

Thanks
Gary

At 12:50 PM 1/27/2004 -0500, Halevy, Benny wrote:

> What you suggest makes sense.  I'd move the last sentence of the "proprietary systems"
> paragraph to the grand finale since it is pretty much repeated in the other paragraph.
>  
> That will make these two paragraphs look like this:
>  <?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" />
>
> Today the file system marketplace offers a number of proprietary systems that provide separated control and data flow.  Examples include EMC High Road and IBM TotalStorage SAN FS.  The lack of interoperability between these proprietary systems hinders their adoption.  An approach that solves the bandwidth problem using NFS is desirable.  By standardizing the key architectural features of separated control and data flows, a range of competitive and interoperable implementations can be provided.
>
>  
>
> Such an approach has additional benefits.  Even though NFS is widely used as a network file system protocol, most of the world's data resides in data stores that are not accessible through NFS.  Much of this data is stored in Storage Area Networks, accessible by Fibre Channel Protocol, or increasingly, by iSCSI.  Storage Area Networks do not have the simple management capability that comes from a file system, which associates data with named objects in a hierarchical namespace.  Such capabilities can be provided by NFS, while leveraging the existing SAN data access infrastructure, all within a common architectural framework protecting the industry's large investment both in NFS and in SAN storage infrastructure.
>
>     -----Original Message-----
>     From: Noveck, Dave [mailto:dnoveck@netapp.com]
>     Sent: Tuesday, January 27, 2004 10:46 AM
>     To: pnfs-reqs@yahoogroups.com
>     Subject: RE: [pnfs-reqs] new version of problem statement
>
>     I just noticed that although Peter incorporated my paragraphs in the Introduction, the two paragraphs that they were intended to replace are still there as well.
>      
>     Garth, when you make the next pass, could you delete the fourth and fifth paragraphs of the introduction.
>      
>     One other issue is that I still think that the last and penultimate paragraphs of the introduction are better swapped.  What do other people think about this?
>     -----Original Message-----
>     From: Corbett, Peter
>     Sent: Tuesday, January 27, 2004 9:16 AM
>     To: pnfs-reqs@yahoogroups.com
>     Subject: [pnfs-reqs] new version of problem statement
>
>     Here is a new version of the problem statement.  There are still some gaps, especially in the application section.  I incorporated the paragraph Dave wrote in the introduction.  I made a large number of local changes, and a few broader changes, moving a few paragraphs around, and deleting some repetitive content.   I think it is getting better.  It is still making the same point over and over again.  And it is repeating the same point, making it several times.
>
>     Garth, I didn't add your references.  Can you do that?  Also, I couldn't track down the spelling of Benny's last name for the Ack section.  Garth, you will also need to add your address info.
>
>     I am going to pass the token now, as I don't think I'll have any more time to work on this before I leave on vacation Friday.  Please forward comments to the group.
>
>     Thanks,
>     Peter
>
>     <<draft-ietf-pNFS-problem-statement-v3.doc>>
>
>
>     Yahoo! Groups Links 
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>       Yahoo! Groups Links
>           o To visit your group on the web, go to:
>           o http://groups.yahoo.com/group/pnfs-reqs/
>           o  
>           o To unsubscribe from this group, send an email to:
>           o pnfs-reqs-unsubscribe@yahoogroups.com
>           o  
>           o Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>             Yahoo! Groups Links
>                 + To visit your group on the web, go to:
>                 + http://groups.yahoo.com/group/pnfs-reqs/
>                 +  
>                 + To unsubscribe from this group, send an email to:
>                 + pnfs-reqs-unsubscribe@yahoogroups.com
>                 +  
>                 + Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From garth@panasas.com Thu Jan 29 06:05:31 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 77144 invoked from network); 29 Jan 2004 14:05:30 -0000
Received: from unknown (66.218.66.167)
by m5.grp.scd.yahoo.com with QMQP; 29 Jan 2004 14:05:30 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 29 Jan 2004 14:05:30 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYMHA4; Thu, 29 Jan 2004 09:03:28 -0500
Mime-Version: 1.0 (Apple Message framework v609)
In-Reply-To: <5.2.0.9.2.20040127181650.01609868@cic-mail.lanl.gov>
References: <5.2.0.9.2.20040127181650.01609868@cic-mail.lanl.gov>
Content-Type: multipart/mixed; boundary=Apple-Mail-3-441032590
Message-Id: <E2C0F720-5263-11D8-A5D8-000A95A94F04@panasas.com>
Date: Thu, 29 Jan 2004 06:03:16 -0800
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Here is my Wed pass -- most of the work was in the applications and
citations sections, but I also did quite a bit in the introduction and
a little bit in other places.

Weekly concall is at 11am EST today.

garth



Attachment (not stored)
draft-gibson-prob-st-00.doc
Type: application/applefile

Attachment (not stored)
draft-gibson-prob-st-00.doc
Type: application/msword

From Brian.Pawlowski@netapp.com Thu Jan 29 06:27:08 2004
Return-Path: <beepy@netapp.com>
X-Sender: beepy@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 33797 invoked from network); 29 Jan 2004 14:26:58 -0000
Received: from unknown (66.218.66.167)
by m20.grp.scd.yahoo.com with QMQP; 29 Jan 2004 14:26:58 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 29 Jan 2004 14:26:55 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0TEQhKw015463
for <pnfs-reqs@yahoogroups.com>; Thu, 29 Jan 2004 06:26:43 -0800 (PST)
Received: from tooting-fe.eng.netapp.com (tooting-fe.eng.netapp.com [10.56.10.118])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0TEQhDi004514
for <pnfs-reqs@yahoogroups.com>; Thu, 29 Jan 2004 06:26:43 -0800 (PST)
Received: (from beepy@localhost)
by tooting-fe.eng.netapp.com (8.11.6+Sun/8.11.6) id i0TEQh717920
for pnfs-reqs@yahoogroups.com; Thu, 29 Jan 2004 06:26:43 -0800 (PST)
Message-Id: <200401291426.i0TEQh717920@tooting-fe.eng.netapp.com>
In-Reply-To: <CF94E7DF-31AA-11D8-996E-000393754F12@panasas.com> from Garth Gibson at "Dec 18, 3 05:37:50 pm"
To: pnfs-reqs@yahoogroups.com
Date: Thu, 29 Jan 2004 06:26:43 -0800 (PST)
X-Mailer: ELM [version 2.4ME++ PL40 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: Brian Pawlowski <beepy@netapp.com>
From: Brian Pawlowski <Brian.Pawlowski@netapp.com>
Subject: How am I identified in pnfs-reqs and -ops mail lists?
X-Yahoo-Group-Post: member; u=169504717

Can't get into archives - it's getting cranky.

My e-mail address is beepy@netapp.com

Did you enter me in some other way?

beepy




From Brian.Pawlowski@netapp.com Thu Jan 29 06:35:10 2004
Return-Path: <beepy@netapp.com>
X-Sender: beepy@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 52196 invoked from network); 29 Jan 2004 14:35:09 -0000
Received: from unknown (66.218.66.166)
by m12.grp.scd.yahoo.com with QMQP; 29 Jan 2004 14:35:09 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta5.grp.scd.yahoo.com with SMTP; 29 Jan 2004 14:35:09 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i0TEVKKw016076
for <pnfs-reqs@yahoogroups.com>; Thu, 29 Jan 2004 06:31:20 -0800 (PST)
Received: from tooting-fe.eng.netapp.com (tooting-fe.eng.netapp.com [10.56.10.118])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i0TEVKRh012071
for <pnfs-reqs@yahoogroups.com>; Thu, 29 Jan 2004 06:31:20 -0800 (PST)
Received: (from beepy@localhost)
by tooting-fe.eng.netapp.com (8.11.6+Sun/8.11.6) id i0TEVJa18450;
Thu, 29 Jan 2004 06:31:19 -0800 (PST)
Message-Id: <200401291431.i0TEVJa18450@tooting-fe.eng.netapp.com>
In-Reply-To: <200401291426.i0TEQh717920@tooting-fe.eng.netapp.com> from Brian Pawlowski at "Jan 29, 4 06:26:43 am"
To: pnfs-reqs@yahoogroups.com
Date: Thu, 29 Jan 2004 06:31:19 -0800 (PST)
Cc: pnfs-reqs@yahoogroups.com
X-Mailer: ELM [version 2.4ME++ PL40 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: Brian Pawlowski <beepy@netapp.com>
From: Brian Pawlowski <Brian.Pawlowski@netapp.com>
Subject: Re: [pnfs-reqs] How am I identified in pnfs-reqs and -ops mail lists?
X-Yahoo-Group-Post: member; u=169504717

Great - meant to send that to Garth - sorry:-)

> Can't get into archives - it's getting cranky.
>
> My e-mail address is beepy@netapp.com
>
> Did you enter me in some other way?
>
> beepy
>
>
>
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
> 

From garth@panasas.com Thu Jan 29 08:16:52 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 80097 invoked from network); 29 Jan 2004 16:16:51 -0000
Received: from unknown (66.218.66.172)
by m15.grp.scd.yahoo.com with QMQP; 29 Jan 2004 16:16:51 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 29 Jan 2004 16:16:43 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYMHPB; Thu, 29 Jan 2004 11:16:06 -0500
In-Reply-To: <E2C0F720-5263-11D8-A5D8-000A95A94F04@panasas.com>
References: <E2C0F720-5263-11D8-A5D8-000A95A94F04@panasas.com>
Mime-Version: 1.0 (Apple Message framework v609)
Content-Type: multipart/mixed; boundary=Apple-Mail-10-448991504
Message-Id: <6AA27F46-5276-11D8-A5D8-000A95A94F04@panasas.com>
Cc: Peter Corbett <pcorbett@netapp.com>
Date: Thu, 29 Jan 2004 08:15:55 -0800
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.609)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Here is a PDF version.
garth

On Jan 29, 2004, at 6:03 AM, Garth Gibson wrote:

> Here is my Wed pass -- most of the work was in the applications and
> citations sections, but I also did quite a bit in the introduction and
> a little bit in other places.
>
> Weekly concall is at 11am EST today.
>
> garth
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
> <draft-gibson-prob-st-00.doc>




Attachment (not stored)
draft-gibson-prob-st-00..pdf
Type: application/pdf

From dhildebz@eecs.umich.edu Sun Feb 01 19:52:38 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 19117 invoked from network); 2 Feb 2004 03:52:34 -0000
Received: from unknown (66.218.66.217)
by m16.grp.scd.yahoo.com with QMQP; 2 Feb 2004 03:52:34 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta2.grp.scd.yahoo.com with SMTP; 2 Feb 2004 03:52:34 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i123qLae013573
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO);
Sun, 1 Feb 2004 22:52:22 -0500
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i123qLUu013570;
Sun, 1 Feb 2004 22:52:21 -0500
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Sun, 1 Feb 2004 22:52:21 -0500 (EST)
To: pnfs-reqs@yahoogroups.com
Cc: Peter Corbett <pcorbett@netapp.com>
In-Reply-To: <6AA27F46-5276-11D8-A5D8-000A95A94F04@panasas.com>
Message-ID: <Pine.LNX.4.44.0402012233450.29370-100000@willow.eecs.umich.edu>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

A couple comments:

(1)
Page 3, paragraph 2
> Storage Area Networks routinely provide much higher data bandwidths than
>do NFS file servers. Unfortunately, the simple array of blocks interface
>into Storage Area Networks does not lend itself to sharing data among the
>clients in a cluster. NFS file service, with its hierarchical namespace
>of separately controlled files, offers simpler and more cost-effective
>management. One might conclude that users must chose between high
>bandwidth and data sharing.
I'm wondering if the concept of 'data sharing' is obvious here. I mean,
maybe it should be expanded to make it clear it is talking about file
consistency and such.

Page 6, 2nd paragraph (Eliminating the bottleneck)
There is no mention here of NFSv4 state information. I beleive the fact
that NFSv4 state information prevents exporting the same file via multiple
NFSv4 servers (as was done in v3) should be mentioned.

Dean

On Thu, 29 Jan 2004, Garth Gibson wrote:

> Here is a PDF version.
> garth
>
> On Jan 29, 2004, at 6:03 AM, Garth Gibson wrote:
>
> > Here is my Wed pass -- most of the work was in the applications and
> > citations sections, but I also did quite a bit in the introduction and
> > a little bit in other places.
> >
> > Weekly concall is at 11am EST today.
> >
> > garth
> >
> > Yahoo! Groups Links
> >
> > To visit your group on the web, go to:
> >  http://groups.yahoo.com/group/pnfs-reqs/
> >
> > To unsubscribe from this group, send an email to:
> >  pnfs-reqs-unsubscribe@yahoogroups.com
> >
> > Your use of Yahoo! Groups is subject to:
> >  http://docs.yahoo.com/info/terms/
> >
> > <draft-gibson-prob-st-00.doc>
>
>
>
> ________________________________________________________________________________
> Yahoo! Groups Links
> * To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>  
> * To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>  
> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
> 

From garth@panasas.com Tue Feb 03 08:30:00 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 66954 invoked from network); 3 Feb 2004 16:29:46 -0000
Received: from unknown (66.218.66.172)
by m13.grp.scd.yahoo.com with QMQP; 3 Feb 2004 16:29:46 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 3 Feb 2004 16:29:42 -0000
Received: from [172.17.3.217] ([172.17.3.217]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYM9F5; Tue, 3 Feb 2004 11:29:34 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <Pine.LNX.4.44.0402012233450.29370-100000@willow.eecs.umich.edu>
References: <Pine.LNX.4.44.0402012233450.29370-100000@willow.eecs.umich.edu>
Content-Type: text/plain; charset=ISO-8859-1; delsp=yes; format=flowed
Message-Id: <231E5860-5666-11D8-8D65-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Date: Tue, 3 Feb 2004 11:29:28 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Dean,

Thanks!

I will clarify data sharing in this paragraph. In my experience these
two words "data sharing" are the most common way the industry
distinguishes any file system from a pure logical volume, so I don't
think there is much risk of confusion.

As to NFSv4 versus NFSv3, exporting a writable filesystem through
multiple server addresses is rarely done in v3, though a few vendors do
this. Those same vendors, at least, are very likely to export a
writable filesystem through v4 over multiple servers, The key thing as
I see it is that neither v3 nor v4 clients have a clue how to use more
than one server address to spread out their work.

I'd be happy to add that v4's additional statefulness adds to the
complexity of exporting the same filesystem through multiple servers,
though as I just argued, I think it is tangential to the main point.

garth


On Feb 1, 2004, at 10:52 PM, Dean Hildebrand wrote:

> A couple comments:
>
> (1)
> Page 3, paragraph 2
>> Storage Area Networks routinely provide much higher data bandwidths
>> than
>> do NFS file servers. Unfortunately, the simple array of blocks
>> interface
>> into Storage Area Networks does not lend itself to sharing data among
>> the
>> clients in a cluster. NFS file service, with its hierarchical
>> namespace
>> of separately controlled files, offers simpler and more cost-effective
>> management. One might conclude that users must chose between high
>> bandwidth and data sharing.
> I'm wondering if the concept of 'data sharing' is obvious here. I
> mean,
> maybe it should be expanded to make it clear it is talking about file
> consistency and such.
>
> Page 6, 2nd paragraph (Eliminating the bottleneck)
> There is no mention here of NFSv4 state information. I beleive the
> fact
> that NFSv4 state information prevents exporting the same file via
> multiple
> NFSv4 servers (as was done in v3) should be mentioned.
>
> Dean
>
> On Thu, 29 Jan 2004, Garth Gibson wrote:
>
>> Here is a PDF version.
>> garth
>>
>> On Jan 29, 2004, at 6:03 AM, Garth Gibson wrote:
>>
>>> Here is my Wed pass -- most of the work was in the applications and
>>> citations sections, but I also did quite a bit in the introduction
>>> and
>>> a little bit in other places.
>>>
>>> Weekly concall is at 11am EST today.
>>>
>>> garth
>>>
>>> Yahoo! Groups Links
>>>
>>> To visit your group on the web, go to:
>>>   http://groups.yahoo.com/group/pnfs-reqs/
>>>
>>> To unsubscribe from this group, send an email to:
>>>   pnfs-reqs-unsubscribe@yahoogroups.com
>>>
>>> Your use of Yahoo! Groups is subject to:
>>>   http://docs.yahoo.com/info/terms/
>>>
>>> <draft-gibson-prob-st-00.doc>
>>
>>
>>
>> ______________________________________________________________________
>> __________
>> Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>  
>> * To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>  
>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
>> Service.
>>
>>
>
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> ---------------------------------------------------------------------
> ~->
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>

From garth@panasas.com Tue Feb 03 08:58:31 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 64240 invoked from network); 3 Feb 2004 16:57:51 -0000
Received: from unknown (66.218.66.166)
by m4.grp.scd.yahoo.com with QMQP; 3 Feb 2004 16:57:51 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 3 Feb 2004 16:57:51 -0000
Received: from [172.17.2.81] ([172.17.2.81]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYM9KV; Tue, 3 Feb 2004 11:57:45 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: 7bit
Message-Id: <138EA0C4-566A-11D8-8D65-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Tue, 3 Feb 2004 11:57:40 -0500
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: call today not needed
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
We had tentatively scheduled a pNFS concall today at 3pm EST if needed.
I heard from Spencer this morning and he thought the current draft
suited its purpose (still working to talk to Brian). And the only
comment on this mailing list was from Dean (thanks Dean!) -- I'll add
clarifications for his comments today.

garth

From dhildebz@eecs.umich.edu Tue Feb 03 09:51:22 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 78745 invoked from network); 3 Feb 2004 17:51:22 -0000
Received: from unknown (66.218.66.167)
by m6.grp.scd.yahoo.com with QMQP; 3 Feb 2004 17:51:22 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta6.grp.scd.yahoo.com with SMTP; 3 Feb 2004 17:51:21 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i13HnYkG006983
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Tue, 3 Feb 2004 12:49:35 -0500
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i13HnYVV006980
for <pnfs-reqs@yahoogroups.com>; Tue, 3 Feb 2004 12:49:34 -0500
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Tue, 3 Feb 2004 12:49:34 -0500 (EST)
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <231E5860-5666-11D8-8D65-000A95A94F04@panasas.com>
Message-ID: <Pine.LNX.4.44.0402031240410.5320-100000@willow.eecs.umich.edu>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

I think you are right that the problem statement is not directly concerned
with the statefulness of NFSv4. Adding something about it, or even adding
what you said about spreading the work over the multiple servers might be
useful.

I also noticed some spelling/double word things that I can send along if
useful.
Dean

On Tue, 3 Feb 2004, Garth Gibson wrote:

> Dean,
>
> Thanks!
>
> I will clarify data sharing in this paragraph.  In my experience these 
> two words "data sharing" are the most common way the industry 
> distinguishes any file system from a pure logical volume, so I don't 
> think there is much risk of confusion.
>
> As to NFSv4 versus NFSv3, exporting a writable filesystem through 
> multiple server addresses is rarely done in v3, though a few vendors do 
> this.  Those same vendors, at least, are very likely to export a 
> writable filesystem through v4 over multiple servers,  The key thing as 
> I see it is that neither v3 nor v4 clients have a clue how to use more 
> than one server address to spread out their work.
>
> I'd be happy to add that v4's additional statefulness adds to the 
> complexity of exporting the same filesystem through multiple servers, 
> though as I just argued, I think it is tangential to the main point.
>
> garth
>
>
> On Feb 1, 2004, at 10:52 PM, Dean Hildebrand wrote:
>
> > A couple comments:
> >
> > (1)
> > Page 3, paragraph 2
> >> Storage Area Networks routinely provide much higher data bandwidths 
> >> than
> >> do NFS file servers.  Unfortunately, the simple array of blocks 
> >> interface
> >> into Storage Area Networks does not lend itself to sharing data among 
> >> the
> >> clients in a cluster.  NFS file service, with its hierarchical 
> >> namespace
> >> of separately controlled files, offers simpler and more cost-effective
> >> management.  One might conclude that users must chose between high
> >> bandwidth and data sharing.
> > I'm wondering if the concept of 'data sharing' is obvious here.  I 
> > mean,
> > maybe it should be expanded to make it clear it is talking about file
> > consistency and such.
> >
> > Page 6, 2nd paragraph (Eliminating the bottleneck)
> > There is no mention here of NFSv4 state information.  I beleive the 
> > fact
> > that NFSv4 state information prevents exporting the same file via 
> > multiple
> > NFSv4 servers (as was done in v3) should be mentioned.
> >
> > Dean
> >
> > On Thu, 29 Jan 2004, Garth Gibson wrote:
> >
> >> Here is a PDF version.
> >> garth
> >>
> >> On Jan 29, 2004, at 6:03 AM, Garth Gibson wrote:
> >>
> >>> Here is my Wed pass -- most of the work was in the applications and
> >>> citations sections, but I also did quite a bit in the introduction 
> >>> and
> >>> a little bit in other places.
> >>>
> >>> Weekly concall is at 11am EST today.
> >>>
> >>> garth
> >>>
> >>> Yahoo! Groups Links
> >>>
> >>> To visit your group on the web, go to:
> >>>   http://groups.yahoo.com/group/pnfs-reqs/
> >>>
> >>> To unsubscribe from this group, send an email to:
> >>>   pnfs-reqs-unsubscribe@yahoogroups.com
> >>>
> >>> Your use of Yahoo! Groups is subject to:
> >>>   http://docs.yahoo.com/info/terms/
> >>>
> >>> <draft-gibson-prob-st-00.doc>
> >>
> >>
> >>
> >> ______________________________________________________________________
> >> __________
> >> Yahoo! Groups Links
> >>  *  To visit your group on the web, go to:
> >>     http://groups.yahoo.com/group/pnfs-reqs/
> >>      
> >>  *  To unsubscribe from this group, send an email to:
> >>     pnfs-reqs-unsubscribe@yahoogroups.com
> >>      
> >>  *  Your use of Yahoo! Groups is subject to the Yahoo! Terms of 
> >> Service.
> >>
> >>
> >
> >
> >
> >
> > ------------------------ Yahoo! Groups Sponsor 
> > ---------------------~-->
> > Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> > Printer at MyInks.com. Free s/h on orders $50 or more to the US & 
> > Canada.
> > http://www.c1tracking.com/l.asp?cid=5511
> > http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> > ---------------------------------------------------------------------
> > ~->
> >
> > Yahoo! Groups Links
> >
> > To visit your group on the web, go to:
> >  http://groups.yahoo.com/group/pnfs-reqs/
> >
> > To unsubscribe from this group, send an email to:
> >  pnfs-reqs-unsubscribe@yahoogroups.com
> >
> > Your use of Yahoo! Groups is subject to:
> >  http://docs.yahoo.com/info/terms/
> >
>
>
> ________________________________________________________________________________
> Yahoo! Groups Links
> * To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>  
> * To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>  
> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
> 

From garth@panasas.com Tue Feb 03 10:07:25 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 74656 invoked from network); 3 Feb 2004 18:07:22 -0000
Received: from unknown (66.218.66.166)
by m14.grp.scd.yahoo.com with QMQP; 3 Feb 2004 18:07:22 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 3 Feb 2004 18:07:21 -0000
Received: from [172.17.2.81] ([172.17.2.81]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYM9Y4; Tue, 3 Feb 2004 13:06:33 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <Pine.LNX.4.44.0402031240410.5320-100000@willow.eecs.umich.edu>
References: <Pine.LNX.4.44.0402031240410.5320-100000@willow.eecs.umich.edu>
Content-Type: text/plain; charset=ISO-8859-1; delsp=yes; format=flowed
Message-Id: <AFD66F8E-5673-11D8-8D65-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Date: Tue, 3 Feb 2004 13:06:27 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Please do send me all spelling/grammar corrections. I want to finish
this asap :-)
garth

On Feb 3, 2004, at 12:49 PM, Dean Hildebrand wrote:

> I think you are right that the problem statement is not directly
> concerned
> with the statefulness of NFSv4. Adding something about it, or even
> adding
> what you said about spreading the work over the multiple servers might
> be
> useful.
>
> I also noticed some spelling/double word things that I can send along
> if
> useful.
> Dean
>
> On Tue, 3 Feb 2004, Garth Gibson wrote:
>
>> Dean,
>>
>> Thanks!
>>
>> I will clarify data sharing in this paragraph.  In my experience
>> these 
>> two words "data sharing" are the most common way the industry 
>> distinguishes any file system from a pure logical volume, so I don't 
>> think there is much risk of confusion.
>>
>> As to NFSv4 versus NFSv3, exporting a writable filesystem through 
>> multiple server addresses is rarely done in v3, though a few vendors
>> do 
>> this.  Those same vendors, at least, are very likely to export a 
>> writable filesystem through v4 over multiple servers,  The key thing
>> as 
>> I see it is that neither v3 nor v4 clients have a clue how to use
>> more 
>> than one server address to spread out their work.
>>
>> I'd be happy to add that v4's additional statefulness adds to the 
>> complexity of exporting the same filesystem through multiple servers, 
>> though as I just argued, I think it is tangential to the main point.
>>
>> garth
>>
>>
>> On Feb 1, 2004, at 10:52 PM, Dean Hildebrand wrote:
>>
>>> A couple comments:
>>>
>>> (1)
>>> Page 3, paragraph 2
>>>> Storage Area Networks routinely provide much higher data bandwidths 
>>>> than
>>>> do NFS file servers.  Unfortunately, the simple array of blocks 
>>>> interface
>>>> into Storage Area Networks does not lend itself to sharing data
>>>> among 
>>>> the
>>>> clients in a cluster.  NFS file service, with its hierarchical 
>>>> namespace
>>>> of separately controlled files, offers simpler and more
>>>> cost-effective
>>>> management.  One might conclude that users must chose between high
>>>> bandwidth and data sharing.
>>> I'm wondering if the concept of 'data sharing' is obvious here.  I 
>>> mean,
>>> maybe it should be expanded to make it clear it is talking about file
>>> consistency and such.
>>>
>>> Page 6, 2nd paragraph (Eliminating the bottleneck)
>>> There is no mention here of NFSv4 state information.  I beleive the 
>>> fact
>>> that NFSv4 state information prevents exporting the same file via 
>>> multiple
>>> NFSv4 servers (as was done in v3) should be mentioned.
>>>
>>> Dean
>>>
>>> On Thu, 29 Jan 2004, Garth Gibson wrote:
>>>
>>>> Here is a PDF version.
>>>> garth
>>>>
>>>> On Jan 29, 2004, at 6:03 AM, Garth Gibson wrote:
>>>>
>>>>> Here is my Wed pass -- most of the work was in the applications and
>>>>> citations sections, but I also did quite a bit in the introduction 
>>>>> and
>>>>> a little bit in other places.
>>>>>
>>>>> Weekly concall is at 11am EST today.
>>>>>
>>>>> garth
>>>>>
>>>>> Yahoo! Groups Links
>>>>>
>>>>> To visit your group on the web, go to:
>>>>>   http://groups.yahoo.com/group/pnfs-reqs/
>>>>>
>>>>> To unsubscribe from this group, send an email to:
>>>>>   pnfs-reqs-unsubscribe@yahoogroups.com
>>>>>
>>>>> Your use of Yahoo! Groups is subject to:
>>>>>   http://docs.yahoo.com/info/terms/
>>>>>
>>>>> <draft-gibson-prob-st-00.doc>
>>>>
>>>>
>>>>
>>>> ____________________________________________________________________
>>>> __
>>>> __________
>>>> Yahoo! Groups Links
>>>>   *  To visit your group on the web, go to:
>>>>      http://groups.yahoo.com/group/pnfs-reqs/
>>>>       
>>>>   *  To unsubscribe from this group, send an email to:
>>>>      pnfs-reqs-unsubscribe@yahoogroups.com
>>>>       
>>>>   *  Your use of Yahoo! Groups is subject to the Yahoo! Terms of 
>>>> Service.
>>>>
>>>>
>>>
>>>
>>>
>>>
>>> ------------------------ Yahoo! Groups Sponsor 
>>> ---------------------~-->
>>> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or
>>> Lexmark
>>> Printer at MyInks.com. Free s/h on orders $50 or more to the US & 
>>> Canada.
>>> http://www.c1tracking.com/l.asp?cid=5511
>>> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>>> ---------------------------------------------------------------------
>>> ~->
>>>
>>> Yahoo! Groups Links
>>>
>>> To visit your group on the web, go to:
>>>   http://groups.yahoo.com/group/pnfs-reqs/
>>>
>>> To unsubscribe from this group, send an email to:
>>>   pnfs-reqs-unsubscribe@yahoogroups.com
>>>
>>> Your use of Yahoo! Groups is subject to:
>>>   http://docs.yahoo.com/info/terms/
>>>
>>
>>
>> ______________________________________________________________________
>> __________
>> Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>  
>> * To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>  
>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
>> Service.
>>
>>
>
>
>
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>

From dhildebz@eecs.umich.edu Tue Feb 03 11:23:37 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 19030 invoked from network); 3 Feb 2004 19:23:34 -0000
Received: from unknown (66.218.66.172)
by m9.grp.scd.yahoo.com with QMQP; 3 Feb 2004 19:23:34 -0000
Received: from unknown (HELO smtp.eecs.umich.edu) (141.213.4.43)
by mta4.grp.scd.yahoo.com with SMTP; 3 Feb 2004 19:23:34 -0000
Received: from oemcomputer (dh152.citi.umich.edu [141.211.133.152])
(authenticated bits=0)
by smtp.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i13JNTNm024614
(version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Tue, 3 Feb 2004 14:23:29 -0500
Message-ID: <001401c3ea8a$ec1c7420$9885d38d@oemcomputer>
To: <pnfs-reqs@yahoogroups.com>
References: <Pine.LNX.4.44.0402031240410.5320-100000@willow.eecs.umich.edu> <AFD66F8E-5673-11D8-8D65-000A95A94F04@panasas.com>
Date: Tue, 3 Feb 2004 14:21:23 -0500
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----=_NextPart_000_0010_01C3EA61.002A3800"
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2800.1158
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
X-Scanned-By: MIMEDefang 2.39
X-eGroups-Remote-IP: 141.213.4.43
From: "Dean Hildebrand" <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

Here is a copy of the World doc with changes to spelling, grammar and such. Feel free to use or ignore any changes. If you View->Markup in Word you should be able to see what I did.
Dean
----- Original Message -----
From: Garth Gibson
To: pnfs-reqs@yahoogroups.com
Sent: Tuesday, February 03, 2004 1:06 PM
Subject: Re: [pnfs-reqs] new version of problem statement


Please do send me all spelling/grammar corrections. I want to finish
this asap :-)
garth

On Feb 3, 2004, at 12:49 PM, Dean Hildebrand wrote:

> I think you are right that the problem statement is not directly
> concerned
> with the statefulness of NFSv4. Adding something about it, or even
> adding
> what you said about spreading the work over the multiple servers might
> be
> useful.
>
> I also noticed some spelling/double word things that I can send along
> if
> useful.
> Dean
>
> On Tue, 3 Feb 2004, Garth Gibson wrote:
>
>> Dean,
>>
>> Thanks!
>>
>> I will clarify data sharing in this paragraph. In my experience
>> these
>> two words "data sharing" are the most common way the industry
>> distinguishes any file system from a pure logical volume, so I don't
>> think there is much risk of confusion.
>>
>> As to NFSv4 versus NFSv3, exporting a writable filesystem through
>> multiple server addresses is rarely done in v3, though a few vendors
>> do
>> this. Those same vendors, at least, are very likely to export a
>> writable filesystem through v4 over multiple servers, The key thing
>> as
>> I see it is that neither v3 nor v4 clients have a clue how to use
>> more
>> than one server address to spread out their work.
>>
>> I'd be happy to add that v4's additional statefulness adds to the
>> complexity of exporting the same filesystem through multiple servers,
>> though as I just argued, I think it is tangential to the main point.
>>
>> garth
>>
>>
>> On Feb 1, 2004, at 10:52 PM, Dean Hildebrand wrote:
>>
>>> A couple comments:
>>>
>>> (1)
>>> Page 3, paragraph 2
>>>> Storage Area Networks routinely provide much higher data bandwidths
>>>> than
>>>> do NFS file servers. Unfortunately, the simple array of blocks
>>>> interface
>>>> into Storage Area Networks does not lend itself to sharing data
>>>> among
>>>> the
>>>> clients in a cluster. NFS file service, with its hierarchical
>>>> namespace
>>>> of separately controlled files, offers simpler and more
>>>> cost-effective
>>>> management. One might conclude that users must chose between high
>>>> bandwidth and data sharing.
>>> I'm wondering if the concept of 'data sharing' is obvious here. I
>>> mean,
>>> maybe it should be expanded to make it clear it is talking about file
>>> consistency and such.
>>>
>>> Page 6, 2nd paragraph (Eliminating the bottleneck)
>>> There is no mention here of NFSv4 state information. I beleive the
>>> fact
>>> that NFSv4 state information prevents exporting the same file via
>>> multiple
>>> NFSv4 servers (as was done in v3) should be mentioned.
>>>
>>> Dean
>>>
>>> On Thu, 29 Jan 2004, Garth Gibson wrote:
>>>
>>>> Here is a PDF version.
>>>> garth
>>>>
>>>> On Jan 29, 2004, at 6:03 AM, Garth Gibson wrote:
>>>>
>>>>> Here is my Wed pass -- most of the work was in the applications and
>>>>> citations sections, but I also did quite a bit in the introduction
>>>>> and
>>>>> a little bit in other places.
>>>>>
>>>>> Weekly concall is at 11am EST today.
>>>>>
>>>>> garth
>>>>>
>>>>> Yahoo! Groups Links
>>>>>
>>>>> To visit your group on the web, go to:
>>>>> http://groups.yahoo.com/group/pnfs-reqs/
>>>>>
>>>>> To unsubscribe from this group, send an email to:
>>>>> pnfs-reqs-unsubscribe@yahoogroups.com
>>>>>
>>>>> Your use of Yahoo! Groups is subject to:
>>>>> http://docs.yahoo.com/info/terms/
>>>>>
>>>>> <draft-gibson-prob-st-00.doc>
>>>>
>>>>
>>>>
>>>> ____________________________________________________________________
>>>> __
>>>> __________
>>>> Yahoo! Groups Links
>>>> * To visit your group on the web, go to:
>>>> http://groups.yahoo.com/group/pnfs-reqs/
>>>>
>>>> * To unsubscribe from this group, send an email to:
>>>> pnfs-reqs-unsubscribe@yahoogroups.com
>>>>
>>>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
>>>> Service.
>>>>
>>>>
>>>
>>>
>>>
>>>
>>> ------------------------ Yahoo! Groups Sponsor
>>> ---------------------~-->
>>> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or
>>> Lexmark
>>> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
>>> Canada.
>>> http://www.c1tracking.com/l.asp?cid=5511
>>> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>>> ---------------------------------------------------------------------
>>> ~->
>>>
>>> Yahoo! Groups Links
>>>
>>> To visit your group on the web, go to:
>>> http://groups.yahoo.com/group/pnfs-reqs/
>>>
>>> To unsubscribe from this group, send an email to:
>>> pnfs-reqs-unsubscribe@yahoogroups.com
>>>
>>> Your use of Yahoo! Groups is subject to:
>>> http://docs.yahoo.com/info/terms/
>>>
>>
>>
>> ______________________________________________________________________
>> __________
>> Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>> * To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
>> Service.
>>
>>
>
>
>
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>



------------------------------------------------------------------------------
Yahoo! Groups Links

a.. To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

b.. To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

c.. Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.



Attachment (not stored)
draft-gibson-prob-st-00.doc
Type: application/msword

From dhildebz@eecs.umich.edu Tue Feb 03 11:24:03 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 847 invoked from network); 3 Feb 2004 19:24:01 -0000
Received: from unknown (66.218.66.172)
by m6.grp.scd.yahoo.com with QMQP; 3 Feb 2004 19:24:01 -0000
Received: from unknown (HELO smtp.eecs.umich.edu) (141.213.4.43)
by mta4.grp.scd.yahoo.com with SMTP; 3 Feb 2004 19:24:00 -0000
Received: from oemcomputer (dh152.citi.umich.edu [141.211.133.152])
(authenticated bits=0)
by smtp.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i13JMw1v024533
(version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Tue, 3 Feb 2004 14:22:58 -0500
Message-ID: <000801c3ea8a$d9ec6300$9885d38d@oemcomputer>
To: <pnfs-reqs@yahoogroups.com>
References: <Pine.LNX.4.44.0402031240410.5320-100000@willow.eecs.umich.edu> <AFD66F8E-5673-11D8-8D65-000A95A94F04@panasas.com>
Date: Tue, 3 Feb 2004 14:20:52 -0500
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="----=_NextPart_000_0004_01C3EA60.EDD27AA0"
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2800.1158
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
X-Scanned-By: MIMEDefang 2.39
X-eGroups-Remote-IP: 141.213.4.43
From: "Dean Hildebrand" <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] new version of problem statement
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

ADVERTISEMENT
click here
Here is a copy of the World doc with changes to spelling, grammar and such. Feel free to use or ignore any changes. If you View->Markup in Word you should be able to see what I did.
Dean
----- Original Message -----
From: Garth Gibson
To: pnfs-reqs@yahoogroups.com
Sent: Tuesday, February 03, 2004 1:06 PM
Subject: Re: [pnfs-reqs] new version of problem statement


Please do send me all spelling/grammar corrections. I want to finish
this asap :-)
garth

On Feb 3, 2004, at 12:49 PM, Dean Hildebrand wrote:

> I think you are right that the problem statement is not directly
> concerned
> with the statefulness of NFSv4. Adding something about it, or even
> adding
> what you said about spreading the work over the multiple servers might
> be
> useful.
>
> I also noticed some spelling/double word things that I can send along
> if
> useful.
> Dean
>
> On Tue, 3 Feb 2004, Garth Gibson wrote:
>
>> Dean,
>>
>> Thanks!
>>
>> I will clarify data sharing in this paragraph. In my experience
>> these
>> two words "data sharing" are the most common way the industry
>> distinguishes any file system from a pure logical volume, so I don't
>> think there is much risk of confusion.
>>
>> As to NFSv4 versus NFSv3, exporting a writable filesystem through
>> multiple server addresses is rarely done in v3, though a few vendors
>> do
>> this. Those same vendors, at least, are very likely to export a
>> writable filesystem through v4 over multiple servers, The key thing
>> as
>> I see it is that neither v3 nor v4 clients have a clue how to use
>> more
>> than one server address to spread out their work.
>>
>> I'd be happy to add that v4's additional statefulness adds to the
>> complexity of exporting the same filesystem through multiple servers,
>> though as I just argued, I think it is tangential to the main point.
>>
>> garth
>>
>>
>> On Feb 1, 2004, at 10:52 PM, Dean Hildebrand wrote:
>>
>>> A couple comments:
>>>
>>> (1)
>>> Page 3, paragraph 2
>>>> Storage Area Networks routinely provide much higher data bandwidths
>>>> than
>>>> do NFS file servers. Unfortunately, the simple array of blocks
>>>> interface
>>>> into Storage Area Networks does not lend itself to sharing data
>>>> among
>>>> the
>>>> clients in a cluster. NFS file service, with its hierarchical
>>>> namespace
>>>> of separately controlled files, offers simpler and more
>>>> cost-effective
>>>> management. One might conclude that users must chose between high
>>>> bandwidth and data sharing.
>>> I'm wondering if the concept of 'data sharing' is obvious here. I
>>> mean,
>>> maybe it should be expanded to make it clear it is talking about file
>>> consistency and such.
>>>
>>> Page 6, 2nd paragraph (Eliminating the bottleneck)
>>> There is no mention here of NFSv4 state information. I beleive the
>>> fact
>>> that NFSv4 state information prevents exporting the same file via
>>> multiple
>>> NFSv4 servers (as was done in v3) should be mentioned.
>>>
>>> Dean
>>>
>>> On Thu, 29 Jan 2004, Garth Gibson wrote:
>>>
>>>> Here is a PDF version.
>>>> garth
>>>>
>>>> On Jan 29, 2004, at 6:03 AM, Garth Gibson wrote:
>>>>
>>>>> Here is my Wed pass -- most of the work was in the applications and
>>>>> citations sections, but I also did quite a bit in the introduction
>>>>> and
>>>>> a little bit in other places.
>>>>>
>>>>> Weekly concall is at 11am EST today.
>>>>>
>>>>> garth
>>>>>
>>>>> Yahoo! Groups Links
>>>>>
>>>>> To visit your group on the web, go to:
>>>>> http://groups.yahoo.com/group/pnfs-reqs/
>>>>>
>>>>> To unsubscribe from this group, send an email to:
>>>>> pnfs-reqs-unsubscribe@yahoogroups.com
>>>>>
>>>>> Your use of Yahoo! Groups is subject to:
>>>>> http://docs.yahoo.com/info/terms/
>>>>>
>>>>> <draft-gibson-prob-st-00.doc>
>>>>
>>>>
>>>>
>>>> ____________________________________________________________________
>>>> __
>>>> __________
>>>> Yahoo! Groups Links
>>>> * To visit your group on the web, go to:
>>>> http://groups.yahoo.com/group/pnfs-reqs/
>>>>
>>>> * To unsubscribe from this group, send an email to:
>>>> pnfs-reqs-unsubscribe@yahoogroups.com
>>>>
>>>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
>>>> Service.
>>>>
>>>>
>>>
>>>
>>>
>>>
>>> ------------------------ Yahoo! Groups Sponsor
>>> ---------------------~-->
>>> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or
>>> Lexmark
>>> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
>>> Canada.
>>> http://www.c1tracking.com/l.asp?cid=5511
>>> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>>> ---------------------------------------------------------------------
>>> ~->
>>>
>>> Yahoo! Groups Links
>>>
>>> To visit your group on the web, go to:
>>> http://groups.yahoo.com/group/pnfs-reqs/
>>>
>>> To unsubscribe from this group, send an email to:
>>> pnfs-reqs-unsubscribe@yahoogroups.com
>>>
>>> Your use of Yahoo! Groups is subject to:
>>> http://docs.yahoo.com/info/terms/
>>>
>>
>>
>> ______________________________________________________________________
>> __________
>> Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>> * To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
>> Service.
>>
>>
>
>
>
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>



------------------------------------------------------------------------------
Yahoo! Groups Links

a.. To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

b.. To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

c.. Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.



Attachment (not stored)
draft-gibson-prob-st-00.doc
Type: application/msword

From Thomas.Talpey@netapp.com Thu Feb 05 08:34:36 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 64988 invoked from network); 5 Feb 2004 16:34:34 -0000
Received: from unknown (66.218.66.166)
by m15.grp.scd.yahoo.com with QMQP; 5 Feb 2004 16:34:34 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta5.grp.scd.yahoo.com with SMTP; 5 Feb 2004 16:34:34 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i15GY4Kw028307
for <pnfs-reqs@yahoogroups.com>; Thu, 5 Feb 2004 08:34:04 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i15GXnS5024214
for <pnfs-reqs@yahoogroups.com>; Thu, 5 Feb 2004 08:34:03 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.31]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Thu, 5 Feb 2004 08:15:44 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3EBEA.290B8000"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Thu, 5 Feb 2004 05:15:29 -0800
Message-ID: <5.2.1.1.2.20040205080928.035852f8@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: Do we have a concall today?
Thread-Index: AcPr6imDHfvfdLxESLynAqqz6p5XGQ==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Do we have a concall today?
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu
X-eGroups-Rocket-Track: -10 ; IPCR=n-w0,n100,g0

Are we on for a concall today (in a couple of hours) I assume?

The final submission deadline is Monday 9am Eastern, and we need
to wrap up the edits and send it pronto.

Tom.


From garth@panasas.com Thu Feb 05 08:42:27 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 60107 invoked from network); 5 Feb 2004 16:42:26 -0000
Received: from unknown (66.218.66.172)
by m9.grp.scd.yahoo.com with QMQP; 5 Feb 2004 16:42:26 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 5 Feb 2004 16:42:26 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYNK52; Thu, 5 Feb 2004 11:42:22 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <5.2.1.1.2.20040205080928.035852f8@silver.nane.netapp.com>
References: <5.2.1.1.2.20040205080928.035852f8@silver.nane.netapp.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <41A40A7E-57FA-11D8-8D65-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Thu, 5 Feb 2004 11:42:16 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Do we have a concall today?
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson
X-eGroups-Rocket-Track: -10

We had a concall, Tom and me, on getting this done.

I am doing the conversion to ASCI today. Thanks to Tom I have
instructions, but as I look at it, my confidence is not that high. If
anyone has a magic bullet, please speak up.

garth

On Feb 5, 2004, at 8:15 AM, Talpey, Thomas wrote:

> Are we on for a concall today (in a couple of hours) I assume?
>
> The final submission deadline is Monday 9am Eastern, and we need
> to wrap up the edits and send it pronto.
>
> Tom. 

From ggrider@lanl.gov Thu Feb 05 09:45:28 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 28495 invoked from network); 5 Feb 2004 17:45:18 -0000
Received: from unknown (66.218.66.167)
by m20.grp.scd.yahoo.com with QMQP; 5 Feb 2004 17:45:18 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta6.grp.scd.yahoo.com with SMTP; 5 Feb 2004 17:45:14 -0000
Received: from mailrelay1.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i15Hj2HR023161
for <pnfs-reqs@yahoogroups.com>; Thu, 5 Feb 2004 10:45:03 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay1.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i15Hj2rl030787
for <pnfs-reqs@yahoogroups.com>; Thu, 5 Feb 2004 10:45:02 -0700
Received: from cthulu.lanl.gov (cthulu.lanl.gov [128.165.115.129])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i15Hj2Yi025881
for <pnfs-reqs@yahoogroups.com>; Thu, 5 Feb 2004 10:45:02 -0700
Message-Id: <5.2.0.9.2.20040205104444.0154f488@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Thu, 05 Feb 2004 10:45:02 -0700
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <41A40A7E-57FA-11D8-8D65-000A95A94F04@panasas.com>
References: <5.2.1.1.2.20040205080928.035852f8@silver.nane.netapp.com>
<5.2.1.1.2.20040205080928.035852f8@silver.nane.netapp.com>
Mime-Version: 1.0
Content-Type: multipart/alternative;
boundary="=====================_8417043==.ALT"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] Do we have a concall today?
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs
X-eGroups-Rocket-Track: 1: 100 ; IPCR=n-w0,n100,g0 ; SERVER=66.218.86.251

Your confidence is not high on?

Thanks
Gary

At 11:42 AM 2/5/2004 -0500, you wrote:

> We had a concall, Tom and me, on getting this done.
>
> I am doing the conversion to ASCI today.  Thanks to Tom I have
> instructions, but as I look at it, my confidence is not that high.  If
> anyone has a magic bullet, please speak up.
>
> garth
>
> On Feb 5, 2004, at 8:15 AM, Talpey, Thomas wrote:
>
> > Are we on for a concall today (in a couple of hours) I assume?
> >
> > The final submission deadline is Monday 9am Eastern, and we need
> > to wrap up the edits and send it pronto.
> >
> > Tom.
>
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From garth@panasas.com Thu Feb 05 09:51:58 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 33445 invoked from network); 5 Feb 2004 17:51:57 -0000
Received: from unknown (66.218.66.167)
by m13.grp.scd.yahoo.com with QMQP; 5 Feb 2004 17:51:56 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 5 Feb 2004 17:51:55 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYNLFF; Thu, 5 Feb 2004 12:51:18 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <5.2.0.9.2.20040205104444.0154f488@cic-mail.lanl.gov>
References: <5.2.1.1.2.20040205080928.035852f8@silver.nane.netapp.com> <5.2.1.1.2.20040205080928.035852f8@silver.nane.netapp.com> <5.2.0.9.2.20040205104444.0154f488@cic-mail.lanl.gov>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Message-Id: <E2C35E6F-5803-11D8-8D65-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Date: Thu, 5 Feb 2004 12:51:12 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Do we have a concall today?
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson
X-eGroups-Rocket-Track: 1: 100 ; SERVER=66.218.86.249

converting Word to IETF compliant ASCI :-)

On Feb 5, 2004, at 12:45 PM, Gary Grider wrote:

> Your confidence is not high on?
>
> Thanks
> Gary
>
> At 11:42 AM 2/5/2004 -0500, you wrote:
>
> We had a concall, Tom and me, on getting this done.
>
> I am doing the conversion to ASCI today.  Thanks to Tom I have
> instructions, but as I look at it, my confidence is not that high.  If
> anyone has a magic bullet, please speak up.
>
> garth

From garth@panasas.com Thu Feb 05 13:48:03 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 18326 invoked from network); 5 Feb 2004 21:48:01 -0000
Received: from unknown (66.218.66.172)
by m12.grp.scd.yahoo.com with QMQP; 5 Feb 2004 21:48:01 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 5 Feb 2004 21:47:59 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYNMS9; Thu, 5 Feb 2004 16:46:40 -0500
Mime-Version: 1.0 (Apple Message framework v612)
To: pnfs-reqs@yahoogroups.com
Message-Id: <C47E2635-5824-11D8-8D65-000A95A94F04@panasas.com>
Content-Type: multipart/mixed; boundary=Apple-Mail-51-1073630512
Date: Thu, 5 Feb 2004 16:46:34 -0500
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: working on the Word to IETF conversion
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson
X-eGroups-Rocket-Track: 1: 100 ; SERVER=66.218.86.252

Here are two Word files. The first is the draft after all content
editing. The second is my attempt to reset margins and indents in
order to use print to generic/text-only file to get an IETF ASCI
document. I'm not doing well. Still working at it. Please help.



Attachment (not stored)
draft-gibson-prob-st-00.doc
Type: application/applefile

Attachment (not stored)
draft-gibson-prob-st-00.doc
Type: application/msword

Attachment (not stored)
draft-gibson-prob-st-00-1.doc
Type: application/applefile

Attachment (not stored)
draft-gibson-prob-st-00-1.doc
Type: application/msword

From bhalevy@panasas.com Thu Feb 05 13:56:05 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 14661 invoked from network); 5 Feb 2004 21:55:25 -0000
Received: from unknown (66.218.66.218)
by m20.grp.scd.yahoo.com with QMQP; 5 Feb 2004 21:55:25 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 5 Feb 2004 21:55:21 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSYNM4T>; Thu, 5 Feb 2004 16:54:54 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38832@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Thu, 5 Feb 2004 16:54:41 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] working on the Word to IETF conversion
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy
X-eGroups-Rocket-Track: 1: 100 ; SERVER=66.218.86.251

Garth, I'll take a stab at it if no one else (with better
experience with submitting I-Ds) volunteers...

I'll need to do some homework going over
http://www.ietf.org/ietf/1id-guidelines.txt,
http://www.ietf.org/rfc/rfc2026.txt, and
http://www.ietf.org/ID-nits.html

Benny

> -----Original Message-----
> From: Garth Gibson [mailto:garth@Panasas.Com]
> Sent: Thursday, February 05, 2004 4:47 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] working on the Word to IETF conversion
>
>
> Here are two Word files. The first is the draft after all content
> editing. The second is my attempt to reset margins and indents in
> order to use print to generic/text-only file to get an IETF ASCI
> document. I'm not doing well. Still working at it. Please help.
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the
> US & Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> --------------------------------------------------------------
> -------~->
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From black_david@emc.com Thu Feb 05 22:03:14 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 6396 invoked from network); 6 Feb 2004 06:03:14 -0000
Received: from unknown (66.218.66.217)
by m11.grp.scd.yahoo.com with QMQP; 6 Feb 2004 06:03:14 -0000
Received: from unknown (HELO srexchimc2.eng.emc.com) (168.159.100.11)
by mta2.grp.scd.yahoo.com with SMTP; 6 Feb 2004 06:03:13 -0000
Received: from MAHO3MSX2.corp.emc.com (maho3msx2.isus.emc.com [128.221.11.32]) by srexchimc2.eng.emc.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2657.72)
id DWRHDBH1; Fri, 6 Feb 2004 01:02:22 -0500
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <DK6RM5L1>; Fri, 6 Feb 2004 01:02:21 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A5522@corpmx14.corp.emc.com>
X-Sybari-Trust: 2b481d31 b1a25add bdf41840 0000013d
To: pnfs-reqs@yahoogroups.com
Date: Fri, 6 Feb 2004 01:02:20 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 168.159.100.11
From: black_david@emc.com
Subject: RE: [pnfs-reqs] working on the Word to IETF conversion
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237
X-eGroups-Rocket-Track: 1: 100 ; SERVER=66.218.86.248

ADVERTISEMENT
Garth,

> Here are two Word files. The first is the draft after all content
> editing. The second is my attempt to reset margins and indents in
> order to use print to generic/text-only file to get an IETF ASCI
> document. I'm not doing well. Still working at it. Please help.

You're fighting a losing battle - MS Word has entirely too many
tricks up its sleeve. Fortunately, it has been beaten into submission
by experts in the past, who have left instructions on how to do
it in RFC 3285 (http://www.ietf.org/rfc/rfc3285.txt). You want to
get the MS Word template from one of the locations provided in that
RFC, cut and paste your entire content as unformatted text into
a new file based on that template. It is crucial to use only the RFC
text styles in that template - there should be no text in *any* other
style (e.g., Normal, Heading) when you're done). Then follow the
instructions in the RFC to print to a file via a Text-only printer
("Save As" won't work, even when saving as a text file, as it
allows Word too much latitude to play games) and run the
CRLF utility over the resulting text file before submitting.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

> -----Original Message-----
> From: Garth Gibson [mailto:garth@panasas.com]
> Sent: Thursday, February 05, 2004 4:47 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] working on the Word to IETF conversion
>
>
> Here are two Word files. The first is the draft after all content
> editing. The second is my attempt to reset margins and indents in
> order to use print to generic/text-only file to get an IETF ASCI
> document. I'm not doing well. Still working at it. Please help.
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the
> US & Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> --------------------------------------------------------------
> -------~->
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From julian_satran@il.ibm.com Fri Feb 06 01:29:54 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 78483 invoked from network); 6 Feb 2004 09:29:53 -0000
Received: from unknown (66.218.66.172)
by m12.grp.scd.yahoo.com with QMQP; 6 Feb 2004 09:29:53 -0000
Received: from unknown (HELO mtagate3.uk.ibm.com) (195.212.29.136)
by mta4.grp.scd.yahoo.com with SMTP; 6 Feb 2004 09:29:52 -0000
Received: from d06nrmr1407.portsmouth.uk.ibm.com (d06nrmr1407.portsmouth.uk.ibm.com [9.149.38.185])
by mtagate3.uk.ibm.com (8.12.10/8.12.10) with ESMTP id i169TiMf108972
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Feb 2004 09:29:44 GMT
Received: from d12ml102.megacenter.de.ibm.com (d06av02.portsmouth.uk.ibm.com [9.149.37.228])
by d06nrmr1407.portsmouth.uk.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i169ThHh176618
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Feb 2004 09:29:44 GMT
In-Reply-To: <B459CE1AFFC52D4688B2A5B842CA35EA7A5522@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF8763D48A.054DAC1C-ONC2256E32.002EBFA5-C2256E32.00342921@il.ibm.com>
Date: Fri, 6 Feb 2004 11:29:42 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2|July 23, 2003) at
06/02/2004 11:29:43,
Serialize complete at 06/02/2004 11:29:43
Content-Type: text/plain; charset="US-ASCII"
X-eGroups-Remote-IP: 195.212.29.136
From: Julian Satran <julian_satran@il.ibm.com>
Subject: RE: [pnfs-reqs] working on the Word to IETF conversion
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

Garth,

Go for the file I sent you. It should have all you need (including
numbering not available in the RFC).
If it does not work switch to framemaker. It will take you around an hour
if you know your way and two-three if you don't.

Regards,
Julo



black_david@emc.com
06/02/2004 08:02
Please respond to
pnfs-reqs


To
pnfs-reqs@yahoogroups.com
cc

Subject
RE: [pnfs-reqs] working on the Word to IETF conversion






Garth,

> Here are two Word files. The first is the draft after all content
> editing. The second is my attempt to reset margins and indents in
> order to use print to generic/text-only file to get an IETF ASCI
> document. I'm not doing well. Still working at it. Please help.

You're fighting a losing battle - MS Word has entirely too many
tricks up its sleeve. Fortunately, it has been beaten into submission
by experts in the past, who have left instructions on how to do
it in RFC 3285 (http://www.ietf.org/rfc/rfc3285.txt). You want to
get the MS Word template from one of the locations provided in that
RFC, cut and paste your entire content as unformatted text into
a new file based on that template. It is crucial to use only the RFC
text styles in that template - there should be no text in *any* other
style (e.g., Normal, Heading) when you're done). Then follow the
instructions in the RFC to print to a file via a Text-only printer
("Save As" won't work, even when saving as a text file, as it
allows Word too much latitude to play games) and run the
CRLF utility over the resulting text file before submitting.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

> -----Original Message-----
> From: Garth Gibson [mailto:garth@panasas.com]
> Sent: Thursday, February 05, 2004 4:47 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] working on the Word to IETF conversion
>
>
> Here are two Word files. The first is the draft after all content
> editing. The second is my attempt to reset margins and indents in
> order to use print to generic/text-only file to get an IETF ASCI
> document. I'm not doing well. Still working at it. Please help.
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the
> US & Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> --------------------------------------------------------------
> -------~->
>
>
> Yahoo! Groups Links
>
>
>
>
>




Yahoo! Groups Links

From julian_satran@il.ibm.com Fri Feb 06 01:30:43 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 34778 invoked from network); 6 Feb 2004 09:30:39 -0000
Received: from unknown (66.218.66.217)
by m14.grp.scd.yahoo.com with QMQP; 6 Feb 2004 09:30:38 -0000
Received: from unknown (HELO mtagate3.uk.ibm.com) (195.212.29.136)
by mta2.grp.scd.yahoo.com with SMTP; 6 Feb 2004 09:30:32 -0000
Received: from d06nrmr1407.portsmouth.uk.ibm.com (d06nrmr1407.portsmouth.uk.ibm.com [9.149.38.185])
by mtagate3.uk.ibm.com (8.12.10/8.12.10) with ESMTP id i169TiMf126760
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Feb 2004 09:29:44 GMT
Received: from d12ml102.megacenter.de.ibm.com (d06av02.portsmouth.uk.ibm.com [9.149.37.228])
by d06nrmr1407.portsmouth.uk.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i169ThHg176618
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Feb 2004 09:29:43 GMT
In-Reply-To: <C47E2635-5824-11D8-8D65-000A95A94F04@panasas.com>
To: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF6E44FC82.C54480A3-ONC2256E32.002D268C-C2256E32.0034208D@il.ibm.com>
Date: Fri, 6 Feb 2004 11:29:19 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2|July 23, 2003) at
06/02/2004 11:29:42
Content-Type: multipart/mixed; boundary="=_mixed 002E9A2BC2256E32_="
X-eGroups-Remote-IP: 195.212.29.136
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] working on the Word to IETF conversion
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

Garth,

I send you a draft in word that has a set of word formats named RFCxxx.
Use it as a template and change all your formats to it.
Do not change it. It has all the fonts in fixed pitch and the template if
you use only the RFC named styles is working (was with Word 2000).
Then you should be safe with the two conversion tools (print to a generic
printer and use crlf.exe).

If you still have trouble I can send you the Framemaker template and
instructions on how to use it (far better).

Regards,
Julo





Garth Gibson <garth@panasas.com>
05/02/2004 23:46
Please respond to
pnfs-reqs


To
pnfs-reqs@yahoogroups.com
cc

Subject
[pnfs-reqs] working on the Word to IETF conversion






Here are two Word files. The first is the draft after all content
editing. The second is my attempt to reset margins and indents in
order to use print to generic/text-only file to get an IETF ASCI
document. I'm not doing well. Still working at it. Please help.





Yahoo! Groups Links







Attachment (not stored)
draft-gibson-prob-st-00.doc.hqx
Type: application/mac-binhex40

From garth@panasas.com Fri Feb 06 10:37:03 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 72863 invoked from network); 6 Feb 2004 18:36:59 -0000
Received: from unknown (66.218.66.172)
by m16.grp.scd.yahoo.com with QMQP; 6 Feb 2004 18:36:59 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 6 Feb 2004 18:36:57 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYNQ7H; Fri, 6 Feb 2004 13:35:43 -0500
Mime-Version: 1.0 (Apple Message framework v612)
To: pnfs-reqs@yahoogroups.com
Message-Id: <4138B0BA-58D3-11D8-825E-000A95A94F04@panasas.com>
Content-Type: multipart/mixed; boundary=Apple-Mail-6--998911455
Date: Fri, 6 Feb 2004 13:35:36 -0500
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Have ASCII, approaching submission
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Okay, so with help from many of you we may have an ASCI internet draft.
Attached is the file that may be it.

Next, the file name. According to "Guidelines to Authors of
Internet-Drafts, Last modified September 5, 2002", says that we need to
get the file name from IETF. Assuming that this is not an NFS
document, not yet anyway, we are suggesting the name
"draft-gibson-pnfs-problem-statement-00.txt".

Unless I hear otherwise, I will submit a request for a name, giving
this name as a suggestion.

garth

----------- test from the Guidelines section ------

For those authors submitting updates to existing Internet-Drafts, the
choice of the file name is easily determined (up the version by 1).
For new documents, either suggest one or send a message to
"internet-drafts@ietf.org" with the document title, noting if it is a
product of a working group (and the name of the group), and an
abstract. The file name to be assigned will be included in a response.
Simply add the filename text to the document (ASCII and PostScript
versions) and submit the Internet-Draft.

If the document is a new one (i.e. starting with revision -00.txt) and
is submitted as a working group document, the IETF secretariat will ask
the chair(s) of the wg the permission to publish it as a working group
document. To expedite the process, authors are encouraged to send the
document to internet-drafts@ietf.org and at the same time cc: to the
chair(s) of the working group. If the document is accepted as a
working group document, then it will have the draft-ietf-<working group
acronym> file name and will be announced on the working group mailing
list by the IETF Secretariat. If the document is not accepted as a
working group document, it will be processed as an individual
submission, where the filename will be draft-<yourname>-....txt.

NOTE: Revision numbers are based on the filename (as in first, second,
or third version of this document). If there is a filename
change, the version number starts over at -00. Put another way, the
prior version number will NOT be incremented when an Internet-Draft
filename has changed. ALL FILES BEGIN at -00

Before each IETF meeting, a deadline is announced for submitting
documents ahead of time to be published for the meeting. For new
documents, he deadline is even sooner (one week). There is no accepted
delay. If you send at the very last minute, it is possible that it will
arrive too late because of congestion of your mail server queues. If
it is received too late, it will not be published on time for the IETF
meeting.

Note that if a filename is suggested, but not used, the document will
have to be resubmitted with the actual file name.



Begin forwarded message:
> From: "Benny Halevy" <bhalevy@panasas.com>
> Date: February 6, 2004 3:11:02 AM EST
> To: "Garth Gibson" <garth@panasas.com>
> Cc: "Benny Halevy" <bhalevy@panasas.com>
> Subject: RE: crlf.exe
>
> Garth, I followed David Black's instructions.
> Files attached.
>
> Benny
>




Internet Draft Garth Gibson
Expires: August 2004 Panasas Inc. & CMU
Peter Corbett
Network Appliance, Inc.

Document: draft-gibson-pnfs-problem-statement-00.txt February 2004




pNFS Problem Statement


Status of this Memo

This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026.

Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.

Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."

The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt

The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.





Copyright Notice

Copyright (C) The Internet Society (2003). All Rights Reserved.











Gibson et al Expires - August 2004 [Page 1]
Internet Draft pNFS Problem Statement February 2004



Abstract

This draft considers the problem of limited bandwidth to NFS servers.
The bandwidth limitation exists because an NFS server has limited
network, CPU, memory and disk I/O resources. Yet, access to any one
file system through the NFSv4 protocol requires that a single server
be accessed. While NFSv4 allows file system migration, it does not
provide a mechanism that supports multiple servers simultaneously
exporting a single writable file system.

This problem has become aggravated in recent years with the advent of
very cheap and easily expanded clusters of application servers that
are also NFS clients. The aggregate bandwidth demands of such
clustered clients, typically working on a shared data set
preferentially stored in a single file system, can increase much more
quickly than the bandwidth of any server. The proposed solution is
to provide for the parallelization of file services, by enhancing
NFSv4 in a minor version.


Table of Contents

1. Introduction...................................................2
2. Bandwidth Scaling in Clusters..................................4
3. Clustered Applications.........................................4
4. Existing File Systems for Clusters.............................6
5. Eliminating the Bottleneck.....................................7
6. Separated control and data access techniques...................8
7. Security Considerations........................................9
8. Informative References.........................................9
9. Acknowledgments...............................................11
10. Author's Addresses...........................................11
11. Full Copyright Statement.....................................11


1. Introduction

The storage I/O bandwidth requirements of clients are rapidly
outstripping the ability of network file servers to supply them.
Increasingly, this problem is being encountered in installations
running the NFS protocol. The problem can be solved by increasing
the server bandwidth. This draft suggests that an effort be mounted
to enable NFS file service to scale with its clusters of clients.
The proposed approach is to increase the aggregate bandwidth possible
to a single file system by parallelizing the file service, resulting
in multiple network connections to multiple server endpoints
participating in the transfer of requested data. This should be



Gibson et al Expires - August 2004 [Page 2]
Internet Draft pNFS Problem Statement February 2004


achievable within the framework of NFS, possibly in a minor version
of the NFSv4 protocol.

In many application areas, single system servers are rapidly being
replaced by clusters of inexpensive commodity computers. As
clustering technology has improved, the barriers to running
application codes on very large clusters have been lowered. Examples
of application areas that are seeing the rapid adoption of scalable
client clusters are data intensive applications such as genomics,
seismic processing, data mining, content and video distribution, and
high performance computing. The aggregate storage I/O requirements of
a cluster can scale proportionally to the number of computers in the
cluster. It is not unusual for clusters today to make bandwidth
demands that far outstrip the capabilities of traditional file
servers. A natural solution to this problem is to enable file
service to scale as well, by increasing the number of server nodes
that are able to service a single file system to a cluster of
clients.

Scalable bandwidth can be claimed by simply adding multiple
independent servers to the network. Unfortunately, this leaves to
file system users the task of spreading data across these independent
servers. Because the data processed by a given data-intensive
application is usually logically associated, users routinely co-
locate this data in a single file system, directory or even a single
file. The NFSv4 protocol currently requires that all the data in a
single file system be accessible through a single exported network
endpoint, constraining access to be through a single NFS server.

A better way of increasing the bandwidth to a single file system is
to enable access to be provided through multiple endpoints in a
coordinated or coherent fashion. Separation of control and data
flows provides a straightforward framework to accomplish this, by
allowing transfers of data to proceed in parallel from many clients
to many data storage endpoints. Control and file management
operations, inherently more difficult to parallelize, can remain the
province of a single NFS server, inheriting the simple management of
today's NFS file service, while offloading data transfer operations
allows bandwidth scalability. Data transfer may be done using NFS or
other protocols, such as iSCSI.

While NFS is a widely used network file system protocol, most of the
world's data resides in data stores that are not accessible through
NFS. Much of this data is stored in Storage Area Networks,
accessible by SCSI's Fibre Channel Protocol (FCP), or increasingly,
by iSCSI. Storage Area Networks routinely provide much higher data
bandwidths than do NFS file servers. Unfortunately, the simple array
of blocks interface into Storage Area Networks does not lend itself
to controlling multiple clients that are simultaneously reading and


Gibson et al Expires - August 2004 [Page 3]
Internet Draft pNFS Problem Statement February 2004


writing the blocks of the same or different files, a workload usually
referred to as data sharing. NFS file service, with its hierarchical
namespace of separately controlled files, offers simpler and more
cost-effective management. One might conclude that users must chose
between high bandwidth and data sharing. Not only is this conclusion
false, but it should also be possible to allow data stored in SAN
devices, FCP or iSCSI, to be accessed under the control of an NFS
server. Such an approach protects the industry's large investment in
NFS, since the bandwidth bottleneck no longer needs to drive users to
adopt a proprietary alternative solution, and leverages SAN storage
infrastructures, all within a common architectural framework.


2. Bandwidth Scaling in Clusters

When applied to data-intensive applications, clusters can generate
unprecedented demand for storage bandwidth. At present, each node in
the cluster is likely to be a dual processor, with each processor
running at multiple GHz, with gigabytes of DRAM. Depending on the
specific application, each node is capable of sustaining a demand of
10s to 100s of MB/s of data from storage. In addition, the number of
nodes in a cluster is commonly in the 100s, with many instances of
1000s to 10,000s of nodes. The result is that storage systems may be
called upon to provide an aggregate bandwidth of GB/s ranging upwards
toward TB/s.

The performance of a single NFS server has been improving, but it is
not able to keep pace with cluster demand. Directly connected storage
devices behind an NFS server have given way to disk arrays and
networked disk arrays, making it now possible for an NFS server to
directly access 100s to 1000s of disk drives whose aggregate capacity
reaches upwards to PBs and whose raw bandwidths range upwards to 10s
of GB/s.

An NFS server is interposed between the scalable storage subsystem
and the scalable client cluster. Multiple NIC endpoints help network
bandwidth keep up with DRAM bandwidth. However, the rate of
improvement of NFS server performance is not faster than the rate of
improvement in each client node. As long as an NFS file system is
associated with a single client-side network endpoint, the aggregate
capabilities of a single NFS server to move data between storage
networks and client networks will not be able to keep pace with the
aggregate demand of clustered clients and large disk subsystems.


3. Clustered Applications

Large datasets and high bandwidth processing of large datasets are
increasingly common in a wide variety of applications. As most


Gibson et al Expires - August 2004 [Page 4]
Internet Draft pNFS Problem Statement February 2004


computer users can affirm, the size of everyday presentations,
pictures and programs seems to grow continuously, and in fact average
file size does grow with time [Ousterhout85, Baker91]. Simple
copying, viewing, archiving and sharing of even this baseline use of
growing files in day-to-day business and personal computing drives up
the bandwidth demand on servers.

Some applications, however, make much larger demands on file and file
system capacity and bandwidth. Databases of DNA sequences, used in
bioinformatics search, range up to tens of GBs and are often in use
by all cluster users are the same time [NIH03]. These huge files may
experience bursts of many concurrent clients loading the whole file
independently.

Bioinformatics is an example of extensive search in science
application. Extensive search is much broader than science. Wall
Street has taken to collecting long-term transaction record
histories. Looking for patterns of unbilled transactions, fraud or
predictable market trends is a growing financial opportunity
[Agarwal95, Senator95].

Security and authentication are driving a need for image search, such
as face recognition [Flickner95]. Databasing the faces of approved
or suspected individuals and searching through many camera feeds
involves huge data and bandwidths. Traditional database indexing in
these high dimension data structures often fails to avoid full
database scans of these huge files [Berchtold97].

With huge storage repositories and fast computers, huge sensor
capture is increasingly used in many applications. Consumer digital
photography fits this model, with photo touch-up and slide show
generation tools driving bandwidth, although much more demanding
applications are not unusual.

Medical test imagery is being captured at very high resolution and
tools are being developed for automatic preliminary diagnosis, for
example [Afework98]. In the science world, even larger datasets are
captured from satellites, telescopes, and atom-smashers, for example
[Greiman97]. Preliminary processing of a sky survey suggests that
thousand node clusters may sustain GB/s storage bandwidths [Gray03].
Seismic trace data, often measured in helicopter loads, commands
large clusters for days to months [Knott03].

At the high end of science application, accurate physical simulation,
its visualization and fault-tolerance checkpointing, has been
estimated to need 10 GB/s bandwidth and 100 TB of capacity for every
thousand nodes in a cluster [SGPFS01].




Gibson et al Expires - August 2004 [Page 5]
Internet Draft pNFS Problem Statement February 2004


Most of these applications make heavy use of shared data across many
clients, users and applications, have limited budgets available to
fund aggressive computational goals, and have technical or scientific
users with strong preferences for file systems and no patience for
tuning storage. NFS file service, appropriately scaled up in
capacity and bandwidth, is highly desired.

In addition to these search, sensor and science applications,
traditional database applications are increasingly employing NFS
servers. These applications often have hotspot tables, leading to
high bandwidth storage demands. Yet SAN-based solutions are
sometimes harder to manage than NFS based solutions, especially in
databases with a large number of tables. NFS servers with scalable
bandwidth would accelerate the adoption of NFS for database
applications.

These examples suggest that there is no shortage of applications
frustrated by the limitations of a single network endpoint on a
single NFS server exporting a single file system or single huge file.


4. Existing File Systems for Clusters

The server bottleneck has induced various vendors to develop
proprietary alternatives to NFS.

Known variously as asymmetric, out-of-band, clustered or SAN file
systems, these proprietary alternatives exploit the scalability of
storage networks by attaching all nodes in the client cluster to the
storage network. Then, by reorganizing client and server code
functionality to separate data traffic from control traffic, client
nodes are able to access storage devices directly rather than
requesting all data from the same single network endpoint in the file
server that handles control traffic.

Most proprietary alternative solutions have been tailored to storage
area networks based on the fixed-sized block SCSI storage device
command set and its Fibrechannel SCSI transport. Examples in this
class include EMC's High Road (www.emc.com); IBM's TotalStorage SAN
FS, SANergy and GPFS (www.ibm.com); Sistina/Redhat's GFS
(www.readhat.com); SGI's CXFS (www.sgi.com); Veritas' SANPoint Direct
and CFS (www.veritas.com); and Sun's QFS (www.sun.com). The
Fibrechannel SCSI transport used in these systems may soon be
replaceable by a TCP/IP SCSI transport, iSCSI, enabling these
proprietary alternatives to operate on the same equipment and IETF
protocols commonly used by NFS servers.

While fixed-sized block SCSI storage devices are used in most file
systems with separated data and control paths, this is not the only


Gibson et al Expires - August 2004 [Page 6]
Internet Draft pNFS Problem Statement February 2004


alternative available today. SCSI's newly emerging command set, the
Object Storage Device (OSD) command set, transmits variable length
storage objects over SCSI transports [T10-03]. Panasas' ActiveScale
storage cluster employs a proto-OSD command set over iSCSI on its
separated data path (www.panasas.com). IBM's research is also
demonstrating a variant of their TotalStorage SAN FS employing proto-
OSD commands [Azagury02].

Even more distinctive is Zforce's File Switch technology
(www.zforce.com). Zforce virtualizes a CIFS file server spreading
the contents of a file share over many backend CIFS storage servers
and places their control path functionality inside a network switch
in order to have some of the properties of both separated and non-
separated data and control paths. However, striping files over
multiple file-based storage servers is not a new concept. Berkeley's
Zebra file system, the successor to the log-based file system
developed for RAID storage, had a separated data and control path
with file protocols to both [Hartman95].


5. Eliminating the Bottleneck

The restriction of a single network endpoint results from the way NFS
associates file servers and file systems. Essentially, each client
machine "mounts" each exported file system; these mount operations
bind a network endpoint to all files in the exported file system,
instructing the client to address that network endpoint with all
requests associated with all files in that file system. Mechanisms
intended for primarily for failover have been established for giving
clients a list of network endpoints associated with a given file
system.

Multiple NFS servers can be used instead of a single NFS server, and
many cluster administrators, programmers and end-users have
experimented with this alternative. The principle compromise
involved in exploiting multiple NFS servers is that a single file or
single file system is decomposed into multiple files or file systems,
respectively. For instance, a single file can be decomposed into many
files, each located in a part of the namespace that is exported by a
different NFS server; or the files of a single directory can be
linked to files in directories located in file systems exported by
different NFS servers. Because this decomposition is done without
NFS server support, the work of decomposing and recomposing and the
implications of the decomposition on capacity and load balancing,
backup consistency, error recovery, and namespace management all fall
to the customer. Moreover, the additional statefulness of NFSv4 makes
correct semantics for files decomposed over multiple services without
NFS support much more complex. Such extra work and extra problems are



Gibson et al Expires - August 2004 [Page 7]
Internet Draft pNFS Problem Statement February 2004


usually referred to as storage management costs, and are blamed for
causing a high total cost of ownership for storage.

Preserving the relative ease of use of NFS storage systems requires
solutions to the bandwidth bottleneck that do not decompose files and
directories in the file subtree namespace.
A solution to this problem should continue to use the existing single
network endpoint for control traffic, including namespace
manipulations. Decompositions of individual files and file systems
over multiple network endpoints can be provided via the separated
data paths, without separating the control and metadata paths.


6. Separated control and data access techniques

Separating storage data flow from file system control flow
effectively moves the bottleneck away from the single endpoint of an
NFS server and distributes it across the bisectional bandwidth of the
storage network between the cluster nodes and storage devices. Since
switch bandwidths of upwards of terabits per second are available
today, this bottleneck is at least two orders of magnitude better
than that of an NFS server network endpoint.

In an architecture that separates the storage data path from the NFS
control path there are choices of protocol for the data path. One
straightforward answer is to extend the NFS protocol so it can
accommodate can be used on both control and separated data paths.
Another straightforward answer is to capture the existing market's
dominant separated data path, fixed-sized block SCSI storage. A third
alternative is the emerging object storage SCSI command set, OSD,
which is appearing in new products with separate data and control
paths.

A solution that accommodates all of these approaches provides the
broadest applicability for NFS. Specifically, NFS extensions should
make minimal assumptions about the storage data server access
protocol. The clients in such an extended NFS system should be
compatible with the current NFSv4 protocol, and should be compatible
with earlier versions of NFS as well. A solution should be capable
of providing both asymmetric data access, with the data path
connected via NFS or other protocols and transports, and symmetric
parallel access to servers that run NFS on each server node.
Specifically, it is desirable to enable NFS to manage asymmetric
access to storage attached via iSCSI and Fibre Channel/SCSI storage
area networks.

As previously discussed, the root cause of the NFS server bottleneck
is the binding between one network endpoint and all the files in a
file system. NFS extensions can allow the association of additional


Gibson et al Expires - August 2004 [Page 8]
Internet Draft pNFS Problem Statement February 2004


network endpoints with specific files. These associations could be
represented layout maps [Gibson98]. NFS clients could be extended to
have the ability to retrieve and use these layout maps.

NFSv4 provides an excellent foundation for this. We may be able to
extend the current notion of file delegations to include the ability
to retrieve and utilize a file layout map. A number of ideas have
been proposed for storing, accessing, and acting upon layout
information stored by NFS servers to allow separate access to file
data over separate data paths. Data access can be supported over
multiple protocols, including NFSv4, iSCSI, and OSD.

7. Security Considerations

Bandwidth scaling solutions that employ separation of control and
data paths will introduce new security concerns. For example, the
data access methods will require authentication and access control
mechanisms that are consistent with the primary mechanisms on the
NFSv4 control paths. Object storage employs revocable cryptographic
restrictions on each object, which can be created and revoked in the
control path. With iSCSI access methods, iSCSI security capabilities
are available, but do not contain NFS access control. Fibre Channel
based SCSI access methods have less sophisticated security than
iSCSI. These access methods typically use private networks to
provide security.

Any proposed solution must be analyzed for security threats and any
such threats must be addressed. The IETF and the NFS working group
have significant expertise in this area.


8. Informative References

[Afework98] A. Afework, M. Beynon, F. Bustamonte, A. Demarzo, R.
Ferriera, R. Miller, M. Silberman, J. Saltz, A. Sussman, H. Tang,
"Digital dynamic telepathology - the virtual microscope," Proc. of
the AMIA'98 Fall Symposium 1998.

[Agarwal95] Agrawal, R. and Srikant, R. "Fast Algorithms for Mining
Association Rules" VLDB, September 1995.

[Azagury02] Azagury, A., Dreizin, V., Factor, M., Henis, E., Naor,
D., Rinetzky, N., Satran, J., Tavory, A., Yerushalmi, L, "Towards
an Object Store," IBM Storage Systems Technology Workshop,
November 2002.

[Baker91] Baker, M.G., Hartman, J.H., Kupfer, M.D., Shirriff, K.W.
and Ousterhout, J.K. "Measurements of a Distributed File System"
SOSP, October 1991.


Gibson et al Expires - August 2004 [Page 9]
Internet Draft pNFS Problem Statement February 2004



[Berchtold97] Berchtold, S., Boehm, C., Keim, D.A. and Kriegel, H. "A
Cost Model For Nearest Neighbor Search in High-Dimensional Data
Space" ACM PODS, May 1997.

[Fayyad98] Fayyad, U. "Taming the Giants and the Monsters: Mining
Large Databases for Nuggets of Knowledge" Database Programming and
Design, March 1998.

[Flickner95] Flickner, M., Sawhney, H., Niblack, W., Ashley, J.,
Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic,
D., Steele, D. and Yanker, P. "Query by Image and Video Content:
the QBIC System" IEEE Computer, September 1995.

[Gibson98] Gibson, G. A., et. al., "A Cost-Effective, High-Bandwidth
Storage Architecture," International Conference on Architectural
Support for Programming Languages and Operating Systems (ASPLOS),
October 1998.

[Gray03] Jim Gray, "Distributed Computing Economics," Technical
Report MSR-TR-2003-24, March 2003.

[Greiman97] Greiman, W., W. E. Johnston, C. McParland, D. Olson, B.
Tierney, C. Tull, "High-Speed Distributed Data Handling for HENP,"
Computing in High Energy Physics, April, 1997. Berlin, Germany.

[Hartman95] John H. Hartman and John K. Ousterhout, "The Zebra
Striped Network File System," ACM Transactions on Computer Systems
13, 3, August 1995.

[Knott03] Knott, T., "Computing colossus," BP Frontiers magazine,
Issue 6, April 2003, http://www.bp.com/frontiers.

[NIH03] "Easy Large-Scale Bioinformatics on the NIH Biowulf
Supercluster," http://biowulf.nih.gov/easy.html, 2003.

[Ousterhout85] Ousterhout, J.K., DaCosta, H., Harrison, D., Kunze,
J.A., Kupfer, M. and Thompson, J.G. "A Trace Drive Analysis of the
UNIX 4.2 BSD FIle System" SOSP, December 1985.

[Senator95] Senator, T.E., Goldberg, H.G., Wooten, J., Cottini, M.A.,
Khan, A.F.U., Klinger, C.D., Llamas, W.M., Marrone, M.P. and Wong,
R.W.H. "The Financial Crimes Enforcement Network AI System (FAIS):
Identifying potential money laundering from reports of large cash
transactions" AIMagazine 16 (4), Winter 1995.

[SGPFS01] SGS File System RFP, DOE NNCA and DOD NSA, April 25, 2001.




Gibson et al Expires - August 2004 [Page 10]
Internet Draft pNFS Problem Statement February 2004


[T10-03] Draft OSD Standard, T10 Committee, Storage Networking
Industry Association(SNIA),
ftp://www.t10.org/ftp/t10/drafts/osd/osd-r08.pdf


9. Acknowledgments

David Black, Gary Grider, Benny Halevy, Dean Hildebrand, Dave Noveck,
Julian Satran, Tom Talpey, and Brent Welch contributed to the
development of this problem statement.


10. Author's Addresses

Garth Gibson
Panasas Inc, and Carnegie Mellon University
1501 Reedsdale Street
Pittsburgh, PA 15233 USA
Phone: +1 412 323 3500
Email: ggibson@panasas.com

Peter Corbett
Network Appliance Inc.
375 Totten Pond Road
Waltham, MA 02451 USA
Phone: +1 781 768 5343
Email: peter@pcorbett.net


11. Full Copyright Statement

Copyright (C) The Internet Society (2004). All Rights Reserved.

This document and translations of it may be copied and furnished to
others, and derivative works that comment on or otherwise explain it
or assist in its implementation may be prepared, copied, published
and distributed, in whole or in part, without restriction of any
kind, provided that the above copyright notice and this paragraph are
included on all such copies and derivative works.
However, this document itself may not be modified in any way, such as
by removing the copyright notice or references to the Internet
Society or other Internet organizations, except as needed for the
purpose of developing Internet standards in which case the procedures
for copyrights defined in the Internet Standards process must be
followed, or as required to translate it into languages other than
English.

The limited permissions granted above are perpetual and will not be
revoked by the Internet Society or its successors or assigns.


Gibson et al Expires - August 2004 [Page 11]
Internet Draft pNFS Problem Statement February 2004



This document and the information contained herein is provided on an
"AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING
TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING
BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION
HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF
MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.












































Gibson et al Expires - August 2004 [Page 12] 


From ggrider@lanl.gov Fri Feb 06 10:43:54 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 47383 invoked from network); 6 Feb 2004 18:43:54 -0000
Received: from unknown (66.218.66.216)
by m4.grp.scd.yahoo.com with QMQP; 6 Feb 2004 18:43:54 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta1.grp.scd.yahoo.com with SMTP; 6 Feb 2004 18:43:53 -0000
Received: from mailrelay3.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i16IgpHR017493
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Feb 2004 11:42:51 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay3.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i16IgoeI012727
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Feb 2004 11:42:50 -0700
Received: from cthulu.lanl.gov (vpn-client-187.lanl.gov [128.165.253.187])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i16IgmYi006488;
Fri, 6 Feb 2004 11:42:48 -0700
Message-Id: <5.2.0.9.2.20040206114221.01582600@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Fri, 06 Feb 2004 11:42:45 -0700
To: pnfs-reqs@yahoogroups.com, pnfs-reqs@yahoogroups.com
In-Reply-To: <4138B0BA-58D3-11D8-825E-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="=====================_9928526==.REL"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] Have ASCII, approaching submission
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

Thanks Garth for the enormous effort.  Thanks to all that helped.

Gary

At 01:35 PM 2/6/2004 -0500, Garth Gibson wrote:

> Okay, so with help from many of you we may have an ASCI internet draft.
>   Attached is the file that may be it.
>
> Next, the file name.  According to "Guidelines to Authors of
> Internet-Drafts, Last modified September 5, 2002", says that we need to
> get the file name from IETF.  Assuming that this is not an NFS
> document, not yet anyway, we are suggesting the name
> "draft-gibson-pnfs-problem-statement-00.txt".
>
> Unless I hear otherwise, I will submit a request for a name, giving
> this name as a suggestion.
>
> garth
>
> ----------- test from the Guidelines section ------
>
> For those authors submitting updates to existing Internet-Drafts, the
> choice of the file name is easily determined (up the version by 1).
> For new documents, either suggest one or send a message to
> "internet-drafts@ietf.org" with the document title, noting if it is a
> product of a working group (and the name of the group), and an
> abstract. The file name to be assigned will be included in a response.
> Simply add the filename text to the document (ASCII and PostScript
> versions) and submit the Internet-Draft.
>
> If the document is a new one (i.e. starting with revision -00.txt) and
> is submitted as a working group document, the IETF secretariat will ask
> the chair(s) of the wg the permission to publish it as a working group
> document.  To expedite the process, authors are encouraged to send the
> document to internet-drafts@ietf.org and at the same time cc: to the
> chair(s) of the working group.  If the document is accepted as a
> working group document, then it will have the draft-ietf-<working group
> acronym> file name and will be announced on the working group mailing
> list by the IETF Secretariat.  If the document is not accepted as a
> working group document, it will be processed as an individual
> submission, where the filename will be draft-<yourname>-....txt.
>
> NOTE: Revision numbers are based on the filename (as in first, second,
>        or third version of this document). If there is a filename
> change, the version number starts over at -00. Put another way, the
> prior version number will NOT be incremented when an Internet-Draft
> filename has changed. ALL FILES BEGIN at -00
>
> Before each IETF meeting, a deadline is announced for submitting
> documents ahead of time to be published for the meeting. For new
> documents, he deadline is even sooner (one week). There is no accepted
> delay. If you send at the very last minute, it is possible that it will
> arrive too late because of congestion of your mail server queues.  If
> it is received too late, it will not be published on time for the IETF
> meeting.
>
> Note that if a filename is suggested, but not used, the document will
> have to be resubmitted with the actual file name.
>
>
>
> Begin forwarded message:
> > From: "Benny Halevy" <bhalevy@panasas.com>
> > Date: February 6, 2004 3:11:02 AM EST
> > To: "Garth Gibson" <garth@panasas.com>
> > Cc: "Benny Halevy" <bhalevy@panasas.com>
> > Subject: RE: crlf.exe
> >
> > Garth, I followed David Black's instructions.
> > Files attached.
> >
> > Benny
> >
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> 977749.jpg
> 97778f.jpg
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From Thomas.Talpey@netapp.com Fri Feb 06 10:54:54 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 65270 invoked from network); 6 Feb 2004 18:54:53 -0000
Received: from unknown (66.218.66.218)
by m4.grp.scd.yahoo.com with QMQP; 6 Feb 2004 18:54:53 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 6 Feb 2004 18:54:53 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i16IsRKw010580
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Feb 2004 10:54:28 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i16IsRRj003290
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Feb 2004 10:54:27 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.32]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Fri, 6 Feb 2004 13:54:26 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3ECE2.A4505D00"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Fri, 6 Feb 2004 10:53:43 -0800
Message-ID: <5.2.1.1.2.20040206134617.00c3afd0@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Have ASCII, approaching submission
Thread-Index: AcPs4qTyDQAiLTf+S/uat3b2ahLQIA==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] Have ASCII, approaching submission
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

At 01:35 PM 2/6/2004, Garth Gibson wrote:
>Okay, so with help from many of you we may have an ASCI internet draft.
>  Attached is the file that may be it.
>
>Next, the file name.  According to "Guidelines to Authors of
>Internet-Drafts, Last modified September 5, 2002", says that we need to
>get the file name from IETF.  Assuming that this is not an NFS
>document, not yet anyway, we are suggesting the name
>"draft-gibson-pnfs-problem-statement-00.txt".
>
>Unless I hear otherwise, I will submit a request for a name, giving
>this name as a suggestion.

I have only one comment - the 1st IETF copyright is the 2003 boilerplate.
It needs to be 2004! Interestingly, the second appearance, at the end
of the document, is fine - just the first one. I recommend fixing the .txt
and not re-formatting. :-)

You don't need to submit a request for a name, this is an "individual"
submission. Your suggested title is the correct form and will be fine,
you can go ahead and send it to internet-drafts@ietf.org with the
necessary e-cover letter. You'll get an automated response and
unless you hear otherwise, it will appear in a few days.

Congrats!

Tom. 

From garth@panasas.com Fri Feb 06 12:47:03 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 6062 invoked from network); 6 Feb 2004 20:47:00 -0000
Received: from unknown (66.218.66.172)
by m6.grp.scd.yahoo.com with QMQP; 6 Feb 2004 20:47:00 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 6 Feb 2004 20:47:00 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSYNRZZ; Fri, 6 Feb 2004 15:45:37 -0500
Mime-Version: 1.0 (Apple Message framework v612)
To: pnfs-reqs@yahoogroups.com
Message-Id: <65468135-58E5-11D8-825E-000A95A94F04@panasas.com>
Content-Type: multipart/mixed; boundary=Apple-Mail-14--991120022
Date: Fri, 6 Feb 2004 15:45:27 -0500
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Fwd: submitting "draft-gibson-pnfs-problem-statement-00.txt" an informational internet draft
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

It's submitted -- attached is the text that was finally sent (Tom found
a 2003 error and Craig found a missing "as")

Begin forwarded message:

> From: ietfauto@ietf.org (Internet Draft Submission Manager)
> Date: February 6, 2004 3:39:52 PM EST
> To: garth@panasas.com
> Subject: Re: submitting "draft-gibson-pnfs-problem-statement-00.txt"
> an informational internet draft
> Subject: Autoreply from Internet Draft Submission Manager
> Reply-To: dinaras@ietf.org
>
> Greetings:
>
> This message is being sent to acknowledge receipt of your
> Internet-Draft
> submission or message to internet-drafts@ietf.org.
> If you submitted an Internet-Draft, then it will be posted
> on the Internet-Drafts page of the IETF Web site, and an I-D
> Action message will be sent to the IETF Announcement List.
>
> Please note that all Internet-Drafts offered for publication
> as RFCs must conform to the requirements specified in ID Nits
> (http://www.ietf.org/ID-nits.html) or they will be returned
> to the author(s) for revision. Therefore, the IETF Secretariat
> strongly recommends that you address all of the issues raised
> in this document before submitting a request to publish your
> Internet-Draft to the IESG.
>
> The IETF Secretariat
>




Internet Draft Garth Gibson
Expires: August 2004 Panasas Inc. & CMU
Peter Corbett
Network Appliance, Inc.

Document: draft-gibson-pnfs-problem-statement-00.txt February 2004




pNFS Problem Statement


Status of this Memo

This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026.

Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.

Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."

The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt

The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.





Copyright Notice

Copyright (C) The Internet Society (2004). All Rights Reserved.











Gibson et al Expires - August 2004 [Page 1]
Internet Draft pNFS Problem Statement February 2004



Abstract

This draft considers the problem of limited bandwidth to NFS servers.
The bandwidth limitation exists because an NFS server has limited
network, CPU, memory and disk I/O resources. Yet, access to any one
file system through the NFSv4 protocol requires that a single server
be accessed. While NFSv4 allows file system migration, it does not
provide a mechanism that supports multiple servers simultaneously
exporting a single writable file system.

This problem has become aggravated in recent years with the advent of
very cheap and easily expanded clusters of application servers that
are also NFS clients. The aggregate bandwidth demands of such
clustered clients, typically working on a shared data set
preferentially stored in a single file system, can increase much more
quickly than the bandwidth of any server. The proposed solution is
to provide for the parallelization of file services, by enhancing
NFSv4 in a minor version.


Table of Contents

1. Introduction...................................................2
2. Bandwidth Scaling in Clusters..................................4
3. Clustered Applications.........................................4
4. Existing File Systems for Clusters.............................6
5. Eliminating the Bottleneck.....................................7
6. Separated control and data access techniques...................8
7. Security Considerations........................................9
8. Informative References.........................................9
9. Acknowledgments...............................................11
10. Author's Addresses...........................................11
11. Full Copyright Statement.....................................11


1. Introduction

The storage I/O bandwidth requirements of clients are rapidly
outstripping the ability of network file servers to supply them.
Increasingly, this problem is being encountered in installations
running the NFS protocol. The problem can be solved by increasing
the server bandwidth. This draft suggests that an effort be mounted
to enable NFS file service to scale with its clusters of clients.
The proposed approach is to increase the aggregate bandwidth possible
to a single file system by parallelizing the file service, resulting
in multiple network connections to multiple server endpoints
participating in the transfer of requested data. This should be



Gibson et al Expires - August 2004 [Page 2]
Internet Draft pNFS Problem Statement February 2004


achievable within the framework of NFS, possibly in a minor version
of the NFSv4 protocol.

In many application areas, single system servers are rapidly being
replaced by clusters of inexpensive commodity computers. As
clustering technology has improved, the barriers to running
application codes on very large clusters have been lowered. Examples
of application areas that are seeing the rapid adoption of scalable
client clusters are data intensive applications such as genomics,
seismic processing, data mining, content and video distribution, and
high performance computing. The aggregate storage I/O requirements of
a cluster can scale proportionally to the number of computers in the
cluster. It is not unusual for clusters today to make bandwidth
demands that far outstrip the capabilities of traditional file
servers. A natural solution to this problem is to enable file
service to scale as well, by increasing the number of server nodes
that are able to service a single file system to a cluster of
clients.

Scalable bandwidth can be claimed by simply adding multiple
independent servers to the network. Unfortunately, this leaves to
file system users the task of spreading data across these independent
servers. Because the data processed by a given data-intensive
application is usually logically associated, users routinely co-
locate this data in a single file system, directory or even a single
file. The NFSv4 protocol currently requires that all the data in a
single file system be accessible through a single exported network
endpoint, constraining access to be through a single NFS server.

A better way of increasing the bandwidth to a single file system is
to enable access to be provided through multiple endpoints in a
coordinated or coherent fashion. Separation of control and data
flows provides a straightforward framework to accomplish this, by
allowing transfers of data to proceed in parallel from many clients
to many data storage endpoints. Control and file management
operations, inherently more difficult to parallelize, can remain the
province of a single NFS server, inheriting the simple management of
today's NFS file service, while offloading data transfer operations
allows bandwidth scalability. Data transfer may be done using NFS or
other protocols, such as iSCSI.

While NFS is a widely used network file system protocol, most of the
world's data resides in data stores that are not accessible through
NFS. Much of this data is stored in Storage Area Networks,
accessible by SCSI's Fibre Channel Protocol (FCP), or increasingly,
by iSCSI. Storage Area Networks routinely provide much higher data
bandwidths than do NFS file servers. Unfortunately, the simple array
of blocks interface into Storage Area Networks does not lend itself
to controlling multiple clients that are simultaneously reading and


Gibson et al Expires - August 2004 [Page 3]
Internet Draft pNFS Problem Statement February 2004


writing the blocks of the same or different files, a workload usually
referred to as data sharing. NFS file service, with its hierarchical
namespace of separately controlled files, offers simpler and more
cost-effective management. One might conclude that users must chose
between high bandwidth and data sharing. Not only is this conclusion
false, but it should also be possible to allow data stored in SAN
devices, FCP or iSCSI, to be accessed under the control of an NFS
server. Such an approach protects the industry's large investment in
NFS, since the bandwidth bottleneck no longer needs to drive users to
adopt a proprietary alternative solution, and leverages SAN storage
infrastructures, all within a common architectural framework.


2. Bandwidth Scaling in Clusters

When applied to data-intensive applications, clusters can generate
unprecedented demand for storage bandwidth. At present, each node in
the cluster is likely to be a dual processor, with each processor
running at multiple GHz, with gigabytes of DRAM. Depending on the
specific application, each node is capable of sustaining a demand of
10s to 100s of MB/s of data from storage. In addition, the number of
nodes in a cluster is commonly in the 100s, with many instances of
1000s to 10,000s of nodes. The result is that storage systems may be
called upon to provide an aggregate bandwidth of GB/s ranging upwards
toward TB/s.

The performance of a single NFS server has been improving, but it is
not able to keep pace with cluster demand. Directly connected storage
devices behind an NFS server have given way to disk arrays and
networked disk arrays, making it now possible for an NFS server to
directly access 100s to 1000s of disk drives whose aggregate capacity
reaches upwards to PBs and whose raw bandwidths range upwards to 10s
of GB/s.

An NFS server is interposed between the scalable storage subsystem
and the scalable client cluster. Multiple NIC endpoints help network
bandwidth keep up with DRAM bandwidth. However, the rate of
improvement of NFS server performance is not faster than the rate of
improvement in each client node. As long as an NFS file system is
associated with a single client-side network endpoint, the aggregate
capabilities of a single NFS server to move data between storage
networks and client networks will not be able to keep pace with the
aggregate demand of clustered clients and large disk subsystems.


3. Clustered Applications

Large datasets and high bandwidth processing of large datasets are
increasingly common in a wide variety of applications. As most


Gibson et al Expires - August 2004 [Page 4]
Internet Draft pNFS Problem Statement February 2004


computer users can affirm, the size of everyday presentations,
pictures and programs seems to grow continuously, and in fact average
file size does grow with time [Ousterhout85, Baker91]. Simple
copying, viewing, archiving and sharing of even this baseline use of
growing files in day-to-day business and personal computing drives up
the bandwidth demand on servers.

Some applications, however, make much larger demands on file and file
system capacity and bandwidth. Databases of DNA sequences, used in
bioinformatics search, range up to tens of GBs and are often in use
by all cluster users are the same time [NIH03]. These huge files may
experience bursts of many concurrent clients loading the whole file
independently.

Bioinformatics is an example of extensive search in science
application. Extensive search is much broader than science. Wall
Street has taken to collecting long-term transaction record
histories. Looking for patterns of unbilled transactions, fraud or
predictable market trends is a growing financial opportunity
[Agarwal95, Senator95].

Security and authentication are driving a need for image search, such
as face recognition [Flickner95]. Databasing the faces of approved
or suspected individuals and searching through many camera feeds
involves huge data and bandwidths. Traditional database indexing in
these high dimension data structures often fails to avoid full
database scans of these huge files [Berchtold97].

With huge storage repositories and fast computers, huge sensor
capture is increasingly used in many applications. Consumer digital
photography fits this model, with photo touch-up and slide show
generation tools driving bandwidth, although much more demanding
applications are not unusual.

Medical test imagery is being captured at very high resolution and
tools are being developed for automatic preliminary diagnosis, for
example [Afework98]. In the science world, even larger datasets are
captured from satellites, telescopes, and atom-smashers, for example
[Greiman97]. Preliminary processing of a sky survey suggests that
thousand node clusters may sustain GB/s storage bandwidths [Gray03].
Seismic trace data, often measured in helicopter loads, commands
large clusters for days to months [Knott03].

At the high end of science application, accurate physical simulation,
its visualization and fault-tolerance checkpointing, has been
estimated to need 10 GB/s bandwidth and 100 TB of capacity for every
thousand nodes in a cluster [SGPFS01].




Gibson et al Expires - August 2004 [Page 5]
Internet Draft pNFS Problem Statement February 2004


Most of these applications make heavy use of shared data across many
clients, users and applications, have limited budgets available to
fund aggressive computational goals, and have technical or scientific
users with strong preferences for file systems and no patience for
tuning storage. NFS file service, appropriately scaled up in
capacity and bandwidth, is highly desired.

In addition to these search, sensor and science applications,
traditional database applications are increasingly employing NFS
servers. These applications often have hotspot tables, leading to
high bandwidth storage demands. Yet SAN-based solutions are
sometimes harder to manage than NFS based solutions, especially in
databases with a large number of tables. NFS servers with scalable
bandwidth would accelerate the adoption of NFS for database
applications.

These examples suggest that there is no shortage of applications
frustrated by the limitations of a single network endpoint on a
single NFS server exporting a single file system or single huge file.


4. Existing File Systems for Clusters

The server bottleneck has induced various vendors to develop
proprietary alternatives to NFS.

Known variously as asymmetric, out-of-band, clustered or SAN file
systems, these proprietary alternatives exploit the scalability of
storage networks by attaching all nodes in the client cluster to the
storage network. Then, by reorganizing client and server code
functionality to separate data traffic from control traffic, client
nodes are able to access storage devices directly rather than
requesting all data from the same single network endpoint in the file
server that handles control traffic.

Most proprietary alternative solutions have been tailored to storage
area networks based on the fixed-sized block SCSI storage device
command set and its Fibrechannel SCSI transport. Examples in this
class include EMC's High Road (www.emc.com); IBM's TotalStorage SAN
FS, SANergy and GPFS (www.ibm.com); Sistina/Redhat's GFS
(www.readhat.com); SGI's CXFS (www.sgi.com); Veritas' SANPoint Direct
and CFS (www.veritas.com); and Sun's QFS (www.sun.com). The
Fibrechannel SCSI transport used in these systems may soon be
replaceable by a TCP/IP SCSI transport, iSCSI, enabling these
proprietary alternatives to operate on the same equipment and IETF
protocols commonly used by NFS servers.

While fixed-sized block SCSI storage devices are used in most file
systems with separated data and control paths, this is not the only


Gibson et al Expires - August 2004 [Page 6]
Internet Draft pNFS Problem Statement February 2004


alternative available today. SCSI's newly emerging command set, the
Object Storage Device (OSD) command set, transmits variable length
storage objects over SCSI transports [T10-03]. Panasas' ActiveScale
storage cluster employs a proto-OSD command set over iSCSI on its
separated data path (www.panasas.com). IBM's research is also
demonstrating a variant of their TotalStorage SAN FS employing proto-
OSD commands [Azagury02].

Even more distinctive is Zforce's File Switch technology
(www.zforce.com). Zforce virtualizes a CIFS file server spreading
the contents of a file share over many backend CIFS storage servers
and places their control path functionality inside a network switch
in order to have some of the properties of both separated and non-
separated data and control paths. However, striping files over
multiple file-based storage servers is not a new concept. Berkeley's
Zebra file system, the successor to the log-based file system
developed for RAID storage, had a separated data and control path
with file protocols to both [Hartman95].


5. Eliminating the Bottleneck

The restriction of a single network endpoint results from the way NFS
associates file servers and file systems. Essentially, each client
machine "mounts" each exported file system; these mount operations
bind a network endpoint to all files in the exported file system,
instructing the client to address that network endpoint with all
requests associated with all files in that file system. Mechanisms
intended for primarily for failover have been established for giving
clients a list of network endpoints associated with a given file
system.

Multiple NFS servers can be used instead of a single NFS server, and
many cluster administrators, programmers and end-users have
experimented with this alternative. The principle compromise
involved in exploiting multiple NFS servers is that a single file or
single file system is decomposed into multiple files or file systems,
respectively. For instance, a single file can be decomposed into many
files, each located in a part of the namespace that is exported by a
different NFS server; or the files of a single directory can be
linked to files in directories located in file systems exported by
different NFS servers. Because this decomposition is done without
NFS server support, the work of decomposing and recomposing and the
implications of the decomposition on capacity and load balancing,
backup consistency, error recovery, and namespace management all fall
to the customer. Moreover, the additional statefulness of NFSv4 makes
correct semantics for files decomposed over multiple services without
NFS support much more complex. Such extra work and extra problems are



Gibson et al Expires - August 2004 [Page 7]
Internet Draft pNFS Problem Statement February 2004


usually referred to as storage management costs, and are blamed for
causing a high total cost of ownership for storage.

Preserving the relative ease of use of NFS storage systems requires
solutions to the bandwidth bottleneck that do not decompose files and
directories in the file subtree namespace.
A solution to this problem should continue to use the existing single
network endpoint for control traffic, including namespace
manipulations. Decompositions of individual files and file systems
over multiple network endpoints can be provided via the separated
data paths, without separating the control and metadata paths.


6. Separated control and data access techniques

Separating storage data flow from file system control flow
effectively moves the bottleneck away from the single endpoint of an
NFS server and distributes it across the bisectional bandwidth of the
storage network between the cluster nodes and storage devices. Since
switch bandwidths of upwards of terabits per second are available
today, this bottleneck is at least two orders of magnitude better
than that of an NFS server network endpoint.

In an architecture that separates the storage data path from the NFS
control path there are choices of protocol for the data path. One
straightforward answer is to extend the NFS protocol so it can
accommodate can be used on both control and separated data paths.
Another straightforward answer is to capture the existing market's
dominant separated data path, fixed-sized block SCSI storage. A third
alternative is the emerging object storage SCSI command set, OSD,
which is appearing in new products with separate data and control
paths.

A solution that accommodates all of these approaches provides the
broadest applicability for NFS. Specifically, NFS extensions should
make minimal assumptions about the storage data server access
protocol. The clients in such an extended NFS system should be
compatible with the current NFSv4 protocol, and should be compatible
with earlier versions of NFS as well. A solution should be capable
of providing both asymmetric data access, with the data path
connected via NFS or other protocols and transports, and symmetric
parallel access to servers that run NFS on each server node.
Specifically, it is desirable to enable NFS to manage asymmetric
access to storage attached via iSCSI and Fibre Channel/SCSI storage
area networks.

As previously discussed, the root cause of the NFS server bottleneck
is the binding between one network endpoint and all the files in a
file system. NFS extensions can allow the association of additional


Gibson et al Expires - August 2004 [Page 8]
Internet Draft pNFS Problem Statement February 2004


network endpoints with specific files. These associations could be
represented as layout maps [Gibson98]. NFS clients could be extended
to have the ability to retrieve and use these layout maps.

NFSv4 provides an excellent foundation for this. We may be able to
extend the current notion of file delegations to include the ability
to retrieve and utilize a file layout map. A number of ideas have
been proposed for storing, accessing, and acting upon layout
information stored by NFS servers to allow separate access to file
data over separate data paths. Data access can be supported over
multiple protocols, including NFSv4, iSCSI, and OSD.

7. Security Considerations

Bandwidth scaling solutions that employ separation of control and
data paths will introduce new security concerns. For example, the
data access methods will require authentication and access control
mechanisms that are consistent with the primary mechanisms on the
NFSv4 control paths. Object storage employs revocable cryptographic
restrictions on each object, which can be created and revoked in the
control path. With iSCSI access methods, iSCSI security capabilities
are available, but do not contain NFS access control. Fibre Channel
based SCSI access methods have less sophisticated security than
iSCSI. These access methods typically use private networks to
provide security.

Any proposed solution must be analyzed for security threats and any
such threats must be addressed. The IETF and the NFS working group
have significant expertise in this area.


8. Informative References

[Afework98] A. Afework, M. Beynon, F. Bustamonte, A. Demarzo, R.
Ferriera, R. Miller, M. Silberman, J. Saltz, A. Sussman, H. Tang,
"Digital dynamic telepathology - the virtual microscope," Proc. of
the AMIA'98 Fall Symposium 1998.

[Agarwal95] Agrawal, R. and Srikant, R. "Fast Algorithms for Mining
Association Rules" VLDB, September 1995.

[Azagury02] Azagury, A., Dreizin, V., Factor, M., Henis, E., Naor,
D., Rinetzky, N., Satran, J., Tavory, A., Yerushalmi, L, "Towards
an Object Store," IBM Storage Systems Technology Workshop,
November 2002.

[Baker91] Baker, M.G., Hartman, J.H., Kupfer, M.D., Shirriff, K.W.
and Ousterhout, J.K. "Measurements of a Distributed File System"
SOSP, October 1991.


Gibson et al Expires - August 2004 [Page 9]
Internet Draft pNFS Problem Statement February 2004



[Berchtold97] Berchtold, S., Boehm, C., Keim, D.A. and Kriegel, H. "A
Cost Model For Nearest Neighbor Search in High-Dimensional Data
Space" ACM PODS, May 1997.

[Fayyad98] Fayyad, U. "Taming the Giants and the Monsters: Mining
Large Databases for Nuggets of Knowledge" Database Programming and
Design, March 1998.

[Flickner95] Flickner, M., Sawhney, H., Niblack, W., Ashley, J.,
Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic,
D., Steele, D. and Yanker, P. "Query by Image and Video Content:
the QBIC System" IEEE Computer, September 1995.

[Gibson98] Gibson, G. A., et. al., "A Cost-Effective, High-Bandwidth
Storage Architecture," International Conference on Architectural
Support for Programming Languages and Operating Systems (ASPLOS),
October 1998.

[Gray03] Jim Gray, "Distributed Computing Economics," Technical
Report MSR-TR-2003-24, March 2003.

[Greiman97] Greiman, W., W. E. Johnston, C. McParland, D. Olson, B.
Tierney, C. Tull, "High-Speed Distributed Data Handling for HENP,"
Computing in High Energy Physics, April, 1997. Berlin, Germany.

[Hartman95] John H. Hartman and John K. Ousterhout, "The Zebra
Striped Network File System," ACM Transactions on Computer Systems
13, 3, August 1995.

[Knott03] Knott, T., "Computing colossus," BP Frontiers magazine,
Issue 6, April 2003, http://www.bp.com/frontiers.

[NIH03] "Easy Large-Scale Bioinformatics on the NIH Biowulf
Supercluster," http://biowulf.nih.gov/easy.html, 2003.

[Ousterhout85] Ousterhout, J.K., DaCosta, H., Harrison, D., Kunze,
J.A., Kupfer, M. and Thompson, J.G. "A Trace Drive Analysis of the
UNIX 4.2 BSD FIle System" SOSP, December 1985.

[Senator95] Senator, T.E., Goldberg, H.G., Wooten, J., Cottini, M.A.,
Khan, A.F.U., Klinger, C.D., Llamas, W.M., Marrone, M.P. and Wong,
R.W.H. "The Financial Crimes Enforcement Network AI System (FAIS):
Identifying potential money laundering from reports of large cash
transactions" AIMagazine 16 (4), Winter 1995.

[SGPFS01] SGS File System RFP, DOE NNCA and DOD NSA, April 25, 2001.




Gibson et al Expires - August 2004 [Page 10]
Internet Draft pNFS Problem Statement February 2004


[T10-03] Draft OSD Standard, T10 Committee, Storage Networking
Industry Association(SNIA),
ftp://www.t10.org/ftp/t10/drafts/osd/osd-r08.pdf


9. Acknowledgments

David Black, Gary Grider, Benny Halevy, Dean Hildebrand, Dave Noveck,
Julian Satran, Tom Talpey, and Brent Welch contributed to the
development of this problem statement.


10. Author's Addresses

Garth Gibson
Panasas Inc, and Carnegie Mellon University
1501 Reedsdale Street
Pittsburgh, PA 15233 USA
Phone: +1 412 323 3500
Email: ggibson@panasas.com

Peter Corbett
Network Appliance Inc.
375 Totten Pond Road
Waltham, MA 02451 USA
Phone: +1 781 768 5343
Email: peter@pcorbett.net


11. Full Copyright Statement

Copyright (C) The Internet Society (2004). All Rights Reserved.

This document and translations of it may be copied and furnished to
others, and derivative works that comment on or otherwise explain it
or assist in its implementation may be prepared, copied, published
and distributed, in whole or in part, without restriction of any
kind, provided that the above copyright notice and this paragraph are
included on all such copies and derivative works.
However, this document itself may not be modified in any way, such as
by removing the copyright notice or references to the Internet
Society or other Internet organizations, except as needed for the
purpose of developing Internet standards in which case the procedures
for copyrights defined in the Internet Standards process must be
followed, or as required to translate it into languages other than
English.

The limited permissions granted above are perpetual and will not be
revoked by the Internet Society or its successors or assigns.


Gibson et al Expires - August 2004 [Page 11]
Internet Draft pNFS Problem Statement February 2004



This document and the information contained herein is provided on an
"AS IS" basis and THE INTERNET SOCIETY AND THE INTERNET ENGINEERING
TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING
BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION
HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF
MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.












































Gibson et al Expires - August 2004 [Page 12]


From Thomas.Talpey@netapp.com Tue Feb 10 09:27:36 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 11044 invoked from network); 10 Feb 2004 17:27:33 -0000
Received: from unknown (66.218.66.166)
by m13.grp.scd.yahoo.com with QMQP; 10 Feb 2004 17:27:33 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta5.grp.scd.yahoo.com with SMTP; 10 Feb 2004 17:27:32 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1AHQXKw020644;
Tue, 10 Feb 2004 09:26:33 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1AHQWBr025977;
Tue, 10 Feb 2004 09:26:32 -0800 (PST)
Received: from tmt.netapp.com ([10.97.1.34]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Tue, 10 Feb 2004 12:26:21 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3EFFA.FFDC5C80"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Tue, 10 Feb 2004 09:25:25 -0800
Message-ID: <5.2.1.1.2.20040210122105.00c43e08@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: I-D ACTION:draft-gibson-pnfs-problem-statement-00.txt
Thread-Index: AcPv+wA+PZxVNM4nRKmu+s4AMQ1+KQ==
To: <pnfs-reqs@yahoogroups.com>, "Garth Gibson" <garth@panasas.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Fwd: I-D ACTION:draft-gibson-pnfs-problem-statement-00.txt
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

Garth, if you haven't already, I think it would make sense for
you to forward this to the nfsv4 wg alias, with the suggestion
that it be discussed at the upcoming Korea meeting and the list.

What's the status on putting together a presentation?

Tom.

> ---------- Forwarded Message ----------
>Subject: I-D ACTION:draft-gibson-pnfs-problem-statement-00.txt
>Date: Mon, 9 Feb 2004 16:15:13 -0500
>From: <Internet-Drafts@ietf.org>
>Reply-To: <Internet-Drafts@ietf.org>
>
>A New Internet-Draft is available from the on-line Internet-Drafts directories.
>
>
>       Title           : pNFS Problem Statement
>       Author(s)       : G. Gibson
>       Filename        : draft-gibson-pnfs-problem-statement-00.txt
>       Pages           : 12
>       Date            : 2004-2-9
>      
>This draft considers the problem of limited bandwidth to NFS servers. 
>   The bandwidth limitation exists because an NFS server has limited
>   network, CPU, memory and disk I/O resources.  Yet, access to any one
>   file system through the NFSv4 protocol requires that a single server
>   be accessed.  While NFSv4 allows file system migration, it does not
>   provide a mechanism that supports multiple servers simultaneously
>   exporting a single writable file system.
>   
>   This problem has become aggravated in recent years with the advent of
>   very cheap and easily expanded clusters of application servers that
>   are also NFS clients.  The aggregate bandwidth demands of such
>   clustered clients, typically working on a shared data set
>   preferentially stored in a single file system, can increase much more
>   quickly than the bandwidth of any server.  The proposed solution is
>   to provide for the parallelization of file services, by enhancing
>   NFSv4 in a minor version.
>
>A URL for this Internet-Draft is:
>http://www.ietf.org/internet-drafts/draft-gibson-pnfs-problem-statement-00.txt
>
>To remove yourself from the IETF Announcement list, send a message to
>ietf-announce-request with the word unsubscribe in the body of the message.
>
>Internet-Drafts are also available by anonymous FTP. Login with the username
>"anonymous" and a password of your e-mail address. After logging in,
>type "cd internet-drafts" and then
>       "get draft-gibson-pnfs-problem-statement-00.txt".
>
>A list of Internet-Drafts directories can be found in
>http://www.ietf.org/shadow.html
>or ftp://ftp.ietf.org/ietf/1shadow-sites.txt
>
>
>Internet-Drafts can also be obtained by e-mail.
>
>Send a message to:
>       mailserv@ietf.org.
>In the body type:
>       "FILE /internet-drafts/draft-gibson-pnfs-problem-statement-00.txt".
>      
>NOTE:  The mail server at ietf.org can return the document in
>       MIME-encoded form by using the "mpack" utility.  To use this
>       feature, insert the command "ENCODING mime" before the "FILE"
>       command.  To decode the response(s), you will need "munpack" or
>       a MIME-compliant mail reader.  Different MIME-compliant mail readers
>       exhibit different behavior, especially when dealing with
>       "multipart" MIME messages (i.e. documents which have been split
>       up into multiple messages), so check your local documentation on
>       how to manipulate these messages.
>              
>              
>Below is the data which will enable a MIME compliant mail reader
>implementation to automatically retrieve the ASCII version of the
>Internet-Draft.
>
>
> ---------- End of Forwarded Message ---------- 

From garth@panasas.com Wed Feb 11 16:11:10 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 92622 invoked from network); 12 Feb 2004 00:11:09 -0000
Received: from unknown (66.218.66.172)
by m8.grp.scd.yahoo.com with QMQP; 12 Feb 2004 00:11:09 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 12 Feb 2004 00:11:08 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY3LX3; Wed, 11 Feb 2004 19:11:05 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: 7bit
Message-Id: <EC98CBEA-5CEF-11D8-B5DB-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Wed, 11 Feb 2004 16:10:54 -0800
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: concall tomorrow
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

We'll hold a concall tomorrow 11am EST.

Agenda items can include:

- we need to convert the problem statement into a presentation (our
target was Feb 19)

- we need to identify who is giving the presentation at Seoul, if our
topic is given time

- should we give it at Connectathon next week, and if so, who will give
it (who is going?)

- its time to get back to the original roles of the mailing lists:
- a draft of a requirements doc
- a draft of the operations we suggest for NFSv4 extension
- a draft of the wire format of layout metadata for SBC (FC/SCSI)
backends
- a draft of the wire format of layout metadata for OSD backends
- a draft of the wire format of layout metadata for NFS backends

- get some milestones and dates beside the above

garth

From garth@panasas.com Wed Feb 11 16:16:57 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 57531 invoked from network); 12 Feb 2004 00:16:56 -0000
Received: from unknown (66.218.66.167)
by m20.grp.scd.yahoo.com with QMQP; 12 Feb 2004 00:16:56 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 12 Feb 2004 00:16:55 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY3LY1; Wed, 11 Feb 2004 19:16:52 -0500
Mime-Version: 1.0 (Apple Message framework v612)
To: pnfs-reqs@yahoogroups.com
Message-Id: <BC58308B-5CF0-11D8-B5DB-000A95A94F04@panasas.com>
Content-Type: multipart/mixed; boundary=Apple-Mail-3--546444835
Date: Wed, 11 Feb 2004 16:16:42 -0800
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Reminder -- David's note on the presentation we need for IETF, and his
example.

Begin forwarded message:

> From: black_david@emc.com
> Date: December 23, 2003 12:32:59 PM PST
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] RE: NEPS-REQS: getting started
> Reply-To: pnfs-reqs@yahoogroups.com
>
> Garth Gibson wrote:
>
>>> The RDDP problem statement is similar and dissimilar to what we are
>>> doing. It is similar in that it is about higher performance, which
>>> always turns out to be cost-performance. It is dissimilar in that it
>>> was fighting an uphill battle to get RDMA into the IETF, while we are
>>> looking at no preconceived support or opposition in the IETF (that I
>>> am aware of). And it is dissimilar in that what we are proposing
>>> helps in the manageability of federated systems, which is not really
>>> a
>>> performance issue.
>>>
>>> I followed the RDDP example closely because it was easy -- our
>>> arguments on strictly bandwidth are at least as strong, in my
>>> opinion.
>>> And because I am not certain how to predict the IETF management's
>>> reaction to a manageability argument. And the standardized client
>>> code argument, although very import to some of us, seemed outside my
>>> notion of the IETF scope.
>>>
>>> Perhaps those with more experience selling ideas to the IETF could
>>> educate us? Should we focus on a small number of the most easily
>>> demonstrated problems or fill the problem statement out with all the
>>> problems we can contribute to solving?
>
> Having been heavily involved in getting both IPS and RDDP work underway
> in the IETF, I have a few observations:
>
> - A problem statement draft is a good thing to have, but the folks in
> charge of the IETF are looking for a concise summary of what the
> problem is, how to go about solving it, and **why** the IETF should
> solve it. The latter is of particular importance, as I'll
> explain shortly.
> - I've attached a slide deck that I used for RDDP at the Spring 2002
> IETF BOF on this topic. This sort of "elevator pitch" style
> coverage of the topics is needed in addition to the more in-depth
> academic approach that is in the RDDP problem statement.
> - Goals and battles need to be chosen carefully. One of the things
> that delayed RDDP work is that the RDDP proponents were
> absolutely
> convinced that they needed to change TCP, and hence decided to go
> to battle with the IETF Transport community which was equally
> convinced that TCP should not be changed. In 20/20 hindsight,
> this was a mistake, as the IETF Transport community turned out
> to be correct that TCP does not require normative changes for RDDP.
> - Nonetheless, there is somewhat of an "uphill battle" to be engaged,
> as
> Beepy and/or Spencer described in Ann Arbor - the IETF has grown to
> a potentially unwieldy size, and as a consequence has developed a
> healthy institutional bias against new work. As a result, it is
> necessary to have good reasons not only for why work should be
> done, but also why it should be done in the IETF. The fact that
> we want to extend an existing IETF protocol (NFSv4) in a way that
> can take advantage of another (iSCSI) provides at least two reasons.
> Beyond this, there is value in drawing on the IETF's network
> expertise
> in areas such as security.
> - A draft WG statement/scope of work is very important at an early
> stage,
> including not only what we want to do, but what we do *not* want to
> do. I tend to view the latter as more important, as a shared view
> of what will not be worked on is a significant sign that a technical
> community has coalesced around a common effort and goals. For
> example,
> there are fairly strong statements about work that is out of scope
> in
> both the IPS and RDDP charters, and as a WG chair, I've found those
> statements useful from time to time ...
>
> I hope this helps,
> --David
> ----------------------------------------------------
> David L. Black, Senior Technologist
> EMC Corporation, 176 South St., Hopkinton, MA 01748
> +1 (508) 293-7953 FAX: +1 (508) 293-7786
> black_david@emc.com Mobile: +1 (978) 394-7754
> ----------------------------------------------------
>
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> ---------------------------------------------------------------------
> ~->
>
> Yahoo! Groups Links
>
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to:
> http://docs.yahoo.com/info/terms/
>
>


Attachment (not stored)
ROI-Problem-Scenario-0302.ppt
Type: application/vnd.ms-powerpoint

From ggrider@lanl.gov Wed Feb 11 21:55:17 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 3577 invoked from network); 12 Feb 2004 05:55:16 -0000
Received: from unknown (66.218.66.172)
by m13.grp.scd.yahoo.com with QMQP; 12 Feb 2004 05:55:16 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta4.grp.scd.yahoo.com with SMTP; 12 Feb 2004 05:55:15 -0000
Received: from mailrelay3.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1C5tEHR023258
for <pnfs-reqs@yahoogroups.com>; Wed, 11 Feb 2004 22:55:14 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay3.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1C5tEeI025221
for <pnfs-reqs@yahoogroups.com>; Wed, 11 Feb 2004 22:55:14 -0700
Received: from cthulu.lanl.gov (vpn-client-141.lanl.gov [128.165.253.141])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1C5t2Yi014519
for <pnfs-reqs@yahoogroups.com>; Wed, 11 Feb 2004 22:55:12 -0700
Message-Id: <5.2.0.9.2.20040211225331.015c28f0@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Wed, 11 Feb 2004 22:55:03 -0700
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <BC58308B-5CF0-11D8-B5DB-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: multipart/mixed;
boundary="=====================_9859727==_"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

elevator pitch

Gary

Attachment (not stored)
pNFS-elevator-pitch.ppt
Type: application/octet-stream


From pcorbett@netapp.com Thu Feb 12 07:22:44 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 66471 invoked from network); 12 Feb 2004 15:22:43 -0000
Received: from unknown (66.218.66.218)
by m8.grp.scd.yahoo.com with QMQP; 12 Feb 2004 15:22:43 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 12 Feb 2004 15:22:43 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1CFMIJC026157
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 07:22:18 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1CFMIiJ004972
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 07:22:18 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Thu, 12 Feb 2004 07:22:13 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A011DE886@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started
Thread-Index: AcPxLMx2H5E+soocRACCb74eZ9kJkQATki0A
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

The slides mention server bypass twice, but don't talk about parallel
access. Our problem statement is much more focussed on parallel access.
We are trying to define a standard for parallel access, whether that is
to nfs based data servers, object servers, virtualized SAN or
non-virtualized SAN devices. So, I think we need to be clearer when
talking about bypassing the server that we are really talking about
direct access to a parallel data store from clustered clients, with a
shared NFS server acting as a metadata server. To me, that is the key
point that applies across the entire solution space, whereas direct
access to devices is a data access technique in part of the solution
space.

-----Original Message-----
From: Gary Grider [mailto:ggrider@lanl.gov]
Sent: Thursday, February 12, 2004 12:55 AM
To: pnfs-reqs@yahoogroups.com
Subject: Re: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started


elevator pitch

Gary


Yahoo! Groups Links

From margaret.susairaj@oracle.com Thu Feb 12 07:33:55 2004
Return-Path: <Margaret.Susairaj@oracle.com>
X-Sender: Margaret.Susairaj@oracle.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 94736 invoked from network); 12 Feb 2004 15:33:53 -0000
Received: from unknown (66.218.66.217)
by m20.grp.scd.yahoo.com with QMQP; 12 Feb 2004 15:33:53 -0000
Received: from unknown (HELO agminet02.oracle.com) (141.146.126.229)
by mta2.grp.scd.yahoo.com with SMTP; 12 Feb 2004 15:33:53 -0000
Received: from rgmgw4.us.oracle.com (rgmgw4.us.oracle.com [138.1.191.13])
by agminet02.oracle.com (Switch-3.1.2/Switch-3.1.0) with ESMTP id i1CFVEcq006415
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 07:31:52 -0800
Received: from rgmgw4.us.oracle.com (localhost [127.0.0.1])
by rgmgw4.us.oracle.com (Switch-2.1.5/Switch-2.1.0) with ESMTP id i1CFVDb23530
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 08:31:13 -0700 (MST)
Received: from oracle.com (dhcp-amer-vpn-gw2-east-141-144-81-15.vpn.oracle.com [141.144.81.15])
by rgmgw4.us.oracle.com (Switch-2.1.5/Switch-2.1.0) with ESMTP id i1CFVDb23508
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 08:31:13 -0700 (MST)
Message-ID: <402B9E6C.FD2926D8@oracle.com>
Date: Thu, 12 Feb 2004 07:40:28 -0800
Organization: Oracle Corporation
X-Mailer: Mozilla 4.7 [en] (WinNT; I)
X-Accept-Language: en
MIME-Version: 1.0
To: pnfs-reqs@yahoogroups.com
References: <EC98CBEA-5CEF-11D8-B5DB-000A95A94F04@panasas.com>
Content-Type: multipart/mixed;
boundary="------------ABD85AA3E0FD053C29F098BA"
X-Brightmail-Tracker: AAAAAQAAAAI=
X-White-List-Member: TRUE
X-eGroups-Remote-IP: 141.146.126.229
From: Margaret Susairaj <margaret.susairaj@oracle.com>
Subject: Re: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=175561634
X-Yahoo-Profile: msusaira

ADVERTISEMENT
Garth,
    I joined this group recently. What is the number I can call to attend the concall?

Regards,
Margaret
 

Garth Gibson wrote:

>  We'll hold a concall tomorrow 11am EST.
>
> Agenda items can include:
>
> - we need to convert the problem statement into a presentation (our
> target was Feb 19)
>
> - we need to identify who is giving the presentation at Seoul, if our
> topic is given time
>
> - should we give it at Connectathon next week, and if so, who will give
> it (who is going?)
>
> - its time to get back to the original roles of the mailing lists:
>       - a draft of a requirements doc
>       - a draft of the operations we suggest for NFSv4 extension
>       - a draft of the wire format of layout metadata for SBC (FC/SCSI)
> backends
>       - a draft of the wire format of layout metadata for OSD backends
>       - a draft of the wire format of layout metadata for NFS backends
>
> - get some milestones and dates beside the above
>
> garth 

From bhalevy@panasas.com Thu Feb 12 07:58:28 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 65082 invoked from network); 12 Feb 2004 15:58:24 -0000
Received: from unknown (66.218.66.217)
by m16.grp.scd.yahoo.com with QMQP; 12 Feb 2004 15:58:24 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta2.grp.scd.yahoo.com with SMTP; 12 Feb 2004 15:58:23 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSY33QL>; Thu, 12 Feb 2004 10:58:07 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38863@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Thu, 12 Feb 2004 10:58:07 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3F181.01463780"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

1-800-387-6159 conf. id #5370035
 
Benny

    -----Original Message-----
    From: Margaret Susairaj [mailto:margaret.susairaj@oracle.com]
    Sent: Thursday, February 12, 2004 10:40 AM
    To: pnfs-reqs@yahoogroups.com
    Subject: Re: [pnfs-reqs] concall tomorrow

    Garth,
        I joined this group recently. What is the number I can call to attend the concall?

    Regards,
    Margaret
     

    Garth Gibson wrote:

>      We'll hold a concall tomorrow 11am EST.
>
>     Agenda items can include:
>
>     - we need to convert the problem statement into a presentation (our
>     target was Feb 19)
>
>     - we need to identify who is giving the presentation at Seoul, if our
>     topic is given time
>
>     - should we give it at Connectathon next week, and if so, who will give
>     it (who is going?)
>
>     - its time to get back to the original roles of the mailing lists:
>           - a draft of a requirements doc
>           - a draft of the operations we suggest for NFSv4 extension
>           - a draft of the wire format of layout metadata for SBC (FC/SCSI)
>     backends
>           - a draft of the wire format of layout metadata for OSD backends
>           - a draft of the wire format of layout metadata for NFS backends
>
>     - get some milestones and dates beside the above
>
>     garth 

From bhalevy@panasas.com Thu Feb 12 09:04:58 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@groups.yahoo.com
Received: (qmail 50691 invoked from network); 12 Feb 2004 17:04:56 -0000
Received: from unknown (66.218.66.218)
by m6.grp.scd.yahoo.com with QMQP; 12 Feb 2004 17:04:56 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 12 Feb 2004 17:04:56 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSY336R>; Thu, 12 Feb 2004 12:04:41 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38869@PIKES.panasas.com>
To: "'pnfs-reqs@groups.yahoo.com'" <pnfs-reqs@yahoogroups.com>
Date: Thu, 12 Feb 2004 12:04:40 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: access to pnfs-* mail archives
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

Following up on our conference call from today, people asked
what are all the pnfs-* lists and how to access their e-mail
archives.

The groups are pnfs-reqs where we discussed the problem statement
and will get deeper into the requirements draft.
pnfs-ops where we discuss the common extensions.
and pnfs-nfs, pnfs-obj, and pnfs-sbc where we intend to discuss
the specifics of each flavor such as layout format, addressing
scheme, security details, etc.

We have limited access to the groups' email archives for members only.
If you are interested in the archive but not in getting each posting
in (soft) real time for that list the membership configuration for
"Group Messages Delivery" allows you to select either "Individual Emails"
(default. you get it all), "Special Notices", "Daily Digest", or "No email".

All the groups can be accessed via http://groups.yahoo.com/group/<group-name>

Benny

--
Benny Halevy
Software Architect, Panasas Inc.
Delivering the premier storage system for scalable Linux clusters
http://www.panasas.com
bhalevy@panasas.com
tel: 412-323-6437
cell: 412-580-2520


From bwelch@panasas.com Thu Feb 12 09:20:33 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 65152 invoked from network); 12 Feb 2004 17:20:32 -0000
Received: from unknown (66.218.66.217)
by m19.grp.scd.yahoo.com with QMQP; 12 Feb 2004 17:20:32 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta2.grp.scd.yahoo.com with SMTP; 12 Feb 2004 17:20:32 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i1CHKVU02851
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 09:20:31 -0800
Message-Id: <200402121720.i1CHKVU02851@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.6.3 04/02/2003 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com
In-reply-to: <EC98CBEA-5CEF-11D8-B5DB-000A95A94F04@panasas.com>
References: <EC98CBEA-5CEF-11D8-B5DB-000A95A94F04@panasas.com>
Comments: In-reply-to Garth Gibson <garth@Panasas.Com>
message dated "Wed, 11 Feb 2004 16:10:54 -0800."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Thu, 12 Feb 2004 09:20:31 -0800
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

Here are my notes from the call today. Present were
Brent Welch, Benny Halevy, Peter Corbett,
Dave Noveck, Tom Talpey, David Black

>>>Garth Gibson said:

> - we need to convert the problem statement into a presentation (our
> target was Feb 19)

We will work on this via email today and tomorrow as next week is
full of travel and vacation days for several of us. Look for another
email from me with a sketch talk outline (cannon fodder)

> - we need to identify who is giving the presentation at Seoul, if our
> topic is given time

Tom will give the talk. BP has been informed we want the slot,
which will probably be 10 minutes. There is 2 hours for NFSv4,
so it seems there will be ample room. Haven't gotten confirmation, yet.

> - should we give it at Connectathon next week, and if so, who will give
> it (who is going?)

Tom will be there Monday and part of Tuesday, and will try to give an
informal talk - the talk schedule is currently full except for overflow
time on Wednesday. To prime the pump, Tom is going to send the
problem statement (or a pointer) to the nfsv4 working group mailing list.

We'll use the same material as the IETF talk, (heavy overlap) but
the focus of the two talks will be different. IETF more about why
IETF should be interested. Connectathon, why this is technically cool.

> - its time to get back to the original roles of the mailing lists:
> - a draft of a requirements doc
> - a draft of the operations we suggest for NFSv4 extension
> - a draft of the wire format of layout metadata for SBC (FC/SCSI)
> backends
> - a draft of the wire format of layout metadata for OSD backends
> - a draft of the wire format of layout metadata for NFS backends
>
> - get some milestones and dates beside the above

We seemed mostly interested in focusing on the requirements doc next,
although we should at least have a sketch in place for the ops and
the metadata formats for the block/object/file mechanisms.

There was also brief discussion about the mailing lists. All the
traffic is on this the pnfs-reqs list right now, with some older
traffic on the pnfs-ops group. Eventually we expect most stuff
to transition to the nfsv4 list, but not until we get a more
official charter.

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From ggrider@lanl.gov Thu Feb 12 15:31:38 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 16743 invoked from network); 12 Feb 2004 23:31:38 -0000
Received: from unknown (66.218.66.218)
by m12.grp.scd.yahoo.com with QMQP; 12 Feb 2004 23:31:38 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta3.grp.scd.yahoo.com with SMTP; 12 Feb 2004 23:31:37 -0000
Received: from mailrelay3.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1CNVaHR032738
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 16:31:36 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay3.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1CNVaeI021291
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 16:31:36 -0700
Received: from cthulu.lanl.gov (vpn-client-224.lanl.gov [128.165.253.224])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1CNVYYi004263
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 16:31:35 -0700
Message-Id: <5.2.0.9.2.20040212162218.0160a6c8@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Thu, 12 Feb 2004 16:31:34 -0700
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <EC98CBEA-5CEF-11D8-B5DB-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: multipart/alternative;
boundary="=====================_26921190==.ALT"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

At 04:10 PM 2/11/2004 -0800, Garth Gibson wrote:

> We'll hold a concall tomorrow 11am EST.
>
> Agenda items can include:
>
> - we need to convert the problem statement into a presentation (our
> target was Feb 19)
>
> - we need to identify who is giving the presentation at Seoul, if our
> topic is given time
>
> - should we give it at Connectathon next week, and if so, who will give
> it (who is going?)
>
> - its time to get back to the original roles of the mailing lists:
>       - a draft of a requirements doc
>       - a draft of the operations we suggest for NFSv4 extension
>       - a draft of the wire format of layout metadata for SBC (FC/SCSI)
> backends
>       - a draft of the wire format of layout metadata for OSD backends
>       - a draft of the wire format of layout metadata for NFS backends


Ok, I have a big problem with the last three statements.  I know I have not been following
this stuff as closely as I should have, but why cant the maps be sent to from the server
in an agnostic way?  Why does the NFS server have to understand any of what is in the
map?  Why isnt there a plug in on the server side to get the map info for the server to
send to the client, and why isnt there a plug in on the client side to pass the map down to?

What if the world decides to add another way to do I/O besides Block, Object, and NFS?
What happens if the Object model evolves, does NFS have to change to stay in
sync with it?   Shouldnt all this stuff be as opaque as it can be?  You do need locks
for making sure the map doesnt change. 

I am confused why we gave up on trying to make this agnostic. 
Sorry for the question, but I have this built in alarm, that goes off when I see
any numbers besides 0,1, and N  (not 3).

Thanks
Gary


> - get some milestones and dates beside the above
>
> garth
>
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 



From garth@panasas.com Thu Feb 12 16:17:25 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 12811 invoked from network); 13 Feb 2004 00:17:18 -0000
Received: from unknown (66.218.66.172)
by m13.grp.scd.yahoo.com with QMQP; 13 Feb 2004 00:17:18 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 13 Feb 2004 00:17:18 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY3RG1; Thu, 12 Feb 2004 19:17:16 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <5.2.0.9.2.20040212162218.0160a6c8@cic-mail.lanl.gov>
References: <5.2.0.9.2.20040212162218.0160a6c8@cic-mail.lanl.gov>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Message-Id: <F5E5F812-5DB9-11D8-B5DB-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Date: Thu, 12 Feb 2004 16:17:08 -0800
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

On Feb 12, 2004, at 3:31 PM, Gary Grider wrote:
> At 04:10 PM 2/11/2004 -0800, Garth Gibson wrote:
>> - its time to get back to the original roles of the mailing lists:
>>       - a draft of a requirements doc
>>       - a draft of the operations we suggest for NFSv4 extension
>>       - a draft of the wire format of layout metadata for SBC
>> (FC/SCSI) backends
>>       - a draft of the wire format of layout metadata for OSD
>> backends
>>       - a draft of the wire format of layout metadata for NFS
>> backends
>
> Ok, I have a big problem with the last three statements.  I know I
> have not been following
> this stuff as closely as I should have, but why cant the maps be sent
> to from the server
> in an agnostic way?  Why does the NFS server have to understand any
> of what is in the
> map?  Why isnt there a plug in on the server side to get the map info
> for the server to
> send to the client, and why isnt there a plug in on the client side
> to pass the map down to?

The NFS server does not have to understand what is in the map, other
than enough to know what file the layout delegation and map pertain to
for recalling that delegation.

However, for any chance of interoperability, the map formats must be
documented. And since each backend server flavor has addressing
characteristics that will be visible in the map, the documented map
formats will be specific to the backend flavors.

The theory we started with was that the base NFSv4 extensions would
describe opaque "maps". And that we would propose separate internet
drafts for a wire format for each flavor of map. Hence the NFSv4
extensions are backend protocol agnostic, yet the client
implementations can be interoperable.

> What if the world decides to add another way to do I/O besides Block,
> Object, and NFS?
> What happens if the Object model evolves, does NFS have to change to
> stay in
> sync with it?   Shouldnt all this stuff be as opaque as it can be? 
> You do need locks
> for making sure the map doesnt change. 

A new backend protocol would cause a new map flavor, and a new document
for that map flavor's wire format.

The challenge for us is to make the typing and sizing of the opaque map
flexible enough to allow said extension. The fall back, if the new
backend is very much different from the scope of SBC, OSD and NFS,
would be do further extend NFSv4. I hope we do not have to do that.

> I am confused why we gave up on trying to make this agnostic. 
> Sorry for the question, but I have this built in alarm, that goes off
> when I see
> any numbers besides 0,1, and N  (not 3).

This is intended to be a structure for 1, 2 and 3, with an inductive
step enabling any positive whole number to be induced :-)

> Thanks
> Gary

garth

From ggrider@lanl.gov Thu Feb 12 16:44:30 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 3162 invoked from network); 13 Feb 2004 00:44:27 -0000
Received: from unknown (66.218.66.167)
by m5.grp.scd.yahoo.com with QMQP; 13 Feb 2004 00:44:27 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta6.grp.scd.yahoo.com with SMTP; 13 Feb 2004 00:44:26 -0000
Received: from mailrelay2.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D0iQHR004985
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 17:44:26 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay2.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D0iPq1026837
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 17:44:25 -0700
Received: from cthulu.lanl.gov (vpn-client-224.lanl.gov [128.165.253.224])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D0iNYi010035
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 17:44:24 -0700
Message-Id: <5.2.0.9.2.20040212173055.015c5670@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Thu, 12 Feb 2004 17:44:21 -0700
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <F5E5F812-5DB9-11D8-B5DB-000A95A94F04@panasas.com>
References: <5.2.0.9.2.20040212162218.0160a6c8@cic-mail.lanl.gov>
<5.2.0.9.2.20040212162218.0160a6c8@cic-mail.lanl.gov>
Mime-Version: 1.0
Content-Type: multipart/alternative;
boundary="=====================_31290192==.ALT"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

At 04:17 PM 2/12/2004 -0800, Garth Gibson wrote:

> On Feb 12, 2004, at 3:31 PM, Gary Grider wrote:
> >  At 04:10 PM 2/11/2004 -0800, Garth Gibson wrote:
> >>  - its time to get back to the original roles of the mailing lists:
> >>        - a draft of a requirements doc
> >>        - a draft of the operations we suggest for NFSv4 extension
> >>        - a draft of the wire format of layout metadata for SBC
> >> (FC/SCSI) backends
> >>        - a draft of the wire format of layout metadata for OSD
> >> backends
> >>        - a draft of the wire format of layout metadata for NFS
> >> backends
> >
> >  Ok, I have a big problem with the last three statements.  I know I
> > have not been following
> >  this stuff as closely as I should have, but why cant the maps be sent
> > to from the server
> >  in an agnostic way?  Why does the NFS server have to understand any
> > of what is in the
> >  map?  Why isnt there a plug in on the server side to get the map info
> > for the server to
> >  send to the client, and why isnt there a plug in on the client side
> > to pass the map down to?
>
> The NFS server does not have to understand what is in the map, other
> than enough to know what file the layout delegation and map pertain to
> for recalling that delegation.
>
> However, for any chance of interoperability, the map formats must be
> documented.


agree, but does IETF care about this, its just information that NFS can pass on,
seems like if a consortia of T10 folks want a common set of plugins, thats great,
if SBC folks want a common plug in, thats great, but why should NFS/IETF care
about this?

>   And since each backend server flavor has addressing
> characteristics that will be visible in the map, the documented map
> formats will be specific to the backend flavors.
>
> The theory we started with was that the base NFSv4 extensions would
> describe opaque "maps".  And that we would propose separate internet
> drafts for a wire format for each flavor of map. 


Why do we need a different wire format?  Isnt it just a blob of data that is passed
through some normal NFS protocol mechanism?  Why does it need a wire format?
I agree it needs a published format of the stream of data.  Am I reading more into
"wire" than I should.  If we do a separate IETF process for each new format, wont it
be hard to keep up.  I agree we need to have a "type" of back end and maybe a version
or something, but what is in the map could be of no concern to the IETF, couldnt it?

>  Hence the NFSv4
> extensions are backend protocol agnostic, yet the client
> implementations can be interoperable.
>
> >  What if the world decides to add another way to do I/O besides Block,
> > Object, and NFS?
> >  What happens if the Object model evolves, does NFS have to change to
> > stay in
> >  sync with it?   Shouldnt all this stuff be as opaque as it can be? 
> > You do need locks
> >  for making sure the map doesnt change.
>
> A new backend protocol would cause a new map flavor, and a new document
> for that map flavor's wire format.


Do we want to get an IETF action every time we need to add a new flavor?
How easy is this process going to be?  How easy is it going to be to change.
Sounds like it will be very much harder than just changing out your plugins.
Are we linking several different communities to work in lock
step?  Is that good or bad?


Thanks
Gary


> The challenge for us is to make the typing and sizing of the opaque map
> flexible enough to allow said extension.  The fall back, if the new
> backend is very much different from the scope of SBC, OSD and NFS,
> would be do further extend NFSv4.  I hope we do not have to do that.
>
> >  I am confused why we gave up on trying to make this agnostic.
> >  Sorry for the question, but I have this built in alarm, that goes off
> > when I see
> >  any numbers besides 0,1, and N  (not 3).
>
> This is intended to be a structure for 1, 2 and 3, with an inductive
> step enabling any positive whole number to be induced :-)
>
> >  Thanks
> >  Gary
>
> garth
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From bhalevy@panasas.com Thu Feb 12 16:52:15 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 46343 invoked from network); 13 Feb 2004 00:52:15 -0000
Received: from unknown (66.218.66.216)
by m14.grp.scd.yahoo.com with QMQP; 13 Feb 2004 00:52:15 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta1.grp.scd.yahoo.com with SMTP; 13 Feb 2004 00:52:14 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSY3RK1>; Thu, 12 Feb 2004 19:52:13 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38874@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Thu, 12 Feb 2004 19:52:12 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

ADVERTISEMENT
I agree with everything Garth says below. Just as an example
of how an extensible mechanism was spec'ed before in the NFS
community you can take the RPC protocol security flavors.

RPC (rfc1831) specifies how RPC security flavors are encoded
in the RPC protocol. It specifies the basic flavors and the
wire format of the auth_flavor "frame" which contains a type
and an opaque_body.

It says:
enum auth_flavor {
AUTH_NONE = 0,
AUTH_SYS = 1,
AUTH_SHORT = 2
/* and more to be defined */
};

struct opaque_auth {
auth_flavor flavor;
opaque body<400>;
};
...
The interpretation and semantics of the data contained within the
authentication fields is specified by individual, independent
authentication protocol specifications. (Section 9 defines the
various authentication protocols.)
...
9. AUTHENTICATION PROTOCOLS

As previously stated, authentication parameters are opaque, but
open-ended to the rest of the RPC protocol. This section defines two
standard "flavors" of authentication. Implementors are free to
invent new authentication types, with the same rules of flavor number
assignment as there is for program number assignment. The "flavor"
of a credential or verifier refers to the value of the "flavor" field
in the opaque_auth structure. Flavor numbers, like RPC program
numbers, are also administered centrally, and developers may assign
new flavor numbers by applying through electronic mail to
"rpc@sun.com". Credentials and verifiers are represented as variable
length opaque data (the "body" field in the opaque_auth structure).

In this document, two flavors of authentication are described. Of
these, Null authentication (described in the next subsection) is
mandatory - it must be available in all implementations. System
authentication is described in Appendix A.


And it then defines the NULL auth flavor and later in appendix the
SYS auth.

Later, more auth flavors were added, and lately NFSv4 mandated a
new auth flavor, GSS-API, that is defined in a separate RFC (rfc2743)
which is referred to by the NFSv4 RFC.

The transport protocols, RPC and NFS, do not define the wire format
of all security flavors but provide enough metadata in the spec for
clients and servers to interoperate.

Benny

>-----Original Message-----
>From: Garth Gibson [mailto:garth@Panasas.Com]
>Sent: Thursday, February 12, 2004 7:17 PM
>To: pnfs-reqs@yahoogroups.com
>Subject: Re: [pnfs-reqs] concall tomorrow
>
>
>On Feb 12, 2004, at 3:31 PM, Gary Grider wrote:
>> At 04:10 PM 2/11/2004 -0800, Garth Gibson wrote:
>>> - its time to get back to the original roles of the mailing lists:
>>>       - a draft of a requirements doc
>>>       - a draft of the operations we suggest for NFSv4 extension
>>>       - a draft of the wire format of layout metadata for SBC
>>> (FC/SCSI) backends
>>>       - a draft of the wire format of layout metadata for OSD
>>> backends
>>>       - a draft of the wire format of layout metadata for NFS
>>> backends
>>
>> Ok, I have a big problem with the last three statements.  I know I
>> have not been following
>> this stuff as closely as I should have, but why cant the
>maps be sent
>> to from the server
>> in an agnostic way?  Why does the NFS server have to understand any
>> of what is in the
>> map?  Why isnt there a plug in on the server side to get
>the map info
>> for the server to
>> send to the client, and why isnt there a plug in on the client side
>> to pass the map down to?
>
>The NFS server does not have to understand what is in the map, other
>than enough to know what file the layout delegation and map pertain to
>for recalling that delegation.
>
>However, for any chance of interoperability, the map formats must be
>documented. And since each backend server flavor has addressing
>characteristics that will be visible in the map, the documented map
>formats will be specific to the backend flavors.
>
>The theory we started with was that the base NFSv4 extensions would
>describe opaque "maps". And that we would propose separate internet
>drafts for a wire format for each flavor of map. Hence the NFSv4
>extensions are backend protocol agnostic, yet the client
>implementations can be interoperable.
>
>> What if the world decides to add another way to do I/O
>besides Block,
>> Object, and NFS?
>> What happens if the Object model evolves, does NFS have to
>change to
>> stay in
>> sync with it?   Shouldnt all this stuff be as opaque as it can be? 
>> You do need locks
>> for making sure the map doesnt change. 
>
>A new backend protocol would cause a new map flavor, and a new
>document
>for that map flavor's wire format.
>
>The challenge for us is to make the typing and sizing of the
>opaque map
>flexible enough to allow said extension. The fall back, if the new
>backend is very much different from the scope of SBC, OSD and NFS,
>would be do further extend NFSv4. I hope we do not have to do that.
>
>> I am confused why we gave up on trying to make this agnostic. 
>> Sorry for the question, but I have this built in alarm,
>that goes off
>> when I see
>> any numbers besides 0,1, and N  (not 3).
>
>This is intended to be a structure for 1, 2 and 3, with an inductive
>step enabling any positive whole number to be induced :-)
>
>> Thanks
>> Gary
>
>garth
>
>
>
>Yahoo! Groups Links
>
>
>
>
>

From bhalevy@panasas.com Thu Feb 12 17:03:27 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 61264 invoked from network); 13 Feb 2004 01:03:25 -0000
Received: from unknown (66.218.66.167)
by m3.grp.scd.yahoo.com with QMQP; 13 Feb 2004 01:03:25 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 13 Feb 2004 01:03:25 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSY3RLA>; Thu, 12 Feb 2004 20:03:24 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38875@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Thu, 12 Feb 2004 20:03:23 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3F1CD.2DA4CA70"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

ADVERTISEMENT
>  If we do a separate IETF process for each new format, wont it
> be hard to keep up. 
 
I don't see that as necessarily a bad thing...
 
> I agree we need to have a "type" of back end and maybe a version
> or something, but what is in the map could be of no concern to the IETF, couldnt it?
The IETF is concerned about interoperability.  I believe we should be concerned with standardizing
the wire format of the layout maps.  Do we *have* to do that within the IETF?  There
could be other options but since you have to refer to some external standard to talk about
interoperability it'll be difficult to refer to non-existing standards that are being standardized
outside of the IETF.
 
T10-OSD is one such external standard that substantial enough so it can be referred to.
 
> Do we want to get an IETF action every time we need to add a new flavor?
 
Probably, in order to extend the vector of available flavors.  This could be done within
NFSv4's minor versioning model.

> How easy is this process going to be?  How easy is it going to be to change.
> Sounds like it will be very much harder than just changing out your plugins.
 
Again, there's a trade-off between versatility and interoperability.

> Are we linking several different communities to work in lock
> step?  Is that good or bad?
 
The hierarchical standard model is intended to allow each flavor to make progress at its own pace.

Benny

    -----Original Message-----
    From: Gary Grider [mailto:ggrider@lanl.gov]
    Sent: Thursday, February 12, 2004 7:44 PM
    To: pnfs-reqs@yahoogroups.com
    Subject: Re: [pnfs-reqs] concall tomorrow

    At 04:17 PM 2/12/2004 -0800, Garth Gibson wrote:

>     On Feb 12, 2004, at 3:31 PM, Gary Grider wrote:
>     >  At 04:10 PM 2/11/2004 -0800, Garth Gibson wrote:
>     >>  - its time to get back to the original roles of the mailing lists:
>     >>        - a draft of a requirements doc
>     >>        - a draft of the operations we suggest for NFSv4 extension
>     >>        - a draft of the wire format of layout metadata for SBC
>     >> (FC/SCSI) backends
>     >>        - a draft of the wire format of layout metadata for OSD
>     >> backends
>     >>        - a draft of the wire format of layout metadata for NFS
>     >> backends
>     >
>     >  Ok, I have a big problem with the last three statements.  I know I
>     > have not been following
>     >  this stuff as closely as I should have, but why cant the maps be sent
>     > to from the server
>     >  in an agnostic way?  Why does the NFS server have to understand any
>     > of what is in the
>     >  map?  Why isnt there a plug in on the server side to get the map info
>     > for the server to
>     >  send to the client, and why isnt there a plug in on the client side
>     > to pass the map down to?
>
>     The NFS server does not have to understand what is in the map, other
>     than enough to know what file the layout delegation and map pertain to
>     for recalling that delegation.
>
>     However, for any chance of interoperability, the map formats must be
>     documented.


    agree, but does IETF care about this, its just information that NFS can pass on,
    seems like if a consortia of T10 folks want a common set of plugins, thats great,
    if SBC folks want a common plug in, thats great, but why should NFS/IETF care
    about this?

>       And since each backend server flavor has addressing
>     characteristics that will be visible in the map, the documented map
>     formats will be specific to the backend flavors.
>
>     The theory we started with was that the base NFSv4 extensions would
>     describe opaque "maps".  And that we would propose separate internet
>     drafts for a wire format for each flavor of map. 


    Why do we need a different wire format?  Isnt it just a blob of data that is passed
    through some normal NFS protocol mechanism?  Why does it need a wire format?
    I agree it needs a published format of the stream of data.  Am I reading more into
    "wire" than I should.  If we do a separate IETF process for each new format, wont it
    be hard to keep up.  I agree we need to have a "type" of back end and maybe a version
    or something, but what is in the map could be of no concern to the IETF, couldnt it?

>      Hence the NFSv4
>     extensions are backend protocol agnostic, yet the client
>     implementations can be interoperable.
>
>     >  What if the world decides to add another way to do I/O besides Block,
>     > Object, and NFS?
>     >  What happens if the Object model evolves, does NFS have to change to
>     > stay in
>     >  sync with it?   Shouldnt all this stuff be as opaque as it can be? 
>     > You do need locks
>     >  for making sure the map doesnt change.
>
>     A new backend protocol would cause a new map flavor, and a new document
>     for that map flavor's wire format.


    Do we want to get an IETF action every time we need to add a new flavor?
    How easy is this process going to be?  How easy is it going to be to change.
    Sounds like it will be very much harder than just changing out your plugins.
    Are we linking several different communities to work in lock
    step?  Is that good or bad?


    Thanks
    Gary


>     The challenge for us is to make the typing and sizing of the opaque map
>     flexible enough to allow said extension.  The fall back, if the new
>     backend is very much different from the scope of SBC, OSD and NFS,
>     would be do further extend NFSv4.  I hope we do not have to do that.
>
>     >  I am confused why we gave up on trying to make this agnostic.
>     >  Sorry for the question, but I have this built in alarm, that goes off
>     > when I see
>     >  any numbers besides 0,1, and N  (not 3).
>
>     This is intended to be a structure for 1, 2 and 3, with an inductive
>     step enabling any positive whole number to be induced :-)
>
>     >  Thanks
>     >  Gary
>
>     garth
>
>     Yahoo! Groups Links
>
>         * To visit your group on the web, go to:
>         * http://groups.yahoo.com/group/pnfs-reqs/
>        *
>         * To unsubscribe from this group, send an email to:
>         * pnfs-reqs-unsubscribe@yahoogroups.com
>        *
>         * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From ggrider@lanl.gov Thu Feb 12 17:19:21 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 24129 invoked from network); 13 Feb 2004 01:19:20 -0000
Received: from unknown (66.218.66.166)
by m7.grp.scd.yahoo.com with QMQP; 13 Feb 2004 01:19:20 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta5.grp.scd.yahoo.com with SMTP; 13 Feb 2004 01:19:19 -0000
Received: from mailrelay3.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D1JJHR007154
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 18:19:19 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay3.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D1JIeI007886
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 18:19:18 -0700
Received: from cthulu.lanl.gov (vpn-client-224.lanl.gov [128.165.253.224])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D1JFYi012624
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 18:19:16 -0700
Message-Id: <5.2.0.9.2.20040212181154.01621d60@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Thu, 12 Feb 2004 18:19:15 -0700
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <30489F1321F5C343ACF6872B2CF7942A05D38875@PIKES.panasas.com
>
Mime-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="=====================_33382571==.REL"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: RE: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

So I think having a header that has the info that is needed for interop is fine and
having a payload that is opaque is fine. 

So explain why we need three different "wire formats"   for these three
different methods?  Why arent these things just 3 different types within the
same wire format?  Or is that what is envisioned, and I am just hung up on the
terminology, and what you are saying is that we need to
1) define the header for interop
    type, version, length perhaps etc.
and
2) define
a) opaque info for SBC type = 1 version = 1
b) opaque info for Object type = 2 version = 1
c) opaque info for NFS back ends type = 3 version = 1

and if someone wants to write an IETF request to have type = 4 added
they can write a draft and get it approved and document the opaque info format?

Thanks
Gary

At 08:03 PM 2/12/2004 -0500, Halevy, Benny wrote:

> >  If we do a separate IETF process for each new format, wont it
> > be hard to keep up. 
>  
> I don't see that as necessarily a bad thing...
>  
> > I agree we need to have a "type" of back end and maybe a version
> > or something, but what is in the map could be of no concern to the IETF, couldnt it?
> The IETF is concerned about interoperability.  I believe we should be concerned with standardizing
> the wire format of the layout maps.  Do we *have* to do that within the IETF?  There
> could be other options but since you have to refer to some external standard to talk about
> interoperability it'll be difficult to refer to non-existing standards that are being standardized
> outside of the IETF.
>  
> T10-OSD is one such external standard that substantial enough so it can be referred to.
>  
> > Do we want to get an IETF action every time we need to add a new flavor?
>  
> Probably, in order to extend the vector of available flavors.  This could be done within
> NFSv4's minor versioning model.
>
> > How easy is this process going to be?  How easy is it going to be to change.
> > Sounds like it will be very much harder than just changing out your plugins.
>  
> Again, there's a trade-off between versatility and interoperability.
>
> > Are we linking several different communities to work in lock
> > step?  Is that good or bad?
>  
> The hierarchical standard model is intended to allow each flavor to make progress at its own pace.
>
> Benny
>
>     -----Original Message-----
>     From: Gary Grider [mailto:ggrider@lanl.gov]
>     Sent: Thursday, February 12, 2004 7:44 PM
>     To: pnfs-reqs@yahoogroups.com
>     Subject: Re: [pnfs-reqs] concall tomorrow
>
>     At 04:17 PM 2/12/2004 -0800, Garth Gibson wrote:
>
>>         On Feb 12, 2004, at 3:31 PM, Gary Grider wrote:
>>         >  At 04:10 PM 2/11/2004 -0800, Garth Gibson wrote:
>>         >>  - its time to get back to the original roles of the mailing lists:
>>         >>        - a draft of a requirements doc
>>         >>        - a draft of the operations we suggest for NFSv4 extension
>>         >>        - a draft of the wire format of layout metadata for SBC
>>         >> (FC/SCSI) backends
>>         >>        - a draft of the wire format of layout metadata for OSD
>>         >> backends
>>         >>        - a draft of the wire format of layout metadata for NFS
>>         >> backends
>>         >
>>         >  Ok, I have a big problem with the last three statements.  I know I
>>         > have not been following
>>         >  this stuff as closely as I should have, but why cant the maps be sent
>>         > to from the server
>>         >  in an agnostic way?  Why does the NFS server have to understand any
>>         > of what is in the
>>         >  map?  Why isnt there a plug in on the server side to get the map info
>>         > for the server to
>>         >  send to the client, and why isnt there a plug in on the client side
>>         > to pass the map down to?
>>
>>         The NFS server does not have to understand what is in the map, other
>>         than enough to know what file the layout delegation and map pertain to
>>         for recalling that delegation.
>>
>>         However, for any chance of interoperability, the map formats must be
>>         documented.
>
>
>     agree, but does IETF care about this, its just information that NFS can pass on,
>     seems like if a consortia of T10 folks want a common set of plugins, thats great,
>     if SBC folks want a common plug in, thats great, but why should NFS/IETF care
>     about this?
>
>>           And since each backend server flavor has addressing
>>         characteristics that will be visible in the map, the documented map
>>         formats will be specific to the backend flavors.
>>
>>         The theory we started with was that the base NFSv4 extensions would
>>         describe opaque "maps".  And that we would propose separate internet
>>         drafts for a wire format for each flavor of map. 
>
>
>     Why do we need a different wire format?  Isnt it just a blob of data that is passed
>     through some normal NFS protocol mechanism?  Why does it need a wire format?
>     I agree it needs a published format of the stream of data.  Am I reading more into
>     "wire" than I should.  If we do a separate IETF process for each new format, wont it
>     be hard to keep up.  I agree we need to have a "type" of back end and maybe a version
>     or something, but what is in the map could be of no concern to the IETF, couldnt it?
>
>>          Hence the NFSv4
>>         extensions are backend protocol agnostic, yet the client
>>         implementations can be interoperable.
>>
>>         >  What if the world decides to add another way to do I/O besides Block,
>>         > Object, and NFS?
>>         >  What happens if the Object model evolves, does NFS have to change to
>>         > stay in
>>         >  sync with it?   Shouldnt all this stuff be as opaque as it can be? 
>>         > You do need locks
>>         >  for making sure the map doesnt change.
>>
>>         A new backend protocol would cause a new map flavor, and a new document
>>         for that map flavor's wire format.
>
>
>     Do we want to get an IETF action every time we need to add a new flavor?
>     How easy is this process going to be?  How easy is it going to be to change.
>     Sounds like it will be very much harder than just changing out your plugins.
>     Are we linking several different communities to work in lock
>     step?  Is that good or bad?
>
>
>     Thanks
>     Gary
>
>
>>         The challenge for us is to make the typing and sizing of the opaque map
>>         flexible enough to allow said extension.  The fall back, if the new
>>         backend is very much different from the scope of SBC, OSD and NFS,
>>         would be do further extend NFSv4.  I hope we do not have to do that.
>>
>>         >  I am confused why we gave up on trying to make this agnostic.
>>         >  Sorry for the question, but I have this built in alarm, that goes off
>>         > when I see
>>         >  any numbers besides 0,1, and N  (not 3).
>>
>>         This is intended to be a structure for 1, 2 and 3, with an inductive
>>         step enabling any positive whole number to be induced :-)
>>
>>         >  Thanks
>>         >  Gary
>>
>>         garth
>>
>>         Yahoo! Groups Links 
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 


Yahoo! Groups Sponsor
ADVERTISEMENT
1fd596e.jpgClick Here
1fd5ab9.jpg

Yahoo! Groups Links

    * To visit your group on the web, go to:
    * http://groups.yahoo.com/group/pnfs-reqs/
    *  
    * To unsubscribe from this group, send an email to:
    * pnfs-reqs-unsubscribe@yahoogroups.com
    *  
    * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From Thomas.Talpey@netapp.com Thu Feb 12 17:22:21 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 8672 invoked from network); 13 Feb 2004 01:22:21 -0000
Received: from unknown (66.218.66.218)
by m11.grp.scd.yahoo.com with QMQP; 13 Feb 2004 01:22:21 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 13 Feb 2004 01:22:21 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1D1MKJC029608
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 17:22:20 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1D1MKiH015003
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 17:22:20 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.30]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Thu, 12 Feb 2004 20:22:14 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3F1CF.CF99E700"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Thu, 12 Feb 2004 17:22:04 -0800
Message-ID: <5.2.1.1.2.20040212202018.01e2aea0@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] concall tomorrow
Thread-Index: AcPxz9A7m6fy2BsXR2KPKZkslTsDvQ==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

At 07:17 PM 2/12/2004, Garth Gibson wrote:
>The NFS server does not have to understand what is in the map, other
>than enough to know what file the layout delegation and map pertain to
>for recalling that delegation.

What map?

Did we go from problem statement to protocol definition overnight?

Confused,
Tom.


From ggrider@lanl.gov Thu Feb 12 20:44:07 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 30018 invoked from network); 13 Feb 2004 04:44:06 -0000
Received: from unknown (66.218.66.217)
by m18.grp.scd.yahoo.com with QMQP; 13 Feb 2004 04:44:06 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta2.grp.scd.yahoo.com with SMTP; 13 Feb 2004 04:44:05 -0000
Received: from mailrelay1.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D4i5HR020583
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 21:44:05 -0700
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay1.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D4i4rl018790
for <pnfs-reqs@yahoogroups.com>; Thu, 12 Feb 2004 21:44:04 -0700
Received: from cthulu.lanl.gov (vpn-client-160.lanl.gov [128.165.253.160])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i1D4hvYk027660;
Thu, 12 Feb 2004 21:44:00 -0700
Message-Id: <5.2.0.9.2.20040212213826.038a2208@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Thu, 12 Feb 2004 21:41:14 -0700
To: pnfs-reqs@yahoogroups.com, <pnfs-reqs@yahoogroups.com>
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A011DE886@silver.nane.netap
p.com>
Mime-Version: 1.0
Content-Type: multipart/mixed;
boundary="=====================_45664431==_"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: RE: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

I agree.

I added some parallel stuff.

Thanks
Gary

At 07:22 AM 2/12/2004 -0800, Corbett, Peter wrote:

> The slides mention server bypass twice, but don't talk about parallel
> access.  Our problem statement is much more focussed on parallel access.
> We are trying to define a standard for parallel access, whether that is
> to nfs based data servers, object servers, virtualized SAN or
> non-virtualized SAN devices.  So, I think we need to be clearer when
> talking about bypassing the server that we are really talking about
> direct access to a parallel data store from clustered clients, with a
> shared NFS server acting as a metadata server.  To me, that is the key
> point that applies across the entire solution space, whereas direct
> access to devices is a data access technique in part of the solution
> space.
>
> -----Original Message-----
> From: Gary Grider [mailto:ggrider@lanl.gov]
> Sent: Thursday, February 12, 2004 12:55 AM
> To: pnfs-reqs@yahoogroups.com
> Subject: Re: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started
>
>
>   elevator pitch
>
> Gary
>
>
> Yahoo! Groups Links
>
>
>
>
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 



Attachment (not stored)
pNFS-elevator-pitch.ppt
Type: application/octet-stream

From bhalevy@panasas.com Thu Feb 12 21:41:40 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 48383 invoked from network); 13 Feb 2004 05:41:39 -0000
Received: from unknown (66.218.66.218)
by m14.grp.scd.yahoo.com with QMQP; 13 Feb 2004 05:41:38 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 13 Feb 2004 05:41:38 -0000
Received: from yang ([172.17.19.58]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id SVSY3R0N; Fri, 13 Feb 2004 00:41:35 -0500
To: <pnfs-reqs@yahoogroups.com>
Date: Fri, 13 Feb 2004 00:41:27 -0500
Message-ID: <LCEAJMHHKPKEPAIDBBEKIEAHCBAA.bhalevy@panasas.com>
MIME-Version: 1.0
Content-Type: text/plain;
charset="US-ASCII"
Content-Transfer-Encoding: 7bit
X-Priority: 3 (Normal)
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0)
In-reply-to: <5.2.0.9.2.20040212181154.01621d60@cic-mail.lanl.gov>
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
Importance: Normal
X-eGroups-Remote-IP: 65.194.124.178
From: "Benny Halevy" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

> Or is that what is envisioned, and I am just hung up on the
> terminology, and what you are saying is that we need to
> 1) define the header for interop
> type, version, length perhaps etc.
> and
> 2) define
> a) opaque info for SBC type = 1 version = 1
> b) opaque info for Object type = 2 version = 1
> c) opaque info for NFS back ends type = 3 version = 1
>

If I understand what you think I think then yes, this is
pretty much they way I see it :)
"define opaque info" is a little ambiguous. I believe we should
define the wire format of each flavor to a level necessary
to demonstrate interoperability. Some encapsulated data structures
such as device addresses, file handles or capabilities
can be defined "by reference" to other standards but the overall
framing protocol must be well defined so that it is parseable
by any *independent* implementation of a NFSv4.p client.

The client shouldn't require any knowledge about the underlying server
file system internals and it shouldn't require any external configuration
to help it parse the opaque structures. It needs to be able to
understand whatever it needs to know about the flavor's "opaque info"
in order to use it from the standard's types and versions.

> and if someone wants to write an IETF request to have type = 4 added
> they can write a draft and get it approved and document the opaque info
format?

Yes and there should be a process in place to register new types
in a central repository (likely to be IANA, see below).

So far I think this discussion exposed a couple requirements:

1) Interoperability
To conform with rfc2026, the extensions we propose must provide enough
details to eventually satisfy "The requirement for at least two
independent and interoperable implementations".

2) Extensibility
The NFSv4 protocol extensions should allow to add new pNFS flavors.
Some (or all) flavors may have a requirement for extensibility
in their own wire format.

IANA considerations
pNFS flavors should be registered with IANA. Need to define the process
requirements and add to NFSv4 IANA requirements section.
For example, see rfc3530 section 17.2. ONC RPC Network Identifiers
(netids):
... the registration of new Network Identifiers will
require the publication of an Information RFC with similar detail as
listed above for the Network Identifier itself and corresponding
Universal Address.

Benny

-----Original Message-----
From: Gary Grider [mailto:ggrider@lanl.gov]
Sent: Thursday, February 12, 2004 20:19
To: pnfs-reqs@yahoogroups.com
Subject: RE: [pnfs-reqs] concall tomorrow



So I think having a header that has the info that is needed for interop is
fine and
having a payload that is opaque is fine.

So explain why we need three different "wire formats" for these three
different methods? Why arent these things just 3 different types within the
same wire format? Or is that what is envisioned, and I am just hung up on
the
terminology, and what you are saying is that we need to
1) define the header for interop
type, version, length perhaps etc.
and
2) define
a) opaque info for SBC type = 1 version = 1
b) opaque info for Object type = 2 version = 1
c) opaque info for NFS back ends type = 3 version = 1

and if someone wants to write an IETF request to have type = 4 added
they can write a draft and get it approved and document the opaque info
format?

Thanks
Gary

At 08:03 PM 2/12/2004 -0500, Halevy, Benny wrote:

> If we do a separate IETF process for each new format, wont it
> be hard to keep up.

I don't see that as necessarily a bad thing...

> I agree we need to have a "type" of back end and maybe a version
> or something, but what is in the map could be of no concern to the IETF,
couldnt it?
The IETF is concerned about interoperability. I believe we should be
concerned with standardizing
the wire format of the layout maps. Do we *have* to do that within the
IETF? There
could be other options but since you have to refer to some external standard
to talk about
interoperability it'll be difficult to refer to non-existing standards that
are being standardized
outside of the IETF.

T10-OSD is one such external standard that substantial enough so it can be
referred to.

> Do we want to get an IETF action every time we need to add a new flavor?

Probably, in order to extend the vector of available flavors. This could be
done within
NFSv4's minor versioning model.

> How easy is this process going to be? How easy is it going to be to
change.
> Sounds like it will be very much harder than just changing out your
plugins.

Again, there's a trade-off between versatility and interoperability.

> Are we linking several different communities to work in lock
> step? Is that good or bad?

The hierarchical standard model is intended to allow each flavor to make
progress at its own pace.

Benny

-----Original Message-----

From: Gary Grider [mailto:ggrider@lanl.gov]

Sent: Thursday, February 12, 2004 7:44 PM

To: pnfs-reqs@yahoogroups.com

Subject: Re: [pnfs-reqs] concall tomorrow


At 04:17 PM 2/12/2004 -0800, Garth Gibson wrote:

On Feb 12, 2004, at 3:31 PM, Gary Grider wrote:

> At 04:10 PM 2/11/2004 -0800, Garth Gibson wrote:

>> - its time to get back to the original roles of the mailing lists:

>> - a draft of a requirements doc

>> - a draft of the operations we suggest for NFSv4 extension

>> - a draft of the wire format of layout metadata for SBC

>> (FC/SCSI) backends

>> - a draft of the wire format of layout metadata for OSD

>> backends

>> - a draft of the wire format of layout metadata for NFS

>> backends

>

> Ok, I have a big problem with the last three statements. I know I

> have not been following

> this stuff as closely as I should have, but why cant the maps be sent

> to from the server

> in an agnostic way? Why does the NFS server have to understand any

> of what is in the

> map? Why isnt there a plug in on the server side to get the map info

> for the server to

> send to the client, and why isnt there a plug in on the client side

> to pass the map down to?


The NFS server does not have to understand what is in the map, other

than enough to know what file the layout delegation and map pertain to

for recalling that delegation.


However, for any chance of interoperability, the map formats must be

documented.


agree, but does IETF care about this, its just information that NFS can pass
on,

seems like if a consortia of T10 folks want a common set of plugins, thats
great,

if SBC folks want a common plug in, thats great, but why should NFS/IETF
care

about this?


And since each backend server flavor has addressing

characteristics that will be visible in the map, the documented map

formats will be specific to the backend flavors.


The theory we started with was that the base NFSv4 extensions would

describe opaque "maps". And that we would propose separate internet

drafts for a wire format for each flavor of map.


Why do we need a different wire format? Isnt it just a blob of data that is
passed

through some normal NFS protocol mechanism? Why does it need a wire format?

I agree it needs a published format of the stream of data. Am I reading
more into

"wire" than I should. If we do a separate IETF process for each new format,
wont it

be hard to keep up. I agree we need to have a "type" of back end and maybe
a version

or something, but what is in the map could be of no concern to the IETF,
couldnt it?


Hence the NFSv4

extensions are backend protocol agnostic, yet the client

implementations can be interoperable.


> What if the world decides to add another way to do I/O besides Block,

> Object, and NFS?

> What happens if the Object model evolves, does NFS have to change to

> stay in

> sync with it? Shouldnt all this stuff be as opaque as it can be?

> You do need locks

> for making sure the map doesnt change.


A new backend protocol would cause a new map flavor, and a new document

for that map flavor's wire format.


Do we want to get an IETF action every time we need to add a new flavor?

How easy is this process going to be? How easy is it going to be to change.

Sounds like it will be very much harder than just changing out your plugins.

Are we linking several different communities to work in lock

step? Is that good or bad?



Thanks

Gary



The challenge for us is to make the typing and sizing of the opaque map

flexible enough to allow said extension. The fall back, if the new

backend is very much different from the scope of SBC, OSD and NFS,

would be do further extend NFSv4. I hope we do not have to do that.


> I am confused why we gave up on trying to make this agnostic.

> Sorry for the question, but I have this built in alarm, that goes off

> when I see

> any numbers besides 0,1, and N (not 3).


This is intended to be a structure for 1, 2 and 3, with an inductive

step enabling any positive whole number to be induced :-)


> Thanks

> Gary


garth




Yahoo! Groups Links
To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/
To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com
Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.

Yahoo! Groups Sponsor
ADVERTISEMENT
Click Here




Yahoo! Groups Links
To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.





Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.

From pcorbett@netapp.com Fri Feb 13 06:13:10 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 52136 invoked from network); 13 Feb 2004 14:13:06 -0000
Received: from unknown (66.218.66.172)
by m14.grp.scd.yahoo.com with QMQP; 13 Feb 2004 14:13:06 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 13 Feb 2004 14:13:06 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1DED5JC022822
for <pnfs-reqs@yahoogroups.com>; Fri, 13 Feb 2004 06:13:05 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1DED5iH002891
for <pnfs-reqs@yahoogroups.com>; Fri, 13 Feb 2004 06:13:05 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="----_=_NextPart_001_01C3F23B.797167B2"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Fri, 13 Feb 2004 06:12:55 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A015BF359@silver.nane.netapp.com>
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] concall tomorrow
Thread-Index: AcPxz2tga1ZicUe6RVmgDK9LWOKhBQAa63FA
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: [pnfs-reqs] concall tomorrow
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

ADVERTISEMENT
That's right.  I think we'll do most of the work in the ops group, and the 3 transport mechanism groups will work out the specific details of the descriptors.  There will be some back and forth to make sure we get the ops set right.

    -----Original Message-----
    From: Gary Grider [mailto:ggrider@lanl.gov]
    Sent: Thursday, February 12, 2004 8:19 PM
    To: pnfs-reqs@yahoogroups.com
    Subject: RE: [pnfs-reqs] concall tomorrow


    So I think having a header that has the info that is needed for interop is fine and
    having a payload that is opaque is fine. 

    So explain why we need three different "wire formats"   for these three
    different methods?  Why arent these things just 3 different types within the
    same wire format?  Or is that what is envisioned, and I am just hung up on the
    terminology, and what you are saying is that we need to
    1) define the header for interop
        type, version, length perhaps etc.
    and
    2) define
    a) opaque info for SBC type = 1 version = 1
    b) opaque info for Object type = 2 version = 1
    c) opaque info for NFS back ends type = 3 version = 1

    and if someone wants to write an IETF request to have type = 4 added
    they can write a draft and get it approved and document the opaque info format?

    Thanks
    Gary

    At 08:03 PM 2/12/2004 -0500, Halevy, Benny wrote:

>     >  If we do a separate IETF process for each new format, wont it
>     > be hard to keep up. 
>      
>     I don't see that as necessarily a bad thing...
>      
>     > I agree we need to have a "type" of back end and maybe a version
>     > or something, but what is in the map could be of no concern to the IETF, couldnt it?
>     The IETF is concerned about interoperability.  I believe we should be concerned with standardizing
>     the wire format of the layout maps.  Do we *have* to do that within the IETF?  There
>     could be other options but since you have to refer to some external standard to talk about
>     interoperability it'll be difficult to refer to non-existing standards that are being standardized
>     outside of the IETF.
>      
>     T10-OSD is one such external standard that substantial enough so it can be referred to.
>      
>     > Do we want to get an IETF action every time we need to add a new flavor?
>      
>     Probably, in order to extend the vector of available flavors.  This could be done within
>     NFSv4's minor versioning model.
>
>     > How easy is this process going to be?  How easy is it going to be to change.
>     > Sounds like it will be very much harder than just changing out your plugins.
>      
>     Again, there's a trade-off between versatility and interoperability.
>
>     > Are we linking several different communities to work in lock
>     > step?  Is that good or bad?
>      
>     The hierarchical standard model is intended to allow each flavor to make progress at its own pace.
>
>     Benny
>
>         -----Original Message-----
>         From: Gary Grider [mailto:ggrider@lanl.gov]
>         Sent: Thursday, February 12, 2004 7:44 PM
>         To: pnfs-reqs@yahoogroups.com
>         Subject: Re: [pnfs-reqs] concall tomorrow
>
>         At 04:17 PM 2/12/2004 -0800, Garth Gibson wrote:
>
>>             On Feb 12, 2004, at 3:31 PM, Gary Grider wrote:
>>             >  At 04:10 PM 2/11/2004 -0800, Garth Gibson wrote:
>>             >>  - its time to get back to the original roles of the mailing lists:
>>             >>        - a draft of a requirements doc
>>             >>        - a draft of the operations we suggest for NFSv4 extension
>>             >>        - a draft of the wire format of layout metadata for SBC
>>             >> (FC/SCSI) backends
>>             >>        - a draft of the wire format of layout metadata for OSD
>>             >> backends
>>             >>        - a draft of the wire format of layout metadata for NFS
>>             >> backends
>>             >
>>             >  Ok, I have a big problem with the last three statements.  I know I
>>             > have not been following
>>             >  this stuff as closely as I should have, but why cant the maps be sent
>>             > to from the server
>>             >  in an agnostic way?  Why does the NFS server have to understand any
>>             > of what is in the
>>             >  map?  Why isnt there a plug in on the server side to get the map info
>>             > for the server to
>>             >  send to the client, and why isnt there a plug in on the client side
>>             > to pass the map down to?
>>
>>             The NFS server does not have to understand what is in the map, other
>>             than enough to know what file the layout delegation and map pertain to
>>             for recalling that delegation.
>>
>>             However, for any chance of interoperability, the map formats must be
>>             documented.
>
>
>         agree, but does IETF care about this, its just information that NFS can pass on,
>         seems like if a consortia of T10 folks want a common set of plugins, thats great,
>         if SBC folks want a common plug in, thats great, but why should NFS/IETF care
>         about this?
>
>>               And since each backend server flavor has addressing
>>             characteristics that will be visible in the map, the documented map
>>             formats will be specific to the backend flavors.
>>
>>             The theory we started with was that the base NFSv4 extensions would
>>             describe opaque "maps".  And that we would propose separate internet
>>             drafts for a wire format for each flavor of map. 
>
>
>         Why do we need a different wire format?  Isnt it just a blob of data that is passed
>         through some normal NFS protocol mechanism?  Why does it need a wire format?
>         I agree it needs a published format of the stream of data.  Am I reading more into
>         "wire" than I should.  If we do a separate IETF process for each new format, wont it
>         be hard to keep up.  I agree we need to have a "type" of back end and maybe a version
>         or something, but what is in the map could be of no concern to the IETF, couldnt it?
>
>>              Hence the NFSv4
>>             extensions are backend protocol agnostic, yet the client
>>             implementations can be interoperable.
>>
>>             >  What if the world decides to add another way to do I/O besides Block,
>>             > Object, and NFS?
>>             >  What happens if the Object model evolves, does NFS have to change to
>>             > stay in
>>             >  sync with it?   Shouldnt all this stuff be as opaque as it can be? 
>>             > You do need locks
>>             >  for making sure the map doesnt change.
>>
>>             A new backend protocol would cause a new map flavor, and a new document
>>             for that map flavor's wire format.
>
>
>         Do we want to get an IETF action every time we need to add a new flavor?
>         How easy is this process going to be?  How easy is it going to be to change.
>         Sounds like it will be very much harder than just changing out your plugins.
>         Are we linking several different communities to work in lock
>         step?  Is that good or bad?
>
>
>         Thanks
>         Gary
>
>
>>             The challenge for us is to make the typing and sizing of the opaque map
>>             flexible enough to allow said extension.  The fall back, if the new
>>             backend is very much different from the scope of SBC, OSD and NFS,
>>             would be do further extend NFSv4.  I hope we do not have to do that.
>>
>>             >  I am confused why we gave up on trying to make this agnostic.
>>             >  Sorry for the question, but I have this built in alarm, that goes off
>>             > when I see
>>             >  any numbers besides 0,1, and N  (not 3).
>>
>>             This is intended to be a structure for 1, 2 and 3, with an inductive
>>             step enabling any positive whole number to be induced :-)
>>
>>             >  Thanks
>>             >  Gary
>>
>>             garth
>>
>>             Yahoo! Groups Links 
>
>         * To visit your group on the web, go to:
>         * http://groups.yahoo.com/group/pnfs-reqs/
>         * To unsubscribe from this group, send an email to:
>         * pnfs-reqs-unsubscribe@yahoogroups.com
>         * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 


    Yahoo! Groups Sponsor
    ADVERTISEMENT
    1fd596e.jpgClick Here
    1fd5ab9.jpg

    Yahoo! Groups Links

        * To visit your group on the web, go to:
        * http://groups.yahoo.com/group/pnfs-reqs/
        *
        * To unsubscribe from this group, send an email to:
        * pnfs-reqs-unsubscribe@yahoogroups.com
        *
        * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.

From julian_satran@il.ibm.com Mon Feb 16 08:53:48 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 83072 invoked from network); 16 Feb 2004 16:53:48 -0000
Received: from unknown (66.218.66.167)
by m16.grp.scd.yahoo.com with QMQP; 16 Feb 2004 16:53:48 -0000
Received: from unknown (HELO mtagate3.uk.ibm.com) (195.212.29.136)
by mta6.grp.scd.yahoo.com with SMTP; 16 Feb 2004 16:53:47 -0000
Received: from d06nrmr1407.portsmouth.uk.ibm.com (d06nrmr1407.portsmouth.uk.ibm.com [9.149.38.185])
by mtagate3.uk.ibm.com (8.12.10/8.12.10) with ESMTP id i1GGrkMf044458
for <pnfs-reqs@yahoogroups.com>; Mon, 16 Feb 2004 16:53:46 GMT
Received: from d12ml102.megacenter.de.ibm.com (d06av02.portsmouth.uk.ibm.com [9.149.37.228])
by d06nrmr1407.portsmouth.uk.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i1GGrjFK184550
for <pnfs-reqs@yahoogroups.com>; Mon, 16 Feb 2004 16:53:46 GMT
In-Reply-To: <5.2.0.9.2.20040212181154.01621d60@cic-mail.lanl.gov>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OFB9CBC112.7A654A7B-ONC2256E3C.005A9415-C2256E3C.005CCEF1@il.ibm.com>
Date: Mon, 16 Feb 2004 18:55:39 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2|July 23, 2003) at
16/02/2004 18:55:41,
Serialize complete at 16/02/2004 18:55:41
Content-Type: text/plain; charset="US-ASCII"
X-eGroups-Remote-IP: 195.212.29.136
From: Julian Satran <julian_satran@il.ibm.com>
Subject: RE: [pnfs-reqs] plugins vs. wire protocol - a false dilemma
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

I followed this thread with interest. I agree with Garth that wire format
is required for interoperability - and an opaque format is good only for
communication between entities of the same kind. That is required and
having more than one is not necessarily a bad thing.

However Gary has a good point in that by introducing many different
formats server and client implementers may feel somewhat uneasy (and may
delay implementations).

Security that Benny used as a benchmark has solved this by standardizing
an API (GSS-API) besides the wire protocol (however beware IETF does not
like to standardize APIs - unless presented as a semantic definition for
the endpoint operation).

Perhaps the right thing to do is define both an API for the plug-in Gary
suggested and the wire protocol (that enables interoperability).

And BTW - using a meaningful subject line is part of the mailing list
etiquette - I almost dropped the whole thread knowing that I can't make
the call.

Regards,
Julo

From black_david@emc.com Mon Feb 16 14:43:45 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 82548 invoked from network); 16 Feb 2004 22:43:44 -0000
Received: from unknown (66.218.66.167)
by m19.grp.scd.yahoo.com with QMQP; 16 Feb 2004 22:43:44 -0000
Received: from unknown (HELO srexchimc2.eng.emc.com) (168.159.100.11)
by mta6.grp.scd.yahoo.com with SMTP; 16 Feb 2004 22:43:44 -0000
Received: from maho3msx2.corp.emc.com ([128.221.11.32]) by srexchimc2.eng.emc.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2657.72)
id 1ZQJFTF2; Mon, 16 Feb 2004 17:43:43 -0500
Received: by maho3msx2.isus.emc.com with Internet Mail Service (5.5.2653.19)
id <1YVFBR8B>; Mon, 16 Feb 2004 17:43:42 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A55B7@corpmx14.us.dg.com>
X-Sybari-Trust: 858f5b34 1d8c424f c070781c 0000013d
To: pnfs-reqs@yahoogroups.com
Date: Mon, 16 Feb 2004 17:43:36 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 168.159.100.11
From: black_david@emc.com
Subject: Data formats & Interoperability (was: concall tomorrow)
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

ADVERTISEMENT
click here
Gary Grider wrote:

So I think having a header that has the info that is needed for
interop is fine and having a payload that is opaque is fine.

So explain why we need three different "wire formats" for these
three
different methods? Why arent these things just 3 different types
within
the same wire format? Or is that what is envisioned, and I am just
hung
up on the terminology, and what you are saying is that we need to

1) define the header for interop type, version, length perhaps etc.
and
2) define
a) opaque info for SBC type = 1 version = 1
b) opaque info for Object type = 2 version = 1
c) opaque info for NFS back ends type = 3 version = 1

and if someone wants to write an IETF request to have type = 4 added
they can write a draft and get it approved and document the opaque
info format?

That's correct, and there are lots of IETF examples of protocols that
are structured in this fashion.

There is one issue that may cause some difficulty - mandatory requirements
for interoperability. IETF requires that two implementations of the same
protocol be capable of interoperating when implementers have made different
choices among optional to implement features. This interoperation can
be dependent on suitable configuration of the implementations.

For example, there are mandatory-to-implement cryptographic algorithm
requirements for protocols like IPsec and TLS to ensure that any two
implementations can interoperate even if they've implemented different
sets of cryptographic algorithms. In that case the "mandatory to implement"
cryptographic algorithms will have been implemented by both, and will result
in interoperation if they are chosen by both sides, although there's no
requirement that they be offered or selected in negotiation.

Requiring any one of the three metadata types Gary lists above to
always be implemented is going to cause problems for some of the
envisioned implementations.

I think the best bet for the pNFS extensions will be to define "none"
(i.e., vanilla NFSv4) as the "mandatory to implement" interoperable mode,
with all pNFS extensions being optional among mutually consenting clients
and servers. This may also settle an earlier issue about whether
metadata-only servers should not be supported - they can't fall back to
NFSv4 as the "mandatory to implement" interoperable mode, and hence
will get us into a tarpit over which metadata type must be "mandatory
to implement" to ensure interoperability. IMHO, the best bet is to
"just say no" - pNFS is an NFSv4 extension, therefore any implementation
of pNFS is required to implement vanilla NFSv4 without any pNFS extensions
in addition to pNFS-extended NFSv4.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From bhalevy@panasas.com Mon Feb 16 15:01:17 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 69170 invoked from network); 16 Feb 2004 23:01:14 -0000
Received: from unknown (66.218.66.218)
by m3.grp.scd.yahoo.com with QMQP; 16 Feb 2004 23:01:14 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 16 Feb 2004 23:01:14 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <SVSYPAKM>; Mon, 16 Feb 2004 18:01:12 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38883@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Mon, 16 Feb 2004 18:01:06 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] Data formats & Interoperability (was: concall tom
orrow)
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

>I think the best bet for the pNFS extensions will be to define "none"
>(i.e., vanilla NFSv4) as the "mandatory to implement"
>interoperable mode,
>with all pNFS extensions being optional among mutually
>consenting clients
>and servers. This may also settle an earlier issue about whether
>metadata-only servers should not be supported - they can't fall back to
>NFSv4 as the "mandatory to implement" interoperable mode, and hence
>will get us into a tarpit over which metadata type must be "mandatory
>to implement" to ensure interoperability. IMHO, the best bet is to
>"just say no" - pNFS is an NFSv4 extension, therefore any
>implementation
>of pNFS is required to implement vanilla NFSv4 without any
>pNFS extensions
>in addition to pNFS-extended NFSv4.

I think that makes a lot of sense and goes well with NFSv4's minor
versioning rules.

Benny

>-----Original Message-----
>From: black_david@emc.com [mailto:black_david@emc.com]
>Sent: Monday, February 16, 2004 5:44 PM
>To: pnfs-reqs@yahoogroups.com
>Subject: [pnfs-reqs] Data formats & Interoperability (was: concall
>tomorrow)
>
>
>Gary Grider wrote:
>
> So I think having a header that has the info that is needed for
> interop is fine and having a payload that is opaque is fine.
>
> So explain why we need three different "wire formats"
>for these
>three
> different methods? Why arent these things just 3
>different types
>within
> the same wire format? Or is that what is envisioned,
>and I am just
>hung
> up on the terminology, and what you are saying is that
>we need to
>
> 1) define the header for interop type, version, length
>perhaps etc.
> and
> 2) define
> a) opaque info for SBC type = 1 version = 1
> b) opaque info for Object type = 2 version = 1
> c) opaque info for NFS back ends type = 3 version = 1
>
> and if someone wants to write an IETF request to have
>type = 4 added
> they can write a draft and get it approved and document
>the opaque
>info format?
>
>That's correct, and there are lots of IETF examples of protocols that
>are structured in this fashion.
>
>There is one issue that may cause some difficulty - mandatory
>requirements
>for interoperability. IETF requires that two implementations
>of the same
>protocol be capable of interoperating when implementers have
>made different
>choices among optional to implement features. This interoperation can
>be dependent on suitable configuration of the implementations.
>
>For example, there are mandatory-to-implement cryptographic algorithm
>requirements for protocols like IPsec and TLS to ensure that any two
>implementations can interoperate even if they've implemented different
>sets of cryptographic algorithms. In that case the "mandatory
>to implement"
>cryptographic algorithms will have been implemented by both,
>and will result
>in interoperation if they are chosen by both sides, although there's no
>requirement that they be offered or selected in negotiation.
>
>Requiring any one of the three metadata types Gary lists above to
>always be implemented is going to cause problems for some of the
>envisioned implementations.
>
>I think the best bet for the pNFS extensions will be to define "none"
>(i.e., vanilla NFSv4) as the "mandatory to implement"
>interoperable mode,
>with all pNFS extensions being optional among mutually
>consenting clients
>and servers. This may also settle an earlier issue about whether
>metadata-only servers should not be supported - they can't fall back to
>NFSv4 as the "mandatory to implement" interoperable mode, and hence
>will get us into a tarpit over which metadata type must be "mandatory
>to implement" to ensure interoperability. IMHO, the best bet is to
>"just say no" - pNFS is an NFSv4 extension, therefore any
>implementation
>of pNFS is required to implement vanilla NFSv4 without any
>pNFS extensions
>in addition to pNFS-extended NFSv4.
>
>Thanks,
>--David
>----------------------------------------------------
>David L. Black, Senior Technologist
>EMC Corporation, 176 South St., Hopkinton, MA 01748
>+1 (508) 293-7953 FAX: +1 (508) 293-7786
>black_david@emc.com Mobile: +1 (978) 394-7754
>----------------------------------------------------
>
>
>
>
>
>
>
>
>
>
>
>
>
>------------------------ Yahoo! Groups Sponsor
>---------------------~-->
>Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
>Printer at MyInks.com. Free s/h on orders $50 or more to the
>US & Canada.
>http://www.c1tracking.com/l.asp?cid=5511
>http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>---------------------------------------------------------------
>------~->
>
>
>Yahoo! Groups Links
>
>
>
>
>

From black_david@emc.com Mon Feb 16 15:04:45 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 88331 invoked from network); 16 Feb 2004 23:04:42 -0000
Received: from unknown (66.218.66.172)
by m13.grp.scd.yahoo.com with QMQP; 16 Feb 2004 23:04:42 -0000
Received: from unknown (HELO mercury.eng.emc.com) (168.159.100.12)
by mta4.grp.scd.yahoo.com with SMTP; 16 Feb 2004 23:04:41 -0000
Received: from mxic2.corp.emc.com ([128.221.12.9]) by mercury.eng.emc.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2656.59)
id 1ZQHJRNX; Mon, 16 Feb 2004 18:04:40 -0500
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <1YVBKVS1>; Mon, 16 Feb 2004 18:03:37 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A55B8@corpmx14.us.dg.com>
X-Sybari-Trust: e925cc80 1d8c424f 321aa0c1 0000013d
To: pnfs-reqs@yahoogroups.com
Date: Mon, 16 Feb 2004 18:04:33 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 168.159.100.12
From: black_david@emc.com
Subject: RE: [pnfs-reqs] plugins vs. wire protocol - a false dilemma
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

Julian,

The opaque format (and API) approach for interoperability works
when the opaqueness is to things that are above the level of the
protocol involved. For example, NFSv4 has nothing to say about
the contents of the files it provides access to - they're opaque
and rightly so.

The problem we have here is that pNFS is at the functional level
of NFSv4. If pNFS can't be configured to work (e.g., no metadata
type is common to client and server) and NFSv4 is not supported
by both ends, the result is no file access, no matter how the ends
of the connection are configured. Needless to say, this would
be wrong for implementations claiming to meet the requirements
of an IETF spec. As I noted in my previous message, requiring
vanilla NFSv4 without pNFS extensions as the interoperable mode
is probably the best way forward here. I don't think defining
a plug-in API will help with this interoperability issue, although
describing the functional interface between the generic pNFS
client and the storage-specific functionality is probably still
a good thing to do.

Also, attempting to anticipate a further issue from Gary - the
IETF requirement is functionality only, not performance. In
other words, NFSv4 just has to work, it doesn't have to be fast
(e.g., if all the clients in a pNFS system suddenly fall back
on NFSv4 for some reason, it's ok w/IETF if the results are
slow, although Gary might be very unhappy if one of his systems
ever did this).

Thanks,
--David

> I followed this thread with interest. I agree with Garth that wire format
> is required for interoperability - and an opaque format is good only for
> communication between entities of the same kind. That is required and
> having more than one is not necessarily a bad thing.
>
> However Gary has a good point in that by introducing many different
> formats server and client implementers may feel somewhat uneasy (and may
> delay implementations).
>
> Security that Benny used as a benchmark has solved this by standardizing
> an API (GSS-API) besides the wire protocol (however beware IETF does not
> like to standardize APIs - unless presented as a semantic definition for
> the endpoint operation).
>
> Perhaps the right thing to do is define both an API for the plug-in Gary
> suggested and the wire protocol (that enables interoperability).
>
> And BTW - using a meaningful subject line is part of the mailing list
> etiquette - I almost dropped the whole thread knowing that I
> can't make
> the call.
>
> Regards,
> Julo

From garth@panasas.com Fri Feb 20 16:22:40 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 57576 invoked from network); 21 Feb 2004 00:22:36 -0000
Received: from unknown (66.218.66.217)
by m18.grp.scd.yahoo.com with QMQP; 21 Feb 2004 00:22:36 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta2.grp.scd.yahoo.com with SMTP; 21 Feb 2004 00:22:36 -0000
Received: from [172.17.2.81] ([172.17.2.81]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id FK58QLDY; Fri, 20 Feb 2004 19:22:34 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <5.2.0.9.2.20040212213826.038a2208@cic-mail.lanl.gov>
References: <5.2.0.9.2.20040212213826.038a2208@cic-mail.lanl.gov>
Content-Type: multipart/mixed; boundary=Apple-Mail-1-231497768
Message-Id: <064537AE-6404-11D8-A7AF-000A95A94F04@panasas.com>
Date: Fri, 20 Feb 2004 19:22:25 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Based on feedback from Brent's concall 8 days ago, here is my cut at
Gary's proposal for a short problem statement introduction
presentation.

garth

On Feb 12, 2004, at 11:41 PM, Gary Grider wrote:
> I agree.
>
> I added some parallel stuff.
>
> Thanks
> Gary
>
> At 07:22 AM 2/12/2004 -0800, Corbett, Peter wrote:
>
> The slides mention server bypass twice, but don't talk about parallel
> access. Our problem statement is much more focussed on parallel
> access.
> We are trying to define a standard for parallel access, whether that is
> to nfs based data servers, object servers, virtualized SAN or
> non-virtualized SAN devices. So, I think we need to be clearer when
> talking about bypassing the server that we are really talking about
> direct access to a parallel data store from clustered clients, with a
> shared NFS server acting as a metadata server. To me, that is the key
> point that applies across the entire solution space, whereas direct
> access to devices is a data access technique in part of the solution
> space.
>
> -----Original Message-----
> From: Gary Grider [ mailto:ggrider@lanl.gov <mailto:ggrider@lanl.gov>
> ] Sent: Thursday, February 12, 2004 12:55 AM
> To: pnfs-reqs@yahoogroups.com
> Subject: Re: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started
>
>
> elevator pitch
>
> Gary
>



Attachment (not stored)
pNFS-intro.ppt
Type: application/vnd.ms-powerpoint

From dnoveck@netapp.com Sun Feb 22 10:22:12 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 79037 invoked from network); 22 Feb 2004 18:22:11 -0000
Received: from unknown (66.218.66.167)
by m6.grp.scd.yahoo.com with QMQP; 22 Feb 2004 18:22:11 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 22 Feb 2004 18:22:11 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1MIMBJC011267
for <pnfs-reqs@yahoogroups.com>; Sun, 22 Feb 2004 10:22:11 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1MIMBDU023705
for <pnfs-reqs@yahoogroups.com>; Sun, 22 Feb 2004 10:22:11 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Sun, 22 Feb 2004 10:22:02 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D36B5@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] RE: NEPS-REQS: getting started
Thread-Index: AcP4ENaVWrKnCLT9S1ivd9w4C4AJJQBXI02Q
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

I have some suggestions on slides 5 and 6.

I would drop the line about delegations from this slide. Unless
you come to this from the sorts of discussions we have been having
(and thus aren't the critical part of the audience), this is
really not going to be understandable. One problem that we have
in presenting this is that if we explain the situation, we wind up
having to explain that we think we pretty much know how to do this
already and just need the IETF to bless our choice (I'm exaggerating
but only some), and that isn't likely to go down very well with a
lot of people.

I'd express the last sub-bullet in this section as something like:

NFSv4 minor version model a good way to provide incremental extensions

which doesn't say that we know pretty much what these are (but it
doesn't say we don't :-)

As to the last section of slide 6, I'd revise to be something like
the following, again to reduce the we-know-how-to-do-this tone.

Much interest in exploring how v4 could be extended to solve this

Extension of delegations to provide "layout" information to clients

Clients use layout information to do IO and avoid single-server bottleneck

NFS, SCSI Block, SCSI Object layout formats all discussed

Support for multiple formats looks desirable (and doable).

-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Friday, February 20, 2004 7:22 PM
To: pnfs-reqs@yahoogroups.com
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started


Based on feedback from Brent's concall 8 days ago, here is my cut at
Gary's proposal for a short problem statement introduction
presentation.

garth

On Feb 12, 2004, at 11:41 PM, Gary Grider wrote:
> I agree.
>
> I added some parallel stuff.
>
> Thanks
> Gary
>
> At 07:22 AM 2/12/2004 -0800, Corbett, Peter wrote:
>
> The slides mention server bypass twice, but don't talk about parallel
> access. Our problem statement is much more focussed on parallel
> access.
> We are trying to define a standard for parallel access, whether that is
> to nfs based data servers, object servers, virtualized SAN or
> non-virtualized SAN devices. So, I think we need to be clearer when
> talking about bypassing the server that we are really talking about
> direct access to a parallel data store from clustered clients, with a
> shared NFS server acting as a metadata server. To me, that is the key
> point that applies across the entire solution space, whereas direct
> access to devices is a data access technique in part of the solution
> space.
>
> -----Original Message-----
> From: Gary Grider [ mailto:ggrider@lanl.gov <mailto:ggrider@lanl.gov>
> ] Sent: Thursday, February 12, 2004 12:55 AM
> To: pnfs-reqs@yahoogroups.com
> Subject: Re: Fwd: [pnfs-reqs] RE: NEPS-REQS: getting started
>
>
> elevator pitch
>
> Gary
>





Yahoo! Groups Links

From garth@panasas.com Sun Feb 22 15:44:29 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 14930 invoked from network); 22 Feb 2004 23:44:28 -0000
Received: from unknown (66.218.66.166)
by m18.grp.scd.yahoo.com with QMQP; 22 Feb 2004 23:44:28 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 22 Feb 2004 23:44:27 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id FLP29YY1; Sun, 22 Feb 2004 18:44:26 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A6D36B5@silver.nane.netapp.com>
References: <C8CF60CFC4D8A74E9945E32CF096548A6D36B5@silver.nane.netapp.com>
Content-Type: multipart/mixed; boundary=Apple-Mail-3-402012520
Message-Id: <08FB86C8-6591-11D8-9D79-000A95A94F04@panasas.com>
Date: Sun, 22 Feb 2004 18:44:20 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

I understand the sensitivity of IETF to having solutions presented
instead of problems.

Here is a revision following the recommendations below, with some word
choices of my own. Specifically, I'm reluctant to say nothing about
how NFSv4 is better for fixing this than NFSv3; that is, the definition
of NFSv4 creates the opportunity for "direct" or "out-of-band" access.

Page 5: the two lines in question,

- NFSv4, relative to NFSv3, has enhanced client side optimizations
- NFSv4 minor extensions may suffice for incremental functionality

Page 6: last section:

Much interest in exploring NFSv4 extensions to meet scalability needs
- Extend NFSv4 �delegations� to provide �layout� information to clients
- Clients use �layout� to directly access storage, avoiding
single-server bottleneck
- NFS, SCSI Block, and SCSI Object �layout� formats all discussed
- Support for multiple �layout� formats desirable (and looks doable)

Dave, how is this?

garth


On Feb 22, 2004, at 1:22 PM, Noveck, Dave wrote:

> I have some suggestions on slides 5 and 6.
>
> I would drop the line about delegations from this slide. Unless
> you come to this from the sorts of discussions we have been having
> (and thus aren't the critical part of the audience), this is
> really not going to be understandable. One problem that we have
> in presenting this is that if we explain the situation, we wind up
> having to explain that we think we pretty much know how to do this
> already and just need the IETF to bless our choice (I'm exaggerating
> but only some), and that isn't likely to go down very well with a
> lot of people.
>
> I'd express the last sub-bullet in this section as something like:
>
> NFSv4 minor version model a good way to provide incremental
> extensions
>
> which doesn't say that we know pretty much what these are (but it
> doesn't say we don't :-)
>
> As to the last section of slide 6, I'd revise to be something like
> the following, again to reduce the we-know-how-to-do-this tone.
>
> Much interest in exploring how v4 could be extended to solve this
>
> Extension of delegations to provide "layout" information to
> clients
>
> Clients use layout information to do IO and avoid
> single-server bottleneck
>
> NFS, SCSI Block, SCSI Object layout formats all discussed
>
> Support for multiple formats looks desirable (and doable).
>
> -----Original Message-----
> From: Garth Gibson [mailto:garth@panasas.com]
> Sent: Friday, February 20, 2004 7:22 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
>
>
> Based on feedback from Brent's concall 8 days ago, here is my cut at
> Gary's proposal for a short problem statement introduction
> presentation.
>
> garth
>



Attachment (not stored)
pNFS-intro-2-22.ppt
Type: application/vnd.ms-powerpoint

From dnoveck@netapp.com Sun Feb 22 16:29:36 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 26422 invoked from network); 23 Feb 2004 00:29:34 -0000
Received: from unknown (66.218.66.167)
by m13.grp.scd.yahoo.com with QMQP; 23 Feb 2004 00:29:34 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 23 Feb 2004 00:29:34 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1N0TTJC016163
for <pnfs-reqs@yahoogroups.com>; Sun, 22 Feb 2004 16:29:29 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1N0TTB4000373
for <pnfs-reqs@yahoogroups.com>; Sun, 22 Feb 2004 16:29:29 -0800 (PST)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="Windows-1252"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Sun, 22 Feb 2004 16:29:18 -0800
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D36B6@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] RE: NEPS-REQS: getting started
Thread-Index: AcP5ndac4dbytrq3RvKJaomlNrNv9AAA7Y1g
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

> I understand the sensitivity of IETF to having solutions presented
> instead of problems.

> Here is a revision following the recommendations below, with some word
> choices of my own. Specifically, I'm reluctant to say nothing about
> how NFSv4 is better for fixing this than NFSv3; that is, the definition
> of NFSv4 creates the opportunity for "direct" or "out-of-band" access

> Page 5: the two lines in question,

> - NFSv4, relative to NFSv3, has enhanced client side optimizations

I can see that the idea that this is just an extension of an already
establiched v4 trend is worth mentioning. My preference would be not
to mention v3 explicitly, but maybe that's one of those things that
we'll never agree on. No big deal. How about:

NFSv4 already provides mechanisms for clients to act autonomously
when circumstances allow it.

> - NFSv4 minor extensions may suffice for incremental functionality

"minor extensions" sounds like you are talking about the size of the
extensions so I would somehow use "minor version" or "minor versioning"
to indicate that v4 already has made provisions for extensions like
this (even without knowing what they would be :-)

> Page 6: last section:
>
> Much interest in exploring NFSv4 extensions to meet scalability needs
> - Extend NFSv4 �delegations� to provide �layout� information to clients

I wouldn't have the quotes around 'delegations'. Those should be
an understood idea in the NFSv4 context.

> - Clients use �layout� to directly access storage, avoiding
> single-server bottleneck
> - NFS, SCSI Block, and SCSI Object �layout� formats all discussed
> - Support for multiple �layout� formats desirable (and looks doable)

> Dave, how is this?

Looks OK to me.

From Thomas.Talpey@netapp.com Sun Feb 22 16:49:04 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 86191 invoked from network); 23 Feb 2004 00:49:00 -0000
Received: from unknown (66.218.66.218)
by m20.grp.scd.yahoo.com with QMQP; 23 Feb 2004 00:49:00 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 23 Feb 2004 00:49:00 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1N0n0JC017966
for <pnfs-reqs@yahoogroups.com>; Sun, 22 Feb 2004 16:49:00 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1N0n0DU011074
for <pnfs-reqs@yahoogroups.com>; Sun, 22 Feb 2004 16:49:00 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.32]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Sun, 22 Feb 2004 19:48:51 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3F9A6.CDD9DB80"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Sun, 22 Feb 2004 16:48:40 -0800
Message-ID: <6.0.3.0.0.20040222194310.01b7da00@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] RE: NEPS-REQS: getting started
Thread-Index: AcP5ps6EfXbzcs3NTDaL2WopU4HhCw==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

ADVERTISEMENT
click here

What is "EDA" (slide 3)? Spell it out, whatever it is! :-)

I wouldn't include "FCIP" on slide 5. Does anyone use it,
especially for storage?

I think there should be another slide after slide 2, which
drills down at least a little into the ideas that have been
discussed. It's a good place to establish the context of
the presentation - as it is there isn't really any "proposal".
I'm thinking maybe a single bullet for some or all of the
whitepapers at NEPS?

Tom.

At 06:44 PM 2/22/2004, Garth Gibson wrote:
>I understand the sensitivity of IETF to having solutions presented
>instead of problems.
>
>Here is a revision following the recommendations below, with some word
>choices of my own.  Specifically, I'm reluctant to say nothing about
>how NFSv4 is better for fixing this than NFSv3; that is, the definition
>of NFSv4 creates the opportunity for "direct" or "out-of-band" access.
>
>Page 5: the two lines in question,
>
>- NFSv4, relative to NFSv3, has enhanced client side optimizations
>- NFSv4 minor extensions may suffice for incremental functionality
>
>Page 6: last section:
>
>Much interest in exploring NFSv4 extensions to meet scalability needs
>- Extend NFSv4 �delegations� to provide �layout� information to clients
>- Clients use �layout� to directly access storage, avoiding
>single-server bottleneck
>- NFS, SCSI Block, and SCSI Object �layout� formats all discussed
>- Support for multiple �layout� formats desirable (and looks doable)
>
>Dave, how is this?
>
>garth
>
>
>On Feb 22, 2004, at 1:22 PM, Noveck, Dave wrote:
>
>> I have some suggestions on slides 5 and 6.
>>
>> I would drop the line about delegations from this slide.  Unless
>> you come to this from the sorts of discussions we have been having
>> (and thus aren't the critical part of the audience), this is
>> really not going to be understandable.  One problem that we have
>> in presenting this is that if we explain the situation, we wind up
>> having to explain that we think we pretty much know how to do this
>> already and just need the IETF to bless our choice (I'm exaggerating
>> but only some), and that isn't likely to go down very well with a
>> lot of people.
>>
>> I'd express the last sub-bullet in this section as something like:
>>
>>     NFSv4 minor version model a good way to provide incremental
>> extensions
>>
>> which doesn't say that we know pretty much what these are (but it
>> doesn't say we don't :-)
>>
>> As to the last section of slide 6, I'd revise to be something like
>> the following, again to reduce the we-know-how-to-do-this tone.
>>
>>      Much interest in exploring how v4 could be extended to solve this
>>
>>           Extension of delegations to provide "layout" information to
>> clients
>>
>>           Clients use layout information to do IO and avoid
>> single-server bottleneck
>>
>>           NFS, SCSI Block, SCSI Object layout formats all discussed
>>
>>           Support for multiple formats looks desirable (and doable).
>>
>> -----Original Message-----
>> From: Garth Gibson [mailto:garth@panasas.com]
>> Sent: Friday, February 20, 2004 7:22 PM
>> To: pnfs-reqs@yahoogroups.com
>> Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
>>
>>
>> Based on feedback from Brent's concall 8 days ago, here is my cut at
>> Gary's proposal for a short problem statement introduction
>> presentation.
>>
>> garth
>>
>
>
>
>------------------------ Yahoo! Groups Sponsor ---------------------~-->
>Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
>Printer at MyInks.com. Free s/h on orders $50 or more to the US & Canada.
>http://www.c1tracking.com/l.asp?cid=5511
>http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>---------------------------------------------------------------------~->
>
>
>Yahoo! Groups Links
>
><*> To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
><*> To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
><*> Your use of Yahoo! Groups is subject to:
>     http://docs.yahoo.com/info/terms/
>
>
>


From garth@panasas.com Sun Feb 22 17:52:39 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 88387 invoked from network); 23 Feb 2004 01:52:38 -0000
Received: from unknown (66.218.66.167)
by m16.grp.scd.yahoo.com with QMQP; 23 Feb 2004 01:52:38 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 23 Feb 2004 01:52:37 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id FLP29Z15; Sun, 22 Feb 2004 20:52:35 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <6.0.3.0.0.20040222194310.01b7da00@silver.nane.netapp.com>
References: <6.0.3.0.0.20040222194310.01b7da00@silver.nane.netapp.com>
Content-Type: multipart/mixed; boundary=Apple-Mail-5-409702016
Message-Id: <F0475BD1-65A2-11D8-B028-000A95A94F04@panasas.com>
Date: Sun, 22 Feb 2004 20:52:29 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Okay, I think I'm in tune with all of Dave Noveck's comments and the
first two of these comments from Tom.

The third comment, calling for more description of the ideas that were
in the NEPS workshop, has me confused. I think Dave has been
encouraging that the proposal have less in the way of what the IETF
should do to solve this problem, and I read this request as suggesting
that we have more in the way of proposals for the solution.

I'm certain we can do either, and I think I'd like to hear a little
more direction from the group.

Do we stay with this mostly solution free problem presentation, or do
we add more about layout delegations and other NEPS proposed ideas?

garth


On Feb 22, 2004, at 7:48 PM, Talpey, Thomas wrote:

> What is "EDA" (slide 3)? Spell it out, whatever it is! :-)
>
> I wouldn't include "FCIP" on slide 5. Does anyone use it,
> especially for storage?
>
> I think there should be another slide after slide 2, which
> drills down at least a little into the ideas that have been
> discussed. It's a good place to establish the context of
> the presentation - as it is there isn't really any "proposal".
> I'm thinking maybe a single bullet for some or all of the
> whitepapers at NEPS?
>
> Tom.
>
> At 06:44 PM 2/22/2004, Garth Gibson wrote:
>> I understand the sensitivity of IETF to having solutions presented
>> instead of problems.
>>
>> Here is a revision following the recommendations below, with some word
>> choices of my own. Specifically, I'm reluctant to say nothing about
>> how NFSv4 is better for fixing this than NFSv3; that is, the
>> definition
>
>> of NFSv4 creates the opportunity for "direct" or "out-of-band" access.
>>
>> Page 5: the two lines in question,
>>
>> - NFSv4, relative to NFSv3, has enhanced client side optimizations
>> - NFSv4 minor extensions may suffice for incremental functionality
>>
>> Page 6: last section:
>>
>> Much interest in exploring NFSv4 extensions to meet scalability needs
>> - Extend NFSv4 "delegations" to provide "layout" information to
>> clients
>
>> - Clients use "layout" to directly access storage, avoiding
>> single-server bottleneck
>> - NFS, SCSI Block, and SCSI Object "layout" formats all discussed
>> - Support for multiple "layout" formats desirable (and looks doable)
>>
>> Dave, how is this?
>>
>> garth
>>
>>
>> On Feb 22, 2004, at 1:22 PM, Noveck, Dave wrote:
>>
>>> I have some suggestions on slides 5 and 6.
>>>
>>> I would drop the line about delegations from this slide. Unless
>>> you come to this from the sorts of discussions we have been having
>>> (and thus aren't the critical part of the audience), this is
>>> really not going to be understandable. One problem that we have
>>> in presenting this is that if we explain the situation, we wind up
>>> having to explain that we think we pretty much know how to do this
>>> already and just need the IETF to bless our choice (I'm exaggerating
>>> but only some), and that isn't likely to go down very well with a
>>> lot of people.
>>>
>>> I'd express the last sub-bullet in this section as something like:
>>>
>>> NFSv4 minor version model a good way to provide incremental
>>> extensions
>>>
>>> which doesn't say that we know pretty much what these are (but it
>>> doesn't say we don't :-)
>>>
>>> As to the last section of slide 6, I'd revise to be something like
>>> the following, again to reduce the we-know-how-to-do-this tone.
>>>
>>> Much interest in exploring how v4 could be extended to solve
>>> this
>>>
>>> Extension of delegations to provide "layout" information
>>> to clients
>>>
>>> Clients use layout information to do IO and avoid
>>> single-server bottleneck
>>>
>>> NFS, SCSI Block, SCSI Object layout formats all discussed
>>>
>>> Support for multiple formats looks desirable (and doable).
>>>
>>> -----Original Message-----
>>> From: Garth Gibson [ mailto:garth@panasas.com
>>> <mailto:garth@panasas.com> ]
>>> Sent: Friday, February 20, 2004 7:22 PM
>>> To: pnfs-reqs@yahoogroups.com
>>> Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
>>>
>>>
>>> Based on feedback from Brent's concall 8 days ago, here is my cut at
>>> Gary's proposal for a short problem statement introduction
>>> presentation.
>>>
>>> garth



Attachment (not stored)
pNFS-intro-2-22.2.ppt
Type: application/vnd.ms-powerpoint

From julian_satran@il.ibm.com Mon Feb 23 08:25:56 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 50186 invoked from network); 23 Feb 2004 16:25:42 -0000
Received: from unknown (66.218.66.167)
by m16.grp.scd.yahoo.com with QMQP; 23 Feb 2004 16:25:42 -0000
Received: from unknown (HELO mtagate7.uk.ibm.com) (195.212.29.140)
by mta6.grp.scd.yahoo.com with SMTP; 23 Feb 2004 16:25:41 -0000
Received: from d06nrmr1307.portsmouth.uk.ibm.com (d06nrmr1307.portsmouth.uk.ibm.com [9.149.38.129])
by mtagate7.uk.ibm.com (8.12.10/8.12.10) with ESMTP id i1NGPc4n121918
for <pnfs-reqs@yahoogroups.com>; Mon, 23 Feb 2004 16:25:39 GMT
Received: from d12ml102.megacenter.de.ibm.com (d06av02.portsmouth.uk.ibm.com [9.149.37.228])
by d06nrmr1307.portsmouth.uk.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i1NGPbtS240322
for <pnfs-reqs@yahoogroups.com>; Mon, 23 Feb 2004 16:25:38 GMT
In-Reply-To: <6.0.3.0.0.20040222194310.01b7da00@silver.nane.netapp.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OFA025C17B.CD9D488C-ONC2256E43.0055AAEE-C2256E43.005A3C90@il.ibm.com>
Date: Mon, 23 Feb 2004 18:27:36 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2|July 23, 2003) at
23/02/2004 18:27:37,
Serialize complete at 23/02/2004 18:27:37
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: base64
X-eGroups-Remote-IP: 195.212.29.140
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

"Talpey, Thomas" <Thomas.Talpey@netapp.com> wrote on 23/02/2004 02:48:40:

> What is "EDA" (slide 3)? Spell it out, whatever it is! :-)
> I wouldn't include "FCIP" on slide 5. Does anyone use it,
> especially for storage?

And I would include NFS-RDMA - as an NFS extension example that IETF is
doing.

> I think there should be another slide after slide 2, which
> drills down at least a little into the ideas that have been
> discussed. It's a good place to establish the context of
> the presentation - as it is there isn't really any "proposal".
> I'm thinking maybe a single bullet for some or all of the
> whitepapers at NEPS?
> Tom.
> At 06:44 PM 2/22/2004, Garth Gibson wrote:
> >I understand the sensitivity of IETF to having solutions presented
> >instead of problems.
> >
> >Here is a revision following the recommendations below, with some word
> >choices of my own. Specifically, I'm reluctant to say nothing about
> >how NFSv4 is better for fixing this than NFSv3; that is, the definition

> >of NFSv4 creates the opportunity for "direct" or "out-of-band" access.
> >
> >Page 5: the two lines in question,
> >
> >- NFSv4, relative to NFSv3, has enhanced client side optimizations
> >- NFSv4 minor extensions may suffice for incremental functionality
> >
> >Page 6: last section:
> >
> >Much interest in exploring NFSv4 extensions to meet scalability needs
> >- Extend NFSv4 “delegations” to provide “layout” information to clients

> >- Clients use “layout” to directly access storage, avoiding
> >single-server bottleneck
> >- NFS, SCSI Block, and SCSI Object “layout” formats all discussed
> >- Support for multiple “layout” formats desirable (and looks doable)
> >
> >Dave, how is this?
> >
> >garth
> >
> >
> >On Feb 22, 2004, at 1:22 PM, Noveck, Dave wrote:
> >
> >> I have some suggestions on slides 5 and 6.
> >>
> >> I would drop the line about delegations from this slide. Unless
> >> you come to this from the sorts of discussions we have been having
> >> (and thus aren't the critical part of the audience), this is
> >> really not going to be understandable. One problem that we have
> >> in presenting this is that if we explain the situation, we wind up
> >> having to explain that we think we pretty much know how to do this
> >> already and just need the IETF to bless our choice (I'm exaggerating
> >> but only some), and that isn't likely to go down very well with a
> >> lot of people.
> >>
> >> I'd express the last sub-bullet in this section as something like:
> >>
> >> NFSv4 minor version model a good way to provide incremental
> >> extensions
> >>
> >> which doesn't say that we know pretty much what these are (but it
> >> doesn't say we don't :-)
> >>
> >> As to the last section of slide 6, I'd revise to be something like
> >> the following, again to reduce the we-know-how-to-do-this tone.
> >>
> >> Much interest in exploring how v4 could be extended to solve
this
> >>
> >> Extension of delegations to provide "layout" information to

> >> clients
> >>
> >> Clients use layout information to do IO and avoid
> >> single-server bottleneck
> >>
> >> NFS, SCSI Block, SCSI Object layout formats all discussed
> >>
> >> Support for multiple formats looks desirable (and doable).
> >>
> >> -----Original Message-----
> >> From: Garth Gibson [mailto:garth@panasas.com]
> >> Sent: Friday, February 20, 2004 7:22 PM
> >> To: pnfs-reqs@yahoogroups.com
> >> Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
> >>
> >>
> >> Based on feedback from Brent's concall 8 days ago, here is my cut at
> >> Gary's proposal for a short problem statement introduction
> >> presentation.
> >>
> >> garth
> >>
> >
> >
> >
> >------------------------ Yahoo! Groups Sponsor
---------------------~-->
> >Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> >Printer at MyInks.com. Free s/h on orders $50 or more to the US &
Canada.
> >http://www.c1tracking.com/l.asp?cid=5511
> >http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>
>---------------------------------------------------------------------~->
> >
> >
> >Yahoo! Groups Links
> >
> >
> >
> >
> >
> >
>
> Yahoo! Groups Sponsor
>
> ADVERTISEMENT
>
> [image removed]
>
>
> Yahoo! Groups Links
> To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.

From julian_satran@il.ibm.com Mon Feb 23 08:26:01 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 50171 invoked from network); 23 Feb 2004 16:25:42 -0000
Received: from unknown (66.218.66.216)
by m16.grp.scd.yahoo.com with QMQP; 23 Feb 2004 16:25:42 -0000
Received: from unknown (HELO mtagate5.uk.ibm.com) (195.212.29.138)
by mta1.grp.scd.yahoo.com with SMTP; 23 Feb 2004 16:25:40 -0000
Received: from d06nrmr1307.portsmouth.uk.ibm.com (d06nrmr1307.portsmouth.uk.ibm.com [9.149.38.129])
by mtagate5.uk.ibm.com (8.12.10/8.12.10) with ESMTP id i1NGPcWH026470
for <pnfs-reqs@yahoogroups.com>; Mon, 23 Feb 2004 16:25:38 GMT
Received: from d12ml102.megacenter.de.ibm.com (d06av02.portsmouth.uk.ibm.com [9.149.37.228])
by d06nrmr1307.portsmouth.uk.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i1NGPbtT240322
for <pnfs-reqs@yahoogroups.com>; Mon, 23 Feb 2004 16:25:38 GMT
In-Reply-To: <F0475BD1-65A2-11D8-B028-000A95A94F04@panasas.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5 September 26, 2003
Message-ID: <OF1D6D338A.5D06ADD3-ONC2256E43.00563DF8-C2256E43.005A3CA9@il.ibm.com>
Date: Mon, 23 Feb 2004 18:27:36 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2|July 23, 2003) at
23/02/2004 18:27:38
Content-Type: multipart/mixed; boundary="=_mixed 00567A40C2256E43_="
X-eGroups-Remote-IP: 195.212.29.138
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

The only think fair to say (I think) is that there are initial ideas.
Otherwise the community may not like that you inted to push what they'll
perceive as a done deal.
So it is fair to mention things like delegations - but beyond "solutions
could be based on".

Julo



Garth Gibson <garth@panasas.com>
23/02/2004 03:52
Please respond to
pnfs-reqs


To
pnfs-reqs@yahoogroups.com
cc

Subject
Re: [pnfs-reqs] RE: NEPS-REQS: getting started






Okay, I think I'm in tune with all of Dave Noveck's comments and the
first two of these comments from Tom.

The third comment, calling for more description of the ideas that were
in the NEPS workshop, has me confused. I think Dave has been
encouraging that the proposal have less in the way of what the IETF
should do to solve this problem, and I read this request as suggesting
that we have more in the way of proposals for the solution.

I'm certain we can do either, and I think I'd like to hear a little
more direction from the group.

Do we stay with this mostly solution free problem presentation, or do
we add more about layout delegations and other NEPS proposed ideas?

garth


On Feb 22, 2004, at 7:48 PM, Talpey, Thomas wrote:

> What is "EDA" (slide 3)? Spell it out, whatever it is! :-)
>
> I wouldn't include "FCIP" on slide 5. Does anyone use it,
> especially for storage?
>
> I think there should be another slide after slide 2, which
> drills down at least a little into the ideas that have been
> discussed. It's a good place to establish the context of
> the presentation - as it is there isn't really any "proposal".
> I'm thinking maybe a single bullet for some or all of the
> whitepapers at NEPS?
>
> Tom.
>
> At 06:44 PM 2/22/2004, Garth Gibson wrote:
>> I understand the sensitivity of IETF to having solutions presented
>> instead of problems.
>>
>> Here is a revision following the recommendations below, with some word
>> choices of my own. Specifically, I'm reluctant to say nothing about
>> how NFSv4 is better for fixing this than NFSv3; that is, the
>> definition
>
>> of NFSv4 creates the opportunity for "direct" or "out-of-band" access.
>>
>> Page 5: the two lines in question,
>>
>> - NFSv4, relative to NFSv3, has enhanced client side optimizations
>> - NFSv4 minor extensions may suffice for incremental functionality
>>
>> Page 6: last section:
>>
>> Much interest in exploring NFSv4 extensions to meet scalability needs
>> - Extend NFSv4 "delegations" to provide "layout" information to
>> clients
>
>> - Clients use "layout" to directly access storage, avoiding
>> single-server bottleneck
>> - NFS, SCSI Block, and SCSI Object "layout" formats all discussed
>> - Support for multiple "layout" formats desirable (and looks doable)
>>
>> Dave, how is this?
>>
>> garth
>>
>>
>> On Feb 22, 2004, at 1:22 PM, Noveck, Dave wrote:
>>
>>> I have some suggestions on slides 5 and 6.
>>>
>>> I would drop the line about delegations from this slide. Unless
>>> you come to this from the sorts of discussions we have been having
>>> (and thus aren't the critical part of the audience), this is
>>> really not going to be understandable. One problem that we have
>>> in presenting this is that if we explain the situation, we wind up
>>> having to explain that we think we pretty much know how to do this
>>> already and just need the IETF to bless our choice (I'm exaggerating
>>> but only some), and that isn't likely to go down very well with a
>>> lot of people.
>>>
>>> I'd express the last sub-bullet in this section as something like:
>>>
>>> NFSv4 minor version model a good way to provide incremental
>>> extensions
>>>
>>> which doesn't say that we know pretty much what these are (but it
>>> doesn't say we don't :-)
>>>
>>> As to the last section of slide 6, I'd revise to be something like
>>> the following, again to reduce the we-know-how-to-do-this tone.
>>>
>>> Much interest in exploring how v4 could be extended to solve
>>> this
>>>
>>> Extension of delegations to provide "layout" information
>>> to clients
>>>
>>> Clients use layout information to do IO and avoid
>>> single-server bottleneck
>>>
>>> NFS, SCSI Block, SCSI Object layout formats all discussed
>>>
>>> Support for multiple formats looks desirable (and doable).
>>>
>>> -----Original Message-----
>>> From: Garth Gibson [ mailto:garth@panasas.com
>>> <mailto:garth@panasas.com> ]
>>> Sent: Friday, February 20, 2004 7:22 PM
>>> To: pnfs-reqs@yahoogroups.com
>>> Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
>>>
>>>
>>> Based on feedback from Brent's concall 8 days ago, here is my cut at
>>> Gary's proposal for a short problem statement introduction
>>> presentation.
>>>
>>> garth





Yahoo! Groups Links








Attachment (not stored)
pNFS-intro-2-22.2.ppt
Type: application/octet-stream

From mclarty3@llnl.gov Mon Feb 23 11:15:50 2004
Return-Path: <mclarty3@llnl.gov>
X-Sender: mclarty3@llnl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 52894 invoked from network); 23 Feb 2004 19:15:47 -0000
Received: from unknown (66.218.66.172)
by m12.grp.scd.yahoo.com with QMQP; 23 Feb 2004 19:15:47 -0000
Received: from unknown (HELO smtp-4.llnl.gov) (128.115.41.84)
by mta4.grp.scd.yahoo.com with SMTP; 23 Feb 2004 19:15:47 -0000
Received: from poptop.llnl.gov (localhost [127.0.0.1])
by smtp-4.llnl.gov (8.12.3p2-20030917/8.12.3/LLNL evision: 1.13 $) with ESMTP id i1NJETNU015799;
Mon, 23 Feb 2004 11:15:46 -0800 (PST)
Received: from POLARBEAR.llnl.gov ([134.9.18.59] verified)
by poptop.llnl.gov (CommuniGate Pro SMTP 4.0.6)
with ESMTP id 36870990; Mon, 23 Feb 2004 11:15:23 -0800
Message-Id: <5.0.0.25.2.20040223110657.027281a0@poptop.llnl.gov>
X-Sender: e002801@poptop.llnl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Mon, 23 Feb 2004 11:15:22 -0800
To: pnfs-reqs@yahoogroups.com, pnfs-reqs@yahoogroups.com
In-Reply-To: <F0475BD1-65A2-11D8-B028-000A95A94F04@panasas.com>
References: <6.0.3.0.0.20040222194310.01b7da00@silver.nane.netapp.com>
<6.0.3.0.0.20040222194310.01b7da00@silver.nane.netapp.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-eGroups-Remote-IP: 128.115.41.84
From: Tyce McLarty <mclarty3@llnl.gov>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169320772
X-Yahoo-Profile: mclarty3

ADVERTISEMENT
At 08:52 PM 2/22/2004 -0500, Garth Gibson wrote:
>Okay, I think I'm in tune with all of Dave Noveck's comments and the
>first two of these comments from Tom.
>
>The third comment, calling for more description of the ideas that were
>in the NEPS workshop, has me confused. I think Dave has been
>encouraging that the proposal have less in the way of what the IETF
>should do to solve this problem, and I read this request as suggesting
>that we have more in the way of proposals for the solution.
>
>I'm certain we can do either, and I think I'd like to hear a little
>more direction from the group.
>
>Do we stay with this mostly solution free problem presentation, or do
>we add more about layout delegations and other NEPS proposed ideas?

I think it depends on how we intend to use the "presentation". I expect two
scenarios, neither of which involves the IETF:

1. I catch a higher level manager or VIP in the hall for a couple minutes:
He gets slide 2 first. If he shows any interest at all give him the rest of
slide 7. That's all he needs to know and he will almost surely lose
interest if I try to tell him more.

2. Discussion with technical peer: Again start with slide 2 information.
Expect that he will show enough interest to want it all. Have to use some
judgement here.

In case 2, I think being ready with what we are thinking about for a
solution is on target. A technical type will find supporting an problem
areas that we really want to find out about.

You asked for opinions.

Tyce


>garth
>
>
>On Feb 22, 2004, at 7:48 PM, Talpey, Thomas wrote:
>
> > What is "EDA" (slide 3)? Spell it out, whatever it is! :-)
> >
> > I wouldn't include "FCIP" on slide 5. Does anyone use it,
> > especially for storage?
> >
> > I think there should be another slide after slide 2, which
> > drills down at least a little into the ideas that have been
> > discussed. It's a good place to establish the context of
> > the presentation - as it is there isn't really any "proposal".
> > I'm thinking maybe a single bullet for some or all of the
> > whitepapers at NEPS?
> >
> > Tom.
> >
> > At 06:44 PM 2/22/2004, Garth Gibson wrote:
> >> I understand the sensitivity of IETF to having solutions presented
> >> instead of problems.
> >>
> >> Here is a revision following the recommendations below, with some word
> >> choices of my own. Specifically, I'm reluctant to say nothing about
> >> how NFSv4 is better for fixing this than NFSv3; that is, the
> >> definition
> >
> >> of NFSv4 creates the opportunity for "direct" or "out-of-band" access.
> >>
> >> Page 5: the two lines in question,
> >>
> >> - NFSv4, relative to NFSv3, has enhanced client side optimizations
> >> - NFSv4 minor extensions may suffice for incremental functionality
> >>
> >> Page 6: last section:
> >>
> >> Much interest in exploring NFSv4 extensions to meet scalability needs
> >> - Extend NFSv4 "delegations" to provide "layout" information to
> >> clients
> >
> >> - Clients use "layout" to directly access storage, avoiding
> >> single-server bottleneck
> >> - NFS, SCSI Block, and SCSI Object "layout" formats all discussed
> >> - Support for multiple "layout" formats desirable (and looks doable)
> >>
> >> Dave, how is this?
> >>
> >> garth
> >>
> >>
> >> On Feb 22, 2004, at 1:22 PM, Noveck, Dave wrote:
> >>
> >>> I have some suggestions on slides 5 and 6.
> >>>
> >>> I would drop the line about delegations from this slide. Unless
> >>> you come to this from the sorts of discussions we have been having
> >>> (and thus aren't the critical part of the audience), this is
> >>> really not going to be understandable. One problem that we have
> >>> in presenting this is that if we explain the situation, we wind up
> >>> having to explain that we think we pretty much know how to do this
> >>> already and just need the IETF to bless our choice (I'm exaggerating
> >>> but only some), and that isn't likely to go down very well with a
> >>> lot of people.
> >>>
> >>> I'd express the last sub-bullet in this section as something like:
> >>>
> >>> NFSv4 minor version model a good way to provide incremental
> >>> extensions
> >>>
> >>> which doesn't say that we know pretty much what these are (but it
> >>> doesn't say we don't :-)
> >>>
> >>> As to the last section of slide 6, I'd revise to be something like
> >>> the following, again to reduce the we-know-how-to-do-this tone.
> >>>
> >>> Much interest in exploring how v4 could be extended to solve
> >>> this
> >>>
> >>> Extension of delegations to provide "layout" information
> >>> to clients
> >>>
> >>> Clients use layout information to do IO and avoid
> >>> single-server bottleneck
> >>>
> >>> NFS, SCSI Block, SCSI Object layout formats all discussed
> >>>
> >>> Support for multiple formats looks desirable (and doable).
> >>>
> >>> -----Original Message-----
> >>> From: Garth Gibson [ mailto:garth@panasas.com
> >>> <mailto:garth@panasas.com> ]
> >>> Sent: Friday, February 20, 2004 7:22 PM
> >>> To: pnfs-reqs@yahoogroups.com
> >>> Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
> >>>
> >>>
> >>> Based on feedback from Brent's concall 8 days ago, here is my cut at
> >>> Gary's proposal for a short problem statement introduction
> >>> presentation.
> >>>
> >>> garth
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>
>
>

From Thomas.Talpey@netapp.com Mon Feb 23 12:00:05 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 2658 invoked from network); 23 Feb 2004 20:00:02 -0000
Received: from unknown (66.218.66.172)
by m7.grp.scd.yahoo.com with QMQP; 23 Feb 2004 20:00:02 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 23 Feb 2004 20:00:01 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1NK00JC014888
for <pnfs-reqs@yahoogroups.com>; Mon, 23 Feb 2004 12:00:00 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1NK00DU004861
for <pnfs-reqs@yahoogroups.com>; Mon, 23 Feb 2004 12:00:00 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.37]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Mon, 23 Feb 2004 14:59:58 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3FA47.9CFDB300"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Mon, 23 Feb 2004 11:59:51 -0800
Message-ID: <6.0.3.0.0.20040223145154.01c24c68@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] RE: NEPS-REQS: getting started
Thread-Index: AcP6R504DgBYzoRWQRWi2hKoiYvDNQ==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

ADVERTISEMENT
click here

Garth - sorry for the delay in responding. Connectathon calls.

I don't mean to suggest presenting the NEPS ideas as task
items. What I did mean was to use them to set some context,
and show come diverse interests which pNFS is in the process
of bringing together under the NFSv4 tent.

The reason to bring them up early in the talk is because it's
very abstract without them. Most folks are seeing this for the
first time, they're going to want to know roughly what it's all
about. It's an important message to the IETF as well - there
are multiple viewpoints, there is discussion, there is unified
desire to move it to the open IETF forum.

Definitely don't drill down on them - just the basic components
of each. At least, this is my idea - comments?

Tom.
 
At 08:52 PM 2/22/2004, Garth Gibson wrote:
>Okay, I think I'm in tune with all of Dave Noveck's comments and the
>first two of these comments from Tom.
>
>The third comment, calling for more description of the ideas that were
>in the NEPS workshop, has me confused.  I think Dave has been
>encouraging that the proposal have less in the way of what the IETF
>should do to solve this problem, and I read this request as suggesting
>that we have more in the way of proposals for the solution.
>
>I'm certain we can do either, and I think I'd like to hear a little
>more direction from the group.
>
>Do we stay with this mostly solution free problem presentation, or do
>we add more about layout delegations and other NEPS proposed ideas?
>
>garth
>
>
>On Feb 22, 2004, at 7:48 PM, Talpey, Thomas wrote:
>
>> What is "EDA" (slide 3)? Spell it out, whatever it is! :-)
>>
>> I wouldn't include "FCIP" on slide 5. Does anyone use it,
>> especially for storage?
>>
>> I think there should be another slide after slide 2, which
>> drills down at least a little into the ideas that have been
>> discussed. It's a good place to establish the context of
>> the presentation - as it is there isn't really any "proposal".
>> I'm thinking maybe a single bullet for some or all of the
>> whitepapers at NEPS?
>>
>> Tom.
>>
>> At 06:44 PM 2/22/2004, Garth Gibson wrote:
>>> I understand the sensitivity of IETF to having solutions presented
>>> instead of problems.
>>>
>>> Here is a revision following the recommendations below, with some word
>>> choices of my own.  Specifically, I'm reluctant to say nothing about
>>> how NFSv4 is better for fixing this than NFSv3; that is, the
>>> definition
>>
>>> of NFSv4 creates the opportunity for "direct" or "out-of-band" access.
>>>
>>> Page 5: the two lines in question,
>>>
>>> - NFSv4, relative to NFSv3, has enhanced client side optimizations
>>> - NFSv4 minor extensions may suffice for incremental functionality
>>>
>>> Page 6: last section:
>>>
>>> Much interest in exploring NFSv4 extensions to meet scalability needs
>>> - Extend NFSv4 "delegations" to provide "layout" information to
>>> clients
>>
>>> - Clients use "layout" to directly access storage, avoiding
>>> single-server bottleneck
>>> - NFS, SCSI Block, and SCSI Object "layout" formats all discussed
>>> - Support for multiple "layout" formats desirable (and looks doable)
>>>
>>> Dave, how is this?
>>>
>>> garth
>>>
>>>
>>> On Feb 22, 2004, at 1:22 PM, Noveck, Dave wrote:
>>>
>>>> I have some suggestions on slides 5 and 6.
>>>>
>>>> I would drop the line about delegations from this slide.  Unless
>>>> you come to this from the sorts of discussions we have been having
>>>> (and thus aren't the critical part of the audience), this is
>>>> really not going to be understandable.  One problem that we have
>>>> in presenting this is that if we explain the situation, we wind up
>>>> having to explain that we think we pretty much know how to do this
>>>> already and just need the IETF to bless our choice (I'm exaggerating
>>>> but only some), and that isn't likely to go down very well with a
>>>> lot of people.
>>>>
>>>> I'd express the last sub-bullet in this section as something like:
>>>>
>>>>     NFSv4 minor version model a good way to provide incremental
>>>> extensions
>>>>
>>>> which doesn't say that we know pretty much what these are (but it
>>>> doesn't say we don't :-)
>>>>
>>>> As to the last section of slide 6, I'd revise to be something like
>>>> the following, again to reduce the we-know-how-to-do-this tone.
>>>>
>>>>      Much interest in exploring how v4 could be extended to solve
>>>> this
>>>>
>>>>           Extension of delegations to provide "layout" information
>>>> to clients
>>>>
>>>>           Clients use layout information to do IO and avoid
>>>> single-server bottleneck
>>>>
>>>>           NFS, SCSI Block, SCSI Object layout formats all discussed
>>>>
>>>>           Support for multiple formats looks desirable (and doable).
>>>>
>>>> -----Original Message-----
>>>> From: Garth Gibson [ mailto:garth@panasas.com
>>>> <mailto:garth@panasas.com> ]
>>>> Sent: Friday, February 20, 2004 7:22 PM
>>>> To: pnfs-reqs@yahoogroups.com
>>>> Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
>>>>
>>>>
>>>> Based on feedback from Brent's concall 8 days ago, here is my cut at
>>>> Gary's proposal for a short problem statement introduction
>>>> presentation.
>>>>
>>>> garth
>
>
>
>------------------------ Yahoo! Groups Sponsor ---------------------~-->
>Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
>Printer at MyInks.com. Free s/h on orders $50 or more to the US & Canada.
>http://www.c1tracking.com/l.asp?cid=5511
>http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>---------------------------------------------------------------------~->
>
>
>Yahoo! Groups Links
>
><*> To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
><*> To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
><*> Your use of Yahoo! Groups is subject to:
>     http://docs.yahoo.com/info/terms/
>
>
> 

From garth@panasas.com Thu Feb 26 08:07:43 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 48095 invoked from network); 26 Feb 2004 16:07:13 -0000
Received: from unknown (66.218.66.167)
by m14.grp.scd.yahoo.com with QMQP; 26 Feb 2004 16:07:13 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 26 Feb 2004 16:07:12 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id FWFX67WT; Thu, 26 Feb 2004 11:07:11 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: 7bit
Message-Id: <D27EE7B0-6875-11D8-A762-000A95A94F04@panasas.com>
Content-Type: text/plain
To: pnfs-reqs@yahoogroups.com
Date: Thu, 26 Feb 2004 11:07:05 -0500
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: concall now to finalize problem statement slides for Seoul if possible
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson


From Thomas.Talpey@netapp.com Thu Feb 26 14:32:10 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 10494 invoked from network); 26 Feb 2004 22:32:08 -0000
Received: from unknown (66.218.66.166)
by m16.grp.scd.yahoo.com with QMQP; 26 Feb 2004 22:32:08 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta5.grp.scd.yahoo.com with SMTP; 26 Feb 2004 22:32:08 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i1QMW8JC015683
for <pnfs-reqs@yahoogroups.com>; Thu, 26 Feb 2004 14:32:08 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i1QMVlDY029216
for <pnfs-reqs@yahoogroups.com>; Thu, 26 Feb 2004 14:32:07 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.35]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Thu, 26 Feb 2004 17:31:43 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3FCB8.4F3BB180"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Thu, 26 Feb 2004 14:31:38 -0800
Message-ID: <6.0.3.0.2.20040226172850.01f42ec0@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] concall now to finalize problem statement slides for Seoul if possible
Thread-Index: AcP8uE/Rtfws08/2T2CcDBLrAGtqlA==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] concall now to finalize problem statement slides for Seoul if possible
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

Garth - sorry I couldn't join the call from the Far East.
Could you please send the minutes and the latest
version so I can comment? I'm ready to deliver the
presentation, in any case.

Tom.

At 11:07 AM 2/26/2004, Garth Gibson wrote:
>
>
>
>
>Yahoo! Groups Links
>
><*> To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
><*> To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
><*> Your use of Yahoo! Groups is subject to:
>     http://docs.yahoo.com/info/terms/
> 

From garth@panasas.com Thu Feb 26 18:36:42 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 83484 invoked from network); 27 Feb 2004 02:36:42 -0000
Received: from unknown (66.218.66.172)
by m10.grp.scd.yahoo.com with QMQP; 27 Feb 2004 02:36:42 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 27 Feb 2004 02:36:41 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id FWFX6066; Thu, 26 Feb 2004 21:36:39 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <6.0.3.0.2.20040226172850.01f42ec0@silver.nane.netapp.com>
References: <6.0.3.0.2.20040226172850.01f42ec0@silver.nane.netapp.com>
Content-Type: multipart/mixed; boundary=Apple-Mail-37-757939893
Message-Id: <BE3A6E2E-68CD-11D8-A762-000A95A94F04@panasas.com>
Date: Thu, 26 Feb 2004 21:36:27 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] concall now to finalize problem statement slides for Seoul if possible
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Tom and others

In this mornings call Julian Satran, Peter Corbett, Benny Halevy and I
discussed how to give Tom something more concrete to cut the level of
abstraction without giving the appearance that we are providing a
solution and not describing a problem.

Here is a new draft, unchanged except that I added a slide after slide
2 that shows a "Now vs Goal" pair of diagrams of storage, NFS server
and client and says "Now: requested data moves through NFS server" and
"Goal: reply from NFS server enables parallel access to storage
servers."

Tom. Use it or not. Modify it as needed. You are the speaker :-)

garth


On Feb 26, 2004, at 5:31 PM, Talpey, Thomas wrote:
> Garth - sorry I couldn't join the call from the Far East.
> Could you please send the minutes and the latest
> version so I can comment? I'm ready to deliver the
> presentation, in any case.
>
> Tom.




Attachment (not stored)
pNFS-intro-2-26.ppt
Type: application/vnd.ms-powerpoint

From garth@panasas.com Thu Feb 26 18:57:52 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 71470 invoked from network); 27 Feb 2004 02:57:51 -0000
Received: from unknown (66.218.66.166)
by m17.grp.scd.yahoo.com with QMQP; 27 Feb 2004 02:57:51 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 27 Feb 2004 02:57:51 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id FWFX609T; Thu, 26 Feb 2004 21:57:49 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <5.0.0.25.2.20040223110657.027281a0@poptop.llnl.gov>
References: <6.0.3.0.0.20040222194310.01b7da00@silver.nane.netapp.com> <6.0.3.0.0.20040222194310.01b7da00@silver.nane.netapp.com> <5.0.0.25.2.20040223110657.027281a0@poptop.llnl.gov>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Message-Id: <B5B07318-68D0-11D8-A762-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Thu, 26 Feb 2004 21:57:41 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Thanks Tyce. It is good that these slides are useful for internal and
technical conversations as well as the IETF. I hope the additional
slide helps with the technical conversations too.

garth

On Feb 23, 2004, at 2:15 PM, Tyce McLarty wrote:

> At 08:52 PM 2/22/2004 -0500, Garth Gibson wrote:
>> Okay, I think I'm in tune with all of Dave Noveck's comments and the
>> first two of these comments from Tom.
>>
>> The third comment, calling for more description of the ideas that were
>> in the NEPS workshop, has me confused. I think Dave has been
>> encouraging that the proposal have less in the way of what the IETF
>> should do to solve this problem, and I read this request as suggesting
>> that we have more in the way of proposals for the solution.
>>
>> I'm certain we can do either, and I think I'd like to hear a little
>> more direction from the group.
>>
>> Do we stay with this mostly solution free problem presentation, or do
>> we add more about layout delegations and other NEPS proposed ideas?
>
> I think it depends on how we intend to use the "presentation". I
> expect two
> scenarios, neither of which involves the IETF:
>
> 1. I catch a higher level manager or VIP in the hall for a couple
> minutes:
> He gets slide 2 first. If he shows any interest at all give him the
> rest of
> slide 7. That's all he needs to know and he will almost surely lose
> interest if I try to tell him more.
>
> 2. Discussion with technical peer: Again start with slide 2
> information.
> Expect that he will show enough interest to want it all. Have to use
> some
> judgement here.
>
> In case 2, I think being ready with what we are thinking about for a
> solution is on target. A technical type will find supporting an problem
> areas that we really want to find out about.
>
> You asked for opinions.
>
> Tyce
>
>
>> garth
>>
>>
>> On Feb 22, 2004, at 7:48 PM, Talpey, Thomas wrote:
>>
>>> What is "EDA" (slide 3)? Spell it out, whatever it is! :-)
>>>
>>> I wouldn't include "FCIP" on slide 5. Does anyone use it,
>>> especially for storage?
>>>
>>> I think there should be another slide after slide 2, which
>>> drills down at least a little into the ideas that have been
>>> discussed. It's a good place to establish the context of
>>> the presentation - as it is there isn't really any "proposal".
>>> I'm thinking maybe a single bullet for some or all of the
>>> whitepapers at NEPS?
>>>
>>> Tom.
>>>
>>> At 06:44 PM 2/22/2004, Garth Gibson wrote:
>>>> I understand the sensitivity of IETF to having solutions presented
>>>> instead of problems.
>>>>
>>>> Here is a revision following the recommendations below, with some
>>>> word
>>>> choices of my own. Specifically, I'm reluctant to say nothing about
>>>> how NFSv4 is better for fixing this than NFSv3; that is, the
>>>> definition
>>>
>>>> of NFSv4 creates the opportunity for "direct" or "out-of-band"
>>>> access.
>>>>
>>>> Page 5: the two lines in question,
>>>>
>>>> - NFSv4, relative to NFSv3, has enhanced client side optimizations
>>>> - NFSv4 minor extensions may suffice for incremental functionality
>>>>
>>>> Page 6: last section:
>>>>
>>>> Much interest in exploring NFSv4 extensions to meet scalability
>>>> needs
>>>> - Extend NFSv4 "delegations" to provide "layout" information to
>>>> clients
>>>
>>>> - Clients use "layout" to directly access storage, avoiding
>>>> single-server bottleneck
>>>> - NFS, SCSI Block, and SCSI Object "layout" formats all discussed
>>>> - Support for multiple "layout" formats desirable (and looks doable)
>>>>
>>>> Dave, how is this?
>>>>
>>>> garth
>>>>
>>>>
>>>> On Feb 22, 2004, at 1:22 PM, Noveck, Dave wrote:
>>>>
>>>>> I have some suggestions on slides 5 and 6.
>>>>>
>>>>> I would drop the line about delegations from this slide. Unless
>>>>> you come to this from the sorts of discussions we have been having
>>>>> (and thus aren't the critical part of the audience), this is
>>>>> really not going to be understandable. One problem that we have
>>>>> in presenting this is that if we explain the situation, we wind up
>>>>> having to explain that we think we pretty much know how to do this
>>>>> already and just need the IETF to bless our choice (I'm
>>>>> exaggerating
>>>>> but only some), and that isn't likely to go down very well with a
>>>>> lot of people.
>>>>>
>>>>> I'd express the last sub-bullet in this section as something like:
>>>>>
>>>>> NFSv4 minor version model a good way to provide incremental
>>>>> extensions
>>>>>
>>>>> which doesn't say that we know pretty much what these are (but it
>>>>> doesn't say we don't :-)
>>>>>
>>>>> As to the last section of slide 6, I'd revise to be something like
>>>>> the following, again to reduce the we-know-how-to-do-this tone.
>>>>>
>>>>> Much interest in exploring how v4 could be extended to solve
>>>>> this
>>>>>
>>>>> Extension of delegations to provide "layout" information
>>>>> to clients
>>>>>
>>>>> Clients use layout information to do IO and avoid
>>>>> single-server bottleneck
>>>>>
>>>>> NFS, SCSI Block, SCSI Object layout formats all discussed
>>>>>
>>>>> Support for multiple formats looks desirable (and
>>>>> doable).
>>>>>
>>>>> -----Original Message-----
>>>>> From: Garth Gibson [ mailto:garth@panasas.com
>>>>> <mailto:garth@panasas.com> ]
>>>>> Sent: Friday, February 20, 2004 7:22 PM
>>>>> To: pnfs-reqs@yahoogroups.com
>>>>> Subject: Re: [pnfs-reqs] RE: NEPS-REQS: getting started
>>>>>
>>>>>
>>>>> Based on feedback from Brent's concall 8 days ago, here is my cut
>>>>> at
>>>>> Gary's proposal for a short problem statement introduction
>>>>> presentation.
>>>>>
>>>>> garth
>>
>>
>>
>>
>>
>> Yahoo! Groups Links
>>
>>
>>
>>
>>
>>
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> ---------------------------------------------------------------------
> ~->
>
>
> Yahoo! Groups Links
>
>
>
>

From Thomas.Talpey@netapp.com Sun Feb 29 20:24:09 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 37218 invoked from network); 1 Mar 2004 04:24:08 -0000
Received: from unknown (66.218.66.167)
by m3.grp.scd.yahoo.com with QMQP; 1 Mar 2004 04:24:08 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 1 Mar 2004 04:24:08 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i214O8JC008573
for <pnfs-reqs@yahoogroups.com>; Sun, 29 Feb 2004 20:24:08 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i214O78L027043
for <pnfs-reqs@yahoogroups.com>; Sun, 29 Feb 2004 20:24:07 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.31]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Sun, 29 Feb 2004 23:23:57 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C3FF45.03516C80"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Sun, 29 Feb 2004 20:23:56 -0800
Message-ID: <6.0.3.0.2.20040229231754.01b88ec0@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] concall now to finalize problem statement slides for Seoul if possible
Thread-Index: AcP/RQO41PpGg+75T2KWjKxeFLsw/g==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] concall now to finalize problem statement slides for Seoul if possible
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

I like the new version better. It roots the pNFS solution in reality
pretty well. Kind of the extended version of the elevator pitch.
I might try to split the longer slides. The WG meeting is Thursday
Korean time - Wednesday US.

Tom.

At 09:36 PM 2/26/2004, Garth Gibson wrote:
>Tom and others
>
>In this mornings call Julian Satran, Peter Corbett, Benny Halevy and I
>discussed how to give Tom something more concrete to cut the level of
>abstraction without giving the appearance that we are providing a
>solution and not describing a problem.
>
>Here is a new draft, unchanged except that I added a slide after slide
>2 that shows a "Now vs Goal" pair of diagrams of storage, NFS server
>and client and says "Now: requested data moves through NFS server" and
>"Goal: reply from NFS server enables parallel access to storage
>servers."
>
>Tom. Use it or not.  Modify it as needed.  You are the speaker :-)
>
>garth
>
>
>On Feb 26, 2004, at 5:31 PM, Talpey, Thomas wrote:
>> Garth - sorry I couldn't join the call from the Far East.
>> Could you please send the minutes and the latest
>> version so I can comment? I'm ready to deliver the
>> presentation, in any case.
>>
>> Tom.
>
>
>
>
>
>Yahoo! Groups Links
>
><*> To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
><*> To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
><*> Your use of Yahoo! Groups is subject to:
>     http://docs.yahoo.com/info/terms/
>
>


From garth@panasas.com Wed Mar 03 09:05:37 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 59921 invoked from network); 3 Mar 2004 17:05:33 -0000
Received: from unknown (66.218.66.167)
by m12.grp.scd.yahoo.com with QMQP; 3 Mar 2004 17:05:33 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 3 Mar 2004 17:05:33 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id GFV6TLLQ; Wed, 3 Mar 2004 12:05:09 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <E82C040C-6D34-11D8-A101-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Cc: Garth Gibson <garth@panasas.com>
Date: Wed, 3 Mar 2004 12:05:00 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: concall tomorrow cancelled
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
click here
Sometime in the next 24 hours our problem statement gets presented to
the IETF NFS TWG. Hey, cool, we delivered on that goal! Next week
lets hear what Tom, Julian and David have to say about the reception.

And lets get back to the core requirements document next week - review
the items that are on it ... ie, things like:

1.0 Minimalism
1.1 Proxying
1.1.0 Legacy proxying
1.1.1 Strict proxying
1.1.2 Functional proxying
1.2 Cache consistency
1.3 Delegation promotion & reacquisition
1.4 Layout delegations
1.5 Concurrent write
1.6 Map revocation
1.7 Separability
1.8 NTFS application semantics

etc.

garth

From andros@citi.umich.edu Wed Mar 03 10:27:04 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 38659 invoked from network); 3 Mar 2004 18:27:02 -0000
Received: from unknown (66.218.66.218)
by m8.grp.scd.yahoo.com with QMQP; 3 Mar 2004 18:27:02 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta3.grp.scd.yahoo.com with SMTP; 3 Mar 2004 18:27:02 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id ECB22207EB; Wed, 3 Mar 2004 13:26:39 -0500 (EST)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
In-reply-to: Your message of "Wed, 03 Mar 2004 12:05:00 EST."
<E82C040C-6D34-11D8-A101-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Wed, 03 Mar 2004 13:26:39 -0500
Message-Id: <20040303182640.ECB22207EB@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] concall tomorrow cancelled
X-Yahoo-Group-Post: member; u=169434965

hi garth

where can i get a copy of the core requirements document? does it have any
place holder for the 'single OPEN for a group of clients, meta data scaling'
issue i brought up with you and harvey?

-->Andy

> Sometime in the next 24 hours our problem statement gets presented to
> the IETF NFS TWG. Hey, cool, we delivered on that goal! Next week
> lets hear what Tom, Julian and David have to say about the reception.
>
> And lets get back to the core requirements document next week - review
> the items that are on it ... ie, things like:
>
> 1.0 Minimalism
> 1.1 Proxying
> 1.1.0 Legacy proxying
> 1.1.1 Strict proxying
> 1.1.2 Functional proxying
> 1.2 Cache consistency
> 1.3 Delegation promotion & reacquisition
> 1.4 Layout delegations
> 1.5 Concurrent write
> 1.6 Map revocation
> 1.7 Separability
> 1.8 NTFS application semantics
>
> etc.
>
> garth
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>
> 

From garth@panasas.com Wed Mar 03 10:33:59 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 51877 invoked from network); 3 Mar 2004 18:33:58 -0000
Received: from unknown (66.218.66.172)
by m9.grp.scd.yahoo.com with QMQP; 3 Mar 2004 18:33:58 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 3 Mar 2004 18:33:58 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id GFV6TL8V; Wed, 3 Mar 2004 13:33:56 -0500
In-Reply-To: <20040303182640.ECB22207EB@citi.umich.edu>
References: <20040303182640.ECB22207EB@citi.umich.edu>
Mime-Version: 1.0 (Apple Message framework v612)
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <4F64D15E-6D41-11D8-A101-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Cc: Garth Gibson <garth@panasas.com>
Date: Wed, 3 Mar 2004 13:33:47 -0500
To: Andy Adamson <andros@citi.umich.edu>,
pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] concall tomorrow cancelled
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Andy,

there is not a core requirements document -- this document is the work
item we need to get back to -- we have items that belong in it that
have been described and discussed in email on this reflector -- lets
add "group open" to that list and discuss it next week

garth


On Mar 3, 2004, at 1:26 PM, William A.(Andy) Adamson wrote:

> hi garth
>
> where can i get a copy of the core requirements document? does it have
> any
> place holder for the 'single OPEN for a group of clients, meta data
> scaling'
> issue i brought up with you and harvey?
>
> -->Andy
>
>> Sometime in the next 24 hours our problem statement gets presented to
>> the IETF NFS TWG. Hey, cool, we delivered on that goal! Next week
>> lets hear what Tom, Julian and David have to say about the reception.
>>
>> And lets get back to the core requirements document next week - review
>> the items that are on it ... ie, things like:
>>
>> 1.0 Minimalism
>> 1.1 Proxying
>> 1.1.0 Legacy proxying
>> 1.1.1 Strict proxying
>> 1.1.2 Functional proxying
>> 1.2 Cache consistency
>> 1.3 Delegation promotion & reacquisition
>> 1.4 Layout delegations
>> 1.5 Concurrent write
>> 1.6 Map revocation
>> 1.7 Separability
>> 1.8 NTFS application semantics
>>
>> etc.
>>
>> garth
>>
>>
>>
>>
>>
>> Yahoo! Groups Links
>>
>>
>>
>>
>>
>>
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>

From mclarty3@llnl.gov Wed Mar 03 16:30:19 2004
Return-Path: <mclarty3@llnl.gov>
X-Sender: mclarty3@llnl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 13572 invoked from network); 4 Mar 2004 00:30:16 -0000
Received: from unknown (66.218.66.216)
by m13.grp.scd.yahoo.com with QMQP; 4 Mar 2004 00:30:16 -0000
Received: from unknown (HELO smtp-1.llnl.gov) (128.115.250.81)
by mta1.grp.scd.yahoo.com with SMTP; 4 Mar 2004 00:30:16 -0000
Received: from poptop.llnl.gov (localhost [127.0.0.1])
by smtp-1.llnl.gov (8.12.3p2-20030917/8.12.3/LLNL evision: 1.13 $) with ESMTP id i240U3mm020251
for <pnfs-reqs@yahoogroups.com>; Wed, 3 Mar 2004 16:30:04 -0800 (PST)
Received: from POLARBEAR.llnl.gov ([134.9.18.59] verified)
by poptop.llnl.gov (CommuniGate Pro SMTP 4.0.6)
with ESMTP id 37710231 for pnfs-reqs@yahoogroups.com; Wed, 03 Mar 2004 16:30:03 -0800
Message-Id: <5.0.0.25.2.20040303162808.0276deb0@poptop.llnl.gov>
X-Sender: e002801@poptop.llnl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Wed, 03 Mar 2004 16:30:03 -0800
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <E82C040C-6D34-11D8-A101-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-eGroups-Remote-IP: 128.115.250.81
From: Tyce McLarty <mclarty3@llnl.gov>
Subject: Re: [pnfs-reqs] concall tomorrow cancelled
X-Yahoo-Group-Post: member; u=169320772
X-Yahoo-Profile: mclarty3

Garth,

Didn't we say something in December about another face-to-face meeting at
the time of the FAST'04 conference? I do not remember seeing anything more
definite. Is that still on?

Thanks,
Tyce

At 12:05 PM 3/3/2004 -0500, you wrote:
>Sometime in the next 24 hours our problem statement gets presented to
>the IETF NFS TWG. Hey, cool, we delivered on that goal! Next week
>lets hear what Tom, Julian and David have to say about the reception.
>
>And lets get back to the core requirements document next week - review
>the items that are on it ... ie, things like:
>
>1.0 Minimalism
>1.1 Proxying
>1.1.0 Legacy proxying
>1.1.1 Strict proxying
>1.1.2 Functional proxying
>1.2 Cache consistency
>1.3 Delegation promotion & reacquisition
>1.4 Layout delegations
>1.5 Concurrent write
>1.6 Map revocation
>1.7 Separability
>1.8 NTFS application semantics
>
>etc.
>
>garth
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>

From Thomas.Talpey@netapp.com Wed Mar 03 17:16:58 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 28052 invoked from network); 4 Mar 2004 01:16:56 -0000
Received: from unknown (66.218.66.172)
by m20.grp.scd.yahoo.com with QMQP; 4 Mar 2004 01:16:56 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 4 Mar 2004 01:16:56 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i241GtJC026219
for <pnfs-reqs@yahoogroups.com>; Wed, 3 Mar 2004 17:16:56 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i241Gt8N013164
for <pnfs-reqs@yahoogroups.com>; Wed, 3 Mar 2004 17:16:55 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.36]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Wed, 3 Mar 2004 20:16:45 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C40186.5BC37C80"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Wed, 3 Mar 2004 17:16:31 -0800
Message-ID: <6.0.3.0.2.20040303201511.01eba6d8@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] concall tomorrow cancelled
Thread-Index: AcQBhl1nhCX47ptYSnazBJeLzm1jWA==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] concall tomorrow cancelled
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

Good point - I've already put this in the slides. Someone
let me know in the next 2.5 hours if that's not the case!

BTW I'll send the updated slides in a second...

Tom.

At 07:30 PM 3/3/2004, Tyce McLarty wrote:
>Garth,
>
>Didn't we say something in December about another face-to-face meeting at
>the time of the FAST'04 conference? I do not remember seeing anything more
>definite. Is that still on?
>
>Thanks,
>Tyce
>
>At 12:05 PM 3/3/2004 -0500, you wrote:
>>Sometime in the next 24 hours our problem statement gets presented to
>>the IETF NFS TWG.  Hey, cool, we delivered on that goal!  Next week
>>lets hear what Tom, Julian and David have to say about the reception.
>>
>>And lets get back to the core requirements document next week - review
>>the items that are on it ... ie, things like:
>>
>>1.0 Minimalism
>>1.1 Proxying
>>1.1.0 Legacy proxying
>>1.1.1 Strict proxying
>>1.1.2 Functional proxying
>>1.2 Cache consistency
>>1.3 Delegation promotion & reacquisition
>>1.4 Layout delegations
>>1.5 Concurrent write
>>1.6 Map revocation
>>1.7 Separability
>>1.8 NTFS application semantics
>>
>>etc.
>>
>>garth
>>
>>
>>
>>
>>
>>Yahoo! Groups Links
>>
>>
>>
>>
>
>
>
>
>Yahoo! Groups Links
>
><*> To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
><*> To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
><*> Your use of Yahoo! Groups is subject to:
>     http://docs.yahoo.com/info/terms/
> 

From peter-yahoo@honeyman.org Wed Mar 03 17:20:44 2004
Return-Path: <peter-yahoo@honeyman.org>
X-Sender: peter-yahoo@honeyman.org
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 3642 invoked from network); 4 Mar 2004 01:20:44 -0000
Received: from unknown (66.218.66.172)
by m10.grp.scd.yahoo.com with QMQP; 4 Mar 2004 01:20:44 -0000
Received: from unknown (HELO n6.grp.scd.yahoo.com) (66.218.66.90)
by mta4.grp.scd.yahoo.com with SMTP; 4 Mar 2004 01:20:43 -0000
Received: from [66.218.67.179] by n6.grp.scd.yahoo.com with NNFMP; 04 Mar 2004 01:20:43 -0000
Date: Thu, 04 Mar 2004 01:20:43 -0000
To: pnfs-reqs@yahoogroups.com
Message-ID: <c2609b+6dvt@eGroups.com>
In-Reply-To: <5.0.0.25.2.20040303162808.0276deb0@poptop.llnl.gov>
User-Agent: eGroups-EW/0.82
MIME-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1
Content-Length: 281
X-Mailer: Yahoo Groups Message Poster
X-eGroups-Remote-IP: 66.218.66.90
From: "peterhoneyman" <peter-yahoo@honeyman.org>
X-Originating-IP: 68.248.9.51
Subject: Re: concall tomorrow cancelled
X-Yahoo-Group-Post: member; u=117991698
X-Yahoo-Profile: peterhoneyman

yes, we have a room reserved in the conference hotel for the morning
of march 31.

peter

> Didn't we say something in December about another face-to-face
meeting at
> the time of the FAST'04 conference? I do not remember seeing
anything more
> definite. Is that still on?

From garth@panasas.com Wed Mar 03 17:27:22 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 37476 invoked from network); 4 Mar 2004 01:27:22 -0000
Received: from unknown (66.218.66.216)
by m7.grp.scd.yahoo.com with QMQP; 4 Mar 2004 01:27:22 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta1.grp.scd.yahoo.com with SMTP; 4 Mar 2004 01:27:17 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id GFV6T3BW; Wed, 3 Mar 2004 20:27:13 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <c2609b+6dvt@eGroups.com>
References: <c2609b+6dvt@eGroups.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <0B9C112B-6D7B-11D8-A101-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Wed, 3 Mar 2004 20:27:05 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Re: concall tomorrow cancelled
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

3/31 (Wed) from 8:30am - 12:00pm in the Dolores room at the Grand
Hyatt, in San Francisco -- same hotel as the NSDI and FAST conferences

thanks to Peter for arranging this

garth


On Mar 3, 2004, at 8:20 PM, peterhoneyman wrote:

> yes, we have a room reserved in the conference hotel for the morning
> of march 31.
>
> peter
>
>> Didn't we say something in December about another face-to-face
>> meeting at
>> the time of the FAST'04 conference? I do not remember seeing anything
>> more
>> definite. Is that still on?
>


From Thomas.Talpey@netapp.com Wed Mar 03 17:49:53 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 41657 invoked from network); 4 Mar 2004 01:49:51 -0000
Received: from unknown (66.218.66.166)
by m9.grp.scd.yahoo.com with QMQP; 4 Mar 2004 01:49:51 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta5.grp.scd.yahoo.com with SMTP; 4 Mar 2004 01:49:52 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i241npJC001618
for <pnfs-reqs@yahoogroups.com>; Wed, 3 Mar 2004 17:49:52 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i241np8L023439
for <pnfs-reqs@yahoogroups.com>; Wed, 3 Mar 2004 17:49:51 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.36]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Wed, 3 Mar 2004 20:49:39 -0500
MIME-Version: 1.0
Content-Type: multipart/mixed;
boundary="----_=_NextPart_001_01C4018A.F45BFB80"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Wed, 3 Mar 2004 17:48:51 -0800
Message-ID: <6.0.3.0.2.20040303204633.01c62ec0@silver.nane.netapp.com>
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator:
Thread-Topic: IETF-59 presentation updated draft
Thread-Index: AcQBivg0h9qI3oqJRKStEnDSL6Ptzw==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: IETF-59 presentation updated draft
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

Here's the latest version. The content is largely the same but
I simplified a couple of areas and jazzed up the picture a little.
Also I added a "what" slide to the front by adapting stuff from
the back.

The WG meeting is in two hours so sorry for the late appearance.
I'll check for comments regularly until then.

Tom.


Attachment (not stored)
pNFS-intro-ietf59.ppt
Type: application/octet-stream

From Thomas.Talpey@netapp.com Wed Mar 03 23:12:00 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 86040 invoked from network); 4 Mar 2004 07:12:00 -0000
Received: from unknown (66.218.66.217)
by m14.grp.scd.yahoo.com with QMQP; 4 Mar 2004 07:12:00 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta2.grp.scd.yahoo.com with SMTP; 4 Mar 2004 07:11:59 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i247BlJC010083
for <pnfs-reqs@yahoogroups.com>; Wed, 3 Mar 2004 23:11:47 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i247Bl8L013242
for <pnfs-reqs@yahoogroups.com>; Wed, 3 Mar 2004 23:11:47 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.30]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Thu, 4 Mar 2004 02:11:40 -0500
MIME-Version: 1.0
Content-Type: multipart/mixed;
boundary="----_=_NextPart_001_01C401B7.F0929E00"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Wed, 3 Mar 2004 23:11:22 -0800
Message-ID: <6.0.3.0.2.20040304020718.01c55ec0@silver.nane.netapp.com>
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator:
Thread-Topic: Source for today's presentation
Thread-Index: AcQBt/Ndrsk+aGmQQCODwZ9eFak9QA==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Source for today's presentation
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

Here's the final powerpoint source.

The presentation went well, though the important
message conduit is the minutes/proceedings. I think
I wrote the "elevator pitch" for them. :-)

Tom.


Attachment (not stored)
pNFS-intro-ietf59-final.ppt
Type: application/octet-stream

From julian_satran@il.ibm.com Thu Mar 04 00:56:52 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 55119 invoked from network); 4 Mar 2004 08:56:51 -0000
Received: from unknown (66.218.66.166)
by m16.grp.scd.yahoo.com with QMQP; 4 Mar 2004 08:56:51 -0000
Received: from unknown (HELO mtagate1.de.ibm.com) (195.212.29.150)
by mta5.grp.scd.yahoo.com with SMTP; 4 Mar 2004 08:56:50 -0000
Received: from d12relay01.megacenter.de.ibm.com (d12relay01.megacenter.de.ibm.com [9.149.165.180])
by mtagate1.de.ibm.com (8.12.10/8.12.10) with ESMTP id i248tQpS103912;
Thu, 4 Mar 2004 08:55:27 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay01.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i248tTOQ289120;
Thu, 4 Mar 2004 09:55:30 +0100
In-Reply-To: <E82C040C-6D34-11D8-A101-000A95A94F04@panasas.com>
To: pnfs-reqs@yahoogroups.com
Cc: Garth Gibson <garth@panasas.com>, pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.1 January 21, 2004
Message-ID: <OFFEC762D0.638D9F8F-ON49256E4D.002F7ABD-49256E4D.00310155@il.ibm.com>
Date: Thu, 4 Mar 2004 17:55:24 +0900
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2|July 23, 2003) at
04/03/2004 10:55:25,
Serialize complete at 04/03/2004 10:55:25
Content-Type: text/plain; charset="US-ASCII"
X-eGroups-Remote-IP: 195.212.29.150
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] concall tomorrow cancelled
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

The room was not very full (and that is an eufemism). I tried to bring in
the area director (Jon Petersen) and told him briefly what it's all about.
He did not come but I met him again after the session and reiterated my
position.
As Jon is trying to close whatever groups he can - we will have to
persuade him that NFSV4 is not yet done and this activity can't be pursued
as a tiny addition that can be reviewed by 2 experts (that is what they do
after closing WGs). David did not show-up. Tom will have all the details.
He chaired the WG for Beepy (who is sick).

We proposed to make pNFS an official work-item of the WG but given the
completely minor presence in the room (other session where in this state
too) this has to be brought to the mailing list and it has to generate
some traffic (which I assume it will).

Jon complained also about the lack of volunteers to write drafts.
I assume he was not referring to the NFS only though.

Regards,
Julo



Garth Gibson <garth@panasas.com>
04/03/2004 02:05
Please respond to
pnfs-reqs


To
pnfs-reqs@yahoogroups.com
cc
Garth Gibson <garth@panasas.com>
Subject
[pnfs-reqs] concall tomorrow cancelled






Sometime in the next 24 hours our problem statement gets presented to
the IETF NFS TWG. Hey, cool, we delivered on that goal! Next week
lets hear what Tom, Julian and David have to say about the reception.

And lets get back to the core requirements document next week - review
the items that are on it ... ie, things like:

1.0 Minimalism
1.1 Proxying
1.1.0 Legacy proxying
1.1.1 Strict proxying
1.1.2 Functional proxying
1.2 Cache consistency
1.3 Delegation promotion & reacquisition
1.4 Layout delegations
1.5 Concurrent write
1.6 Map revocation
1.7 Separability
1.8 NTFS application semantics

etc.

garth





Yahoo! Groups Links

From black_david@emc.com Thu Mar 04 03:00:54 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 96985 invoked from network); 4 Mar 2004 11:00:52 -0000
Received: from unknown (66.218.66.172)
by m14.grp.scd.yahoo.com with QMQP; 4 Mar 2004 11:00:52 -0000
Received: from unknown (HELO srexchimc2.eng.emc.com) (168.159.100.11)
by mta4.grp.scd.yahoo.com with SMTP; 4 Mar 2004 11:00:51 -0000
Received: from MAHO3MSX2.corp.emc.com (maho3msx2.isus.emc.com [128.221.11.32]) by srexchimc2.eng.emc.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2657.72)
id GD38HLZM; Thu, 4 Mar 2004 06:00:05 -0500
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <GG8QC7SS>; Thu, 4 Mar 2004 06:00:05 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A56A3@corpmx14.us.dg.com>
X-Sybari-Trust: 03b6d71b 1d8c424f 578b0cff 0000013d
To: pnfs-reqs@yahoogroups.com
Date: Thu, 4 Mar 2004 06:00:04 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 168.159.100.11
From: black_david@emc.com
Subject: Seoul results (was: concall tomorrow cancelled)
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

Julian writes:

> The room was not very full (and that is an eufemism). I tried to bring in
> the area director (Jon Petersen) and told him briefly what it's all about.

> He did not come but I met him again after the session and reiterated my
> position.
> As Jon is trying to close whatever groups he can - we will have to
> persuade him that NFSV4 is not yet done and this activity can't be pursued

> as a tiny addition that can be reviewed by 2 experts (that is what they do

> after closing WGs). David did not show-up. Tom will have all the details.
> He chaired the WG for Beepy (who is sick).

My lack of attendance was a brain-fault on my part, for which I sincerely
apologize. I talked to Jon both before and after the meeting - Jon's ok
with adding pNFS to the nfsv4 WG charter, even considering the overall
desire to close out long-running WGs like nfsv4. That's good news, but
keep in mind that this is today's answer, and ADs always reserve the right
to change their minds.

> We proposed to make pNFS an official work-item of the WG but given the
> completely minor presence in the room (other session where in this state
> too) this has to be brought to the mailing list and it has to generate
> some traffic (which I assume it will).

Attendance from the US is down across the board here in Seoul. pNFS does
need to be followed up on the mailing list and then the chairs (Beepy and
Spencer) will need to work out the addition with Jon. So far, so good.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From Brian.Pawlowski@netapp.com Thu Mar 04 03:29:54 2004
Return-Path: <beepy@netapp.com>
X-Sender: beepy@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 87650 invoked from network); 4 Mar 2004 11:29:53 -0000
Received: from unknown (66.218.66.218)
by m5.grp.scd.yahoo.com with QMQP; 4 Mar 2004 11:29:53 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 4 Mar 2004 11:29:53 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i24BTdJC014388
for <pnfs-reqs@yahoogroups.com>; Thu, 4 Mar 2004 03:29:39 -0800 (PST)
Received: from tooting-fe.eng.netapp.com (tooting-fe.eng.netapp.com [10.56.10.118])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i24BTd8L022129
for <pnfs-reqs@yahoogroups.com>; Thu, 4 Mar 2004 03:29:39 -0800 (PST)
Received: (from beepy@localhost)
by tooting-fe.eng.netapp.com (8.11.7p1+Sun/8.11.6) id i24BTcH14038;
Thu, 4 Mar 2004 03:29:38 -0800 (PST)
Message-Id: <200403041129.i24BTcH14038@tooting-fe.eng.netapp.com>
In-Reply-To: <B459CE1AFFC52D4688B2A5B842CA35EA7A56A3@corpmx14.us.dg.com> from "black_david@emc.com" at "Mar 4, 4 06:00:04 am"
To: pnfs-reqs@yahoogroups.com
Date: Thu, 4 Mar 2004 03:29:38 -0800 (PST)
Cc: pnfs-reqs@yahoogroups.com
X-Mailer: ELM [version 2.4ME++ PL40 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: Brian Pawlowski <beepy@netapp.com>
From: Brian Pawlowski <Brian.Pawlowski@netapp.com>
Subject: Re: [pnfs-reqs] Seoul results (was: concall tomorrow cancelled)
X-Yahoo-Group-Post: member; u=169504717
X-Yahoo-Profile: brianpawlowski

Most of the people that actually do real work on NFS V4 (implementations
and even spec work) do not attend IETF.

Such is life.

I'm still returning from the dead.

> Julian writes:
>
> > The room was not very full (and that is an eufemism). I tried to bring in
> > the area director (Jon Petersen) and told him briefly what it's all about.
>
> > He did not come but I met him again after the session and reiterated my
> > position.
> > As Jon is trying to close whatever groups he can - we will have to
> > persuade him that NFSV4 is not yet done and this activity can't be pursued
>
> > as a tiny addition that can be reviewed by 2 experts (that is what they do
>
> > after closing WGs). David did not show-up. Tom will have all the details.
> > He chaired the WG for Beepy (who is sick).
>
> My lack of attendance was a brain-fault on my part, for which I sincerely
> apologize. I talked to Jon both before and after the meeting - Jon's ok
> with adding pNFS to the nfsv4 WG charter, even considering the overall
> desire to close out long-running WGs like nfsv4. That's good news, but
> keep in mind that this is today's answer, and ADs always reserve the right
> to change their minds.
>
> > We proposed to make pNFS an official work-item of the WG but given the
> > completely minor presence in the room (other session where in this state
> > too) this has to be brought to the mailing list and it has to generate
> > some traffic (which I assume it will).
>
> Attendance from the US is down across the board here in Seoul. pNFS does
> need to be followed up on the mailing list and then the chairs (Beepy and
> Spencer) will need to work out the addition with Jon. So far, so good.
>
> Thanks,
> --David
> ----------------------------------------------------
> David L. Black, Senior Technologist
> EMC Corporation, 176 South St., Hopkinton, MA 01748
> +1 (508) 293-7953 FAX: +1 (508) 293-7786
> black_david@emc.com Mobile: +1 (978) 394-7754
> ----------------------------------------------------
>
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From Thomas.Talpey@netapp.com Thu Mar 04 03:36:45 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 30803 invoked from network); 4 Mar 2004 11:36:44 -0000
Received: from unknown (66.218.66.218)
by m10.grp.scd.yahoo.com with QMQP; 4 Mar 2004 11:36:44 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 4 Mar 2004 11:36:44 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i24BW8JC015149
for <pnfs-reqs@yahoogroups.com>; Thu, 4 Mar 2004 03:32:08 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.10.22.171])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i24BW88L023248
for <pnfs-reqs@yahoogroups.com>; Thu, 4 Mar 2004 03:32:08 -0800 (PST)
Received: from tmt.netapp.com ([10.58.52.57]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Thu, 4 Mar 2004 06:32:00 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C401DC.4ED17800"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Thu, 4 Mar 2004 03:31:42 -0800
Message-ID: <6.0.3.0.2.20040304062612.01b91708@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: Seoul results/status
Thread-Index: AcQB3E+j2XIdnixkSGWV5n7Qy6UwSg==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Seoul results/status
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

Julian's message about summed up the meeting itself, the
attendance was disappointing, but not unexpected. The
key thing is that we have put out the message, and the
AD, and the NFS community, are aware of it. I too spoke
with Jon before the meeting, and he did say that he was
unlikely to attend but was open to the message.

I sent the slides in pdf format to the nfsv4 reflector along
with the agenda/proceedings. They're stuck in the moderator
queue due to the size (only 50K) but should appear soon.
So, folks will see them.

I suggest a concall sometime next week to discuss next
steps. Maybe just use our regular Thursday slot? It's
important to start some buzz before the 3/31 BOF.

Tom.


From garth@panasas.com Thu Mar 04 08:06:06 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 90649 invoked from network); 4 Mar 2004 16:06:03 -0000
Received: from unknown (66.218.66.166)
by m2.grp.scd.yahoo.com with QMQP; 4 Mar 2004 16:06:03 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 4 Mar 2004 16:06:03 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id GFV6TQPD; Thu, 4 Mar 2004 11:05:39 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <6.0.3.0.2.20040304062612.01b91708@silver.nane.netapp.com>
References: <6.0.3.0.2.20040304062612.01b91708@silver.nane.netapp.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <C2A11140-6DF5-11D8-A101-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Thu, 4 Mar 2004 11:05:30 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Seoul results/status
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Yes, the regular Thurs 8am PST, 11am EST concall can be used for this.

If anyone has lost the dialin numbers, please send me a note requesting
them.

Thanks
garth

On Mar 4, 2004, at 6:31 AM, Talpey, Thomas wrote:

> I suggest a concall sometime next week to discuss next
> steps. Maybe just use our regular Thursday slot? It's
> important to start some buzz before the 3/31 BOF.

From garth@panasas.com Thu Mar 11 00:08:10 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 96609 invoked from network); 11 Mar 2004 08:08:07 -0000
Received: from unknown (66.218.66.172)
by m9.grp.scd.yahoo.com with QMQP; 11 Mar 2004 08:08:07 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 11 Mar 2004 08:08:08 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id GT4QJN26; Thu, 11 Mar 2004 03:08:07 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <343800F0-7333-11D8-BDD5-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Cc: Garth Gibson <garth@panasas.com>
Date: Thu, 11 Mar 2004 00:07:56 -0800
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: reminder: concall at 8am PST, 11am EST Thursday (today)
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Tentative agenda:
- Seoul debrief
- making plans for FAST BOF
- review requirements items in flight

garth



From andros@citi.umich.edu Wed Mar 17 09:45:57 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 59954 invoked from network); 17 Mar 2004 17:43:41 -0000
Received: from unknown (66.218.66.166)
by m13.grp.scd.yahoo.com with QMQP; 17 Mar 2004 17:43:41 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta5.grp.scd.yahoo.com with SMTP; 17 Mar 2004 17:43:41 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id B805420F71; Wed, 17 Mar 2004 12:43:40 -0500 (EST)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com, andros@citi.umich.edu
Mime-Version: 1.0
Content-Type: multipart/mixed ;
boundary="==_Exmh_5501136620"
Date: Wed, 17 Mar 2004 12:43:40 -0500
Message-Id: <20040317174340.B805420F71@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=169434965

Sorry for the long email :)

At the conclusion of the NEPS conference last November, Brent Welch emailed
his notes as a starting point for a requirements document (attached). I use
his pNFS extention language to describe a pNFS client using a 'normal open'
servicing an open/write/close with direct access, and a large MPIO application
using a proposed 'group open'.

I note that my knowledge of parallel filesystems is growing, so please excuse
any misconceptions, comments welcome...

The architecture i'm picturing is a large cluster with a Parallel File System
(PFS)
consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I know it's only
one of many architectures the pNFS set of extensions is trying to address.

1000's of pNFS clients
10's of pNFSd, one per PFS MD
100's NAS/SAN


'Normal' open
********************
a) pNFS client issues a compound to one pNFSd consisting of:
OPEN with share: Access/Deny
Multiple pNFSds need to resolve share.
DELEG_ASK: Request Byte-range Delegation
Multiple pNFSds need to resolve delegation
READ/WRITE_IND request direct data access
pNFSd queries PFS MD to get location map

b) pNFS client can then issue READ/WRITE directly to the NAS/SAN using the map
returned in READ/WRITE_IND.

c) pNFS client issues a compound to one pNFSd consisting of:
COMMIT_IND:
CLOSE


An MPIO application opens one very large file, shared by 1000's of compute
clients. Each compute client manipulates its portion of the file. The MPIO
layer manages compute clients so that no client shares a byte range of the
file with another.

This MPIO application consists of
- supervisor code running on 1 MPIO supervisor node
- compute code running on 1000's of MPIO compute nodes

This MPIO application has cyclic behavior.
I) Read initial data
II) compute intermediate result
III) wait for other compute nodes to finish computing
IV) all compute nodes write to file (their portion)
V) compute nodes trade 'edge conditions'
VI) goto II (compute).

While the application is not in IV (writing), another application, say the
visualizer, needs READ access to the file in order to crunch it for
visualization. Visualization is needed to tell if the MPIO application
intermediate results are converging on a solution.

If in step IV all the compute nodes open/write/close as described above as the
Normal open, the pNFSds will be doing a lot of metadata processing: resolving
share and delegation state between themselves as well as delivering per
byte-range layout info. The group open is designed to reduce the metadata
processing from 1000's to one.

I mention a couple of new fcntls used by the MPIO layer to communicate pNFS
state from the supervisor node to the compute nodes. Don't worry about that(!).

Do worry about this: is there anything stoping a compute node from using
OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as described
below? If so, are the changes to pNFS to make this work small enough to be
considered at this time?

Group Open
**********
step IV: supervisor OPENs file, all compute clients write file, supervisor
CLOSES file.

specifically:
a) supervisor issues a compound with
OPEN: Access - Both, Deny Both to a pNFSd
- pNFSds need to resolve the share
- is this a normal nfsv4 OPEN? does pNFSd or the PFS need to know
about the other compute clients?
- do we need the concept of a group clientid?
DELEG_ASK: supervisor asks for WRITE delegation which should be
granted given the OPEN Access-Both, Deny-Both share.
- pNFSds need to resolve delegation request
WRITE_IND: supervisor gets whole file layout info

b) supervisor calls
fcntl(fd, GET_GRPOPEN, cookie_buf);
which returns the filehandle,stateid, and layout map from the
supervisor pNFS.

c) the supervisor code passes filehandle, stateid, and layout map to each
compute
node which calls
fcntl(fd, SET_GRPOPEN, cookie_buf);
the pNFS compute node client receives the filehandle, stateid, and layout map.
performs a local open (nothing need go across the wire) stuffing the
filehandle, stateid, and layout map into it's state tree just as if an across
the wire OPEN/DELEG_ASK/WRITE_IND occured.

d) compute clients use SET_GRPOPEN filehandle, stateid and map to directly
write the data to the appropriate NAS/SAN
- what besides the filehandle, stateid, and layout map is needed?
- when done writing, each compute client issues a COMMIT_IND.

e) when compute clients have flushed all data back to the file, supervisor
issues a compound with

CLOSE



Attachment (not stored)
brent_welch_pnfs_ops
Type: text/plain 

From dnoveck@netapp.com Wed Mar 17 12:23:00 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 38995 invoked from network); 17 Mar 2004 20:22:57 -0000
Received: from unknown (66.218.66.217)
by m9.grp.scd.yahoo.com with QMQP; 17 Mar 2004 20:22:57 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta2.grp.scd.yahoo.com with SMTP; 17 Mar 2004 20:22:59 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i2HKMvJC013738;
Wed, 17 Mar 2004 12:22:57 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i2HKMn3G018277;
Wed, 17 Mar 2004 12:22:56 -0800 (PST)
content-class: urn:content-classes:message
Date: Wed, 17 Mar 2004 11:42:55 -0800
MIME-Version: 1.0
Content-Type: text/plain;
charset="US-ASCII"
Content-Transfer-Encoding: quoted-printable
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A6D36F8@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Thread-Topic: [pnfs-ops] pNFS, MPIO, and client group open
Thread-Index: AcQMR7Z6ZGojAvNQQaq0OSeLkpqjuAAC6hBw
To: <pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-ops] pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

Andy wrote:
> Do worry about this: is there anything stoping a compute node from
using
> OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
described
> below?

I'm going to say "No". I know this wasn't the answer that I gave at the

conference call, (and might not be the answer I give at the next
conference
call :-), but listen to my reasoning before you decide I'm crazy.

In order to resolve this issue it is necessary to get all philosophical
and address the question "What is a computer?". I know lots of people
have already hit delete but I hope somebody is still reading.

Suppose I have an application cluster with 1K nodes and I put on my
marketing
hat (Gee, I hope I don't need a marketing jacket and tie, too :-) and
say "This is really a powerful computer with a thousand (maybe two
thousand)
CPU's". Now that's marketing bullshit but it isn't exactly false.
There
are certainly tasks where you want a large number of CPU's sharing
memory
and a DSM arrangement's performance is going to suck. On the other
hand,
there are applications where having a thousand memories is going to be
much
better than trying to provide adequate memory bandwidth from a single
memory
to many many CPU's.

So what's the point? I think the point is that as far as the NFS server
is
concerned, whether the computer that is talking to it is "really" a
computer,
i.e. it has CPU's sharing memory, or is only a computer qua marketing
bullshit,
i.e. a collection of cpu's that don't share memory, that use other
methods to
co-ordinate common activities, doesn't matter. All the server sees are
the
requests made and if the cluster represents itself as a single machine
(i.e.
in V4 does a single SETCLIENTID or in v4.1 maintains many connections
bound to
a single session), it is one. The server doesn't see the cluster's
memory
architecture. It sees an open and then use of that that stateid. The
fact
that it comes over a different IP address doesn't disqualify it. A
server
might have options to check that (as a matter of security) but it isn't
part
of the protocol and we already have clients with multiple IP addresses.
Having
a thousand of them is a difference of degree (and may pose
implementation
issues) but I don't see a real protocol issue.

OK. Now you can decide if I'm crazy.

-----Original Message-----
From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
Sent: Wednesday, March 17, 2004 12:44 PM
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com; andros@citi.umich.edu
Subject: [pnfs-ops] pNFS, MPIO, and client group open


Sorry for the long email :)

At the conclusion of the NEPS conference last November, Brent Welch
emailed
his notes as a starting point for a requirements document (attached). I
use
his pNFS extention language to describe a pNFS client using a 'normal
open'
servicing an open/write/close with direct access, and a large MPIO
application
using a proposed 'group open'.

I note that my knowledge of parallel filesystems is growing, so please
excuse
any misconceptions, comments welcome...

The architecture i'm picturing is a large cluster with a Parallel File
System
(PFS)
consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I know it's
only
one of many architectures the pNFS set of extensions is trying to
address.

1000's of pNFS clients
10's of pNFSd, one per PFS MD
100's NAS/SAN


'Normal' open
********************
a) pNFS client issues a compound to one pNFSd consisting of:
OPEN with share: Access/Deny
Multiple pNFSds need to resolve share.
DELEG_ASK: Request Byte-range Delegation
Multiple pNFSds need to resolve delegation
READ/WRITE_IND request direct data access
pNFSd queries PFS MD to get location map

b) pNFS client can then issue READ/WRITE directly to the NAS/SAN using
the map
returned in READ/WRITE_IND.

c) pNFS client issues a compound to one pNFSd consisting of:
COMMIT_IND:
CLOSE


An MPIO application opens one very large file, shared by 1000's of
compute
clients. Each compute client manipulates its portion of the file. The
MPIO
layer manages compute clients so that no client shares a byte range of
the
file with another.

This MPIO application consists of
- supervisor code running on 1 MPIO supervisor node
- compute code running on 1000's of MPIO compute nodes

This MPIO application has cyclic behavior.
I) Read initial data
II) compute intermediate result
III) wait for other compute nodes to finish computing
IV) all compute nodes write to file (their portion)
V) compute nodes trade 'edge conditions'
VI) goto II (compute).

While the application is not in IV (writing), another application, say
the
visualizer, needs READ access to the file in order to crunch it for
visualization. Visualization is needed to tell if the MPIO application
intermediate results are converging on a solution.

If in step IV all the compute nodes open/write/close as described above
as the
Normal open, the pNFSds will be doing a lot of metadata processing:
resolving
share and delegation state between themselves as well as delivering per
byte-range layout info. The group open is designed to reduce the
metadata
processing from 1000's to one.

I mention a couple of new fcntls used by the MPIO layer to communicate
pNFS
state from the supervisor node to the compute nodes. Don't worry about
that(!).

Do worry about this: is there anything stoping a compute node from using
OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
described
below? If so, are the changes to pNFS to make this work small enough to
be
considered at this time?

Group Open
**********
step IV: supervisor OPENs file, all compute clients write file,
supervisor
CLOSES file.

specifically:
a) supervisor issues a compound with
OPEN: Access - Both, Deny Both to a pNFSd
- pNFSds need to resolve the share
- is this a normal nfsv4 OPEN? does pNFSd or the PFS need to
know
about the other compute clients?
- do we need the concept of a group clientid?
DELEG_ASK: supervisor asks for WRITE delegation which should be
granted given the OPEN Access-Both, Deny-Both share.
- pNFSds need to resolve delegation request
WRITE_IND: supervisor gets whole file layout info

b) supervisor calls
fcntl(fd, GET_GRPOPEN, cookie_buf);
which returns the filehandle,stateid, and layout map from the
supervisor pNFS.

c) the supervisor code passes filehandle, stateid, and layout map to
each
compute
node which calls
fcntl(fd, SET_GRPOPEN, cookie_buf);
the pNFS compute node client receives the filehandle, stateid, and
layout map.
performs a local open (nothing need go across the wire) stuffing the
filehandle, stateid, and layout map into it's state tree just as if an
across
the wire OPEN/DELEG_ASK/WRITE_IND occured.

d) compute clients use SET_GRPOPEN filehandle, stateid and map to
directly
write the data to the appropriate NAS/SAN
- what besides the filehandle, stateid, and layout map is
needed?
- when done writing, each compute client issues a COMMIT_IND.

e) when compute clients have flushed all data back to the file,
supervisor
issues a compound with

CLOSE




Yahoo! Groups Links

From bhalevy@panasas.com Wed Mar 17 12:48:55 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 35299 invoked from network); 17 Mar 2004 20:48:53 -0000
Received: from unknown (66.218.66.218)
by m16.grp.scd.yahoo.com with QMQP; 17 Mar 2004 20:48:53 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 17 Mar 2004 20:48:52 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <GT4QKNKG>; Wed, 17 Mar 2004 15:48:51 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38938@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
pnfs-ops@yahoogroups.com
Date: Wed, 17 Mar 2004 15:48:50 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

I completely agree with Dave and I certainly don't think he's
crazy.

I perceive this solution as a "clustered" implementation of a
nfsv4 client in which the v4 drivers in the client cluster are
cooperating and propagating state (e.g. file handles, stateids)
among each other.

I believe that the server should not be able to distinguish
such client from a multi-homed client that may have several
ip addresses.

In the nfsv4 sessions world a (clustered) client may open
multiple connections to the server that are associated with
the same session - this will make life for such client even
easier, I hope.

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Wednesday, March 17, 2004 2:43 PM
>To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>Subject: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
>
>
>Andy wrote:
>> Do worry about this: is there anything stoping a compute node from
>using
>> OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>described
>> below?
>
>I'm going to say "No". I know this wasn't the answer that I
>gave at the
>
>conference call, (and might not be the answer I give at the next
>conference
>call :-), but listen to my reasoning before you decide I'm crazy.
>
>In order to resolve this issue it is necessary to get all philosophical
>and address the question "What is a computer?". I know lots of people
>have already hit delete but I hope somebody is still reading.
>
>Suppose I have an application cluster with 1K nodes and I put on my
>marketing
>hat (Gee, I hope I don't need a marketing jacket and tie, too :-) and
>say "This is really a powerful computer with a thousand (maybe two
>thousand)
>CPU's". Now that's marketing bullshit but it isn't exactly false.
>There
>are certainly tasks where you want a large number of CPU's sharing
>memory
>and a DSM arrangement's performance is going to suck. On the other
>hand,
>there are applications where having a thousand memories is going to be
>much
>better than trying to provide adequate memory bandwidth from a single
>memory
>to many many CPU's.
>
>So what's the point? I think the point is that as far as the
>NFS server
>is
>concerned, whether the computer that is talking to it is "really" a
>computer,
>i.e. it has CPU's sharing memory, or is only a computer qua marketing
>bullshit,
>i.e. a collection of cpu's that don't share memory, that use other
>methods to
>co-ordinate common activities, doesn't matter. All the server sees are
>the
>requests made and if the cluster represents itself as a single machine
>(i.e.
>in V4 does a single SETCLIENTID or in v4.1 maintains many connections
>bound to
>a single session), it is one. The server doesn't see the cluster's
>memory
>architecture. It sees an open and then use of that that stateid. The
>fact
>that it comes over a different IP address doesn't disqualify it. A
>server
>might have options to check that (as a matter of security) but it isn't
>part
>of the protocol and we already have clients with multiple IP addresses.
>Having
>a thousand of them is a difference of degree (and may pose
>implementation
>issues) but I don't see a real protocol issue.
>
>OK. Now you can decide if I'm crazy.
>
>-----Original Message-----
>From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
>Sent: Wednesday, March 17, 2004 12:44 PM
>To: pnfs-reqs@yahoogroups.com
>Cc: pnfs-ops@yahoogroups.com; andros@citi.umich.edu
>Subject: [pnfs-ops] pNFS, MPIO, and client group open
>
>
>Sorry for the long email :)
>
>At the conclusion of the NEPS conference last November, Brent Welch
>emailed
>his notes as a starting point for a requirements document (attached). I
>use
>his pNFS extention language to describe a pNFS client using a 'normal
>open'
>servicing an open/write/close with direct access, and a large MPIO
>application
>using a proposed 'group open'.
>
>I note that my knowledge of parallel filesystems is growing, so please
>excuse
>any misconceptions, comments welcome...
>
>The architecture i'm picturing is a large cluster with a Parallel File
>System
>(PFS)
>consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I
>know it's
>only
>one of many architectures the pNFS set of extensions is trying to
>address.
>
>1000's of pNFS clients
>10's of pNFSd, one per PFS MD
>100's NAS/SAN
>
>
>'Normal' open
>********************
>a) pNFS client issues a compound to one pNFSd consisting of:
>OPEN with share: Access/Deny
> Multiple pNFSds need to resolve share.
>DELEG_ASK: Request Byte-range Delegation
> Multiple pNFSds need to resolve delegation
>READ/WRITE_IND request direct data access
> pNFSd queries PFS MD to get location map
>
>b) pNFS client can then issue READ/WRITE directly to the NAS/SAN using
>the map
>returned in READ/WRITE_IND.
>
>c) pNFS client issues a compound to one pNFSd consisting of:
>COMMIT_IND:
>CLOSE
>
>
>An MPIO application opens one very large file, shared by 1000's of
>compute
>clients. Each compute client manipulates its portion of the file. The
>MPIO
>layer manages compute clients so that no client shares a byte range of
>the
>file with another.
>
>This MPIO application consists of
> - supervisor code running on 1 MPIO supervisor node
> - compute code running on 1000's of MPIO compute nodes
>
>This MPIO application has cyclic behavior.
>I) Read initial data
>II) compute intermediate result
>III) wait for other compute nodes to finish computing
>IV) all compute nodes write to file (their portion)
>V) compute nodes trade 'edge conditions'
>VI) goto II (compute).
>
>While the application is not in IV (writing), another application, say
>the
>visualizer, needs READ access to the file in order to crunch it for
>visualization. Visualization is needed to tell if the MPIO application
>intermediate results are converging on a solution.
>
>If in step IV all the compute nodes open/write/close as described above
>as the
>Normal open, the pNFSds will be doing a lot of metadata processing:
>resolving
>share and delegation state between themselves as well as delivering per
>byte-range layout info. The group open is designed to reduce the
>metadata
>processing from 1000's to one.
>
>I mention a couple of new fcntls used by the MPIO layer to communicate
>pNFS
>state from the supervisor node to the compute nodes. Don't worry about
>that(!).
>
>Do worry about this: is there anything stoping a compute node
>from using
>OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>described
>below? If so, are the changes to pNFS to make this work small enough to
>be
>considered at this time?
>
>Group Open
>**********
>step IV: supervisor OPENs file, all compute clients write file,
>supervisor
>CLOSES file.
>
>specifically:
>a) supervisor issues a compound with
>OPEN: Access - Both, Deny Both to a pNFSd
> - pNFSds need to resolve the share
> - is this a normal nfsv4 OPEN? does pNFSd or the PFS need to
>know
>about the other compute clients?
> - do we need the concept of a group clientid?
>DELEG_ASK: supervisor asks for WRITE delegation which should be
> granted given the OPEN Access-Both, Deny-Both share.
> - pNFSds need to resolve delegation request
>WRITE_IND: supervisor gets whole file layout info
>
>b) supervisor calls
> fcntl(fd, GET_GRPOPEN, cookie_buf);
> which returns the filehandle,stateid, and layout map from the
>supervisor pNFS.
>
>c) the supervisor code passes filehandle, stateid, and layout map to
>each
>compute
>node which calls
> fcntl(fd, SET_GRPOPEN, cookie_buf);
>the pNFS compute node client receives the filehandle, stateid, and
>layout map.
>performs a local open (nothing need go across the wire) stuffing the
>filehandle, stateid, and layout map into it's state tree just as if an
>across
>the wire OPEN/DELEG_ASK/WRITE_IND occured.
>
>d) compute clients use SET_GRPOPEN filehandle, stateid and map to
>directly
>write the data to the appropriate NAS/SAN
> - what besides the filehandle, stateid, and layout map is
>needed?
> - when done writing, each compute client issues a COMMIT_IND.
>
>e) when compute clients have flushed all data back to the file,
>supervisor
>issues a compound with
>
>CLOSE
>
>
>
>
>Yahoo! Groups Links
>
>
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>
>

From garth@panasas.com Wed Mar 17 21:17:49 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 7857 invoked from network); 18 Mar 2004 05:17:45 -0000
Received: from unknown (66.218.66.172)
by m9.grp.scd.yahoo.com with QMQP; 18 Mar 2004 05:17:45 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 18 Mar 2004 05:17:47 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id GT4QKPM4; Thu, 18 Mar 2004 00:17:46 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: 7bit
Message-Id: <90E9DAD9-789B-11D8-ACBD-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Thu, 18 Mar 2004 00:17:35 -0500
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Concall notes from last week's call (Mar 11 2004, 11am EST)
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

FAST BOF planning

We have the Dolores room from 9am to 12 noon on Wed 3/31. It will be
set for 50 people, continental breakfast, LCD projector, screen and
microphone. We have requested that this be extended until 2pm, which
is when the FAST conference starts.

The 9am - 12 noon time slot overlaps the last two sessions of the NSDI
conference 9-10:30 and 11-12:30 (agendas attached below). There is a
special registration deal for FAST attendees to go to these Wed morning
sessions for free, so we should expect many/most will be doing just
that. Some of us even.

We plan to accommodate this conflict by moving the wide open (no FAST
registration needed) BOF to the lunch period between NSDI and FAST,
12:30 - 2 pm on Wed.

We will need to arrange lunch food, and *** HELP NEEDED HERE *** find
funds for it.

The contents of this wide open BOF are educational and direction
setting, not technical debates (though they might so become with an
interesting audience). We propose to plan the 90 mins something like
this:

Reprise Seoul IETF pitch: 20-30 mins (Tom would be best, but won't be
there)
Then 3-6 10-min pitches that cover parts of the possible solutions and
show the range of active contributors. This is to generally show
common focus and the intention to make progress, but if they divide up
the proposed multiple backends and the core ops, that would be great
too. So far:
- Peter Corbett, NetApp
- Brent Welch, Panasas
- Peter Honeyman or Andy Adamson, CITI

If time is left, we suggest a panel session of all speakers for general
Q&A.

I should note that I am the keynote speaker for FAST, which will be 9
am on Thurs 4/1. I intend to pitch this activity in a portion of my
talk, so the entire FAST audience (I hope) will catch at least the
highest level pitch.

=================================================

We need an advertisement plan for this BOF. Posters at FAST and NSDI,
probably. Maybe an email from FAST to registered attendees? Email on
the NFSv4 reflector. Email to SNIA NAS working group and SPEC SFS
committees? Other suggestions?

=================================================

We do not plan to give back the Dolores room from 9 to 12:30. We think
that we should use this for a working meeting of the pNFS participants.
Not closed, per se, but announced only on these distribution lists.
Seems like we should not waste an opportunity for face-to-face debate
on NFSv4 extension operations, requirements text and backend metadata
formats.

=================================================

Last NSDI sessions, Wed 3/31

9-10:30 am: Miguel Castro, chair

Session State: Beyond Soft State
Benjamin C. Ling, Emre Kiciman, and Armando Fox, Stanford University

Path-Based Failure and Evolution Management
Mike Y. Chen, University of California, Berkeley; Anthony Accardi,
Tellme; Emre Kiciman, Stanford University; Dave Patterson, University
of California, Berkeley; Armando Fox, Stanford University; Eric Brewer,
University of California, Berkeley

Consistent and Automatic Replica Regeneration
Haifeng Yu, Intel Research Pittsburgh and Carnegie Mellon University;
Amin Vahdat, University of California, San Diego

11-12:30: Jeff Chase, chair

Total Recall: System Support for Automated Availability Management
Ranjita Bhagwan, Kiran Tati, Yu-Chung Cheng, Stefan Savage, and
Geoffrey M. Voelker, University of California, San Diego

TimeLine: A High Performance Archive for a Distributed Object Store
Chuang-Hue Moh and Barbara Liskov, MIT Computer Science and Artificial
Intelligence Laboratory

Explicit Control in the Batch-Aware Distributed File System
John Bent, Douglas Thain, Andrea C. Arpaci-Dusseau, Remzi H.
Arpaci-Dusseau, and Miron Livny, University of Wisconsin, Madison

==========================================

Seoul debrief (Garth's impressions from the reports of others)
- success -- area director Jon Peterson is now aware of the proposal to
extend the charter of NFSv4 WG, and although in general IETF directors
are discouraging long lived working groups, in this case he is
receptive
- next steps are to work with WG chairs to persuade and assist their
deliberation on this proposal -- we don't know how long this step is,
but it is not short



garth

From dhildebz@eecs.umich.edu Fri Mar 19 09:21:25 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 99385 invoked from network); 19 Mar 2004 17:21:24 -0000
Received: from unknown (66.218.66.217)
by m18.grp.scd.yahoo.com with QMQP; 19 Mar 2004 17:21:24 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta2.grp.scd.yahoo.com with SMTP; 19 Mar 2004 17:21:24 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i2JHKtdm013621
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO);
Fri, 19 Mar 2004 12:20:56 -0500
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i2JHKtSS013618;
Fri, 19 Mar 2004 12:20:55 -0500
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Fri, 19 Mar 2004 12:20:55 -0500 (EST)
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Cc: pnfs-ops@yahoogroups.com
In-Reply-To: <30489F1321F5C343ACF6872B2CF7942A05D38938@PIKES.panasas.com>
Message-ID: <Pine.LNX.4.44.0403191217210.13483-100000@willow.eecs.umich.edu>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: RE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

Hi Benny,
Did you mean,
'In the nfsv4 sessions world a (clustered) client may open
simultaneous connections to servers associated with the same session'
or
'In the nfsv4 sessions world a (clustered) client may open
multiple simultaneous connections to a server that is associated with
the same session'

I'm assuming the first as I'm not even sure what the second one
means...but I do not know a lot of about sessions.
Dean

On Wed, 17 Mar 2004, Halevy, Benny wrote:

> I completely agree with Dave and I certainly don't think he's
> crazy.
>
> I perceive this solution as a "clustered" implementation of a
> nfsv4 client in which the v4 drivers in the client cluster are
> cooperating and propagating state (e.g. file handles, stateids)
> among each other.
>
> I believe that the server should not be able to distinguish
> such client from a multi-homed client that may have several
> ip addresses.
>
> In the nfsv4 sessions world a (clustered) client may open
> multiple connections to the server that are associated with
> the same session - this will make life for such client even
> easier, I hope.
>
> Benny
>
> >-----Original Message-----
> >From: Noveck, Dave [mailto:dnoveck@netapp.com]
> >Sent: Wednesday, March 17, 2004 2:43 PM
> >To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
> >Subject: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
> >
> >
> >Andy wrote:
> >> Do worry about this: is there anything stoping a compute node from
> >using
> >> OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
> >described
> >> below?
> >
> >I'm going to say "No".  I know this wasn't the answer that I
> >gave at the
> >
> >conference call, (and might not be the answer I give at the next
> >conference
> >call :-), but listen to my reasoning before you decide I'm crazy.
> >
> >In order to resolve this issue it is necessary to get all philosophical
> >and address the question "What is a computer?".  I know lots of people
> >have already hit delete but I hope somebody is still reading.
> >
> >Suppose I have an application cluster with 1K nodes and I put on my
> >marketing
> >hat (Gee, I hope I don't need a marketing jacket and tie, too :-) and
> >say "This is really a powerful computer with a thousand (maybe two
> >thousand)
> >CPU's".  Now that's marketing bullshit but it isn't exactly false.
> >There
> >are certainly tasks where you want a large number of CPU's sharing
> >memory
> >and a DSM arrangement's performance is going to suck.  On the other
> >hand,
> >there are applications where having a thousand memories is going to be
> >much
> >better than trying to provide adequate memory bandwidth from a single
> >memory
> >to many many CPU's.
> >
> >So what's the point?  I think the point is that as far as the
> >NFS server
> >is
> >concerned, whether the computer that is talking to it is "really" a
> >computer,
> >i.e. it has CPU's sharing memory, or is only a computer qua marketing
> >bullshit,
> >i.e. a collection of cpu's that don't share memory, that use other
> >methods to
> >co-ordinate common activities, doesn't matter.  All the server sees are
> >the
> >requests made and if the cluster represents itself as a single machine
> >(i.e.
> >in V4 does a single SETCLIENTID or in v4.1 maintains many connections
> >bound to
> >a single session), it is one.  The server doesn't see the cluster's
> >memory
> >architecture.  It sees an open and then use of that that stateid.  The
> >fact
> >that it comes over a different IP address doesn't disqualify it.  A
> >server
> >might have options to check that (as a matter of security) but it isn't
> >part
> >of the protocol and we already have clients with multiple IP addresses.
> >Having
> >a thousand of them is a difference of degree (and may pose
> >implementation
> >issues) but I don't see a real protocol issue.
> >
> >OK.  Now you can decide if I'm crazy.
> >
> >-----Original Message-----
> >From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
> >Sent: Wednesday, March 17, 2004 12:44 PM
> >To: pnfs-reqs@yahoogroups.com
> >Cc: pnfs-ops@yahoogroups.com; andros@citi.umich.edu
> >Subject: [pnfs-ops] pNFS, MPIO, and client group open
> >
> >
> >Sorry for the long email :)
> >
> >At the conclusion of the NEPS conference last November, Brent Welch
> >emailed
> >his notes as a starting point for a requirements document (attached). I
> >use
> >his pNFS extention language to describe a pNFS client using a 'normal
> >open'
> >servicing an open/write/close with direct access, and a large MPIO
> >application
> >using a proposed 'group open'.
> >
> >I note that my knowledge of parallel filesystems is growing, so please
> >excuse
> >any misconceptions, comments welcome...
> >
> >The architecture i'm picturing is a large cluster with a Parallel File
> >System
> >(PFS)
> >consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I
> >know it's
> >only
> >one of many architectures the pNFS set of extensions is trying to
> >address.
> >
> >1000's of pNFS clients
> >10's of pNFSd, one per PFS MD
> >100's NAS/SAN
> >
> >
> >'Normal' open
> >********************
> >a) pNFS client issues a compound to one pNFSd consisting of:
> >OPEN with share:  Access/Deny
> >        Multiple pNFSds need to resolve share.
> >DELEG_ASK: Request Byte-range Delegation
> >        Multiple pNFSds need to resolve delegation
> >READ/WRITE_IND request direct data access
> >        pNFSd queries PFS MD to get location map
> >
> >b) pNFS client can then issue READ/WRITE directly to the NAS/SAN using
> >the map
> >returned in READ/WRITE_IND.
> >
> >c) pNFS client issues a compound to one pNFSd consisting of:
> >COMMIT_IND:
> >CLOSE
> >
> >
> >An MPIO application opens one very large file, shared by 1000's of
> >compute
> >clients. Each compute client manipulates its portion of the file. The
> >MPIO
> >layer manages compute clients so that no client shares a byte range of
> >the
> >file with another.
> >
> >This MPIO application consists of
> > - supervisor code running on 1 MPIO supervisor node
> > - compute code running on 1000's of MPIO compute nodes
> >
> >This MPIO application has cyclic behavior.
> >I) Read initial data
> >II) compute intermediate result
> >III) wait for other compute nodes to finish computing
> >IV) all compute nodes write to file (their portion)
> >V) compute nodes trade 'edge conditions'
> >VI) goto II (compute).
> >
> >While the application is not in IV (writing), another application, say
> >the
> >visualizer, needs READ access to the file in order to crunch it for
> >visualization. Visualization is needed to tell if the MPIO application
> >intermediate results are converging on a solution.
> >
> >If in step IV all the compute nodes open/write/close as described above
> >as the
> >Normal open, the pNFSds will be doing a lot of metadata processing:
> >resolving
> >share and delegation state between themselves as well as delivering per
> >byte-range layout info. The group open is designed to reduce the
> >metadata
> >processing from 1000's to one.
> >
> >I mention a couple of new fcntls used by the MPIO layer to communicate
> >pNFS
> >state from the supervisor node to the compute nodes. Don't worry about
> >that(!).
> >
> >Do worry about this: is there anything stoping a compute node
> >from using
> >OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
> >described
> >below? If so, are the changes to pNFS to make this work small enough to
> >be
> >considered at this time?
> >
> >Group Open
> >**********
> >step IV: supervisor OPENs file, all compute clients write file,
> >supervisor
> >CLOSES file.
> >
> >specifically:
> >a) supervisor issues a compound with
> >OPEN: Access - Both, Deny Both to a pNFSd
> >        - pNFSds need to resolve the share
> >        - is this a normal nfsv4 OPEN? does pNFSd or the PFS need to
> >know
> >about the other compute clients?
> >        - do we need the concept of a group clientid?
> >DELEG_ASK: supervisor asks for WRITE delegation which should be
> >        granted given the OPEN Access-Both, Deny-Both share.
> >        - pNFSds need to resolve delegation request
> >WRITE_IND: supervisor gets whole file layout info
> >
> >b) supervisor calls
> >        fcntl(fd, GET_GRPOPEN, cookie_buf);
> >        which returns the filehandle,stateid, and layout map from the
> >supervisor pNFS.
> >
> >c) the supervisor code passes filehandle, stateid, and layout map to
> >each
> >compute
> >node which calls
> >        fcntl(fd, SET_GRPOPEN, cookie_buf);
> >the pNFS compute node client receives the filehandle, stateid, and
> >layout map.
> >performs a local open (nothing need go across the wire) stuffing the
> >filehandle, stateid, and layout map into it's state tree just as if an
> >across
> >the wire OPEN/DELEG_ASK/WRITE_IND occured.
> >
> >d) compute clients use SET_GRPOPEN filehandle, stateid and map to
> >directly
> >write the data to the appropriate NAS/SAN
> >        - what besides the filehandle, stateid, and layout map is
> >needed?
> >        - when done writing, each compute client issues a COMMIT_IND.
> >
> >e) when compute clients have flushed all data back to the file,
> >supervisor
> >issues a compound with
> >
> >CLOSE
> >
> >
> >
> >
> >Yahoo! Groups Links
> >
> >
> >
> >
> >
> >
> >
> >Yahoo! Groups Links
> >
> >
> >
> >
> >
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> click here
>
> ________________________________________________________________________________
> Yahoo! Groups Links
> * To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>  
> * To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>  
> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
> 

From bhalevy@panasas.com Fri Mar 19 11:32:41 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 44665 invoked from network); 19 Mar 2004 19:32:40 -0000
Received: from unknown (66.218.66.167)
by m18.grp.scd.yahoo.com with QMQP; 19 Mar 2004 19:32:40 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 19 Mar 2004 19:32:39 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <H2LG6R35>; Fri, 19 Mar 2004 14:32:35 -0500
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D38951@PIKES.panasas.com>
To: "'dhildebz@eecs.umich.edu'" <dhildebz@eecs.umich.edu>
Cc: "'pnfs-ops@yahoogroups.com'" <pnfs-ops@yahoogroups.com>,
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Fri, 19 Mar 2004 14:32:26 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

ADVERTISEMENT
What I meant was
In the nfsv4 sessions world a (clustered) client may open
simultaneous connections to a server that are associated
with the same session.

Simply put: one session can have multiple connections associated
with it. The proposed NFSv4 session model
(http://www.ietf.org/internet-drafts/draft-talpey-nfsv4-rdma-sess-01.txt)
have another abstraction, a channel, that needs to be thought of too.

My intuition is that for a clustered client, which I think of as
a single logical NFSv4 client, (i.e. all nfsv4 client instances share
the same client id and state), we definitely want all connections to bind
to the same session. It makes a lot of sense in this architecture
to have separate operations channels and back channels for the
client hosts, one or more of each per host. Assuming the
client hosts do not share memory, it will be a burden on them
to manage the per-channel resource management state.

Benny

>-----Original Message-----
>From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>Sent: Friday, March 19, 2004 12:21 PM
>To: 'pnfs-reqs@yahoogroups.com'
>Cc: pnfs-ops@yahoogroups.com
>Subject: RE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group
>open
>
>
>Hi Benny,
>Did you mean,
> 'In the nfsv4 sessions world a (clustered) client may open
> simultaneous connections to servers associated with the same session'
>or
> 'In the nfsv4 sessions world a (clustered) client may open
> multiple simultaneous connections to a server that is associated with
> the same session'
>
>I'm assuming the first as I'm not even sure what the second one
>means...but I do not know a lot of about sessions.
>Dean
>
>On Wed, 17 Mar 2004, Halevy, Benny wrote:
>
>> I completely agree with Dave and I certainly don't think he's
>> crazy.
>>
>> I perceive this solution as a "clustered" implementation of a
>> nfsv4 client in which the v4 drivers in the client cluster are
>> cooperating and propagating state (e.g. file handles, stateids)
>> among each other.
>>
>> I believe that the server should not be able to distinguish
>> such client from a multi-homed client that may have several
>> ip addresses.
>>
>> In the nfsv4 sessions world a (clustered) client may open
>> multiple connections to the server that are associated with
>> the same session - this will make life for such client even
>> easier, I hope.
>>
>> Benny
>>
>> >-----Original Message-----
>> >From: Noveck, Dave [mailto:dnoveck@netapp.com]
>> >Sent: Wednesday, March 17, 2004 2:43 PM
>> >To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>> >Subject: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client
>group open
>> >
>> >
>> >Andy wrote:
>> >> Do worry about this: is there anything stoping a compute node from
>> >using
>> >> OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>> >described
>> >> below?
>> >
>> >I'm going to say "No".  I know this wasn't the answer that I
>> >gave at the
>> >
>> >conference call, (and might not be the answer I give at the next
>> >conference
>> >call :-), but listen to my reasoning before you decide I'm crazy.
>> >
>> >In order to resolve this issue it is necessary to get all
>philosophical
>> >and address the question "What is a computer?".  I know
>lots of people
>> >have already hit delete but I hope somebody is still reading.
>> >
>> >Suppose I have an application cluster with 1K nodes and I put on my
>> >marketing
>> >hat (Gee, I hope I don't need a marketing jacket and tie,
>too :-) and
>> >say "This is really a powerful computer with a thousand (maybe two
>> >thousand)
>> >CPU's".  Now that's marketing bullshit but it isn't exactly false.
>> >There
>> >are certainly tasks where you want a large number of CPU's sharing
>> >memory
>> >and a DSM arrangement's performance is going to suck.  On the other
>> >hand,
>> >there are applications where having a thousand memories is
>going to be
>> >much
>> >better than trying to provide adequate memory bandwidth
>from a single
>> >memory
>> >to many many CPU's.
>> >
>> >So what's the point?  I think the point is that as far as the
>> >NFS server
>> >is
>> >concerned, whether the computer that is talking to it is "really" a
>> >computer,
>> >i.e. it has CPU's sharing memory, or is only a computer qua
>marketing
>> >bullshit,
>> >i.e. a collection of cpu's that don't share memory, that use other
>> >methods to
>> >co-ordinate common activities, doesn't matter.  All the
>server sees are
>> >the
>> >requests made and if the cluster represents itself as a
>single machine
>> >(i.e.
>> >in V4 does a single SETCLIENTID or in v4.1 maintains many
>connections
>> >bound to
>> >a single session), it is one.  The server doesn't see the cluster's
>> >memory
>> >architecture.  It sees an open and then use of that that
>stateid.  The
>> >fact
>> >that it comes over a different IP address doesn't disqualify it.  A
>> >server
>> >might have options to check that (as a matter of security)
>but it isn't
>> >part
>> >of the protocol and we already have clients with multiple
>IP addresses.
>> >Having
>> >a thousand of them is a difference of degree (and may pose
>> >implementation
>> >issues) but I don't see a real protocol issue.
>> >
>> >OK.  Now you can decide if I'm crazy.
>> >
>> >-----Original Message-----
>> >From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
>> >Sent: Wednesday, March 17, 2004 12:44 PM
>> >To: pnfs-reqs@yahoogroups.com
>> >Cc: pnfs-ops@yahoogroups.com; andros@citi.umich.edu
>> >Subject: [pnfs-ops] pNFS, MPIO, and client group open
>> >
>> >
>> >Sorry for the long email :)
>> >
>> >At the conclusion of the NEPS conference last November, Brent Welch
>> >emailed
>> >his notes as a starting point for a requirements document
>(attached). I
>> >use
>> >his pNFS extention language to describe a pNFS client using
>a 'normal
>> >open'
>> >servicing an open/write/close with direct access, and a large MPIO
>> >application
>> >using a proposed 'group open'.
>> >
>> >I note that my knowledge of parallel filesystems is
>growing, so please
>> >excuse
>> >any misconceptions, comments welcome...
>> >
>> >The architecture i'm picturing is a large cluster with a
>Parallel File
>> >System
>> >(PFS)
>> >consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I
>> >know it's
>> >only
>> >one of many architectures the pNFS set of extensions is trying to
>> >address.
>> >
>> >1000's of pNFS clients
>> >10's of pNFSd, one per PFS MD
>> >100's NAS/SAN
>> >
>> >
>> >'Normal' open
>> >********************
>> >a) pNFS client issues a compound to one pNFSd consisting of:
>> >OPEN with share:  Access/Deny
>> >        Multiple pNFSds need to resolve share.
>> >DELEG_ASK: Request Byte-range Delegation
>> >        Multiple pNFSds need to resolve delegation
>> >READ/WRITE_IND request direct data access
>> >        pNFSd queries PFS MD to get location map
>> >
>> >b) pNFS client can then issue READ/WRITE directly to the
>NAS/SAN using
>> >the map
>> >returned in READ/WRITE_IND.
>> >
>> >c) pNFS client issues a compound to one pNFSd consisting of:
>> >COMMIT_IND:
>> >CLOSE
>> >
>> >
>> >An MPIO application opens one very large file, shared by 1000's of
>> >compute
>> >clients. Each compute client manipulates its portion of the
>file. The
>> >MPIO
>> >layer manages compute clients so that no client shares a
>byte range of
>> >the
>> >file with another.
>> >
>> >This MPIO application consists of
>> > - supervisor code running on 1 MPIO supervisor node
>> > - compute code running on 1000's of MPIO compute nodes
>> >
>> >This MPIO application has cyclic behavior.
>> >I) Read initial data
>> >II) compute intermediate result
>> >III) wait for other compute nodes to finish computing
>> >IV) all compute nodes write to file (their portion)
>> >V) compute nodes trade 'edge conditions'
>> >VI) goto II (compute).
>> >
>> >While the application is not in IV (writing), another
>application, say
>> >the
>> >visualizer, needs READ access to the file in order to crunch it for
>> >visualization. Visualization is needed to tell if the MPIO
>application
>> >intermediate results are converging on a solution.
>> >
>> >If in step IV all the compute nodes open/write/close as
>described above
>> >as the
>> >Normal open, the pNFSds will be doing a lot of metadata processing:
>> >resolving
>> >share and delegation state between themselves as well as
>delivering per
>> >byte-range layout info. The group open is designed to reduce the
>> >metadata
>> >processing from 1000's to one.
>> >
>> >I mention a couple of new fcntls used by the MPIO layer to
>communicate
>> >pNFS
>> >state from the supervisor node to the compute nodes. Don't
>worry about
>> >that(!).
>> >
>> >Do worry about this: is there anything stoping a compute node
>> >from using
>> >OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>> >described
>> >below? If so, are the changes to pNFS to make this work
>small enough to
>> >be
>> >considered at this time?
>> >
>> >Group Open
>> >**********
>> >step IV: supervisor OPENs file, all compute clients write file,
>> >supervisor
>> >CLOSES file.
>> >
>> >specifically:
>> >a) supervisor issues a compound with
>> >OPEN: Access - Both, Deny Both to a pNFSd
>> >        - pNFSds need to resolve the share
>> >        - is this a normal nfsv4 OPEN? does pNFSd or the PFS need to
>> >know
>> >about the other compute clients?
>> >        - do we need the concept of a group clientid?
>> >DELEG_ASK: supervisor asks for WRITE delegation which should be
>> >        granted given the OPEN Access-Both, Deny-Both share.
>> >        - pNFSds need to resolve delegation request
>> >WRITE_IND: supervisor gets whole file layout info
>> >
>> >b) supervisor calls
>> >        fcntl(fd, GET_GRPOPEN, cookie_buf);
>> >        which returns the filehandle,stateid, and layout
>map from the
>> >supervisor pNFS.
>> >
>> >c) the supervisor code passes filehandle, stateid, and layout map to
>> >each
>> >compute
>> >node which calls
>> >        fcntl(fd, SET_GRPOPEN, cookie_buf);
>> >the pNFS compute node client receives the filehandle, stateid, and
>> >layout map.
>> >performs a local open (nothing need go across the wire) stuffing the
>> >filehandle, stateid, and layout map into it's state tree
>just as if an
>> >across
>> >the wire OPEN/DELEG_ASK/WRITE_IND occured.
>> >
>> >d) compute clients use SET_GRPOPEN filehandle, stateid and map to
>> >directly
>> >write the data to the appropriate NAS/SAN
>> >        - what besides the filehandle, stateid, and layout map is
>> >needed?
>> >        - when done writing, each compute client issues a
>COMMIT_IND.
>> >
>> >e) when compute clients have flushed all data back to the file,
>> >supervisor
>> >issues a compound with
>> >
>> >CLOSE
>> >
>> >
>> >
>> >
>> >Yahoo! Groups Links
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >Yahoo! Groups Links
>> >
>> >
>> >
>> >
>> >
>>
>> Yahoo! Groups Sponsor
>> ADVERTISEMENT
>> click here
>>
>>
>_______________________________________________________________
>_________________
>> Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>  
>> * To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>  
>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms
>of Service.
>>
>>
>
>
>
>------------------------ Yahoo! Groups Sponsor
>---------------------~-->
>Upgrade to 128-bit SSL Security!
>http://us.click.yahoo.com/LPJzrA/yjVHAA/TtwFAA/W6uqlB/TM
>---------------------------------------------------------------
>------~->
>
>
>Yahoo! Groups Links
>
>
>
>
>


From Thomas.Talpey@netapp.com Fri Mar 19 20:00:18 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 80343 invoked from network); 20 Mar 2004 04:00:16 -0000
Received: from unknown (66.218.66.167)
by m17.grp.scd.yahoo.com with QMQP; 20 Mar 2004 04:00:16 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 20 Mar 2004 04:00:16 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i2K40FZh029459;
Fri, 19 Mar 2004 20:00:15 -0800 (PST)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i2K404Tt010051;
Fri, 19 Mar 2004 20:00:15 -0800 (PST)
Received: from tmt.netapp.com ([10.97.6.35]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Fri, 19 Mar 2004 22:59:53 -0500
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C40E2F.CC79C280"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Fri, 19 Mar 2004 18:09:55 -0800
Message-ID: <6.0.3.0.2.20040319210159.01ec0508@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-ops] pNFS, MPIO, and client group open
Thread-Index: AcQOL8zPrk1+CSVUSeO7GdDagJ+XWg==
To: <pnfs-ops@yahoogroups.com>
Cc: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: RE: [pnfs-ops] pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

I can tell you what the v4/sessions world might say - both.
Currently the sessions proposal explores a single client opening
multiple connections to a given server, then binding them together
by a single session. This allows trunking, failover, etc.

During the call I described that there is another possibility, that
multiple servers can share a session. This would allow the client
to stripe, in a similar way that the one-client-one-server trunks.

In fact, we could also consider multiple clients sharing a session,
but that makes my head hurt at the moment.

Basically, you can think of a session as a mount point, abstracted
to the server. The client would generally create one for each new
mount, and bind both operation and callback channels to it. In the
pNFS case, the server would exchange topology with the client,
which in turn would lead to additional (parallel) pipes to the data
being created and bound by the client.

The important thing is that the whole picture hinges on the scope
of the clientid (or sessionid). When we talk about a server-to-server
protocol to allow standard server pooling, we effectively are making
a way for this scope to be distributed. BTW I think we should defer
that...

<http://www.ietf.org/internet-drafts/draft-talpey-nfsv4-rdma-sess-01.txt>

Tom.

[Do we really need to cc pnfs-reqs on these? Is everyone on both?]


At 12:20 PM 3/19/2004, Dean Hildebrand wrote:
>Hi Benny,
>Did you mean,
> 'In the nfsv4 sessions world a (clustered) client may open
> simultaneous connections to servers associated with the same session'
>or
> 'In the nfsv4 sessions world a (clustered) client may open
> multiple simultaneous connections to a server that is associated with
> the same session'
>
>I'm assuming the first as I'm not even sure what the second one
>means...but I do not know a lot of about sessions.
>Dean
>
>On Wed, 17 Mar 2004, Halevy, Benny wrote:
>
>> I completely agree with Dave and I certainly don't think he's
>> crazy.
>>
>> I perceive this solution as a "clustered" implementation of a
>> nfsv4 client in which the v4 drivers in the client cluster are
>> cooperating and propagating state (e.g. file handles, stateids)
>> among each other.
>>
>> I believe that the server should not be able to distinguish
>> such client from a multi-homed client that may have several
>> ip addresses.
>>
>> In the nfsv4 sessions world a (clustered) client may open
>> multiple connections to the server that are associated with
>> the same session - this will make life for such client even
>> easier, I hope.
>>
>> Benny
>>
>> >-----Original Message-----
>> >From: Noveck, Dave [mailto:dnoveck@netapp.com]
>> >Sent: Wednesday, March 17, 2004 2:43 PM
>> >To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>> >Subject: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
>> >
>> >
>> >Andy wrote:
>> >> Do worry about this: is there anything stoping a compute node from
>> >using
>> >> OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>> >described
>> >> below?
>> >
>> >I'm going to say "No".  I know this wasn't the answer that I
>> >gave at the
>> >
>> >conference call, (and might not be the answer I give at the next
>> >conference
>> >call :-), but listen to my reasoning before you decide I'm crazy.
>> >
>> >In order to resolve this issue it is necessary to get all philosophical
>> >and address the question "What is a computer?".  I know lots of people
>> >have already hit delete but I hope somebody is still reading.
>> >
>> >Suppose I have an application cluster with 1K nodes and I put on my
>> >marketing
>> >hat (Gee, I hope I don't need a marketing jacket and tie, too :-) and
>> >say "This is really a powerful computer with a thousand (maybe two
>> >thousand)
>> >CPU's".  Now that's marketing bullshit but it isn't exactly false.
>> >There
>> >are certainly tasks where you want a large number of CPU's sharing
>> >memory
>> >and a DSM arrangement's performance is going to suck.  On the other
>> >hand,
>> >there are applications where having a thousand memories is going to be
>> >much
>> >better than trying to provide adequate memory bandwidth from a single
>> >memory
>> >to many many CPU's.
>> >
>> >So what's the point?  I think the point is that as far as the
>> >NFS server
>> >is
>> >concerned, whether the computer that is talking to it is "really" a
>> >computer,
>> >i.e. it has CPU's sharing memory, or is only a computer qua marketing
>> >bullshit,
>> >i.e. a collection of cpu's that don't share memory, that use other
>> >methods to
>> >co-ordinate common activities, doesn't matter.  All the server sees are
>> >the
>> >requests made and if the cluster represents itself as a single machine
>> >(i.e.
>> >in V4 does a single SETCLIENTID or in v4.1 maintains many connections
>> >bound to
>> >a single session), it is one.  The server doesn't see the cluster's
>> >memory
>> >architecture.  It sees an open and then use of that that stateid.  The
>> >fact
>> >that it comes over a different IP address doesn't disqualify it.  A
>> >server
>> >might have options to check that (as a matter of security) but it isn't
>> >part
>> >of the protocol and we already have clients with multiple IP addresses.
>> >Having
>> >a thousand of them is a difference of degree (and may pose
>> >implementation
>> >issues) but I don't see a real protocol issue.
>> >
>> >OK.  Now you can decide if I'm crazy.
>> >
>> >-----Original Message-----
>> >From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
>> >Sent: Wednesday, March 17, 2004 12:44 PM
>> >To: pnfs-reqs@yahoogroups.com
>> >Cc: pnfs-ops@yahoogroups.com; andros@citi.umich.edu
>> >Subject: [pnfs-ops] pNFS, MPIO, and client group open
>> >
>> >
>> >Sorry for the long email :)
>> >
>> >At the conclusion of the NEPS conference last November, Brent Welch
>> >emailed
>> >his notes as a starting point for a requirements document (attached). I
>> >use
>> >his pNFS extention language to describe a pNFS client using a 'normal
>> >open'
>> >servicing an open/write/close with direct access, and a large MPIO
>> >application
>> >using a proposed 'group open'.
>> >
>> >I note that my knowledge of parallel filesystems is growing, so please
>> >excuse
>> >any misconceptions, comments welcome...
>> >
>> >The architecture i'm picturing is a large cluster with a Parallel File
>> >System
>> >(PFS)
>> >consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I
>> >know it's
>> >only
>> >one of many architectures the pNFS set of extensions is trying to
>> >address.
>> >
>> >1000's of pNFS clients
>> >10's of pNFSd, one per PFS MD
>> >100's NAS/SAN
>> >
>> >
>> >'Normal' open
>> >********************
>> >a) pNFS client issues a compound to one pNFSd consisting of:
>> >OPEN with share:  Access/Deny
>> >        Multiple pNFSds need to resolve share.
>> >DELEG_ASK: Request Byte-range Delegation
>> >        Multiple pNFSds need to resolve delegation
>> >READ/WRITE_IND request direct data access
>> >        pNFSd queries PFS MD to get location map
>> >
>> >b) pNFS client can then issue READ/WRITE directly to the NAS/SAN using
>> >the map
>> >returned in READ/WRITE_IND.
>> >
>> >c) pNFS client issues a compound to one pNFSd consisting of:
>> >COMMIT_IND:
>> >CLOSE
>> >
>> >
>> >An MPIO application opens one very large file, shared by 1000's of
>> >compute
>> >clients. Each compute client manipulates its portion of the file. The
>> >MPIO
>> >layer manages compute clients so that no client shares a byte range of
>> >the
>> >file with another.
>> >
>> >This MPIO application consists of
>> > - supervisor code running on 1 MPIO supervisor node
>> > - compute code running on 1000's of MPIO compute nodes
>> >
>> >This MPIO application has cyclic behavior.
>> >I) Read initial data
>> >II) compute intermediate result
>> >III) wait for other compute nodes to finish computing
>> >IV) all compute nodes write to file (their portion)
>> >V) compute nodes trade 'edge conditions'
>> >VI) goto II (compute).
>> >
>> >While the application is not in IV (writing), another application, say
>> >the
>> >visualizer, needs READ access to the file in order to crunch it for
>> >visualization. Visualization is needed to tell if the MPIO application
>> >intermediate results are converging on a solution.
>> >
>> >If in step IV all the compute nodes open/write/close as described above
>> >as the
>> >Normal open, the pNFSds will be doing a lot of metadata processing:
>> >resolving
>> >share and delegation state between themselves as well as delivering per
>> >byte-range layout info. The group open is designed to reduce the
>> >metadata
>> >processing from 1000's to one.
>> >
>> >I mention a couple of new fcntls used by the MPIO layer to communicate
>> >pNFS
>> >state from the supervisor node to the compute nodes. Don't worry about
>> >that(!).
>> >
>> >Do worry about this: is there anything stoping a compute node
>> >from using
>> >OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>> >described
>> >below? If so, are the changes to pNFS to make this work small enough to
>> >be
>> >considered at this time?
>> >
>> >Group Open
>> >**********
>> >step IV: supervisor OPENs file, all compute clients write file,
>> >supervisor
>> >CLOSES file.
>> >
>> >specifically:
>> >a) supervisor issues a compound with
>> >OPEN: Access - Both, Deny Both to a pNFSd
>> >        - pNFSds need to resolve the share
>> >        - is this a normal nfsv4 OPEN? does pNFSd or the PFS need to
>> >know
>> >about the other compute clients?
>> >        - do we need the concept of a group clientid?
>> >DELEG_ASK: supervisor asks for WRITE delegation which should be
>> >        granted given the OPEN Access-Both, Deny-Both share.
>> >        - pNFSds need to resolve delegation request
>> >WRITE_IND: supervisor gets whole file layout info
>> >
>> >b) supervisor calls
>> >        fcntl(fd, GET_GRPOPEN, cookie_buf);
>> >        which returns the filehandle,stateid, and layout map from the
>> >supervisor pNFS.
>> >
>> >c) the supervisor code passes filehandle, stateid, and layout map to
>> >each
>> >compute
>> >node which calls
>> >        fcntl(fd, SET_GRPOPEN, cookie_buf);
>> >the pNFS compute node client receives the filehandle, stateid, and
>> >layout map.
>> >performs a local open (nothing need go across the wire) stuffing the
>> >filehandle, stateid, and layout map into it's state tree just as if an
>> >across
>> >the wire OPEN/DELEG_ASK/WRITE_IND occured.
>> >
>> >d) compute clients use SET_GRPOPEN filehandle, stateid and map to
>> >directly
>> >write the data to the appropriate NAS/SAN
>> >        - what besides the filehandle, stateid, and layout map is
>> >needed?
>> >        - when done writing, each compute client issues a COMMIT_IND.
>> >
>> >e) when compute clients have flushed all data back to the file,
>> >supervisor
>> >issues a compound with
>> >
>> >CLOSE
>> >
>> >
>> >
>> >
>> >Yahoo! Groups Links
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> >Yahoo! Groups Links
>> >
>> >
>> >
>> >
>> >
>>
>> Yahoo! Groups Sponsor
>> ADVERTISEMENT
>> click here
>>
>>
>____________________________________________________________________________
>____
>> Yahoo! Groups Links
>>  *  To visit your group on the web, go to:
>>     http://groups.yahoo.com/group/pnfs-reqs/
>>     
>>  *  To unsubscribe from this group, send an email to:
>>     pnfs-reqs-unsubscribe@yahoogroups.com
>>     
>>  *  Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>>
>>
>
>
>
>------------------------ Yahoo! Groups Sponsor ---------------------~-->
>Upgrade to 128-bit SSL Security!
>http://us.click.yahoo.com/LPJzrA/yjVHAA/TtwFAA/W6uqlB/TM
>---------------------------------------------------------------------~->
>
>
>Yahoo! Groups Links
>
><*> To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
><*> To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
><*> Your use of Yahoo! Groups is subject to:
>     http://docs.yahoo.com/info/terms/
> 

From julian_satran@il.ibm.com Sun Mar 21 15:10:05 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 77571 invoked from network); 21 Mar 2004 23:10:03 -0000
Received: from unknown (66.218.66.167)
by m14.grp.scd.yahoo.com with QMQP; 21 Mar 2004 23:10:03 -0000
Received: from unknown (HELO mtagate3.de.ibm.com) (195.212.29.152)
by mta6.grp.scd.yahoo.com with SMTP; 21 Mar 2004 23:09:55 -0000
Received: from d12relay02.megacenter.de.ibm.com (d12relay02.megacenter.de.ibm.com [9.149.165.196])
by mtagate3.de.ibm.com (8.12.10/8.12.10) with ESMTP id i2LN9ixJ130726;
Sun, 21 Mar 2004 23:09:44 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay02.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i2LN9jgh085230;
Mon, 22 Mar 2004 00:09:46 +0100
In-Reply-To: <30489F1321F5C343ACF6872B2CF7942A05D38938@PIKES.panasas.com>
To: pnfs-ops@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com,
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.1 January 21, 2004
Message-ID: <OF5B14E311.0C171A1C-ONC2256E5E.00605204-C2256E5E.007F38C3@il.ibm.com>
Date: Mon, 22 Mar 2004 01:11:48 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2|July 23, 2003) at
22/03/2004 01:11:58,
Serialize complete at 22/03/2004 01:11:58
Content-Type: multipart/alternative; boundary="=_alternative 00608DF1C2256E5E_="
X-eGroups-Remote-IP: 195.212.29.152
From: Julian Satran <julian_satran@il.ibm.com>
Subject: RE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

ADVERTISEMENT
click here

I am sure we all want to be aware of this "twist". It may be more than just putting a label on it - e.g., security protocols must be able to "delegate" credentials and that makes some bindings bad.

Julo


"Halevy, Benny" <bhalevy@panasas.com>

17/03/04 22:48
Please respond to
pnfs-ops

	
To
	"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>, pnfs-ops@yahoogroups.com
cc
	
Subject
	RE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open

	




I completely agree with Dave and I certainly don't think he's
crazy.

I perceive this solution as a "clustered" implementation of a
nfsv4 client in which the v4 drivers in the client cluster are
cooperating and propagating state (e.g. file handles, stateids)
among each other.

I believe that the server should not be able to distinguish
such client from a multi-homed client that may have several
ip addresses.

In the nfsv4 sessions world a (clustered) client may open
multiple connections to the server that are associated with
the same session - this will make life for such client even
easier, I hope.

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Wednesday, March 17, 2004 2:43 PM
>To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>Subject: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
>
>
>Andy wrote:
>> Do worry about this: is there anything stoping a compute node from
>using
>> OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>described
>> below?
>
>I'm going to say "No".  I know this wasn't the answer that I
>gave at the
>
>conference call, (and might not be the answer I give at the next
>conference
>call :-), but listen to my reasoning before you decide I'm crazy.
>
>In order to resolve this issue it is necessary to get all philosophical
>and address the question "What is a computer?".  I know lots of people
>have already hit delete but I hope somebody is still reading.
>
>Suppose I have an application cluster with 1K nodes and I put on my
>marketing
>hat (Gee, I hope I don't need a marketing jacket and tie, too :-) and
>say "This is really a powerful computer with a thousand (maybe two
>thousand)
>CPU's".  Now that's marketing bullshit but it isn't exactly false.
>There
>are certainly tasks where you want a large number of CPU's sharing
>memory
>and a DSM arrangement's performance is going to suck.  On the other
>hand,
>there are applications where having a thousand memories is going to be
>much
>better than trying to provide adequate memory bandwidth from a single
>memory
>to many many CPU's.
>
>So what's the point?  I think the point is that as far as the
>NFS server
>is
>concerned, whether the computer that is talking to it is "really" a
>computer,
>i.e. it has CPU's sharing memory, or is only a computer qua marketing
>bullshit,
>i.e. a collection of cpu's that don't share memory, that use other
>methods to
>co-ordinate common activities, doesn't matter.  All the server sees are
>the
>requests made and if the cluster represents itself as a single machine
>(i.e.
>in V4 does a single SETCLIENTID or in v4.1 maintains many connections
>bound to
>a single session), it is one.  The server doesn't see the cluster's
>memory
>architecture.  It sees an open and then use of that that stateid.  The
>fact
>that it comes over a different IP address doesn't disqualify it.  A
>server
>might have options to check that (as a matter of security) but it isn't
>part
>of the protocol and we already have clients with multiple IP addresses.
>Having
>a thousand of them is a difference of degree (and may pose
>implementation
>issues) but I don't see a real protocol issue.
>
>OK.  Now you can decide if I'm crazy.
>
>-----Original Message-----
>From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
>Sent: Wednesday, March 17, 2004 12:44 PM
>To: pnfs-reqs@yahoogroups.com
>Cc: pnfs-ops@yahoogroups.com; andros@citi.umich.edu
>Subject: [pnfs-ops] pNFS, MPIO, and client group open
>
>
>Sorry for the long email :)
>
>At the conclusion of the NEPS conference last November, Brent Welch
>emailed
>his notes as a starting point for a requirements document (attached). I
>use
>his pNFS extention language to describe a pNFS client using a 'normal
>open'
>servicing an open/write/close with direct access, and a large MPIO
>application
>using a proposed 'group open'.
>
>I note that my knowledge of parallel filesystems is growing, so please
>excuse
>any misconceptions, comments welcome...
>
>The architecture i'm picturing is a large cluster with a Parallel File
>System
>(PFS)
>consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I
>know it's
>only
>one of many architectures the pNFS set of extensions is trying to
>address.
>
>1000's of pNFS clients
>10's of pNFSd, one per PFS MD
>100's NAS/SAN
>
>
>'Normal' open
>********************
>a) pNFS client issues a compound to one pNFSd consisting of:
>OPEN with share:  Access/Deny
>        Multiple pNFSds need to resolve share.
>DELEG_ASK: Request Byte-range Delegation
>        Multiple pNFSds need to resolve delegation
>READ/WRITE_IND request direct data access
>        pNFSd queries PFS MD to get location map
>
>b) pNFS client can then issue READ/WRITE directly to the NAS/SAN using
>the map
>returned in READ/WRITE_IND.
>
>c) pNFS client issues a compound to one pNFSd consisting of:
>COMMIT_IND:
>CLOSE
>
>
>An MPIO application opens one very large file, shared by 1000's of
>compute
>clients. Each compute client manipulates its portion of the file. The
>MPIO
>layer manages compute clients so that no client shares a byte range of
>the
>file with another.
>
>This MPIO application consists of
> - supervisor code running on 1 MPIO supervisor node
> - compute code running on 1000's of MPIO compute nodes
>
>This MPIO application has cyclic behavior.
>I) Read initial data
>II) compute intermediate result
>III) wait for other compute nodes to finish computing
>IV) all compute nodes write to file (their portion)
>V) compute nodes trade 'edge conditions'
>VI) goto II (compute).
>
>While the application is not in IV (writing), another application, say
>the
>visualizer, needs READ access to the file in order to crunch it for
>visualization. Visualization is needed to tell if the MPIO application
>intermediate results are converging on a solution.
>
>If in step IV all the compute nodes open/write/close as described above
>as the
>Normal open, the pNFSds will be doing a lot of metadata processing:
>resolving
>share and delegation state between themselves as well as delivering per
>byte-range layout info. The group open is designed to reduce the
>metadata
>processing from 1000's to one.
>
>I mention a couple of new fcntls used by the MPIO layer to communicate
>pNFS
>state from the supervisor node to the compute nodes. Don't worry about
>that(!).
>
>Do worry about this: is there anything stoping a compute node
>from using
>OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>described
>below? If so, are the changes to pNFS to make this work small enough to
>be
>considered at this time?
>
>Group Open
>**********
>step IV: supervisor OPENs file, all compute clients write file,
>supervisor
>CLOSES file.
>
>specifically:
>a) supervisor issues a compound with
>OPEN: Access - Both, Deny Both to a pNFSd
>        - pNFSds need to resolve the share
>        - is this a normal nfsv4 OPEN? does pNFSd or the PFS need to
>know
>about the other compute clients?
>        - do we need the concept of a group clientid?
>DELEG_ASK: supervisor asks for WRITE delegation which should be
>        granted given the OPEN Access-Both, Deny-Both share.
>        - pNFSds need to resolve delegation request
>WRITE_IND: supervisor gets whole file layout info
>
>b) supervisor calls
>        fcntl(fd, GET_GRPOPEN, cookie_buf);
>        which returns the filehandle,stateid, and layout map from the
>supervisor pNFS.
>
>c) the supervisor code passes filehandle, stateid, and layout map to
>each
>compute
>node which calls
>        fcntl(fd, SET_GRPOPEN, cookie_buf);
>the pNFS compute node client receives the filehandle, stateid, and
>layout map.
>performs a local open (nothing need go across the wire) stuffing the
>filehandle, stateid, and layout map into it's state tree just as if an
>across
>the wire OPEN/DELEG_ASK/WRITE_IND occured.
>
>d) compute clients use SET_GRPOPEN filehandle, stateid and map to
>directly
>write the data to the appropriate NAS/SAN
>        - what besides the filehandle, stateid, and layout map is
>needed?
>        - when done writing, each compute client issues a COMMIT_IND.
>
>e) when compute clients have flushed all data back to the file,
>supervisor
>issues a compound with
>
>CLOSE
>
>
>
>
>Yahoo! Groups Links
>
>
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>
>


------------------------ Yahoo! Groups Sponsor ---------------------~-->
Upgrade to 128-bit SSL Security!
http://us.click.yahoo.com/LPJzrA/yjVHAA/TtwFAA/W6uqlB/TM
---------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/pnfs-ops/

<*> To unsubscribe from this group, send an email to:
    pnfs-ops-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/

From julian_satran@il.ibm.com Sun Mar 21 15:11:41 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 78562 invoked from network); 21 Mar 2004 23:11:41 -0000
Received: from unknown (66.218.66.172)
by m15.grp.scd.yahoo.com with QMQP; 21 Mar 2004 23:11:41 -0000
Received: from unknown (HELO mtagate3.de.ibm.com) (195.212.29.152)
by mta4.grp.scd.yahoo.com with SMTP; 21 Mar 2004 23:11:39 -0000
Received: from d12relay02.megacenter.de.ibm.com (d12relay02.megacenter.de.ibm.com [9.149.165.196])
by mtagate3.de.ibm.com (8.12.10/8.12.10) with ESMTP id i2LN9ixJ057250;
Sun, 21 Mar 2004 23:09:44 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12relay02.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i2LN9jgg085230;
Mon, 22 Mar 2004 00:09:46 +0100
In-Reply-To: <20040317174340.B805420F71@citi.umich.edu>
To: pnfs-ops@yahoogroups.com
Cc: andros@citi.umich.edu, pnfs-ops@yahoogroups.com, pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.1 January 21, 2004
Message-ID: <OFE30965D3.B878ABA5-ONC2256E5E.005F0258-C2256E5E.007F3449@il.ibm.com>
Date: Mon, 22 Mar 2004 01:11:37 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2|July 23, 2003) at
22/03/2004 01:11:58
Content-Type: multipart/mixed; boundary="=_mixed 005F4336C2256E5E_="
X-eGroups-Remote-IP: 195.212.29.152
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-ops] pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

Nice description. I would add that having a single node coordinate the MPIO access simplifies also the coordination needed at open/close and fsync (when transitioning between computation phases).

Julo


"William A.(Andy) Adamson" <andros@citi.umich.edu>

17/03/04 19:43
Please respond to
pnfs-ops

	
To
	pnfs-reqs@yahoogroups.com
cc
	pnfs-ops@yahoogroups.com, andros@citi.umich.edu
Subject
	[pnfs-ops] pNFS, MPIO, and client group open

	




Sorry for the long email :)

At the conclusion of the NEPS conference last November, Brent Welch emailed
his notes as a starting point for a requirements document (attached). I use
his pNFS extention language to describe a pNFS client using a 'normal open'
servicing an open/write/close with direct access, and a large MPIO application
using a proposed 'group open'.

I note that my knowledge of parallel filesystems is growing, so please excuse
any misconceptions, comments welcome...

The architecture i'm picturing is a large cluster with a Parallel File System
(PFS)
consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I know it's only
one of many architectures the pNFS set of extensions is trying to address.

1000's of pNFS clients
10's of pNFSd, one per PFS MD
100's NAS/SAN


'Normal' open
********************
a) pNFS client issues a compound to one pNFSd consisting of:
OPEN with share:  Access/Deny
       Multiple pNFSds need to resolve share.
DELEG_ASK: Request Byte-range Delegation
       Multiple pNFSds need to resolve delegation
READ/WRITE_IND request direct data access
       pNFSd queries PFS MD to get location map

b) pNFS client can then issue READ/WRITE directly to the NAS/SAN using the map
returned in READ/WRITE_IND.

c) pNFS client issues a compound to one pNFSd consisting of:
COMMIT_IND:
CLOSE


An MPIO application opens one very large file, shared by 1000's of compute
clients. Each compute client manipulates its portion of the file. The MPIO
layer manages compute clients so that no client shares a byte range of the
file with another.

This MPIO application consists of
- supervisor code running on 1 MPIO supervisor node
- compute code running on 1000's of MPIO compute nodes

This MPIO application has cyclic behavior.
I) Read initial data
II) compute intermediate result
III) wait for other compute nodes to finish computing
IV) all compute nodes write to file (their portion)
V) compute nodes trade 'edge conditions'
VI) goto II (compute).

While the application is not in IV (writing), another application, say the
visualizer, needs READ access to the file in order to crunch it for
visualization. Visualization is needed to tell if the MPIO application
intermediate results are converging on a solution.

If in step IV all the compute nodes open/write/close as described above as the
Normal open, the pNFSds will be doing a lot of metadata processing: resolving
share and delegation state between themselves as well as delivering per
byte-range layout info. The group open is designed to reduce the metadata
processing from 1000's to one.

I mention a couple of new fcntls used by the MPIO layer to communicate pNFS
state from the supervisor node to the compute nodes. Don't worry about that(!).

Do worry about this: is there anything stoping a compute node from using
OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as described
below? If so, are the changes to pNFS to make this work small enough to be
considered at this time?

Group Open
**********
step IV: supervisor OPENs file, all compute clients write file, supervisor
CLOSES file.

specifically:
a) supervisor issues a compound with
OPEN: Access - Both, Deny Both to a pNFSd
       - pNFSds need to resolve the share
       - is this a normal nfsv4 OPEN? does pNFSd or the PFS need to know
about the other compute clients?
       - do we need the concept of a group clientid?
DELEG_ASK: supervisor asks for WRITE delegation which should be
       granted given the OPEN Access-Both, Deny-Both share.
       - pNFSds need to resolve delegation request
WRITE_IND: supervisor gets whole file layout info

b) supervisor calls
       fcntl(fd, GET_GRPOPEN, cookie_buf);
       which returns the filehandle,stateid, and layout map from the
supervisor pNFS.

c) the supervisor code passes filehandle, stateid, and layout map to each
compute
node which calls
       fcntl(fd, SET_GRPOPEN, cookie_buf);
the pNFS compute node client receives the filehandle, stateid, and layout map.
performs a local open (nothing need go across the wire) stuffing the
filehandle, stateid, and layout map into it's state tree just as if an across
the wire OPEN/DELEG_ASK/WRITE_IND occured.

d) compute clients use SET_GRPOPEN filehandle, stateid and map to directly
write the data to the appropriate NAS/SAN
       - what besides the filehandle, stateid, and layout map is needed?
       - when done writing, each compute client issues a COMMIT_IND.

e) when compute clients have flushed all data back to the file, supervisor
issues a compound with

CLOSE




Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/pnfs-ops/

<*> To unsubscribe from this group, send an email to:
    pnfs-ops-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/




Attachment (not stored)
brent_welch_pnfs_ops
Type: application/octet-stream

From bhalevy@panasas.com Sun Mar 21 17:20:33 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 27471 invoked from network); 22 Mar 2004 01:20:32 -0000
Received: from unknown (66.218.66.167)
by m13.grp.scd.yahoo.com with QMQP; 22 Mar 2004 01:20:32 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 22 Mar 2004 01:20:32 -0000
Received: from yang ([172.17.19.44]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id H2LG662S; Sun, 21 Mar 2004 20:20:29 -0500
To: <julian_satran@il.ibm.com>,
<pnfs-reqs@yahoogroups.com>
Cc: <pnfs-ops@yahoogroups.com>
Date: Sun, 21 Mar 2004 20:20:30 -0500
Message-ID: <LCEAJMHHKPKEPAIDBBEKMEBMCBAA.bhalevy@panasas.com>
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Priority: 3 (Normal)
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook IMO, Build 9.0.6604 (9.0.2911.0)
In-Reply-To: <OF5B14E311.0C171A1C-ONC2256E5E.00605204-C2256E5E.007F38C3@il.ibm.com>
Importance: Normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
X-eGroups-Remote-IP: 65.194.124.178
From: "Benny Halevy" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

ADVERTISEMENT
click here
Julo,

I agree that the proposed method for parallel access requires
the ability to delegate security information along with layout
delegation. Still, I'm not sure there's a problem...

My assumptions were that block based storage networks are
secured with some form of host to LUN mapping thus capabilities
do not play a role in this game.

For file or object storage "capabilities" cannot be delegated
if they allow access to an object only to a specific client host.

OSD capabilities (at least SNIA
http://www.t10.org/ftp/t10/document.03/03-279r0.pdf)
do not have that limitation as far as I know.
The intention with this regards is well versed there:
"Note this protocol does allow delegation of a credential if
a host transfers both the secret part of the credential as
well as the public capability arguments."

We haven't yet discussed the pNFS/NFS security model in details
but assuming the back-end speaks NFSv4 the front-end metadata
server needs to give the pNFS client a NFSv4 filehandle (which
can theoretically serve as a capability) and then the client
must still authenticate with the back end data server.

Benny

-----Original Message-----
From: Julian Satran [mailto:julian_satran@il.ibm.com]
Sent: Sunday, March 21, 2004 18:12
To: pnfs-ops@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com; 'pnfs-reqs@yahoogroups.com'
Subject: RE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open



I am sure we all want to be aware of this "twist". It may be more than just
putting a label on it - e.g., security protocols must be able to "delegate"
credentials and that makes some bindings bad.

Julo


"Halevy, Benny" <bhalevy@panasas.com>
17/03/04 22:48 Please respond to
pnfs-ops

To"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
pnfs-ops@yahoogroups.com
cc
SubjectRE: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open







I completely agree with Dave and I certainly don't think he's
crazy.

I perceive this solution as a "clustered" implementation of a
nfsv4 client in which the v4 drivers in the client cluster are
cooperating and propagating state (e.g. file handles, stateids)
among each other.

I believe that the server should not be able to distinguish
such client from a multi-homed client that may have several
ip addresses.

In the nfsv4 sessions world a (clustered) client may open
multiple connections to the server that are associated with
the same session - this will make life for such client even
easier, I hope.

Benny

>-----Original Message-----
>From: Noveck, Dave [mailto:dnoveck@netapp.com]
>Sent: Wednesday, March 17, 2004 2:43 PM
>To: pnfs-ops@yahoogroups.com; pnfs-reqs@yahoogroups.com
>Subject: [pnfs-reqs] RE: [pnfs-ops] pNFS, MPIO, and client group open
>
>
>Andy wrote:
>> Do worry about this: is there anything stoping a compute node from
>using
>> OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>described
>> below?
>
>I'm going to say "No". I know this wasn't the answer that I
>gave at the
>
>conference call, (and might not be the answer I give at the next
>conference
>call :-), but listen to my reasoning before you decide I'm crazy.
>
>In order to resolve this issue it is necessary to get all philosophical
>and address the question "What is a computer?". I know lots of people
>have already hit delete but I hope somebody is still reading.
>
>Suppose I have an application cluster with 1K nodes and I put on my
>marketing
>hat (Gee, I hope I don't need a marketing jacket and tie, too :-) and
>say "This is really a powerful computer with a thousand (maybe two
>thousand)
>CPU's". Now that's marketing bullshit but it isn't exactly false.
>There
>are certainly tasks where you want a large number of CPU's sharing
>memory
>and a DSM arrangement's performance is going to suck. On the other
>hand,
>there are applications where having a thousand memories is going to be
>much
>better than trying to provide adequate memory bandwidth from a single
>memory
>to many many CPU's.
>
>So what's the point? I think the point is that as far as the
>NFS server
>is
>concerned, whether the computer that is talking to it is "really" a
>computer,
>i.e. it has CPU's sharing memory, or is only a computer qua marketing
>bullshit,
>i.e. a collection of cpu's that don't share memory, that use other
>methods to
>co-ordinate common activities, doesn't matter. All the server sees are
>the
>requests made and if the cluster represents itself as a single machine
>(i.e.
>in V4 does a single SETCLIENTID or in v4.1 maintains many connections
>bound to
>a single session), it is one. The server doesn't see the cluster's
>memory
>architecture. It sees an open and then use of that that stateid. The
>fact
>that it comes over a different IP address doesn't disqualify it. A
>server
>might have options to check that (as a matter of security) but it isn't
>part
>of the protocol and we already have clients with multiple IP addresses.
>Having
>a thousand of them is a difference of degree (and may pose
>implementation
>issues) but I don't see a real protocol issue.
>
>OK. Now you can decide if I'm crazy.
>
>-----Original Message-----
>From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
>Sent: Wednesday, March 17, 2004 12:44 PM
>To: pnfs-reqs@yahoogroups.com
>Cc: pnfs-ops@yahoogroups.com; andros@citi.umich.edu
>Subject: [pnfs-ops] pNFS, MPIO, and client group open
>
>
>Sorry for the long email :)
>
>At the conclusion of the NEPS conference last November, Brent Welch
>emailed
>his notes as a starting point for a requirements document (attached). I
>use
>his pNFS extention language to describe a pNFS client using a 'normal
>open'
>servicing an open/write/close with direct access, and a large MPIO
>application
>using a proposed 'group open'.
>
>I note that my knowledge of parallel filesystems is growing, so please
>excuse
>any misconceptions, comments welcome...
>
>The architecture i'm picturing is a large cluster with a Parallel File
>System
>(PFS)
>consisting of PFS Meta data servers(PFS MD) and PFS NAS/SAN. I
>know it's
>only
>one of many architectures the pNFS set of extensions is trying to
>address.
>
>1000's of pNFS clients
>10's of pNFSd, one per PFS MD
>100's NAS/SAN
>
>
>'Normal' open
>********************
>a) pNFS client issues a compound to one pNFSd consisting of:
>OPEN with share: Access/Deny
> Multiple pNFSds need to resolve share.
>DELEG_ASK: Request Byte-range Delegation
> Multiple pNFSds need to resolve delegation
>READ/WRITE_IND request direct data access
> pNFSd queries PFS MD to get location map
>
>b) pNFS client can then issue READ/WRITE directly to the NAS/SAN using
>the map
>returned in READ/WRITE_IND.
>
>c) pNFS client issues a compound to one pNFSd consisting of:
>COMMIT_IND:
>CLOSE
>
>
>An MPIO application opens one very large file, shared by 1000's of
>compute
>clients. Each compute client manipulates its portion of the file. The
>MPIO
>layer manages compute clients so that no client shares a byte range of
>the
>file with another.
>
>This MPIO application consists of
> - supervisor code running on 1 MPIO supervisor node
> - compute code running on 1000's of MPIO compute nodes
>
>This MPIO application has cyclic behavior.
>I) Read initial data
>II) compute intermediate result
>III) wait for other compute nodes to finish computing
>IV) all compute nodes write to file (their portion)
>V) compute nodes trade 'edge conditions'
>VI) goto II (compute).
>
>While the application is not in IV (writing), another application, say
>the
>visualizer, needs READ access to the file in order to crunch it for
>visualization. Visualization is needed to tell if the MPIO application
>intermediate results are converging on a solution.
>
>If in step IV all the compute nodes open/write/close as described above
>as the
>Normal open, the pNFSds will be doing a lot of metadata processing:
>resolving
>share and delegation state between themselves as well as delivering per
>byte-range layout info. The group open is designed to reduce the
>metadata
>processing from 1000's to one.
>
>I mention a couple of new fcntls used by the MPIO layer to communicate
>pNFS
>state from the supervisor node to the compute nodes. Don't worry about
>that(!).
>
>Do worry about this: is there anything stoping a compute node
>from using
>OPEN/DELEG_ASK/WRITE_IND state obtained by the supervisor node as
>described
>below? If so, are the changes to pNFS to make this work small enough to
>be
>considered at this time?
>
>Group Open
>**********
>step IV: supervisor OPENs file, all compute clients write file,
>supervisor
>CLOSES file.
>
>specifically:
>a) supervisor issues a compound with
>OPEN: Access - Both, Deny Both to a pNFSd
> - pNFSds need to resolve the share
> - is this a normal nfsv4 OPEN? does pNFSd or the PFS need to
>know
>about the other compute clients?
> - do we need the concept of a group clientid?
>DELEG_ASK: supervisor asks for WRITE delegation which should be
> granted given the OPEN Access-Both, Deny-Both share.
> - pNFSds need to resolve delegation request
>WRITE_IND: supervisor gets whole file layout info
>
>b) supervisor calls
> fcntl(fd, GET_GRPOPEN, cookie_buf);
> which returns the filehandle,stateid, and layout map from the
>supervisor pNFS.
>
>c) the supervisor code passes filehandle, stateid, and layout map to
>each
>compute
>node which calls
> fcntl(fd, SET_GRPOPEN, cookie_buf);
>the pNFS compute node client receives the filehandle, stateid, and
>layout map.
>performs a local open (nothing need go across the wire) stuffing the
>filehandle, stateid, and layout map into it's state tree just as if an
>across
>the wire OPEN/DELEG_ASK/WRITE_IND occured.
>
>d) compute clients use SET_GRPOPEN filehandle, stateid and map to
>directly
>write the data to the appropriate NAS/SAN
> - what besides the filehandle, stateid, and layout map is
>needed?
> - when done writing, each compute client issues a COMMIT_IND.
>
>e) when compute clients have flushed all data back to the file,
>supervisor
>issues a compound with
>
>CLOSE
>
>
>
>
>Yahoo! Groups Links
>
>
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>
>




Yahoo! Groups Links









Yahoo! Groups Links

To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.

From garth@panasas.com Sun Mar 28 19:58:52 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 1848 invoked from network); 29 Mar 2004 03:58:48 -0000
Received: from unknown (66.218.66.217)
by m9.grp.scd.yahoo.com with QMQP; 29 Mar 2004 03:58:48 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta2.grp.scd.yahoo.com with SMTP; 29 Mar 2004 03:58:51 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id HZGBBXTN; Sun, 28 Mar 2004 22:58:16 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: quoted-printable
Message-Id: <46F74125-8135-11D8-BB3C-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
To: pnfs-sbc@yahoogroups.com,
pnfs-obj@yahoogroups.com,
pnfs-reqs@yahoogroups.com,
pNFS Operations <pnfs-ops@yahoogroups.com>,
pnfs-nfs@yahoogroups.com
Date: Sun, 28 Mar 2004 22:58:03 -0500
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: FAST04 BOF 3/31 12:30pm: seeking a Parallel NFS (pNFS)
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Announcing a public Birds of a Feather meeting for those
interested in bringing into existence a Parallel NFS (pNFS)
standard for network attached storage.

This BOF is to be held between USENIX' NSDI
(www.usenix.org/events/nsdi04) and FAST (www.usenix.org/events/fast04)
conferences, 12:30pm - 2pm, Wednesday March 31, 2004, in
the Dolores room of the Grand Hyatt hotel in San Francisco.
A simple box lunch for the first 50 attendees will be provided
by Panasas Inc.

Speakers at this BOF will include Peter Corbett, Network Appliance;
David Black, EMC; Julian Satran, IBM; Peter Honeyman, CITI;
Sumanta Chatterjee, Oracle; and Brent Welch, Panasas.

Background materials from the organizers of this BOF can be
found in the proceedings of a recent workshop, NFS Extensions
for Parallel Storage, www.citi.umich.edu/NEPS/agenda.html, held
by the Center for Information Technology Integration at the
University of Michigan. Participants of that workshop
summarized a statement of the problem pNFS might
address in the following recent informational internet draft:

Title: pNFS Problem Statement
Author(s): Garth Gibson, Panasas & CMU, Peter Corbett, Network Appliance
Filename: draft-gibson-pnfs-problem-statement-00.txt
Pages: 12
Date: 2004-2-9
  
This draft considers the problem of limited bandwidth to
NFS servers.  The bandwidth limitation exists because an
NFS server has limited network, CPU, memory and disk
I/O resources.  Yet, access to any one file system through
the NFSv4 protocol requires that a single server be accessed. 
While NFSv4 allows file system migration, it does not provide
a mechanism that supports multiple servers simultaneously
exporting a single writable file system.

This problem has become aggravated in recent years with
the advent of very cheap and easily expanded clusters
of application servers that are also NFS clients.  The
aggregate bandwidth demands of such clustered clients,
typically working on a shared data set preferentially
stored in a single file system, can increase much more
quickly than the bandwidth of any server.  The proposed
solution is to provide for the parallelization of file services,
by enhancing NFSv4 in a minor version.

A URL for this Internet-Draft is:
www.ietf.org/internet-drafts/draft-gibson-pnfs-problem-statement-00.txt


From garth@panasas.com Sun Mar 28 20:03:07 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 33647 invoked from network); 29 Mar 2004 04:03:06 -0000
Received: from unknown (66.218.66.166)
by m16.grp.scd.yahoo.com with QMQP; 29 Mar 2004 04:03:06 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 29 Mar 2004 04:03:06 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id HZGBBX4A; Sun, 28 Mar 2004 23:02:41 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <E46D87AE-8135-11D8-BB3C-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Cc: Garth Gibson <garth@panasas.com>
Date: Sun, 28 Mar 2004 23:02:27 -0500
To: pnfs-sbc@yahoogroups.com,
pnfs-obj@yahoogroups.com,
pnfs-reqs@yahoogroups.com,
pNFS Operations <pnfs-ops@yahoogroups.com>,
pnfs-nfs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: FACE-TO-FACE pNFS working meeting: 3/31 9am-12:30 Grand Hyatt, Dolores Rm, SF
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Reminder, folks,

Before the upcoming pNFS BOF at FAST Wed Mar 31, 12:30 - 2pm, in the
Dolores room of the Grand Hyatt hotel in San Francisco, the pNFS
community reached by these mailing lists will be meeting for a working
session, 9am - 12:30pm.

Tentative agenda:

- Requirements update and discussion (update coming from Garth)
- Operations update and discussion (update coming from Brent)
- Use cases discussion (initial use cases coming from Andy)

See you there!
garth

From garth@panasas.com Mon Mar 29 12:49:03 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 91946 invoked from network); 29 Mar 2004 20:48:59 -0000
Received: from unknown (66.218.66.216)
by m1.grp.scd.yahoo.com with QMQP; 29 Mar 2004 20:48:59 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta1.grp.scd.yahoo.com with SMTP; 29 Mar 2004 20:48:59 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id HZGBB9M4; Mon, 29 Mar 2004 15:48:56 -0500
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <E46D87AE-8135-11D8-BB3C-000A95A94F04@panasas.com>
References: <E46D87AE-8135-11D8-BB3C-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <7694A65B-81C2-11D8-BB3C-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Mon, 29 Mar 2004 15:48:42 -0500
To: pnfs-reqs@yahoogroups.com,
pnfs-obj@yahoogroups.com,
pnfs-sbc@yahoogroups.com,
pNFS Operations <pnfs-ops@yahoogroups.com>,
pnfs-nfs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: FACE-TO-FACE pNFS working meeting: 3/31 9am-12:30 Grand Hyatt, Dolores Rm, SF
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Word is that we will have a polycom and phone line in the FACE-to-FACE
meeting. I will not post the call in, but I will respond to requests
for the dialin details (once I have them, which is currently not yet).

garth

On Mar 28, 2004, at 11:02 PM, Garth Gibson wrote:

> Reminder, folks,
>
> Before the upcoming pNFS BOF at FAST Wed Mar 31, 12:30 - 2pm, in the
> Dolores room of the Grand Hyatt hotel in San Francisco, the pNFS
> community reached by these mailing lists will be meeting for a working
> session, 9am - 12:30pm.
>
> Tentative agenda:
>
> - Requirements update and discussion (update coming from Garth)
> - Operations update and discussion (update coming from Brent)
> - Use cases discussion (initial use cases coming from Andy)
>
> See you there!
> garth

From bwelch@panasas.com Mon Mar 29 23:41:49 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 61803 invoked from network); 30 Mar 2004 07:41:48 -0000
Received: from unknown (66.218.66.172)
by m11.grp.scd.yahoo.com with QMQP; 30 Mar 2004 07:41:48 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta4.grp.scd.yahoo.com with SMTP; 30 Mar 2004 07:41:48 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i2U7ccB14789;
Mon, 29 Mar 2004 23:38:39 -0800
Message-Id: <200403300738.i2U7ccB14789@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.6.3 04/02/2003 with nmh-1.0.4
To: pnfs-obj@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com, pnfs-sbc@yahoogroups.com,
pNFS Operations <pnfs-ops@yahoogroups.com>, pnfs-nfs@yahoogroups.com
In-reply-to: <7694A65B-81C2-11D8-BB3C-000A95A94F04@panasas.com>
References: <E46D87AE-8135-11D8-BB3C-000A95A94F04@panasas.com>
<7694A65B-81C2-11D8-BB3C-000A95A94F04@panasas.com>
Comments: In-reply-to Garth Gibson <garth@panasas.com>
message dated "Mon, 29 Mar 2004 15:48:42 -0500."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: multipart/mixed ;
boundary="==_Exmh_-19518296890"
Date: Mon, 29 Mar 2004 23:38:38 -0800
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: pNFS summary, v2
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

ADVERTISEMENT
In preparation for the meeting at FAST I was tasked with updating my
previous workshop summary with the ideas that have been developing on
the mailing lists. I'm attaching what I have. The main caveat is that
these are my words about lots of peoples ideas, so I'm sure I'm not
always conveying them as you may have intended. I'm sure you'll speak
up where I've strayed or if I've left out important items. I made no
attempt to summarize all the arguments that flowed across the list,
but instead I'm giving the reader's digest of what I think we agree on,
and have enumerated the issue areas.

See you Wednesday.

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com



Attachment (not stored)
pnfs_summary.v2.txt
Type: text/plain 

From black_david@emc.com Tue Mar 30 07:46:53 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 41595 invoked from network); 30 Mar 2004 15:46:51 -0000
Received: from unknown (66.218.66.166)
by m18.grp.scd.yahoo.com with QMQP; 30 Mar 2004 15:46:51 -0000
Received: from unknown (HELO MAHO3MSX2.corp.emc.com) (128.221.11.32)
by mta5.grp.scd.yahoo.com with SMTP; 30 Mar 2004 15:46:51 -0000
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <H9P6A7Y1>; Tue, 30 Mar 2004 10:46:27 -0500
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A57F8@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com, pnfs-obj@yahoogroups.com
Cc: pnfs-sbc@yahoogroups.com, pnfs-ops@yahoogroups.com,
pnfs-nfs@yahoogroups.com
Date: Tue, 30 Mar 2004 10:46:21 -0500
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.11.32
From: black_david@emc.com
Subject: RE: [pnfs-reqs] pNFS summary, v2
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

ADVERTISEMENT
click here
I looked over Brent's summary and the FMP protocol that is used
in HighRoad, and turned up the following items for discussion:

- The exact details of COMMIT_IND (order in which it does things)
matter a lot. That's below the level of Brent's current
summary, and I presume we'll get to it as we flesh out
the design.

- The notification functionality needs more fleshing out. Here's
what FMP provides:
o Recall specific extent delegation(s)
o Downgrade (write to read) specific extent delegation(s)
o Recall all extent delegations for a file handle
o Recall all extent delegations for a filesystem
o Set EOF (This is subtle, consider the case where
client A sets EOF in the middle of an extent
for which client B has a write delegation.)
Again, this is something for further design, but I think
this list is about at the level of Brent's summary.

- Completion callbacks. FMP supports both server queuing of
requests (completion is a notification) and server rejection
of requests. Rejection supports cases where the server
has a notify outstanding to the client when it receives the
client request and wants to force the client to process the
notify; rejection is the right thing to do because the
notification (e.g., recall) may affect whether the client
resubmits the same request. I agree with Brent's view that
the client has to retry if the operation is rejected, however
the ability to support server queuing of conflicting requests
allows the server to provide some liveness/fairness assurances
if the server implementer chooses to do so. In essence a
"Queued" response from the server promises to do the operation,
but not immediately, and frees a client RPC execution context
for things like notification handling.

- Operation ordering. There are some ordering requirements involving
interaction of notifications and operations - for example, if
a client responds to a recall notification and submits an
operation based on having completed the notification, the server
will need to process the notification completion before the
client operation. I strongly favor a "cut on the dotted line"
approach where the ordering requirements for direct data access
are clearly specified as part of the extension (even if they
rely on existing NFS facilities for their realization) so that
it's clear what has to be done to achieve the same functionality
for other distributed filesystem protocols.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

> -----Original Message-----
> From: Brent Welch [mailto:bwelch@panasas.com]
> Sent: Tuesday, March 30, 2004 2:39 AM
> To: pnfs-obj@yahoogroups.com
> Cc: pnfs-reqs@yahoogroups.com; pnfs-sbc@yahoogroups.com; pNFS
> Operations; pnfs-nfs@yahoogroups.com
> Subject: [pnfs-reqs] pNFS summary, v2
>
>
> In preparation for the meeting at FAST I was tasked with updating my
> previous workshop summary with the ideas that have been developing on
> the mailing lists. I'm attaching what I have. The main
> caveat is that
> these are my words about lots of peoples ideas, so I'm sure I'm not
> always conveying them as you may have intended. I'm sure you'll speak
> up where I've strayed or if I've left out important items. I made no
> attempt to summarize all the arguments that flowed across the list,
> but instead I'm giving the reader's digest of what I think we
> agree on,
> and have enumerated the issue areas.
>
> See you Wednesday.
>
> --
> Brent Welch
> Software Architect, Panasas Inc
> Delivering the premier storage system for scalable Linux clusters
>
www.panasas.com
welch@panasas.com





Yahoo! Groups Links

From andros@citi.umich.edu Tue Mar 30 14:08:41 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 7863 invoked from network); 30 Mar 2004 22:08:29 -0000
Received: from unknown (66.218.66.167)
by m14.grp.scd.yahoo.com with QMQP; 30 Mar 2004 22:08:28 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta6.grp.scd.yahoo.com with SMTP; 30 Mar 2004 22:08:27 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 26F9E2084D; Tue, 30 Mar 2004 17:08:27 -0500 (EST)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-obj@yahoogroups.com
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Tue, 30 Mar 2004 17:08:27 -0500
Message-Id: <20040330220827.26F9E2084D@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: my 10 minute FAST talk
X-Yahoo-Group-Post: member; u=169434965

hi speakers

i quickly describe an ASCI type cluster that uses a Scaleable Global Parallel
File System (SGPFS), and quickly describe how NFSv2/v3 is currently used. then
move onto using stock NFSv4.0, and where it fails, ending up with pNFS and
how it can succeed.

* getting rid of the NFSD on SGPFS client (a la NFSv2/v3)
* enterprise desktop NFSv4.0 can access proprietary SGPFS data
* high speed parallel data transfer between pNFS clusters.

-->Andy

From andros@citi.umich.edu Tue Mar 30 15:33:34 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 17814 invoked from network); 30 Mar 2004 23:33:33 -0000
Received: from unknown (66.218.66.166)
by m19.grp.scd.yahoo.com with QMQP; 30 Mar 2004 23:33:33 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta5.grp.scd.yahoo.com with SMTP; 30 Mar 2004 23:33:32 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id D90E7207F3; Tue, 30 Mar 2004 18:33:31 -0500 (EST)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-obj@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com, pnfs-sbc@yahoogroups.com,
pnfs-ops@yahoogroups.com, pnfs-nfs@yahoogroups.com
Mime-Version: 1.0
Content-Type: multipart/mixed ;
boundary="==_Exmh_-17735067940"
Date: Tue, 30 Mar 2004 18:33:31 -0500
Message-Id: <20040330233331.D90E7207F3@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: pNFS use cases for the FAST meeting, first pass!
X-Yahoo-Group-Post: member; u=169434965

In preparation for tomorrows meeting at FAST, i was tasked with beginning a
list of use cases for pNFS. I came up with an initial list which is attached.
I know it's not complete, and i hope the descriptions are meaningful :o

see you tomorrow.

-->Andy




Attachment (not stored)
pnfs_use.txt
Type: text/plain 

From garth@panasas.com Tue Mar 30 23:13:14 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 15375 invoked from network); 31 Mar 2004 07:13:12 -0000
Received: from unknown (66.218.66.172)
by m16.grp.scd.yahoo.com with QMQP; 31 Mar 2004 07:13:12 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 31 Mar 2004 07:13:11 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id HZGBCF03; Wed, 31 Mar 2004 02:12:58 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: quoted-printable
Message-Id: <CC844833-82E2-11D8-BB3C-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=UTF-8; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Tue, 30 Mar 2004 23:12:41 -0800
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Requirements discussions update
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

An update on requirements discussions that have taken place on the
pnfs-reqs@yahoogroups.com reflector
(http://groups.yahoo.com/group/pnfs-reqs). These notes were prepared
for the Wed 3/31 9am-12:30 face-to-face meeting before the pNFS BOF at
FAST 2004.

Though the ideas and concerns reported here are drawn from many people,
including but limited to Dave Noveck, Brent Welch, David Black, Andy
Adamson, Craig Everhart, Julian Satran, Tom Talpey, Benny Halevy, Gary
Grider, Tyce McLarty, Dean Hildebrand, Peter Honeyman, Peter Corbett,
the errors and opinions coloring this document are probably mine.

garth gibson

----------------------------------------


Topics:

0.0 Defining Requirements
1.0 Minimalism
1.1 Proxying
1.2 Cache consistency
1.3 Delegation promotion & reacquisition
1.4 Layout delegations
1.5 Concurrent write
1.6 Map revocation
1.7 Separability
1.8 NTFS application semantics
2.0 NFS Append
2.1 separate read & write mappings; the ability to punch a hole
2.2 Extensible backend mappings
2.3 Group operations
2.4 Clustered server implementations
2.5 Client modified layouts

----------------------------------------

[0.0 Defining Requirements]: What is the scope of requirements subgroup
doing and how is it related to the ops subgroup discussions?

I am beginning to see a significant difference between a "problem
statement" document and a "requirements" document. I believe that in a
problem statement we can make a strong case for a set of properties and
applications that are currently underserved in NFSv4, and a direction
that could in one or more steps resolve some or all of the problem.
Alternatively I am coming to see the detailed requirements as a
compendium of the most contentious and impactful issues, how they were
argued and what resolution was accepted. I can see the problem
statement getting done before we have sorted out all the hard problems,
or even run into all of them, so it is a good document for establishing
our interests in the IETF. But I suspect that the requirements
document stays open well into agreement on the specification issues.

For comparison, the first NFSv4 document was called "Design
Considerations" (rfc2624): This document is to cover the "limitations
and deficiencies of NFS version 3". This document will also be used as
a mechanism to focus discussion and avenues of investigation as the
definition of NFS version 4 progresses. Therefore, the contents of
this document cover the general functional/feature areas that are
anticipated for NFS version 4.

I propose that what we have started into in the requirements subgroup
is the problem statement, and that we should be careful to not let it
get bogged down in the longer term requirements resolutions.

----------------------------------------

[1.0 Minimalism]: How much additional functionality do we sacrifice to
limit the changes we seek in NFSv4?

On one hand, some have said that getting to one true file system, with
the high performance and the manageability of federated systems that
might come with out-of-band access, is worth not matching *every*
feature of all existing out-of-band file systems with this first set of
extensions to NFSv4. That we should bite off what we can do quickly,
correctly, with a clear incremental value to NFSv4, and roadmap more
aggressive changes that could bog us down, or introduce so much
complexity that interoperability becomes elusive. And that we should
be mindful of the reception we may get from the IETF NFS working group
if we *appear* to use out-of-band as an excuse to ask for a brace of
changes in other aspects of NFSv4.

On the other hand, the other out-of-band file systems that are
inspiring the evolution of NFSv4 have customers that may not accept any
backward sets in an evolution to NFSv4. This could create the need to
develop, carry and differentiate all the diverse one-off out-of-band
files systems plus a new out-of-band NFSv4. Some think it makes more
sense to go far enough with this first NFSv4 to simplify the
marketplace by making it reasonable for various vendors to
deprecate/end-of-life/begin to wean from their proprietary offering.

While it is certainly conceivable that we could be designing a roadmap
of solutions in detail from the start, communication among standards
bodies is hard enough without the challenge of designing specs for both
with and without a requirement.

This is a central issue in defining the requirements for out-of-band
NFSv4, or at least for defining the scope of the first set of
extensions.

JS: I am afraid that this text makes achieving compliance with
existing out-of-band filesytems sound more complex than it might be. I
see several items that we should strive to keep even in a minimalist
set of requirements:
• attribute set rich enough to enable expressing the attributes of
the major local-filesytems (Unix brands and Windows)
• access control that accommodates the access control mechanisms of
the major local-filesytems and some of the popular distributed
file-systems (AFS?)
• coherency mechanisms that enable vendors to optionally implement
the two major flavor of coherent file access:
◦ completely coherent
◦ close-to-open coherent
None of those seem to me as involving major departures from NFSv4.

----------------------------------------

[1.1 Proxying]: Operations/work that can only be done out-of-band vs
alternative access through the NFSv4 server for all operations/work

On one hand, some suggest that a set of out-of-band clients should not
have to also have a data path through the NFSv4 metadata server. One
reason is that customers may not tolerate the large variability in
performance between out-of-band (when the going is good) and in-band
(when the server chooses not to grant or to take away a delegation)
accesses. Another reason, and I paraphrase someone else here, is that
it is possible to construct out-of-band metadata servers that do not
have access to the data servers except through the clients -- I
encourage the source of this scenario to replace my paraphrasing with a
correct use case, because I find it odd to design for file servers that
do not have access to the data servers.

On the other hand, others have suggested that any access or work that a
client can do out-of-band should be possible with one or more commands
applied to the metadata server's data path. This has been proposed for
coping with recalled delegations, including concurrent writing by
multiple clients; retry after client access errors, provided adequate
idempotency of out-of-band operations; and many alternative
implementations of out-of-band clients, including legacy clients that
use out-of-band never or rarely.

I think this is a topic that should be argued one way or the other in
the requirements document. Use cases and examples in other systems
would be best.

[1.1.0 Legacy proxying]: an NFS-v4.x server must be able to execute the
full NFS-v4.0 or NFS-v4.1 protocol.

JS: Is it legal for a "compliant server" to have serving data disabled
by a local administrative function (the old "must implement but may
use")? Otherwise an organization that wants to discourage use of data
serving through the metadata server has very little it can do to
enforce policy in a way that will not affect other clients (it may do
serve poorly but this still affects other clients).

[1.1.1 Strict proxying]: does an NFS-v4.x server have to be able to
execute exactly the wire packet that an NFS-v4.x client might have sent
to a SBC/OSD/NFS data server?

This captures the notion that a metadata server must also be a
store-and-forward proxy for every data server it manages. It requires
NFS-v4.x servers implement SCSI SBC over FC, if their data servers
implement it; and the same for objects and files.

This only makes sense to me for NFS data servers. And it is not what I
intended in my prior summary, although it is a relevant question. I
would say that pNFS requirements not require Strict Proxying.

[1.1.2 Functional proxying]: a file transformation achievable by an
NFS-v4.x client using a set of data server operations must be a
equivalently achievable using a (probably different) set of NFS-v4.x
server operations

This is the topic I intended to address in the last email. I believe
Dave is arguing that even with metadata servers that do not have access
to their data servers, the vendor of such a metadata server can
construct a proprietary protocol for the metadata server to (strict)
proxy data server accesses through clients that do have data server
access. I am not comfortable making up a counter to this, so I exhort
those that want a metadata server without data server access to speak
up if they disagree.

More on proxying -- suppose that a metadata server is asked to do reads
or writes and it would rather not do this work (because it is busy or
because its connection to storage is not as good as other nodes) -- can
it "refer" the request to another server that is in a better position
(by load or connectivity to storage) to do the work -- like a file
system referral

[1.1.3 Recovery proxying]: a file transformation begun by an NFS-v4.x
client using a set of data server operations, but interrupted before
completion, must be equivalently completable using a (probably
different) set of NFS-v4.x server operations

Some have suggested that having this property will greatly simplify the
amount of spec that is devoted to out-of-band error recovery. Others
have commented that a simple way to achieve this would be to require
that all operations on data servers should be idempotent.


----------------------------------------

[1.2 Cache consistency]: NFSv4 delegations are not about client cache
consistency; does out-of-band access require stronger cache consistency
than NFSv4 provides

NFSv4 cache consistency is a client function, based on testing file
attributes on open and close. While a client holds a delegation, its
users can close and reopen a file without recourse to the server, so
inside a delegation a client cache contents for that file must be valid
and up to date. However, a client cannot mandate getting a delegation
on open, it must immediately (approximately) give up a delegation if it
is recalled and a client has no way to reacquire a delegation on an
open file after that delegation has been recalled. So we must not
confuse delegations with strong cache consistency.

Many of the various proprietary out-of-band file systems have much
stronger client cache consistency, involving more different types and
interactions of cache callbacks. Some of these differences may have
been motivated by desire for differentiation, some by apps underserved
by NFS cache consistency semantics, and some by the long standing
designer belief that stronger semantics are theoretically better.

The question we must resolve, and argue in the requirements document,
is whether out-of-band access only within the NFSv4 cache consistency
and delegations is not sufficient, why and how much more must/should be
added before such a product is valuable.

I think that application use cases should be discussed. And I caution
us that most of us are the converted, coming to NFSv4 from one of these
proprietary file systems, so gaining agreement amongst ourselves easily
is not a good predictor of the challenge of gaining the agreement of
the NFS standards working group.

DB: HighRoad uses the same FMP protocol to provide both NFS-style
close-to-open consistency for NFS clients and the stronger forms of
consistency required by CIFS - as long as the server knows what clients
have which access rights to what blocks, cache consistency strength
comes down to server implementation decisions about what outstanding
access rights conflict with a new request. We've actually built server
prototypes that provide stronger consistency for NFS without change to
either the FMP protocol or clients, but the shipped product only
provides NFS-style consistency for NFS.

JS: I think that if we work towards common structures for mapping and
caching we might end up letting the implementer or user decide about
the consistency level he wants and support all. We certainly can't
afford to ignore those that require consistency beyond the
close-to-open level conventionally associated with NFS especially when
there are distributed or cluster file-systems that got their customers
use it today (GPFS, SAN-FS).

DN: mapping and caching information are distinct pieces of information
in that one can change while the other does not, and if we decide to
treat these two pieces of information as the same, we are going to be
doing some silly things. If I write in place then my data is changing
but the layout isn't. If my caching strategy to deal with multiple
writers is not to cache data (which is for many applications quite
reasonable) and I need the mapping information to access the data
servers directly, then I don't want to have my layout delegations
recalled because the data is changing. Because I am not caching I
don't need or want data delegations but do need layout delegations and
should be allowed to keep them when the layout is not changing.

DN: In many envirnments data and layout delegations will be recalled
together and so it makes sense not to have these so distinct that, for
example, I am doing separate recall messages for each piece of data.
But in other environments, it may make a lot of sense for me to have
one guarantee (the layout won't change) and not the other (the data
won't change).


----------------------------------------

[1.3 Delegation promotion & reacquisition]: must/should NFSv4 offer
mechanisms for clients to possess a delegations more than once per open

Delegations in NFSv4 are new, and came with significant concern about
lots of complexity for not much performance, as they may do as little
as avoid the client waiting for one round trip to the server on open.
So, as described above with respect to cache consistency, the
limitations on delegations can mean great difficulties for clients
having performance requirements calling for out-of-band access mostly,
or exclusively.

DB: Yes, and this is a strong reason for separating "layout"
delegations from the existing "data" delegations, IMHO. Consider a web
or video server that is caching file opens for performance reasons - if
updating the content underneath the server makes it impossible to get
the direct access ("layout") delegations back, the result is that one
has to shut down and restart all the servers after the content update
in order to restore performance. The sysadmin responsible for this
annoying work will want to tar-and-feather the system designers who
made it necessary (that would be us if we get this wrong ...).

So we have begun to propose mechanisms for clients to be more
aggressive about seeking, obtaining, reobtaining after a recall, and
even waiting for a signal that a denied delegation is now available.
This could lead to discussions of transitioning from a write delegation
to a read delegation, rather than no delegation, when a second
delegation is requested.

We all know, or can imagine, plenty of mechanism for this type of logic
-- after all, it is not far from what some systems do for cache
consistency. But all of this comes with complexity, that threat to
interoperability, and chips away at minimalism.

DN: Downgrade in particular needs special attention. If I have a write
delegation, then DELEGRETURN followed by DELEG_ASK (read), means that
the data I have cached is not valid and may need to be fetched again,
whereas a straight downgrade means nobody has ever had a conflicting
delegation, and so allows me to do more. There are similar
considerations for downgrade of a write delegation to a group-write
(aka CW) delegation and downgrade of a group-write to a read
delegation.

----------------------------------------

[1.4 Layout delegations]: can/should layout metadata "ride" on NFSv4
delegations or are new "layout" delegations needed

If the delegations currently provided by NFSv4 are insufficient, for
reasons of cache consistency or the needed to be able to reacquire a
delegation in order to ensure that performance degradations can be
limited, then some are suggesting that rather than proposing to change
the semantics of the current delegations, we add new delegations
tailored to the purpose, so called layout delegations.

This is consistent with the advice we heard Dec 4 that it is much
easier, and more welcomed, to add new things to NFSv4 than to change
what is already there.

Assuming that in response to requirements arguments, we find the
existing NFSv4 delegations insufficient, then I think this topic is an
implementation issue for the NFSv4 operations subgroup. But I for one
would like to err on the side of fewer NFSv4 changes and slightly
weaker semantics, where possible.

I'd summarize a lot of discussion to say that we need new operations
for layout delegations. And many are suggesting that these layout
delegations should be able to cover only portions of a file, and not
imply anything about the data consistency.

----------------------------------------

[1.5 Concurrent write]: write delegations now are held by exactly one
client, if any; should/must NFS support multiple clients holding
concurrent layout delegations

One specifically excluded use case for out-of-band access is concurrent
write, actually concurrent read and write, or write and write, by
different clients. This is normally associated with expensive client
cache consistency algorithms, but for our purposes here, the issue is
managing the ordering, grouping/atomicity, and failure recovery of
changes on data servers, not updating/invalidating the contents of
client caches. It is certainly feasible to address out-of-band
concurrent writing to data servers without addressing client cache
consistency, if we so choose.

I believe three folks with experience with different existing file
systems referred to databases as the use case for needing concurrent
write.

I believe out-of-band concurrent write is an important use case to call
out carefully, because a ambitious implementation of it could lead to a
lot of state-maintaining messaging.

Some have said that, allowing multiple clients to hold the same lock is
a current need in NFSv4, and that a solution to this can provide the
infrastructure for concurrent delegation of layout maps for read and
overwrite (when growing the size of the file is not needed). This
seems like a good operations discussion topic.

DB: I understand the value of this to the self-coordinating HPC
applications, but would like to see this functionality specified
(assuming it is specified) as a cleanly separable option, as I think
the desire to self-coordinate a shared write delegation will be limited
to a small number of application spaces, like HPC. I also note Gary's
comment that it's sufficient for parallel write to work in the
non-overlapping case, which does not require any new concurrent write
delegation as long as each client can hold an exclusive write
delegation for its range.

JS: I agree with Gary that handling efficiently the "good-path" (e.g.,
concurrent writers with non-overlapping regions, or single writer with
readers needing only close-to-open consistency) is essential. To me it
looks as all those could be better handled if we could approach mapping
and caching concurrently.

----------------------------------------

[1.6 Map revocation]: can/must the NFS server be able to revoke a
client's use of a map, and enforce no future use (fence off the map)

NFSv4 delegations allow a broken or malicious client no additional
power to damage the stored file system because state changes must go
through the server. But a delegated layout map that is held and used
by a broken or malicious client after the delegation has been recalled
could damage the stored file system in a way that the server, by not
being on the data path, has no obvious way to protect against.

So there has been a call for the ability for the server to fence out a
client or enforce the revocation of a client's access to a specific
file or filesystem. At first glance all three data server
technologies, blocks, objects and files have some solution (blocks: lun
masking/acls or SAN zoning; objects: capability revocation, key
replacement; files: component file acls, volatile file handles). The
scope and cost of each of these mechanisms maybe dramatically
different.

Some would say that this is going to end up being a differentiating
property of the choice of underlying data server. For example, many
would say that in systems that allow out-of-band block access, the
client machines must be trustworthy to respect the delegation recall
message (and lease timeouts). Others would object to this weakening of
the NFS server integrity.

I also see this as a requirements argument.

DB: I tend to take the former position, as if one cannot fence off
client access, not allowing access to untrustworthy clients becomes a
fallback. In the block world, while mechanisms exist to fence off
access, standard means of invoking them are somewhat immature.

----------------------------------------

[1.7 Separability]: Independence vs co-dependence of layout metadata
access and NFSv4

On one hand, simple "an address per block/object/file" maps could be
represented as an array of NFSv4 attributes, manipulated using existing
NFSv4 attribute accessing commands, so to reduce the amount of change
to NFSv4.

On the other hand, particularly for block maps of large files composed
of extents, simple array indexing may be cumbersome and much bulkier
than necessary.

And also on the other hand, some suggest that it is desirable for the
metadata access protocol to be separate from NFSv4 attribute access, so
that the same metadata access protocol might be reusable under other
file services.

I think this topic would benefit from proposed metadata formats,
particularly the SBC (block) maps.

----------------------------------------

[1.8 NTFS application semantics]: applications coded to NTFS semantics
are different from those coded to POSIX and UNIX semantics

NFS originated as a exported file system, whose semantics were defined
by the underlying local filesystem on the file server. But since that
local filesystem has almost always been UNIX or UNIX like, customers
have come to think of NFS semantics as a well defined thing, not far
from UNIX semantics (but with a customary list of POSIX exceptions).
The semantics NTFS presents to applications using its storage is
different in significant ways.

Some of us see an evolution to better support for clients trying to
support NTFS well to be very desirable. Others see chasing this as
more than the NFS group as a whole is likely to bite off.

This, and any other issues about wire protocol support for important
semantics needed by different application file system interfaces
(middleware exploited API extensions in databases or parallel
programming systems such as MPI-IO) are also requirements topics.

DB: IMHO, this is an orthogonal tarpit we should stay out of. I
strongly believe that trying to extend NFSv4 so it can be just as good
as CIFS for applications coded to Windows APIs should be someone else's
problem.

----------------------------------------

[2.0 NFS Append]: is append semantics part of this effort, or separable?

Folks think it is interesting. But it can be taken to NFSv4 directly,
and not necessarily as a part of the pNFS extensions.

DN: I am a big fan of this, but my experience is that it can be
controversial. I'm not sure I understand why but there are some people
who really don't like it. I think it may have to do with the fact that
some people are uncomfortable with the idea that you can do a write
(and append is a write) and have no way to valdily reflect that write
your buffer cache.

----------------------------------------

[2.1 separate read & write mappings; the ability to punch a hole]

SAN.FS extents come with both read and write extent mappings and block
usage bitmaps. The separate read and write mappings allow for clients
to participate in copy-on- write functionality - IIRC, Craig has
described this.

[2.1.1]: Should protocol include support for client participation in
copy-on-write?

A motivation for the separate arrays of block usage bits appears to be
allowing clients to turn file data into holes (e.g., AIX fclear system
call).

[2.1.2]: Is the ability to turn valid data into a file "hole" (e.g.,
AIX fclear) at the client important to support?

FMP does not support separate read mappings or usage bitmaps, and hence
is not capable of involving clients in copy-on-write or allowing a
client to turn valid data into a file "hole".

DN: If we do it, should be an NFSv4 server operation too because the
space recovery benefits are not unique to a block backend.

There was also some confusion between holes in the data and holes in
the layout map. Writing into a hole in the data changes the data, so
any other client mapping that same region of the file sees or does not
see the change according to the cache consistency mechanism employed.
But if one client has a layout map with holes then data can be written
into these holes without recalling the map because the client cannot
assume or see anything about the missing part of the map.

That is, are maps delegated in part or only as a whole? I think I
heard strong support for delegation of map ranges as well as maps for
whole files.

----------------------------------------

[2.2 Extensible backend mappings.]

The precedent for our proposed multiple backends in the one IETF
protocol is the GSS security framework and extensible flavors.

One backend may be required, though I think the required backend is the
NFSv4 metadata server that has to be able to do data access for a
legacy client.

----------------------------------------

[2.3 Group operations]

A cluster can be seen as one computer, so we may need to explore group
operations; that is, a clientid may cover all the CPUs in a "cluster
computer."

This brings up a client with many fault domains, which is not generally
a problem in NFS today. Will we need a NFSv4 metadata filer to deliver
a callback to an alternative or failover address in some circumstances?

DN: Suppose I have an application cluster with 1K nodes and I say "This
is really a powerful computer with a thousand (maybe two thousand)
CPU's". Now that's marketing bullshit but it isn't exactly false. I
think the point is that as far as the NFS server is concerned, whether
the computer that is talking to it is "really" a computer, i.e. it has
CPU's sharing memory, or is only a computer qua marketing bullshit,
i.e. a collection of cpu's that don't share memory, that use other
methods to co-ordinate common activities, doesn't matter. All the
server sees are the requests made and if the cluster represents itself
as a single machine (i.e. in V4 does a single SETCLIENTID or in v4.1
maintains many connections bound to a single session), it is one. The
server doesn't see the cluster's memory architecture. It sees an open
and then use of that stateid. The fact that it comes over a different
IP address doesn't disqualify it. A server might have options to check
that (as a matter of security) but it isn't part of the protocol and we
already have clients with multiple IP addresses. Having a thousand of
them is a difference of degree (and may pose implementation issues) but
I don't see a real protocol issue.

BH: This is a "clustered" implementation of a nfsv4 client in which the
v4 drivers in the client cluster are cooperating and propagating state
(e.g. file handles, stateids) among each other. I believe that the
server should not be able to distinguish such client from a multi-homed
client that may have several ip addresses. In the nfsv4 sessions world
a (clustered) client may open multiple connections to the server that
are associated with the same session - this will make life for such
client even easier, I hope.

----------------------------------------

[2.4 Clustered server implementations.]

Talking about clustered clients brought up the issue of clustered
servers. We do not think the server-to-server protocols needed for
implementing clustered servers should be a part of the client protocols
we are discussing herein. This might change if client protocols have
to understand anything other than filesystem migration and server
failover, as they do now.

That is, pNFS extensions are not necessarily part of a solution to a
standard clustered filer protocol.

----------------------------------------

[2.5 Client modified layouts]

BH: A write layout delegation (which I'm not proposing) could be a
delegation to modify the *layout*, it is theoretically possible to give
a single client such exclusive access to the file layout but I think
this is going one step too far from where we are right now and can be
problematic with respect to interoperability.

DN: I agree. This would add problems. Unless there is a big payoff,
I'd stay away from it.

----------------------------------------

From Brian.Pawlowski@netapp.com Wed Mar 31 09:02:28 2004
Return-Path: <beepy@netapp.com>
X-Sender: beepy@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 70593 invoked from network); 31 Mar 2004 17:02:24 -0000
Received: from unknown (66.218.66.216)
by m11.grp.scd.yahoo.com with QMQP; 31 Mar 2004 17:02:24 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 31 Mar 2004 17:02:24 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i2VGwYZh010906
for <pnfs-reqs@yahoogroups.com>; Wed, 31 Mar 2004 08:58:34 -0800 (PST)
Received: from tooting-fe.eng.netapp.com (tooting-fe.eng.netapp.com [10.56.10.118])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i2VGwYbc010248
for <pnfs-reqs@yahoogroups.com>; Wed, 31 Mar 2004 08:58:34 -0800 (PST)
Received: (from beepy@localhost)
by tooting-fe.eng.netapp.com (8.11.7p1+Sun/8.11.6) id i2VGwXO28367
for pnfs-reqs@yahoogroups.com; Wed, 31 Mar 2004 08:58:33 -0800 (PST)
Message-Id: <200403311658.i2VGwXO28367@tooting-fe.eng.netapp.com>
In-Reply-To: <E46D87AE-8135-11D8-BB3C-000A95A94F04@panasas.com> from Garth Gibson at "Mar 28, 4 11:02:27 pm"
To: pnfs-reqs@yahoogroups.com
Date: Wed, 31 Mar 2004 08:58:33 -0800 (PST)
X-Mailer: ELM [version 2.4ME++ PL40 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: Brian Pawlowski <beepy@netapp.com>
From: Brian Pawlowski <Brian.Pawlowski@netapp.com>
Subject: Re: [pnfs-reqs] FACE-TO-FACE pNFS working meeting: 3/31 9am-12:30 Grand Hyatt, Dolores Rm, SF
X-Yahoo-Group-Post: member; u=169504717
X-Yahoo-Profile: brianpawlowski

ADVERTISEMENT
I'm a little behind - will be there shortly.

> Reminder, folks,
>
> Before the upcoming pNFS BOF at FAST Wed Mar 31, 12:30 - 2pm, in the
> Dolores room of the Grand Hyatt hotel in San Francisco, the pNFS
> community reached by these mailing lists will be meeting for a working
> session, 9am - 12:30pm.
>
> Tentative agenda:
>
> - Requirements update and discussion (update coming from Garth)
> - Operations update and discussion (update coming from Brent)
> - Use cases discussion (initial use cases coming from Andy)
>
> See you there!
> garth
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>


From dhildebz@eecs.umich.edu Wed Mar 31 11:22:28 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 90049 invoked from network); 31 Mar 2004 18:08:40 -0000
Received: from unknown (66.218.66.218)
by m14.grp.scd.yahoo.com with QMQP; 31 Mar 2004 18:08:39 -0000
Received: from unknown (HELO smtp.eecs.umich.edu) (141.213.4.43)
by mta3.grp.scd.yahoo.com with SMTP; 31 Mar 2004 18:08:39 -0000
Received: from oemcomputer (da001d1735.stl-mo.osd.concentric.net [66.236.102.199])
(authenticated bits=0)
by smtp.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i2VI7X3e003579
(version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=NO);
Wed, 31 Mar 2004 13:07:36 -0500
Message-ID: <002101c4174a$bc39a650$06396a83@oemcomputer>
To: <pnfs-ops@yahoogroups.com>, <pnfs-reqs@yahoogroups.com>
Date: Wed, 31 Mar 2004 13:05:11 -0500
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: 7bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2800.1158
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1165
X-Spam-Status: No -- Hits: -4.901 Required: 5
X-Spam-Summary: BAYES_00
X-Scanned-By: MIMEDefang 2.40
X-eGroups-Remote-IP: 141.213.4.43
From: "Dean Hildebrand" <dhildebz@eecs.umich.edu>
Subject: QoS papers
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

Here are 2 papers on QoS, I'm sure there are many others.
http://www.usenix.org/events/fast03/tech/lumb.html
http://www.almaden.ibm.com/StorageSystems/autonomic_storage/clockwork/index.shtml

Dean



From garth@panasas.com Wed Mar 31 15:46:21 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 26118 invoked from network); 31 Mar 2004 23:46:14 -0000
Received: from unknown (66.218.66.172)
by m19.grp.scd.yahoo.com with QMQP; 31 Mar 2004 23:46:14 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 31 Mar 2004 23:46:14 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id HZGBCJAD; Wed, 31 Mar 2004 18:46:07 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: 7bit
Message-Id: <8A31B552-836D-11D8-BB3C-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Wed, 31 Mar 2004 15:45:50 -0800
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: FAST BOF was a crowded success
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
click here
Thanks to all participants, today's pNFS BOF packed the room. We asked
for space for 50, thinking that would be many more than would show up.
Instead we had more than 85 people in the room, nearly all who stayed
for the full 90 mins.

Our speakers, as usual, were informative and enthusiastic. Most of the
questions had to do with our commitment to supporting NFSv4 completely,
that the granularity of layout delegation was in fact smaller than the
filesystem, and that the semantics of file attributes like mtime and
EOF might be vague as specific times while the file is being changed
external to the metadata server. Our Oracle guest speaker, Sumanta
Chatterjee, stirred up the room while said "good start and while you
are at it, please look at full user-level IO, async IO, list IO,
exposed layouts, batched interrupts, lower system CPU usage."

With respect to our broad goal of spreading the message, raising the
buzz and seeking additional participants, the first two look really
well met. The third will be measurable on these mailing lists.

garth

From Brian.Pawlowski@netapp.com Wed Mar 31 17:03:48 2004
Return-Path: <beepy@netapp.com>
X-Sender: beepy@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 51103 invoked from network); 1 Apr 2004 01:03:46 -0000
Received: from unknown (66.218.66.167)
by m1.grp.scd.yahoo.com with QMQP; 1 Apr 2004 01:03:46 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 1 Apr 2004 01:03:45 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i3113jZh002905
for <pnfs-reqs@yahoogroups.com>; Wed, 31 Mar 2004 17:03:45 -0800 (PST)
Received: from tooting-fe.eng.netapp.com (tooting-fe.eng.netapp.com [10.56.10.118])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i3113jTR012799
for <pnfs-reqs@yahoogroups.com>; Wed, 31 Mar 2004 17:03:45 -0800 (PST)
Received: (from beepy@localhost)
by tooting-fe.eng.netapp.com (8.11.7p1+Sun/8.11.6) id i3113iw20019;
Wed, 31 Mar 2004 17:03:44 -0800 (PST)
Message-Id: <200404010103.i3113iw20019@tooting-fe.eng.netapp.com>
In-Reply-To: <8A31B552-836D-11D8-BB3C-000A95A94F04@panasas.com> from Garth Gibson at "Mar 31, 4 03:45:50 pm"
To: pnfs-reqs@yahoogroups.com
Date: Wed, 31 Mar 2004 17:03:44 -0800 (PST)
Cc: pnfs-reqs@yahoogroups.com
X-Mailer: ELM [version 2.4ME++ PL40 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: Brian Pawlowski <beepy@netapp.com>
From: Brian Pawlowski <Brian.Pawlowski@netapp.com>
Subject: Re: [pnfs-reqs] FAST BOF was a crowded success
X-Yahoo-Group-Post: member; u=169504717
X-Yahoo-Profile: brianpawlowski

I counted up to 100 and then more people came in.

> Thanks to all participants, today's pNFS BOF packed the room. We asked
> for space for 50, thinking that would be many more than would show up.
> Instead we had more than 85 people in the room, nearly all who stayed
> for the full 90 mins.
>
> Our speakers, as usual, were informative and enthusiastic. Most of the
> questions had to do with our commitment to supporting NFSv4 completely,
> that the granularity of layout delegation was in fact smaller than the
> filesystem, and that the semantics of file attributes like mtime and
> EOF might be vague as specific times while the file is being changed
> external to the metadata server. Our Oracle guest speaker, Sumanta
> Chatterjee, stirred up the room while said "good start and while you
> are at it, please look at full user-level IO, async IO, list IO,
> exposed layouts, batched interrupts, lower system CPU usage."
>
> With respect to our broad goal of spreading the message, raising the
> buzz and seeking additional participants, the first two look really
> well met. The third will be measurable on these mailing lists.
>
> garth
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From garth@panasas.com Sat Apr 03 20:31:03 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 56257 invoked from network); 4 Apr 2004 04:31:00 -0000
Received: from unknown (66.218.66.218)
by m14.grp.scd.yahoo.com with QMQP; 4 Apr 2004 04:31:00 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 4 Apr 2004 04:31:00 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56KCPR; Sat, 3 Apr 2004 23:30:56 -0500
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: 7bit
Message-Id: <D4E6D390-85F0-11D8-A34F-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Sat, 3 Apr 2004 20:30:41 -0800
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: planning for our next face-to-face
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

We talked about our next face to face being in Ann Arbor during the
week of the NFSv4 bake-a-thon. in the week of June 7-11. David
mentioned the conflict with T11 in Chicago, but thought it might be
workable.

Andy, Peter -- what are the best days in that week -- I think we are
looking for a one day meeting. I personally prefer the 7th or 8th.

Also, I encourage us all to post our notes from the face-to-face and
BOF.

thanks
garth

From pcorbett@netapp.com Sun Apr 04 10:20:22 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 61451 invoked from network); 4 Apr 2004 17:20:21 -0000
Received: from unknown (66.218.66.217)
by m6.grp.scd.yahoo.com with QMQP; 4 Apr 2004 17:20:21 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta2.grp.scd.yahoo.com with SMTP; 4 Apr 2004 17:20:21 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i34HKGZh006528
for <pnfs-reqs@yahoogroups.com>; Sun, 4 Apr 2004 10:20:16 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i34HKGTR001543
for <pnfs-reqs@yahoogroups.com>; Sun, 4 Apr 2004 10:20:16 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Sun, 4 Apr 2004 10:20:08 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A016F51BF@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] planning for our next face-to-face
Thread-Index: AcQZ/bxMb4ySfse8T2CP52qiBqibXgAa03bg
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: [pnfs-reqs] planning for our next face-to-face
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

I think any of those days is fine with me.

-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Saturday, April 03, 2004 8:31 PM
To: pnfs-reqs@yahoogroups.com
Subject: [pnfs-reqs] planning for our next face-to-face

We talked about our next face to face being in Ann Arbor during the
week of the NFSv4 bake-a-thon. in the week of June 7-11. David
mentioned the conflict with T11 in Chicago, but thought it might be
workable.

Andy, Peter -- what are the best days in that week -- I think we are
looking for a one day meeting. I personally prefer the 7th or 8th.

Also, I encourage us all to post our notes from the face-to-face and
BOF.

thanks
garth




Yahoo! Groups Links

From black_david@emc.com Mon Apr 05 07:55:42 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 21349 invoked from network); 5 Apr 2004 14:55:40 -0000
Received: from unknown (66.218.66.166)
by m14.grp.scd.yahoo.com with QMQP; 5 Apr 2004 14:55:40 -0000
Received: from unknown (HELO MAHO3MSX2.corp.emc.com) (128.221.11.32)
by mta5.grp.scd.yahoo.com with SMTP; 5 Apr 2004 14:55:39 -0000
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <H9P6L5MA>; Mon, 5 Apr 2004 10:55:14 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A582C@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com
Date: Mon, 5 Apr 2004 10:55:03 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.11.32
From: black_david@emc.com
Subject: RE: [pnfs-reqs] planning for our next face-to-face
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

T11 will be finalizing the meeting map for June during this
week's meetings. Right now, Monday (June 7th) looks good, but
please wait until T11 finalizes their schedule before committing
to Monday. I'll send email later this week when I know for
certain.

Thanks,
--David

> -----Original Message-----
> From: Garth Gibson [mailto:garth@panasas.com]
> Sent: Saturday, April 03, 2004 11:31 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] planning for our next face-to-face
>
>
> We talked about our next face to face being in Ann Arbor during the
> week of the NFSv4 bake-a-thon. in the week of June 7-11. David
> mentioned the conflict with T11 in Chicago, but thought it might be
> workable.
>
> Andy, Peter -- what are the best days in that week -- I think we are
> looking for a one day meeting. I personally prefer the 7th or 8th.
>
> Also, I encourage us all to post our notes from the face-to-face and
> BOF.
>
> thanks
> garth
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From andros@citi.umich.edu Mon Apr 05 07:57:33 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 48032 invoked from network); 5 Apr 2004 14:19:30 -0000
Received: from unknown (66.218.66.172)
by m10.grp.scd.yahoo.com with QMQP; 5 Apr 2004 14:19:30 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta4.grp.scd.yahoo.com with SMTP; 5 Apr 2004 14:19:30 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id A5C26207D0; Mon, 5 Apr 2004 10:18:58 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
In-reply-to: Your message of "Sat, 03 Apr 2004 20:30:41 PST."
<D4E6D390-85F0-11D8-A34F-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 05 Apr 2004 10:18:58 -0400
Message-Id: <20040405141858.A5C26207D0@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] planning for our next face-to-face
X-Yahoo-Group-Post: member; u=169434965

we'll figure out a schedule asap. there has also been talk of an interim IETF
NFSv4 working group meeting as we had last year....

-->Andy

> We talked about our next face to face being in Ann Arbor during the
> week of the NFSv4 bake-a-thon. in the week of June 7-11. David
> mentioned the conflict with T11 in Chicago, but thought it might be
> workable.
>
> Andy, Peter -- what are the best days in that week -- I think we are
> looking for a one day meeting. I personally prefer the 7th or 8th.
>
> Also, I encourage us all to post our notes from the face-to-face and
> BOF.
>
> thanks
> garth
> 

From bwelch@panasas.com Mon Apr 05 11:45:12 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 22840 invoked from network); 5 Apr 2004 18:45:10 -0000
Received: from unknown (66.218.66.166)
by m19.grp.scd.yahoo.com with QMQP; 5 Apr 2004 18:45:10 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta5.grp.scd.yahoo.com with SMTP; 5 Apr 2004 18:45:10 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i35IjAg22920;
Mon, 5 Apr 2004 11:45:10 -0700
Message-Id: <200404051845.i35IjAg22920@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.6.3 04/02/2003 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com, pnfs-ops@yahoogroups.com
Cc: welch@panasas.com
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 05 Apr 2004 11:45:10 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Notes from FAST pNFS ops discussion
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

I thought we had a pretty good ops discussion at the FAST pnfs BOF.
Here are my notes - if anyone has additions, that would be great.
Andy mentioned to me that having a general writeup that gives the
complete picture would be great to have. I'm willing to update the
doc I've been sending around, and will take any/all feedback on it.

*This is a long email* If you have substantial comments on a particular
topic, I suggest you launch a new thread with the appropriate subject.

Notes from March 31, 2004

1. Write coordination

The most significant item from my point of view was the issue of write
coordination. Suppose the metadata server gives out a LAYOUT delegation so
the client can do writes to the file. How closely does the metadata
server monitor the write activity by the client? It turns out that
servers will have a range of desires.

In the block world, the metadata server will make block allocations ahead of
time to allow the writes to proceed. It wants to bound the time that
allocation is outstanding. To do that, it would like the client to release
the write layout and communicate back to the metadata server precisely what
blocks were used by the write. The new EOF position is also important to
keep current.

At the other extreme might be a load-balancing file server. The metadata
server may not care about the write lifetime once it has told the client
where the data for the file lives. It may want to let the client keep
the delegation even during times of file sharing, just so long as the
layout (i.e., the location of the data fork) doesn't change.

However, there are reasons that a file- or object-based system will want to
keep closer track of client writes. These are quota management,
complex aggregation schemes, and (controversially) buffer cache issues.

For quota, suppose that files for a quota domain (called a "qtree" in
NetApp, or a "volume" in other systems) are spread out over multiple
data servers. In this case, the metadata server can act as a central
place to coordinate quota by giving each data server an escrow of quota
to manage. As writes consume quota, escrows need to be adjusted. One
way to accomplish this is to monitor client write activity. This is
very similar to the block allocation problem described above, but it
isn't about the details of where free blocks are, just about the accounting
of used blocks and who gets charged for them.

For complex aggregation schemes like file mirroring or client-driven
raid, which we use in our object-based system, the metadata server
wants to monitor client write activity to handle error recovery. If
the client bombs out in the middle of a write operation that needs to
be coordinated across multiple data servers, then the metadata server
needs to pick up the pieces and put the file into a consistent state.
The longer the client holds the write Layout delegation, the more
clients and more files can be active, and the more state the metadata
server has to track. By bounding the time that the client is allowed
to use the Layout delegation for writing, the metadata server can
bound its state requirements and crash recovery responsibilities.

Even a simple write can lead to consistency issues if the client does
I/O in a block-oriented way. If it accepts a 5 byte write, then its
cache will typically want to read the whole filesystem block that contains
those bytes and apply the write to that block. Later it will write
back the whole block. Even if applications do large writes, they often
do large writes that are not multiples of the blocksize, so the same
issues crop up. I don't think we resolved this dicussion, with several
voices claiming this was a cache consistency issue and unrelated to
layouts. But, see point 8 below about working out these details.

Conclusion: these issues motivate the need for
* Separate read and write layout delegations
* Relatively prompt release of write delegations for those servers
that want to keep a closer eye on their clients. They can give out
short lease times (e.g., 10 minutes as opposed to 8 hours) for the
write layout delegations

2. What are the Ops called

The doc that I floated has READ_IND, WRITE_IND, and COMMIT_IND.
But we have also used DELEG_ASK and DELEG_RETURN. The write/commit
terminology reflects the model where the server is paying close attention
to write patterns by clients. Because that doesn't necessarily apply,
the DELEG_ASK and DELEG_RETURN op names have been suggested. I think
we may need separate READ_DELEG_ASK and WRITE_DELEG_ASK because different
arguments have been suggested, although we may be able to unify them.
I heard a request for
Two ranges in READ_DELEG_ASK, a minimum and maximum size. David - does
this apply to both read and write?
There is also the issue that WRITE_DELEG_RETURN will want to return
updated layout information so it can communicate to block servers what
blocks have been used. I suppose it is possible to model that as a kind
of attribute - we need to communicate the EOF and mtime attribute at
WRITE_DELEG_RETURN anyway. However, in other cases we've decided that
attributes don't work so well for layout info, so I'd be reluctant to
make them into an attribute in this case. So, I'd advocate that
WRITE_DELEG_RETURN is a different op than READ_DELEG_RETURN, and that
it has an explicit layout_prime return parameter to reflect what it did.

Conclusion: I'm coming down in favor of 4 ops: {READ,WRITE}_DELEG_{ASK,RETUR
N}

3. Resource discovery

There was a little discussion of the DEVICE_LIST and DEVICE_INFO ops.
The main point was that the device info needs to be typed so that
clients know if they have the correct datapath driver to use that device.

It was also brought up that servers may be able to export different
kinds of layouts (e.g., both files and objects, or multiple flavors
of layout foo) so clients may want to negotiate with the servers about
what kind of delegation to return. This implies a type in the *_DELEG_ASK
operations.

4. Replicas and complex aggregation types

The whole notion of complex aggregation types that require coordinated
updates to multiple data forks is somewhat controversial. I'm going to
start a separate thread on this topic.

5. Clustered Metadata servers

We discussed issue of clustering the metadata server to partition its load.
If possible, we'd like to keep this outside the scope of pNFS. The main
issue that crops up is that a client may contact a metadata server for
a file and need to be told to use a different metadata server. This can
be made to fit in with the current NFS redirection behavior as long as
the FSID for the file changes. This implies a particular metadata
partitioning along mount point boundaries. In practice this can work
out just fine, although some metadata servers may have more elaborate
schemes that allow very fine grain partitioning of metadata ownership.

6. Security Models

We discussed the security models. One viewpoint is that we could try
and make all the servers implement a similar model (i.e., the "best" one).
However, there was resistance to this. The alternative is to let
the servers provide the security model they want. In particular, these
servers already have security models and they are all different. So
the path of least resistance is let servers use their existing security
models even if they have limitations.

7. Scope of client damange

A question came up about "how much damage can a client do". It depends
on the data service. For files and blocks, the layout is specific to
particular data objects, so the most a client can do is mess up the
file it was given access to. This includes botching the updates involved
in a complex aggregation type like mirroring or client-driven raid. In
the block world, clients typically have more granular access, so they
could damage a volume or LUN. But, this level of vulnerability is
accepted in the SAN filesystems that exist today.

8. Interaction of Data Delegations, Layout Delegations, and Locking

We didn't close on this issue, and I think our homework should be to
explain in detail what the op sequence will be for a few basic scenarios.
These will illustrate the basic cases for how to use the layout
delegations.

-- Brent Welch
Software Architect,Panasas Inc
Delivering the premier storage system for scalable Linuxclusters

www.panasas.com
welch@panasas.com

From black_david@emc.com Tue Apr 06 12:48:54 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 54554 invoked from network); 6 Apr 2004 19:16:42 -0000
Received: from unknown (66.218.66.216)
by m8.grp.scd.yahoo.com with QMQP; 6 Apr 2004 19:16:42 -0000
Received: from unknown (HELO mxic2.corp.emc.com) (128.221.12.9)
by mta1.grp.scd.yahoo.com with SMTP; 6 Apr 2004 19:16:38 -0000
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <H9PR8S8Z>; Tue, 6 Apr 2004 15:00:55 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA053D5931@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com, pnfs-ops@yahoogroups.com
Date: Tue, 6 Apr 2004 15:00:51 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.12.9
From: black_david@emc.com
Subject: RE: [pnfs-reqs] Notes from FAST pNFS ops discussion
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

Some comments:

> Conclusion: these issues motivate the need for
> * Separate read and write layout delegations
> * Relatively prompt release of write delegations for those servers
> that want to keep a closer eye on their clients. They can give out
> short lease times (e.g., 10 minutes as opposed to 8 hours) for the
> write layout delegations

Leases could use some more thought. HighRoad uses a client-wide "lease",
in the form of a periodic heartbeat. In essence, the client has to remain
in constant communication with the server - it's required to do something
once every so often, and issues a no-op if it has nothing else to do in the
necessary time period. This accomplishes the major benefit of leases
(clean recovery of resources from a dead client), without having to track
a time per lease. Notifications are always needed, as leases aren't
enough when there's active contention.

> 2. What are the Ops called
>
> The doc that I floated has READ_IND, WRITE_IND, and COMMIT_IND.
> But we have also used DELEG_ASK and DELEG_RETURN. The write/commit
> terminology reflects the model where the server is paying close attention
> to write patterns by clients. Because that doesn't necessarily apply,
> the DELEG_ASK and DELEG_RETURN op names have been suggested. I think
> we may need separate READ_DELEG_ASK and WRITE_DELEG_ASK because different
> arguments have been suggested, although we may be able to unify them.

I think a name that does not involve any part of the word "delegation"
will help avoid confusion with the existing data delegations (which behave
very differently). MAP and LAYOUT are possibilities.

> I heard a request for
> Two ranges in READ_DELEG_ASK, a minimum and maximum size. David - does
> this apply to both read and write?

Yes. The "minimum" is an instruction from the client to the server to
reject or queue the operation until it can satisfy at least the minimum.
The "maximum" is an optimization hint.

> There is also the issue that WRITE_DELEG_RETURN will want to return
> updated layout information so it can communicate to block servers what
> blocks have been used. I suppose it is possible to model that as a kind
> of attribute - we need to communicate the EOF and mtime attribute at
> WRITE_DELEG_RETURN anyway. However, in other cases we've decided that
> attributes don't work so well for layout info, so I'd be reluctant to
> make them into an attribute in this case. So, I'd advocate that
> WRITE_DELEG_RETURN is a different op than READ_DELEG_RETURN, and that
> it has an explicit layout_prime return parameter to reflect
> what it did.

It's sufficient to flag returned extent ranges as:
- Commit. Only needed for writable extents that contained no valid data.
When writable extents are obtained, the client is told which
ones have invalid data and will need to be committed (this moves
zero-fill responsibility to the client from the metadata server).
This may be block-only, as object and file systems will handle
zero-fill in the data server and hence never need to issue
invalid data extents.
- Release. Give back the read or write rights in an extent.
It's also necessary to be able to set access or modify times - passing
a couple of flags to tell the server to do this avoids any dependence
on clients having the same time as servers - and to be able to set the
EOF as part of the operation.

> Conclusion: I'm coming down in favor of 4 ops:
> {READ,WRITE}_DELEG_{ASK,RETURN}

I'd like to see "DELEG" changed to something like "MAP", "LAYOUT", or
"EXTENT". HighRoad uses 3 ops (common return op for read and write),
but 4 is probably cleaner (e.g., can't get confused and try to set
modify time when returning a read extent, but do need "no change",
"set access" and "set access and modify" options when returning a
write extent).

> 3. Resource discovery
>
> There was a little discussion of the DEVICE_LIST and DEVICE_INFO ops.
> The main point was that the device info needs to be typed so that
> clients know if they have the correct datapath driver to use
> that device.

And the identifiers within each type need to be sufficiently global/
unambiguous to avoid possible confusion.

> It was also brought up that servers may be able to export different
> kinds of layouts (e.g., both files and objects, or multiple flavors
> of layout foo) so clients may want to negotiate with the servers about
> what kind of delegation to return. This implies a type in
> the *_DELEG_ASK operations.

Ok.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From black_david@emc.com Thu Apr 08 22:31:07 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 81972 invoked from network); 8 Apr 2004 22:37:49 -0000
Received: from unknown (66.218.66.167)
by m18.grp.scd.yahoo.com with QMQP; 8 Apr 2004 22:37:49 -0000
Received: from unknown (HELO mxic2.corp.emc.com) (128.221.12.9)
by mta6.grp.scd.yahoo.com with SMTP; 8 Apr 2004 22:37:49 -0000
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <H9PSAS6V>; Thu, 8 Apr 2004 18:37:48 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A5844@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com
Date: Thu, 8 Apr 2004 18:36:27 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.12.9
From: black_david@emc.com
Subject: RE: [pnfs-reqs] planning for our next face-to-face
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

> We talked about our next face to face being in Ann Arbor during the
> week of the NFSv4 bake-a-thon. in the week of June 7-11. David
> mentioned the conflict with T11 in Chicago, but thought it might be
> workable.

At least for me, I appear to have T11 meetings that I should attend
on Monday, Tuesday, and Wednesday. I think I can skip the meetings
on Thursday, so that should make Thursday and Friday of that week
possible.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------


From pcorbett@netapp.com Mon Apr 12 07:00:22 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 76617 invoked from network); 12 Apr 2004 14:00:21 -0000
Received: from unknown (66.218.66.216)
by m21.grp.scd.yahoo.com with QMQP; 12 Apr 2004 14:00:21 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 12 Apr 2004 14:00:21 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i3CDuZZh024024
for <pnfs-reqs@yahoogroups.com>; Mon, 12 Apr 2004 06:56:35 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i3CDuZXv015719
for <pnfs-reqs@yahoogroups.com>; Mon, 12 Apr 2004 06:56:35 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 12 Apr 2004 06:56:30 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A016F52AD@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] planning for our next face-to-face
Thread-Index: AcQd9DUhjtB2awVVQz2lgK0bK5G29ACoaVMQ
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: [pnfs-reqs] planning for our next face-to-face
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

That doesn't work for me, unfortunately. I have prior plans for the
11th that I can't change, and I have to be home on the 10th.

-----Original Message-----
From: black_david@emc.com [mailto:black_david@emc.com]
Sent: Thursday, April 08, 2004 6:36 PM
To: pnfs-reqs@yahoogroups.com
Subject: RE: [pnfs-reqs] planning for our next face-to-face

> We talked about our next face to face being in Ann Arbor during the
> week of the NFSv4 bake-a-thon. in the week of June 7-11. David
> mentioned the conflict with T11 in Chicago, but thought it might be
> workable.

At least for me, I appear to have T11 meetings that I should attend
on Monday, Tuesday, and Wednesday. I think I can skip the meetings
on Thursday, so that should make Thursday and Friday of that week
possible.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------




Yahoo! Groups Links

From garth@panasas.com Mon Apr 12 09:34:59 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 18553 invoked from network); 12 Apr 2004 16:34:58 -0000
Received: from unknown (66.218.66.166)
by m24.grp.scd.yahoo.com with QMQP; 12 Apr 2004 16:34:58 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 12 Apr 2004 16:34:58 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56K6VQ; Mon, 12 Apr 2004 12:34:57 -0400
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A016F52AD@silver.nane.netapp.com>
References: <C8CF60CFC4D8A74E9945E32CF096548A016F52AD@silver.nane.netapp.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <4C08A133-8C9F-11D8-8ED7-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Mon, 12 Apr 2004 12:34:41 -0400
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] planning for our next face-to-face
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

I am also committed to be in CA on June 10, so Thurs and Fri don't work
for me.

David, any chance of a 1/2 day in Ann Arbor on Mon or Tues?

garth

On Apr 12, 2004, at 9:56 AM, Corbett, Peter wrote:
> That doesn't work for me, unfortunately. I have prior plans for the
> 11th that I can't change, and I have to be home on the 10th.
>
> -----Original Message-----
> From: black_david@emc.com [mailto:black_david@emc.com]
> Sent: Thursday, April 08, 2004 6:36 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: RE: [pnfs-reqs] planning for our next face-to-face
>
>> We talked about our next face to face being in Ann Arbor during the
>> week of the NFSv4 bake-a-thon. in the week of June 7-11. David
>> mentioned the conflict with T11 in Chicago, but thought it might be
>> workable.
>
> At least for me, I appear to have T11 meetings that I should attend
> on Monday, Tuesday, and Wednesday. I think I can skip the meetings
> on Thursday, so that should make Thursday and Friday of that week
> possible.
>
> Thanks,
> --David
> ----------------------------------------------------
> David L. Black, Senior Technologist
> EMC Corporation, 176 South St., Hopkinton, MA 01748
> +1 (508) 293-7953 FAX: +1 (508) 293-7786
> black_david@emc.com Mobile: +1 (978) 394-7754
> ----------------------------------------------------

From black_david@emc.com Mon Apr 12 10:14:32 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 31021 invoked from network); 12 Apr 2004 17:14:31 -0000
Received: from unknown (66.218.66.166)
by m25.grp.scd.yahoo.com with QMQP; 12 Apr 2004 17:14:31 -0000
Received: from unknown (HELO MAHO3MSX2.corp.emc.com) (128.221.11.32)
by mta5.grp.scd.yahoo.com with SMTP; 12 Apr 2004 17:14:31 -0000
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <H9P64CF2>; Mon, 12 Apr 2004 13:14:30 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A584F@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com
Date: Mon, 12 Apr 2004 13:14:28 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.11.32
From: black_david@emc.com
Subject: RE: [pnfs-reqs] planning for our next face-to-face
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

> David, any chance of a 1/2 day in Ann Arbor on Mon or Tues?

No, it takes about 4 hours by car, plane, or train between
Ann Arbor and Chicago, so half a day of meetings in either
place takes out the entire day. I could call into a meeting
Monday AM or Tuesday AM, I think ...

On the other hand according to what's currently on T11's
web site, Monday is workable for me (the web site doesn't
match the schedule I thought I saw last week - I don't know
why). I suspect we need to hold off about a week
on scheduling this, as the meeting info determined last
week may not have made it to T11's web site yet (they're
having web problems).

Sorry,
--David

> -----Original Message-----
> From: Garth Gibson [mailto:garth@panasas.com]
> Sent: Monday, April 12, 2004 12:35 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: Re: [pnfs-reqs] planning for our next face-to-face
>
>
> I am also committed to be in CA on June 10, so Thurs and Fri
> don't work
> for me.
>
> David, any chance of a 1/2 day in Ann Arbor on Mon or Tues?
>
> garth
>
> On Apr 12, 2004, at 9:56 AM, Corbett, Peter wrote:
> > That doesn't work for me, unfortunately. I have prior plans for the
> > 11th that I can't change, and I have to be home on the 10th.
> >
> > -----Original Message-----
> > From: black_david@emc.com [mailto:black_david@emc.com]
> > Sent: Thursday, April 08, 2004 6:36 PM
> > To: pnfs-reqs@yahoogroups.com
> > Subject: RE: [pnfs-reqs] planning for our next face-to-face
> >
> >> We talked about our next face to face being in Ann Arbor during the
> >> week of the NFSv4 bake-a-thon. in the week of June 7-11. David
> >> mentioned the conflict with T11 in Chicago, but thought it might be
> >> workable.
> >
> > At least for me, I appear to have T11 meetings that I should attend
> > on Monday, Tuesday, and Wednesday. I think I can skip the meetings
> > on Thursday, so that should make Thursday and Friday of that week
> > possible.
> >
> > Thanks,
> > --David
> > ----------------------------------------------------
> > David L. Black, Senior Technologist
> > EMC Corporation, 176 South St., Hopkinton, MA 01748
> > +1 (508) 293-7953 FAX: +1 (508) 293-7786
> > black_david@emc.com Mobile: +1 (978) 394-7754
> > ----------------------------------------------------
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the
> US & Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> --------------------------------------------------------------
> -------~->
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From bhalevy@panasas.com Thu Apr 15 15:33:34 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 80378 invoked from network); 15 Apr 2004 22:33:33 -0000
Received: from unknown (66.218.66.166)
by m22.grp.scd.yahoo.com with QMQP; 15 Apr 2004 22:33:33 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 15 Apr 2004 22:33:33 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <2B56LNNM>; Thu, 15 Apr 2004 18:33:32 -0400
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D389CF@PIKES.panasas.com>
To: "'pnfs-ops@yahoogroups.com'" <pnfs-ops@yahoogroups.com>,
pnfs-reqs@yahoogroups.com
Date: Thu, 15 Apr 2004 18:33:27 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS ops discussio
n
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

my comments below...

>-----Original Message-----
>From: black_david@emc.com [mailto:black_david@emc.com]
>Sent: Tuesday, April 06, 2004 3:01 PM
>To: pnfs-reqs@yahoogroups.com; pnfs-ops@yahoogroups.com
>Subject: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS ops discussion
>
>
>Some comments:
>
>> Conclusion: these issues motivate the need for
>> * Separate read and write layout delegations
>> * Relatively prompt release of write delegations for those servers
>> that want to keep a closer eye on their clients. They can give out
>> short lease times (e.g., 10 minutes as opposed to 8 hours) for the
>> write layout delegations
>
>Leases could use some more thought. HighRoad uses a
>client-wide "lease",
>in the form of a periodic heartbeat. In essence, the client
>has to remain
>in constant communication with the server - it's required to
>do something
>once every so often, and issues a no-op if it has nothing else
>to do in the
>necessary time period. This accomplishes the major benefit of leases
>(clean recovery of resources from a dead client), without
>having to track
>a time per lease. Notifications are always needed, as leases aren't
>enough when there's active contention.
>
>> 2. What are the Ops called
>>
>> The doc that I floated has READ_IND, WRITE_IND, and COMMIT_IND.
>> But we have also used DELEG_ASK and DELEG_RETURN. The write/commit
>> terminology reflects the model where the server is paying
>close attention
>> to write patterns by clients. Because that doesn't
>necessarily apply,
>> the DELEG_ASK and DELEG_RETURN op names have been suggested. I think
>> we may need separate READ_DELEG_ASK and WRITE_DELEG_ASK
>because different
>> arguments have been suggested, although we may be able to unify them.
>
>I think a name that does not involve any part of the word "delegation"
>will help avoid confusion with the existing data delegations
>(which behave
>very differently). MAP and LAYOUT are possibilities.

I agree. I prefer LAYOUT since it's slightly more general.

>
>> I heard a request for
>> Two ranges in READ_DELEG_ASK, a minimum and maximum size.
>David - does
>> this apply to both read and write?
>
>Yes. The "minimum" is an instruction from the client to the server to
>reject or queue the operation until it can satisfy at least
>the minimum.
>The "maximum" is an optimization hint.
>
>> There is also the issue that WRITE_DELEG_RETURN will want to return
>> updated layout information so it can communicate to block
>servers what
>> blocks have been used. I suppose it is possible to model
>that as a kind
>> of attribute - we need to communicate the EOF and mtime attribute at
>> WRITE_DELEG_RETURN anyway. However, in other cases we've
>decided that
>> attributes don't work so well for layout info, so I'd be reluctant to
>> make them into an attribute in this case. So, I'd advocate that
>> WRITE_DELEG_RETURN is a different op than READ_DELEG_RETURN, and that
>> it has an explicit layout_prime return parameter to reflect
>> what it did.
>
>It's sufficient to flag returned extent ranges as:
>- Commit. Only needed for writable extents that contained no
>valid data.
> When writable extents are obtained, the client is told which
> ones have invalid data and will need to be committed (this moves
> zero-fill responsibility to the client from the
>metadata server).
> This may be block-only, as object and file systems will handle
> zero-fill in the data server and hence never need to issue
> invalid data extents.
>- Release. Give back the read or write rights in an extent.
>It's also necessary to be able to set access or modify times - passing
>a couple of flags to tell the server to do this avoids any dependence
>on clients having the same time as servers - and to be able to set the
>EOF as part of the operation.
>
>> Conclusion: I'm coming down in favor of 4 ops:
>> {READ,WRITE}_DELEG_{ASK,RETURN}
>
>I'd like to see "DELEG" changed to something like "MAP", "LAYOUT", or
>"EXTENT". HighRoad uses 3 ops (common return op for read and write),
>but 4 is probably cleaner (e.g., can't get confused and try to set
>modify time when returning a read extent, but do need "no change",
>"set access" and "set access and modify" options when returning a
>write extent).

Would you always want to return the layout when you commit or
release extents or will you want to keep it in some cases?
If you want to keep it then calling the operation *_RETURN is
confusing...

how about:

LAYOUT_GET:
Takes type (READ/WRITE) and type specific parameters in a discriminated
union.

LAYOUT_FLUSH: [analogous to EMC's FMP_Flush]
Takes type specific parameters such as new EOF, extents to commit/release,
etc. The client may continue to hold the layout after LAYOUT_FLUSH.

LAYOUT_RETURN:
Return the layout. The layout and everything associated with it are
invalid to the client at this point. This will be called when the layout
is being called back or voluntarily by the client.

CB_LAYOUT_RECALL:
Similar to CB_RECALL for data delegations. The client is
instructed to quiesce its activity and to return the layout via
(LAYOUT_FLUSH +) LAYOUT_RETURN.

>
>> 3. Resource discovery
>>
>> There was a little discussion of the DEVICE_LIST and DEVICE_INFO ops.
>> The main point was that the device info needs to be typed so that
>> clients know if they have the correct datapath driver to use
>> that device.
>
>And the identifiers within each type need to be sufficiently global/
>unambiguous to avoid possible confusion.
>
>> It was also brought up that servers may be able to export different
>> kinds of layouts (e.g., both files and objects, or multiple flavors
>> of layout foo) so clients may want to negotiate with the
>servers about
>> what kind of delegation to return. This implies a type in
>> the *_DELEG_ASK operations.
>
>Ok.
>
>Thanks,
>--David
>----------------------------------------------------
>David L. Black, Senior Technologist
>EMC Corporation, 176 South St., Hopkinton, MA 01748
>+1 (508) 293-7953 FAX: +1 (508) 293-7786
>black_david@emc.com Mobile: +1 (978) 394-7754
>----------------------------------------------------
>
>
>
>Yahoo! Groups Links
>
>
>
>
>

From black_david@emc.com Thu Apr 15 15:54:53 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 27622 invoked from network); 15 Apr 2004 22:54:52 -0000
Received: from unknown (66.218.66.217)
by m24.grp.scd.yahoo.com with QMQP; 15 Apr 2004 22:54:52 -0000
Received: from unknown (HELO mxic2.corp.emc.com) (128.221.12.9)
by mta2.grp.scd.yahoo.com with SMTP; 15 Apr 2004 22:54:52 -0000
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <2006516F>; Thu, 15 Apr 2004 18:54:49 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A58AE@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com, pnfs-ops@yahoogroups.com
Date: Thu, 15 Apr 2004 18:54:48 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.12.9
From: black_david@emc.com
Subject: RE: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS ops discussio
n
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

> Would you always want to return the layout when you commit or
> release extents or will you want to keep it in some cases?
> If you want to keep it then calling the operation *_RETURN is
> confusing...

Yes, there are times when one wants to keep the layout. A client
calling fsync() is a particularly obvious example ... I think
the idea of LAYOUT_{GET,FLUSH,RETURN} ops along with the callback
looks ok, although FLUSH and RETURN will have many parameters in
common (FMP_Flush has a flag saying whether to Release/RETURN or
not).

Thanks,
--David

> -----Original Message-----
> From: Halevy, Benny [mailto:bhalevy@panasas.com]
> Sent: Thursday, April 15, 2004 6:33 PM
> To: 'pnfs-ops@yahoogroups.com'; pnfs-reqs@yahoogroups.com
> Subject: RE: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS
> ops discussio n
>
>
> my comments below...
>
> >-----Original Message-----
> >From: black_david@emc.com [mailto:black_david@emc.com]
> >Sent: Tuesday, April 06, 2004 3:01 PM
> >To: pnfs-reqs@yahoogroups.com; pnfs-ops@yahoogroups.com
> >Subject: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS ops
> discussion
> >
> >
> >Some comments:
> >
> >> Conclusion: these issues motivate the need for
> >> * Separate read and write layout delegations
> >> * Relatively prompt release of write delegations for those servers
> >> that want to keep a closer eye on their clients. They
> can give out
> >> short lease times (e.g., 10 minutes as opposed to 8
> hours) for the
> >> write layout delegations
> >
> >Leases could use some more thought. HighRoad uses a
> >client-wide "lease",
> >in the form of a periodic heartbeat. In essence, the client
> >has to remain
> >in constant communication with the server - it's required to
> >do something
> >once every so often, and issues a no-op if it has nothing else
> >to do in the
> >necessary time period. This accomplishes the major benefit of leases
> >(clean recovery of resources from a dead client), without
> >having to track
> >a time per lease. Notifications are always needed, as leases aren't
> >enough when there's active contention.
> >
> >> 2. What are the Ops called
> >>
> >> The doc that I floated has READ_IND, WRITE_IND, and COMMIT_IND.
> >> But we have also used DELEG_ASK and DELEG_RETURN. The write/commit
> >> terminology reflects the model where the server is paying
> >close attention
> >> to write patterns by clients. Because that doesn't
> >necessarily apply,
> >> the DELEG_ASK and DELEG_RETURN op names have been
> suggested. I think
> >> we may need separate READ_DELEG_ASK and WRITE_DELEG_ASK
> >because different
> >> arguments have been suggested, although we may be able to
> unify them.
> >
> >I think a name that does not involve any part of the word
> "delegation"
> >will help avoid confusion with the existing data delegations
> >(which behave
> >very differently). MAP and LAYOUT are possibilities.
>
> I agree. I prefer LAYOUT since it's slightly more general.
>
> >
> >> I heard a request for
> >> Two ranges in READ_DELEG_ASK, a minimum and maximum size.
> >David - does
> >> this apply to both read and write?
> >
> >Yes. The "minimum" is an instruction from the client to the
> server to
> >reject or queue the operation until it can satisfy at least
> >the minimum.
> >The "maximum" is an optimization hint.
> >
> >> There is also the issue that WRITE_DELEG_RETURN will want
> to return
> >> updated layout information so it can communicate to block
> >servers what
> >> blocks have been used. I suppose it is possible to model
> >that as a kind
> >> of attribute - we need to communicate the EOF and mtime
> attribute at
> >> WRITE_DELEG_RETURN anyway. However, in other cases we've
> >decided that
> >> attributes don't work so well for layout info, so I'd be
> reluctant to
> >> make them into an attribute in this case. So, I'd advocate that
> >> WRITE_DELEG_RETURN is a different op than
> READ_DELEG_RETURN, and that
> >> it has an explicit layout_prime return parameter to reflect
> >> what it did.
> >
> >It's sufficient to flag returned extent ranges as:
> >- Commit. Only needed for writable extents that contained no
> >valid data.
> > When writable extents are obtained, the client is told which
> > ones have invalid data and will need to be committed (this moves
> > zero-fill responsibility to the client from the
> >metadata server).
> > This may be block-only, as object and file systems will handle
> > zero-fill in the data server and hence never need to issue
> > invalid data extents.
> >- Release. Give back the read or write rights in an extent.
> >It's also necessary to be able to set access or modify times
> - passing
> >a couple of flags to tell the server to do this avoids any dependence
> >on clients having the same time as servers - and to be able
> to set the
> >EOF as part of the operation.
> >
> >> Conclusion: I'm coming down in favor of 4 ops:
> >> {READ,WRITE}_DELEG_{ASK,RETURN}
> >
> >I'd like to see "DELEG" changed to something like "MAP", "LAYOUT", or
> >"EXTENT". HighRoad uses 3 ops (common return op for read and write),
> >but 4 is probably cleaner (e.g., can't get confused and try to set
> >modify time when returning a read extent, but do need "no change",
> >"set access" and "set access and modify" options when returning a
> >write extent).
>
> Would you always want to return the layout when you commit or
> release extents or will you want to keep it in some cases?
> If you want to keep it then calling the operation *_RETURN is
> confusing...
>
> how about:
>
> LAYOUT_GET:
> Takes type (READ/WRITE) and type specific parameters in a
> discriminated
> union.
>
> LAYOUT_FLUSH: [analogous to EMC's FMP_Flush]
> Takes type specific parameters such as new EOF, extents to
> commit/release,
> etc. The client may continue to hold the layout after LAYOUT_FLUSH.
>
> LAYOUT_RETURN:
> Return the layout. The layout and everything associated with it are
> invalid to the client at this point. This will be called
> when the layout
> is being called back or voluntarily by the client.
>
> CB_LAYOUT_RECALL:
> Similar to CB_RECALL for data delegations. The client is
> instructed to quiesce its activity and to return the layout via
> (LAYOUT_FLUSH +) LAYOUT_RETURN.
>
> >
> >> 3. Resource discovery
> >>
> >> There was a little discussion of the DEVICE_LIST and
> DEVICE_INFO ops.
> >> The main point was that the device info needs to be typed so that
> >> clients know if they have the correct datapath driver to use
> >> that device.
> >
> >And the identifiers within each type need to be sufficiently global/
> >unambiguous to avoid possible confusion.
> >
> >> It was also brought up that servers may be able to export different
> >> kinds of layouts (e.g., both files and objects, or multiple flavors
> >> of layout foo) so clients may want to negotiate with the
> >servers about
> >> what kind of delegation to return. This implies a type in
> >> the *_DELEG_ASK operations.
> >
> >Ok.
> >
> >Thanks,
> >--David
> >----------------------------------------------------
> >David L. Black, Senior Technologist
> >EMC Corporation, 176 South St., Hopkinton, MA 01748
> >+1 (508) 293-7953 FAX: +1 (508) 293-7786
> >black_david@emc.com Mobile: +1 (978) 394-7754
> >----------------------------------------------------
> >
> >
> >
> >Yahoo! Groups Links
> >
> >
> >
> >
> >
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the
> US & Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> --------------------------------------------------------------
> -------~->
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From garth@panasas.com Thu Apr 15 21:37:53 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 68904 invoked from network); 16 Apr 2004 04:37:52 -0000
Received: from unknown (66.218.66.167)
by m21.grp.scd.yahoo.com with QMQP; 16 Apr 2004 04:37:52 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 16 Apr 2004 04:37:52 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56LN0K; Fri, 16 Apr 2004 00:37:51 -0400
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: 7bit
Message-Id: <D0FD8818-8F5F-11D8-8644-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Fri, 16 Apr 2004 00:37:49 -0400
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Fwd: logistics status
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Trying again -- last night's email did not make it into Yahoo somehow.

Begin forwarded message:

> From: Garth Gibson <garth@panasas.com>
> Date: April 15, 2004 1:33:46 AM EDT
> To: pnfs-reqs@yahoogroups.com
> Cc: Garth Gibson <garth@panasas.com>
> Subject: logistics status
>
> Folks,
>
> 1) there is no pNFS logistics concall tomorrow at 11am EST (I am on a
> MSST04 panel at that time)
>
> 2) currently, we have 5 mailing lists and are not using most of them
> -- lets retreat to using just one mailing list for now -- just
> pnfs-reqs@yahoogroups.com -- when discussions of different backend
> metadata formats get going we can reconsider this
>
> 3) weekly concalls, which have always focused on getting
> administrative things done, should be dedicated to this -- we have
> been and should do all technical discussions on the mailing list (or
> at face-to-face meetings) -- anyone who wants to be part of the weekly
> administrative concall, when it happens, should send me email
> identifying yourself and requesting dial-in details
>
> garth
>
> ------------------
>
> We have lots of new members, so I thought I'd reprise a little of how
> we interact (so far anyway).
>
> We have had two face-to-face meetings to date: Dec 4 03 and Mar 31 04
> (FAST).
>
> We are planning our next face-to-face for U. Michigan (CITI) during
> the week of June 7-11, which is when and where the NFSv4 Bake-a-thon
> is being held. Tentative target, not yet confirmed or arranged, is
> June 7. This will give us a very good chance to interact with the
> broader NFSv4 community. There may be an interim IETF NFSv4 working
> group meeting at the same place in the same week (hopefully on the
> following day).
>
> The following face-to-face is tentatively planned for the IETF meeting
> in San Diego the week of Aug 2-6.
>
> In general we are always working on multiple sub-tasks.
>
> Administratively, we have collaborated on a problem statement IETF
> draft that was submitted to the most recent Seoul IETF meeting and a
> slide deck that was given at Seoul and again at FAST. This was done
> to begin the negotiation with the IETF hierarchy. Next steps in these
> administrative processes would be to generate a draft of a
> requirements document and share it with the NFSv4 community. And to
> work with the NFSv4 community to charter pNFS into an IETF process,
> specifically NFSv4. In order for the NFSv4 community to have a good
> chance to see the draft of the requirements document, we should it put
> out in May. Because this seems to be a pain for most people to
> contribute to, I've signed up to push/beg/pull it into some useful
> shape.
>
> Technically, we don't much like the administrative work. We like to
> discuss semantics of possible NFSv4 extension commands. Brent Welch
> has sent out summary docs with some of our thinking twice so far, and
> will probably do so again before too long. Anybody wanting to beat
> him to it is welcome! This is the meat of our work, and every bit of
> progress on it is great. Especially where it helps us pin down
> requirements. I'm guessing that the timeline on posting for wider
> comments on a draft of this will be late summer, possibly with a
> revision of the requirements document. It would be great if it
> happens faster.
>
> garth

From bhalevy@panasas.com Fri Apr 16 04:28:00 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 97092 invoked from network); 16 Apr 2004 11:27:59 -0000
Received: from unknown (66.218.66.167)
by m25.grp.scd.yahoo.com with QMQP; 16 Apr 2004 11:27:59 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 16 Apr 2004 11:27:59 -0000
Received: by PIKES.panasas.com with Internet Mail Service (5.5.2653.19)
id <2B56L3NC>; Fri, 16 Apr 2004 07:27:31 -0400
Message-ID: <30489F1321F5C343ACF6872B2CF7942A05D389D0@PIKES.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
pnfs-ops@yahoogroups.com
Date: Fri, 16 Apr 2004 07:27:29 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS ops discussio
n
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

> although FLUSH and RETURN will have many parameters in
> common (FMP_Flush has a flag saying whether to Release/RETURN or
> not).

This is what I thought the first time I looked at it too.
In second thought, in NFSv4 you'd just COMPOUND them - passing the
common parameters via FLUSH, followed by a thin RETURN op.

Benny

> -----Original Message-----
> From: black_david@emc.com [mailto:black_david@emc.com]
> Sent: Thursday, April 15, 2004 5:55 PM
> To: pnfs-reqs@yahoogroups.com; pnfs-ops@yahoogroups.com
> Subject: RE: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS ops
> discussio n
>
>
> > Would you always want to return the layout when you commit or
> > release extents or will you want to keep it in some cases?
> > If you want to keep it then calling the operation *_RETURN is
> > confusing...
>
> Yes, there are times when one wants to keep the layout. A client
> calling fsync() is a particularly obvious example ... I think
> the idea of LAYOUT_{GET,FLUSH,RETURN} ops along with the callback
> looks ok, although FLUSH and RETURN will have many parameters in
> common (FMP_Flush has a flag saying whether to Release/RETURN or
> not).
>
> Thanks,
> --David
>
> > -----Original Message-----
> > From: Halevy, Benny [mailto:bhalevy@panasas.com]
> > Sent: Thursday, April 15, 2004 6:33 PM
> > To: 'pnfs-ops@yahoogroups.com'; pnfs-reqs@yahoogroups.com
> > Subject: RE: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS
> > ops discussio n
> >
> >
> > my comments below...
> >
> > >-----Original Message-----
> > >From: black_david@emc.com [mailto:black_david@emc.com]
> > >Sent: Tuesday, April 06, 2004 3:01 PM
> > >To: pnfs-reqs@yahoogroups.com; pnfs-ops@yahoogroups.com
> > >Subject: [pnfs-ops] RE: [pnfs-reqs] Notes from FAST pNFS ops
> > discussion
> > >
> > >
> > >Some comments:
> > >
> > >> Conclusion: these issues motivate the need for
> > >> * Separate read and write layout delegations
> > >> * Relatively prompt release of write delegations for
> those servers
> > >> that want to keep a closer eye on their clients. They
> > can give out
> > >> short lease times (e.g., 10 minutes as opposed to 8
> > hours) for the
> > >> write layout delegations
> > >
> > >Leases could use some more thought. HighRoad uses a
> > >client-wide "lease",
> > >in the form of a periodic heartbeat. In essence, the client
> > >has to remain
> > >in constant communication with the server - it's required to
> > >do something
> > >once every so often, and issues a no-op if it has nothing else
> > >to do in the
> > >necessary time period. This accomplishes the major
> benefit of leases
> > >(clean recovery of resources from a dead client), without
> > >having to track
> > >a time per lease. Notifications are always needed, as
> leases aren't
> > >enough when there's active contention.
> > >
> > >> 2. What are the Ops called
> > >>
> > >> The doc that I floated has READ_IND, WRITE_IND, and COMMIT_IND.
> > >> But we have also used DELEG_ASK and DELEG_RETURN. The
> write/commit
> > >> terminology reflects the model where the server is paying
> > >close attention
> > >> to write patterns by clients. Because that doesn't
> > >necessarily apply,
> > >> the DELEG_ASK and DELEG_RETURN op names have been
> > suggested. I think
> > >> we may need separate READ_DELEG_ASK and WRITE_DELEG_ASK
> > >because different
> > >> arguments have been suggested, although we may be able to
> > unify them.
> > >
> > >I think a name that does not involve any part of the word
> > "delegation"
> > >will help avoid confusion with the existing data delegations
> > >(which behave
> > >very differently). MAP and LAYOUT are possibilities.
> >
> > I agree. I prefer LAYOUT since it's slightly more general.
> >
> > >
> > >> I heard a request for
> > >> Two ranges in READ_DELEG_ASK, a minimum and maximum size.
> > >David - does
> > >> this apply to both read and write?
> > >
> > >Yes. The "minimum" is an instruction from the client to the
> > server to
> > >reject or queue the operation until it can satisfy at least
> > >the minimum.
> > >The "maximum" is an optimization hint.
> > >
> > >> There is also the issue that WRITE_DELEG_RETURN will want
> > to return
> > >> updated layout information so it can communicate to block
> > >servers what
> > >> blocks have been used. I suppose it is possible to model
> > >that as a kind
> > >> of attribute - we need to communicate the EOF and mtime
> > attribute at
> > >> WRITE_DELEG_RETURN anyway. However, in other cases we've
> > >decided that
> > >> attributes don't work so well for layout info, so I'd be
> > reluctant to
> > >> make them into an attribute in this case. So, I'd advocate that
> > >> WRITE_DELEG_RETURN is a different op than
> > READ_DELEG_RETURN, and that
> > >> it has an explicit layout_prime return parameter to reflect
> > >> what it did.
> > >
> > >It's sufficient to flag returned extent ranges as:
> > >- Commit. Only needed for writable extents that contained no
> > >valid data.
> > > When writable extents are obtained, the client is told which
> > > ones have invalid data and will need to be committed (this moves
> > > zero-fill responsibility to the client from the
> > >metadata server).
> > > This may be block-only, as object and file systems will handle
> > > zero-fill in the data server and hence never need to issue
> > > invalid data extents.
> > >- Release. Give back the read or write rights in an extent.
> > >It's also necessary to be able to set access or modify times
> > - passing
> > >a couple of flags to tell the server to do this avoids any
> dependence
> > >on clients having the same time as servers - and to be able
> > to set the
> > >EOF as part of the operation.
> > >
> > >> Conclusion: I'm coming down in favor of 4 ops:
> > >> {READ,WRITE}_DELEG_{ASK,RETURN}
> > >
> > >I'd like to see "DELEG" changed to something like "MAP",
> "LAYOUT", or
> > >"EXTENT". HighRoad uses 3 ops (common return op for read
> and write),
> > >but 4 is probably cleaner (e.g., can't get confused and try to set
> > >modify time when returning a read extent, but do need "no change",
> > >"set access" and "set access and modify" options when returning a
> > >write extent).
> >
> > Would you always want to return the layout when you commit or
> > release extents or will you want to keep it in some cases?
> > If you want to keep it then calling the operation *_RETURN is
> > confusing...
> >
> > how about:
> >
> > LAYOUT_GET:
> > Takes type (READ/WRITE) and type specific parameters in a
> > discriminated
> > union.
> >
> > LAYOUT_FLUSH: [analogous to EMC's FMP_Flush]
> > Takes type specific parameters such as new EOF, extents to
> > commit/release,
> > etc. The client may continue to hold the layout after
> LAYOUT_FLUSH.
> >
> > LAYOUT_RETURN:
> > Return the layout. The layout and everything associated
> with it are
> > invalid to the client at this point. This will be called
> > when the layout
> > is being called back or voluntarily by the client.
> >
> > CB_LAYOUT_RECALL:
> > Similar to CB_RECALL for data delegations. The client is
> > instructed to quiesce its activity and to return the layout via
> > (LAYOUT_FLUSH +) LAYOUT_RETURN.
> >
> > >
> > >> 3. Resource discovery
> > >>
> > >> There was a little discussion of the DEVICE_LIST and
> > DEVICE_INFO ops.
> > >> The main point was that the device info needs to be typed so that
> > >> clients know if they have the correct datapath driver to use
> > >> that device.
> > >
> > >And the identifiers within each type need to be
> sufficiently global/
> > >unambiguous to avoid possible confusion.
> > >
> > >> It was also brought up that servers may be able to
> export different
> > >> kinds of layouts (e.g., both files and objects, or
> multiple flavors
> > >> of layout foo) so clients may want to negotiate with the
> > >servers about
> > >> what kind of delegation to return. This implies a type in
> > >> the *_DELEG_ASK operations.
> > >
> > >Ok.
> > >
> > >Thanks,
> > >--David
> > >----------------------------------------------------
> > >David L. Black, Senior Technologist
> > >EMC Corporation, 176 South St., Hopkinton, MA 01748
> > >+1 (508) 293-7953 FAX: +1 (508) 293-7786
> > >black_david@emc.com Mobile: +1 (978) 394-7754
> > >----------------------------------------------------
> > >
> > >
> > >
> > >Yahoo! Groups Links
> > >
> > >
> > >
> > >
> > >
> >
> >
> > ------------------------ Yahoo! Groups Sponsor
> > ---------------------~-->
> > Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon
> or Lexmark
> > Printer at MyInks.com. Free s/h on orders $50 or more to the
> > US & Canada.
> > http://www.c1tracking.com/l.asp?cid=5511
> > http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> > --------------------------------------------------------------
> > -------~->
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
> >
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the
> US & Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> --------------------------------------------------------------
> -------~->
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From andros@citi.umich.edu Fri Apr 16 08:43:25 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 44338 invoked from network); 16 Apr 2004 15:43:24 -0000
Received: from unknown (66.218.66.172)
by m2.grp.scd.yahoo.com with QMQP; 16 Apr 2004 15:43:24 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta4.grp.scd.yahoo.com with SMTP; 16 Apr 2004 15:43:24 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id CE6B820824; Fri, 16 Apr 2004 11:43:23 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
In-reply-to: Your message of "Fri, 16 Apr 2004 00:37:49 EDT."
<D0FD8818-8F5F-11D8-8644-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Fri, 16 Apr 2004 11:43:23 -0400
Message-Id: <20040416154323.CE6B820824@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=169434965

hello

we do need to choose a day for both the face-to-face and the interim IETF
working group meeting - i need to reserve the appropriate room(s) at the
university. i also need an approximate head count. who makes the decision!?

-->Andy


garth@panasas.com said:
> We are planning our next face-to-face for U. Michigan (CITI) during
> the week of June 7-11, which is when and where the NFSv4 Bake-a-thon
> is being held. Tentative target, not yet confirmed or arranged, is
> June 7. This will give us a very good chance to interact with the
> broader NFSv4 community. There may be an interim IETF NFSv4 working
> group meeting at the same place in the same week (hopefully on the
> following day).


From dnoveck@netapp.com Fri Apr 16 09:20:51 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 89990 invoked from network); 16 Apr 2004 16:20:50 -0000
Received: from unknown (66.218.66.167)
by m2.grp.scd.yahoo.com with QMQP; 16 Apr 2004 16:20:50 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 16 Apr 2004 16:20:50 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i3GGKnZh005983
for <pnfs-reqs@yahoogroups.com>; Fri, 16 Apr 2004 09:20:49 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i3GGKnNm026049
for <pnfs-reqs@yahoogroups.com>; Fri, 16 Apr 2004 09:20:49 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Fri, 16 Apr 2004 09:20:45 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548AB80D2D@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Fwd: logistics status
Thread-Index: AcQjyZdlAHU10p9NRGWHkBOwGXB0mQAAMXZw
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

> who makes the decision!?

I think that's obvious. You do.

As far as the face-to-face goes, I think what I've heard is that
originally David Black was going to be in Chicago attending
T11 meetings Mon-Wed but that Peter Corbett but could not make
either Thursday or Friday, leaving us in a bind. David then said
that the schedule might have changed and that Monday might be OK,
but I haven't seen any confirmation of that. I think he also said
he could probably call in Monday AM, even if the schedule didn't
change. So maybe Monday AM, is the best bet if you have to decide
now.

One other possibility to consider is late Sunday afternoon or early
evening. People going to the bakeathon would probably be coming
Sunday anyway and would just have to arrive a bit earlier for the
face-to-face. It seems like this would accommodate what I know of
David's schedule, even if the T11 meeting is Monday PM in chicago.
Comments on the feasibility of this, anyone?

As far as the interim working group meeting there are fewer constraints
in that I assume that the attendance of all the pnfs players is not
required. We only need a few to bring the message to the v4 developers,
and I assume we will certainly have those in any case. To minimize
the strain for people not taking part in the bakeathon, I would assume
that we would make it in the same half of the week as the face-to-face.
So maybe Tuesday if the face-to-face is Monday or even Monday if the
face-to-face is Sunday.

-----Original Message-----
From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
Sent: Friday, April 16, 2004 11:43 AM
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
Subject: Re: [pnfs-reqs] Fwd: logistics status


hello

we do need to choose a day for both the face-to-face and the interim IETF
working group meeting - i need to reserve the appropriate room(s) at the
university. i also need an approximate head count. who makes the decision!?

-->Andy


garth@panasas.com said:
> We are planning our next face-to-face for U. Michigan (CITI) during
> the week of June 7-11, which is when and where the NFSv4 Bake-a-thon
> is being held. Tentative target, not yet confirmed or arranged, is
> June 7. This will give us a very good chance to interact with the
> broader NFSv4 community. There may be an interim IETF NFSv4 working
> group meeting at the same place in the same week (hopefully on the
> following day).






Yahoo! Groups Links

From ggrider@lanl.gov Fri Apr 16 14:29:00 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 24379 invoked from network); 16 Apr 2004 21:29:00 -0000
Received: from unknown (66.218.66.216)
by m22.grp.scd.yahoo.com with QMQP; 16 Apr 2004 21:29:00 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta1.grp.scd.yahoo.com with SMTP; 16 Apr 2004 21:28:59 -0000
Received: from mailrelay3.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i3GLSmik029265
for <pnfs-reqs@yahoogroups.com>; Fri, 16 Apr 2004 15:28:48 -0600
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay3.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i3GLSmQ8024396
for <pnfs-reqs@yahoogroups.com>; Fri, 16 Apr 2004 15:28:48 -0600
Received: from cthulu.lanl.gov (vpn-client-138.lanl.gov [128.165.253.138])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i3GLSkia005828;
Fri, 16 Apr 2004 15:28:47 -0600
Message-Id: <5.2.0.9.2.20040416152807.02d28ca8@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Fri, 16 Apr 2004 15:28:45 -0600
To: pnfs-reqs@yahoogroups.com, <pnfs-reqs@yahoogroups.com>
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548AB80D2D@silver.nane.netapp.
com>
Mime-Version: 1.0
Content-Type: multipart/alternative;
boundary="=====================_1041117==.ALT"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: RE: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

ADVERTISEMENT
I would like to be able to call in if I cant make it in person, so can I request that
a conf call in be part of the festivities?

Thanks
Gary Grider
LANL

At 09:20 AM 4/16/2004 -0700, Noveck, Dave wrote:

> > who makes the decision!?
>
> I think that's obvious.  You do.
>
> As far as the face-to-face goes, I think what I've heard is that
> originally David Black was going to be in Chicago attending
> T11 meetings Mon-Wed but that Peter Corbett but could not make
> either Thursday or Friday, leaving us in a bind.  David then said
> that the schedule might have changed and that Monday might be OK,
> but I haven't seen any confirmation of that.  I think he also said
> he could probably call in Monday AM, even if the schedule didn't
> change.  So maybe Monday AM, is the best bet if you have to decide
> now.
>
> One other possibility to consider is late Sunday afternoon or early
> evening.  People going to the bakeathon would probably be coming
> Sunday anyway and would just have to arrive a bit earlier for the
> face-to-face.  It seems like this would accommodate what I know of
> David's schedule, even if the T11 meeting is Monday PM in chicago.
> Comments on the feasibility of this, anyone?
>
> As far as the interim working group meeting there are fewer constraints
> in that I assume that the attendance of all the pnfs players is not
> required.  We only need a few to bring the message to the v4 developers,
> and I assume we will certainly have those in any case.  To minimize
> the strain for people not taking part in the bakeathon, I would assume
> that we would make it in the same half of the week as the face-to-face.
> So maybe Tuesday if the face-to-face is Monday or even Monday if the
> face-to-face is Sunday.
>
> -----Original Message-----
> From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
> Sent: Friday, April 16, 2004 11:43 AM
> To: pnfs-reqs@yahoogroups.com
> Cc: andros@citi.umich.edu
> Subject: Re: [pnfs-reqs] Fwd: logistics status
>
>
> hello
>
> we do need to choose a day for both the face-to-face and the interim IETF
> working group meeting - i need to reserve the appropriate room(s) at the
> university. i also need an approximate head count. who makes the decision!?
>
> -->Andy
>
>
> garth@panasas.com said:
> > We are planning our next face-to-face for U. Michigan (CITI) during
> > the week of June 7-11, which is when and where the NFSv4 Bake-a-thon
> > is being held.  Tentative target, not yet confirmed or arranged, is
> > June 7.  This will give us a very good chance to interact with the
> > broader NFSv4 community.  There may be an interim IETF NFSv4 working
> > group meeting at the same place in the same week (hopefully on the
> > following day).
>
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From black_david@emc.com Sun Apr 18 09:15:50 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 63530 invoked from network); 18 Apr 2004 16:15:49 -0000
Received: from unknown (66.218.66.172)
by m20.grp.scd.yahoo.com with QMQP; 18 Apr 2004 16:15:49 -0000
Received: from unknown (HELO mxic2.corp.emc.com) (128.221.12.9)
by mta4.grp.scd.yahoo.com with SMTP; 18 Apr 2004 16:15:49 -0000
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <JDZDAQ5M>; Sun, 18 Apr 2004 12:15:48 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A58BB@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com
Date: Sun, 18 Apr 2004 12:15:47 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-eGroups-Remote-IP: 128.221.12.9
From: black_david@emc.com
Subject: RE: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

> As far as the face-to-face goes, I think what I've heard is that
> originally David Black was going to be in Chicago attending
> T11 meetings Mon-Wed but that Peter Corbett but could not make
> either Thursday or Friday, leaving us in a bind. David then said
> that the schedule might have changed and that Monday might be OK,
> but I haven't seen any confirmation of that. I think he also said
> he could probably call in Monday AM, even if the schedule didn't
> change. So maybe Monday AM, is the best bet if you have to decide
> now.

T11 now has the correct info on their web site. I need to attend
T11 meetings in Chicago (Central time):
Mon (6/7): 1p-6p
Tue (6/8): 1p-9p
Wed (6/9): 9a-12n, 3p-6p
Transportation realities make it impossible to spend half a day in
Ann Arbor and the other half of the same day in Chicago, so a Mon or
Tue AM pNFS meeting with dial-in looks like the best bet. I could be
in Ann Arbor all day Thu or Fri.

Thanks,
--David

From garth@panasas.com Mon Apr 19 09:30:07 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 69066 invoked from network); 19 Apr 2004 16:30:06 -0000
Received: from unknown (66.218.66.167)
by m25.grp.scd.yahoo.com with QMQP; 19 Apr 2004 16:30:06 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 19 Apr 2004 16:30:06 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56LWFG; Mon, 19 Apr 2004 12:29:41 -0400
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <B459CE1AFFC52D4688B2A5B842CA35EA7A58BB@corpmx14.us.dg.com>
References: <B459CE1AFFC52D4688B2A5B842CA35EA7A58BB@corpmx14.us.dg.com>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Message-Id: <C132FCE0-921E-11D8-8644-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Mon, 19 Apr 2004 09:29:39 -0700
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
and Friday.

What about others? Lets find a day where the largest number can attend
in person.

Pending the outcome of this poll, Andy & Peter -- who does a Monday
pNFS face-to-face meeting look at your end?

garth

On Apr 18, 2004, at 9:15 AM, black_david@emc.com wrote:

>> As far as the face-to-face goes, I think what I've heard is that
>> originally David Black was going to be in Chicago attending
>> T11 meetings Mon-Wed but that Peter Corbett but could not make
>> either Thursday or Friday, leaving us in a bind. David then said
>> that the schedule might have changed and that Monday might be OK,
>> but I haven't seen any confirmation of that. I think he also said
>> he could probably call in Monday AM, even if the schedule didn't
>> change. So maybe Monday AM, is the best bet if you have to decide
>> now.
>
> T11 now has the correct info on their web site. I need to attend
> T11 meetings in Chicago (Central time):
> Mon (6/7): 1p-6p
> Tue (6/8): 1p-9p
> Wed (6/9): 9a-12n, 3p-6p
> Transportation realities make it impossible to spend half a day in
> Ann Arbor and the other half of the same day in Chicago, so a Mon or
> Tue AM pNFS meeting with dial-in looks like the best bet. I could be
> in Ann Arbor all day Thu or Fri.
>
> Thanks,
> --David
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> ---------------------------------------------------------------------
> ~->
>
>
> Yahoo! Groups Links
>
>
>
>


From Thomas.Talpey@netapp.com Mon Apr 19 09:43:55 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 44753 invoked from network); 19 Apr 2004 16:43:54 -0000
Received: from unknown (66.218.66.172)
by m14.grp.scd.yahoo.com with QMQP; 19 Apr 2004 16:43:54 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 19 Apr 2004 16:43:54 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i3JGhfZh005594
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 09:43:42 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i3JGhZNs011106
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 09:43:41 -0700 (PDT)
Received: from tmt.netapp.com ([10.97.6.30]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Mon, 19 Apr 2004 12:42:28 -0400
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C4262D.4CF93200"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Mon, 19 Apr 2004 09:41:46 -0700
Message-ID: <6.1.0.6.2.20040419124044.01e5f640@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Fwd: logistics status
Thread-Index: AcQmLU1VkIalZaUGRIqJjO9o9vK7iQ==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

Right now, I plan to attend the v4 event Mon-Thurs and cannot
attend on Fri.

Tom.

At 12:29 PM 4/19/2004, Garth Gibson wrote:
>I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday 
>and Friday.
>
>What about others?  Lets find a day where the largest number can attend 
>in person.
>
>Pending the outcome of this poll, Andy & Peter -- who does a Monday 
>pNFS face-to-face meeting look at your end?
>
>garth
>
>On Apr 18, 2004, at 9:15 AM, black_david@emc.com wrote:
>
>>> As far as the face-to-face goes, I think what I've heard is that
>>> originally David Black was going to be in Chicago attending
>>> T11 meetings Mon-Wed but that Peter Corbett but could not make
>>> either Thursday or Friday, leaving us in a bind.  David then said
>>> that the schedule might have changed and that Monday might be OK,
>>> but I haven't seen any confirmation of that.  I think he also said
>>> he could probably call in Monday AM, even if the schedule didn't
>>> change.  So maybe Monday AM, is the best bet if you have to decide
>>> now.
>>
>> T11 now has the correct info on their web site.  I need to attend
>> T11 meetings in Chicago (Central time):
>>      Mon (6/7): 1p-6p
>>      Tue (6/8): 1p-9p
>>      Wed (6/9): 9a-12n, 3p-6p
>> Transportation realities make it impossible to spend half a day in
>> Ann Arbor and the other half of the same day in Chicago, so a Mon or
>> Tue AM pNFS meeting with dial-in looks like the best bet.  I could be
>> in Ann Arbor all day Thu or Fri.
>>
>> Thanks,
>> --David
>>
>>
>> ------------------------ Yahoo! Groups Sponsor 
>> ---------------------~-->
>> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
>> Printer at MyInks.com.  Free s/h on orders $50 or more to the US & 
>> Canada.
>> http://www.c1tracking.com/l.asp?cid=5511
>> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>> ---------------------------------------------------------------------
>> ~->
>>
>>
>> Yahoo! Groups Links
>>
>>
>>
>>
>
>
>
>------------------------ Yahoo! Groups Sponsor ---------------------~-->
>Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
>Printer at MyInks.com.  Free s/h on orders $50 or more to the US & Canada.
>http://www.c1tracking.com/l.asp?cid=5511
>http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
>---------------------------------------------------------------------~->
>
>
>Yahoo! Groups Links
>
><*> To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
><*> To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
><*> Your use of Yahoo! Groups is subject to:
>     http://docs.yahoo.com/info/terms/
> 

From dnoveck@netapp.com Mon Apr 19 10:25:04 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 61496 invoked from network); 19 Apr 2004 17:25:03 -0000
Received: from unknown (66.218.66.216)
by m24.grp.scd.yahoo.com with QMQP; 19 Apr 2004 17:25:03 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 19 Apr 2004 17:25:02 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i3JHIVZh014915
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 10:18:31 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i3JHIStS026583
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 10:18:31 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 19 Apr 2004 10:18:10 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548AB80D36@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Fwd: logistics status
Thread-Index: AcQmLIt4OogaRwgIQhCl9cy8A5eDmwABXc+w
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

I'm OK with any day. However, Friday PM is something that I'd like to
stay away from as it could complicate my travel plans.

-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Monday, April 19, 2004 12:30 PM
To: pnfs-reqs@yahoogroups.com
Subject: Re: [pnfs-reqs] Fwd: logistics status


I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
and Friday.

What about others? Lets find a day where the largest number can attend

in person.

Pending the outcome of this poll, Andy & Peter -- who does a Monday
pNFS face-to-face meeting look at your end?

garth

On Apr 18, 2004, at 9:15 AM, black_david@emc.com wrote:

>> As far as the face-to-face goes, I think what I've heard is that
>> originally David Black was going to be in Chicago attending
>> T11 meetings Mon-Wed but that Peter Corbett but could not make
>> either Thursday or Friday, leaving us in a bind. David then said
>> that the schedule might have changed and that Monday might be OK,
>> but I haven't seen any confirmation of that. I think he also said
>> he could probably call in Monday AM, even if the schedule didn't
>> change. So maybe Monday AM, is the best bet if you have to decide
>> now.
>
> T11 now has the correct info on their web site. I need to attend
> T11 meetings in Chicago (Central time):
> Mon (6/7): 1p-6p
> Tue (6/8): 1p-9p
> Wed (6/9): 9a-12n, 3p-6p
> Transportation realities make it impossible to spend half a day in
> Ann Arbor and the other half of the same day in Chicago, so a Mon or
> Tue AM pNFS meeting with dial-in looks like the best bet. I could be
> in Ann Arbor all day Thu or Fri.
>
> Thanks,
> --David
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> ---------------------------------------------------------------------
> ~->
>
>
> Yahoo! Groups Links
>
>
>
>





Yahoo! Groups Links



From pcorbett@netapp.com Mon Apr 19 11:22:29 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 25679 invoked from network); 19 Apr 2004 18:22:28 -0000
Received: from unknown (66.218.66.167)
by m25.grp.scd.yahoo.com with QMQP; 19 Apr 2004 18:22:28 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 19 Apr 2004 18:22:28 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i3JIMSZh000317
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 11:22:28 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i3JIMCtA023696
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 11:22:27 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 19 Apr 2004 11:22:20 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A016F53A5@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Fwd: logistics status
Thread-Index: AcQmLIzbxmZ+T7b+TeqmvqUm+2STOwADqsKw
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

Monday is fine with me.

-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Monday, April 19, 2004 12:30 PM
To: pnfs-reqs@yahoogroups.com
Subject: Re: [pnfs-reqs] Fwd: logistics status

I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
and Friday.

What about others? Lets find a day where the largest number can attend

in person.

Pending the outcome of this poll, Andy & Peter -- who does a Monday
pNFS face-to-face meeting look at your end?

garth

On Apr 18, 2004, at 9:15 AM, black_david@emc.com wrote:

>> As far as the face-to-face goes, I think what I've heard is that
>> originally David Black was going to be in Chicago attending
>> T11 meetings Mon-Wed but that Peter Corbett but could not make
>> either Thursday or Friday, leaving us in a bind. David then said
>> that the schedule might have changed and that Monday might be OK,
>> but I haven't seen any confirmation of that. I think he also said
>> he could probably call in Monday AM, even if the schedule didn't
>> change. So maybe Monday AM, is the best bet if you have to decide
>> now.
>
> T11 now has the correct info on their web site. I need to attend
> T11 meetings in Chicago (Central time):
> Mon (6/7): 1p-6p
> Tue (6/8): 1p-9p
> Wed (6/9): 9a-12n, 3p-6p
> Transportation realities make it impossible to spend half a day in
> Ann Arbor and the other half of the same day in Chicago, so a Mon or
> Tue AM pNFS meeting with dial-in looks like the best bet. I could be
> in Ann Arbor all day Thu or Fri.
>
> Thanks,
> --David
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> ---------------------------------------------------------------------
> ~->
>
>
> Yahoo! Groups Links
>
>
>
>





Yahoo! Groups Links

From mclarty3@llnl.gov Mon Apr 19 11:23:15 2004
Return-Path: <mclarty3@llnl.gov>
X-Sender: mclarty3@llnl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 42596 invoked from network); 19 Apr 2004 18:23:14 -0000
Received: from unknown (66.218.66.167)
by m18.grp.scd.yahoo.com with QMQP; 19 Apr 2004 18:23:14 -0000
Received: from unknown (HELO smtp-4.llnl.gov) (128.115.41.84)
by mta6.grp.scd.yahoo.com with SMTP; 19 Apr 2004 18:23:14 -0000
Received: from poptop.llnl.gov ([127.0.0.1])
by smtp-4.llnl.gov (8.12.3/8.12.3/LLNL evision: 1.14 $) with ESMTP id i3JINDv1026748
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 11:23:13 -0700 (PDT)
Received: from POLARBEAR.llnl.gov ([134.9.18.59] verified)
by poptop.llnl.gov (CommuniGate Pro SMTP 4.0.6)
with ESMTP id 41238041 for pnfs-reqs@yahoogroups.com; Mon, 19 Apr 2004 11:23:12 -0700
Message-Id: <5.0.0.25.2.20040419112120.02f67d60@poptop.llnl.gov>
X-Sender: e002801@poptop.llnl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Mon, 19 Apr 2004 11:23:12 -0700
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <C132FCE0-921E-11D8-8644-000A95A94F04@panasas.com>
References: <B459CE1AFFC52D4688B2A5B842CA35EA7A58BB@corpmx14.us.dg.com>
<B459CE1AFFC52D4688B2A5B842CA35EA7A58BB@corpmx14.us.dg.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-Scanned-By: MIMEDefang 2.39
X-eGroups-Remote-IP: 128.115.41.84
From: Tyce McLarty <mclarty3@llnl.gov>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=169320772
X-Yahoo-Profile: mclarty3

Any day that week works for Bill Loewe and I from LLNL.

Tyce

At 09:29 AM 4/19/2004 -0700, you wrote:
>I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
>and Friday.
>
>What about others? Lets find a day where the largest number can attend
>in person.
>
>Pending the outcome of this poll, Andy & Peter -- who does a Monday
>pNFS face-to-face meeting look at your end?
>
>garth
>
>On Apr 18, 2004, at 9:15 AM, black_david@emc.com wrote:
>
> >> As far as the face-to-face goes, I think what I've heard is that
> >> originally David Black was going to be in Chicago attending
> >> T11 meetings Mon-Wed but that Peter Corbett but could not make
> >> either Thursday or Friday, leaving us in a bind. David then said
> >> that the schedule might have changed and that Monday might be OK,
> >> but I haven't seen any confirmation of that. I think he also said
> >> he could probably call in Monday AM, even if the schedule didn't
> >> change. So maybe Monday AM, is the best bet if you have to decide
> >> now.
> >
> > T11 now has the correct info on their web site. I need to attend
> > T11 meetings in Chicago (Central time):
> > Mon (6/7): 1p-6p
> > Tue (6/8): 1p-9p
> > Wed (6/9): 9a-12n, 3p-6p
> > Transportation realities make it impossible to spend half a day in
> > Ann Arbor and the other half of the same day in Chicago, so a Mon or
> > Tue AM pNFS meeting with dial-in looks like the best bet. I could be
> > in Ann Arbor all day Thu or Fri.
> >
> > Thanks,
> > --David
> >
> >
> > ------------------------ Yahoo! Groups Sponsor
> > ---------------------~-->
> > Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> > Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> > Canada.
> > http://www.c1tracking.com/l.asp?cid=5511
> > http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> > ---------------------------------------------------------------------
> > ~->
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>

From andros@citi.umich.edu Mon Apr 19 13:17:08 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 27216 invoked from network); 19 Apr 2004 20:17:07 -0000
Received: from unknown (66.218.66.172)
by m13.grp.scd.yahoo.com with QMQP; 19 Apr 2004 20:17:07 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta4.grp.scd.yahoo.com with SMTP; 19 Apr 2004 20:17:07 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 02CB0207F1; Mon, 19 Apr 2004 16:15:37 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
In-reply-to: Your message of "Mon, 19 Apr 2004 10:18:10 PDT."
<C8CF60CFC4D8A74E9945E32CF096548AB80D36@silver.nane.netapp.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 19 Apr 2004 16:15:36 -0400
Message-Id: <20040419201537.02CB0207F1@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=169434965

monday is spent setting up the bakeathon. i think it would be best if we had
the pNFS meeting on tuesday june 8th, with the interim IETF meeting on
wednesday.

i propose tuesday.

-->Andy



garth@panasas.com said:
> I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
> and Friday.

> What about others? Lets find a day where the largest number can
> attend in person.

> Pending the outcome of this poll, Andy & Peter -- who does a Monday
> pNFS face-to-face meeting look at your end?

> garth 

From spencer.shepler@sun.com Mon Apr 19 14:14:43 2004
Return-Path: <spencer.shepler@sun.com>
X-Sender: spencer.shepler@sun.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 45912 invoked from network); 19 Apr 2004 21:14:42 -0000
Received: from unknown (66.218.66.167)
by m21.grp.scd.yahoo.com with QMQP; 19 Apr 2004 21:14:42 -0000
Received: from unknown (HELO nwkea-mail-2.sun.com) (192.18.42.14)
by mta6.grp.scd.yahoo.com with SMTP; 19 Apr 2004 21:14:42 -0000
Received: from engmail1mpk.Eng.Sun.COM ([129.146.11.21])
by nwkea-mail-2.sun.com (8.12.10/8.12.9) with ESMTP id i3JLEghO022846
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 14:14:42 -0700 (PDT)
Received: from [129.153.128.28] (this.Central.Sun.COM [129.153.128.28])
by engmail1mpk.Eng.Sun.COM (8.12.10+Sun/8.12.10/ENSMAIL,v2.2) with ESMTP id i3JLEfjW014250
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 14:14:42 -0700 (PDT)
Mime-Version: 1.0 (Apple Message framework v613)
In-Reply-To: <20040419201537.02CB0207F1@citi.umich.edu>
References: <20040419201537.02CB0207F1@citi.umich.edu>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <FB51F06E-9246-11D8-96FB-000A95DBCB70@sun.com>
Content-Transfer-Encoding: 7bit
Date: Mon, 19 Apr 2004 16:17:36 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.613)
X-eGroups-Remote-IP: 192.18.42.14
From: Spencer Shepler <spencer.shepler@sun.com>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=129247794
X-Yahoo-Profile: s_shepler

ADVERTISEMENT

this is fine with me with respect to the IETF interim meeting

On Apr 19, 2004, at 3:15 PM, William A.(Andy) Adamson wrote:

> monday is spent setting up the bakeathon. i think it would be best if
> we had
> the pNFS meeting on tuesday june 8th, with the interim IETF meeting on
> wednesday.
>
> i propose tuesday.
>
> -->Andy
>
>
>
> garth@panasas.com said:
>> I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
>> and Friday.
>
>> What about others? Lets find a day where the largest number can
>> attend in person.
>
>> Pending the outcome of this poll, Andy & Peter -- who does a Monday
>> pNFS face-to-face meeting look at your end?
>
>> garth
>
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>

From bwelch@panasas.com Mon Apr 19 18:40:20 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 94824 invoked from network); 20 Apr 2004 01:40:19 -0000
Received: from unknown (66.218.66.167)
by m15.grp.scd.yahoo.com with QMQP; 20 Apr 2004 01:40:19 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta6.grp.scd.yahoo.com with SMTP; 20 Apr 2004 01:40:19 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i3K1eI927227
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Apr 2004 18:40:18 -0700
Message-Id: <200404200140.i3K1eI927227@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.6.4 04/07/2004 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com
In-reply-to: <C132FCE0-921E-11D8-8644-000A95A94F04@panasas.com>
References: <B459CE1AFFC52D4688B2A5B842CA35EA7A58BB@corpmx14.us.dg.com>
<C132FCE0-921E-11D8-8644-000A95A94F04@panasas.com>
Comments: In-reply-to Garth Gibson <garth@panasas.com>
message dated "Mon, 19 Apr 2004 09:29:39 -0700."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 19 Apr 2004 18:40:18 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

Any day is OK with me.

>>>Garth Gibson said:
> I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
> and Friday.
>
> What about others? Lets find a day where the largest number can attend
> in person.
>
> Pending the outcome of this poll, Andy & Peter -- who does a Monday
> pNFS face-to-face meeting look at your end?
>
> garth
>
> On Apr 18, 2004, at 9:15 AM, black_david@emc.com wrote:
>
> >> As far as the face-to-face goes, I think what I've heard is that
> >> originally David Black was going to be in Chicago attending
> >> T11 meetings Mon-Wed but that Peter Corbett but could not make
> >> either Thursday or Friday, leaving us in a bind. David then said
> >> that the schedule might have changed and that Monday might be OK,
> >> but I haven't seen any confirmation of that. I think he also said
> >> he could probably call in Monday AM, even if the schedule didn't
> >> change. So maybe Monday AM, is the best bet if you have to decide
> >> now.
> >
> > T11 now has the correct info on their web site. I need to attend
> > T11 meetings in Chicago (Central time):
> > Mon (6/7): 1p-6p
> > Tue (6/8): 1p-9p
> > Wed (6/9): 9a-12n, 3p-6p
> > Transportation realities make it impossible to spend half a day in
> > Ann Arbor and the other half of the same day in Chicago, so a Mon or
> > Tue AM pNFS meeting with dial-in looks like the best bet. I could be
> > in Ann Arbor all day Thu or Fri.
> >
> > Thanks,
> > --David
> >
> >
> > ------------------------ Yahoo! Groups Sponsor
> > ---------------------~-->
> > Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> > Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> > Canada.
> > http://www.c1tracking.com/l.asp?cid=5511
> > http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> > ---------------------------------------------------------------------
> > ~->
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From garth@panasas.com Tue Apr 20 13:30:18 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 72646 invoked from network); 20 Apr 2004 20:30:17 -0000
Received: from unknown (66.218.66.218)
by m11.grp.scd.yahoo.com with QMQP; 20 Apr 2004 20:30:17 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 20 Apr 2004 20:30:17 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56L9D4; Tue, 20 Apr 2004 16:30:15 -0400
Mime-Version: 1.0 (Apple Message framework v612)
Content-Transfer-Encoding: 7bit
Message-Id: <86588ED1-9309-11D8-912C-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Tue, 20 Apr 2004 13:30:12 -0700
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: FYI
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Call for Papers and Participation

WORKSHOP ON SCALABLE FILE SYSTEMS AND STORAGE TECHNOLOGIES

in conjunction with

THE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED
COMPUTING SYSTEMS (PDCS 2004)

San Francisco, CA
September 15-17, 2004

http://multimedia.ece.uic.edu/pdcs2004/PDCS_WS_CFP.pdf

Computational simulations generate vast amounts of data that require
distributed storage facilities to effectively store, retrieve, and share
the information among collaborating applications. Performance of such
simulations is impacted by the performance of the storage subsystem.
Storage technologies are also evolving drastically in order to keep up
with the ever-increasing demand for improved data access. Along with
complex simulations and associated storage come new challenges. The goal
of this workshop is to encourage innovation by addressing the complex
issues that arise in data storage associated with very large-scale
distributed applications. This one-day workshop promotes the exchange
of
novel ideas, information and developments among universities, industry
and
federal laboratories. Topics of interest of this workshop include (but
are
not limited to) the illustration of advances in the following areas:

* Parallel File Systems :
* Storage Technology and protocols
* Storage issues in Grid computing environments
* Storage issues associated with large-scale distributed applications
* I/O Performance Analysis;
* Parallel I/O support for databases.

There will be several invited 30-minutes lectures as well as contributed
20-30 minutes talks. Accepted abstracts would be published in the
conference proceedings. Links to individual presenters materials will be
posted at http://ardra.hpcl.cis.uab.edu/sfast04/

To submit papers, send your manuscript (4 page extended abstract)
electronically (PS or PDF format, compressed) or by mail (3 paper
copies)
to the following address:

Vijay Velusamy
High Performance Computing Laboratory
Department of Computer and Information Sciences
113A Campbell Hall, 1300 University Boulevard
Birmingham, AL 35294-1170
e-mail: vijay@hpcl.cis.uab.edu


IMPORTANT DATES
==================
Submission deadline : June 4, 2004
Acceptance Notifications : July 5, 2004
Camera-ready prints : July 28, 2004


Invited Speakers:
* Garth Gibson (Carnegie Mellon University, Panasas, Inc.)
* Gary Grider (Los Alamos National Laboratories)
* Tyce Mclarty (Lawrence Livermore National Laboratories)
* Thomas H. Cormen (Dartmouth College)
* Robert Ross (Argonne National Laboratories)

Steering Committee:
* Anthony Skjellum (University of Alabama at Birmingham)
* Vijay Velusamy (University of Alabama at Birmingham)
* Kumaran Rajaram (Verari Systems, Inc.)

From garth@panasas.com Wed Apr 21 10:57:26 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 21847 invoked from network); 21 Apr 2004 17:57:24 -0000
Received: from unknown (66.218.66.167)
by m24.grp.scd.yahoo.com with QMQP; 21 Apr 2004 17:57:24 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 21 Apr 2004 17:57:24 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56MD5F; Wed, 21 Apr 2004 13:56:37 -0400
Mime-Version: 1.0 (Apple Message framework v612)
In-Reply-To: <20040419201537.02CB0207F1@citi.umich.edu>
References: <20040419201537.02CB0207F1@citi.umich.edu>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <3B147EB5-93BD-11D8-912C-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Wed, 21 Apr 2004 10:56:35 -0700
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.612)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Lets decide this at the logistics concall tomorrow morning 11 am EST.

We have a proposal of Tues June 8 for the pNFS face-to-face at
CITI/UMich in Ann Arbor.

Anyone wanting to join the concall tomorrow who does not have the
dialin, please send me direct email.

Thanks
garth

On Apr 19, 2004, at 1:15 PM, William A.(Andy) Adamson wrote:

> monday is spent setting up the bakeathon. i think it would be best if
> we had
> the pNFS meeting on tuesday june 8th, with the interim IETF meeting on
> wednesday.
>
> i propose tuesday.
>
> -->Andy
>
>
>
> garth@panasas.com said:
>> I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
>> and Friday.
>
>> What about others? Lets find a day where the largest number can
>> attend in person.
>
>> Pending the outcome of this poll, Andy & Peter -- who does a Monday
>> pNFS face-to-face meeting look at your end?
>
>> garth
>
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>

From spencer.shepler@sun.com Wed Apr 21 12:28:39 2004
Return-Path: <spencer.shepler@sun.com>
X-Sender: spencer.shepler@sun.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 61926 invoked from network); 21 Apr 2004 19:28:38 -0000
Received: from unknown (66.218.66.167)
by m3.grp.scd.yahoo.com with QMQP; 21 Apr 2004 19:28:38 -0000
Received: from unknown (HELO brmea-mail-3.sun.com) (192.18.98.34)
by mta6.grp.scd.yahoo.com with SMTP; 21 Apr 2004 19:28:38 -0000
Received: from engmail2sun.Eng.Sun.COM ([129.144.134.19])
by brmea-mail-3.sun.com (8.12.10/8.12.9) with ESMTP id i3LJSH4U012158
for <pnfs-reqs@yahoogroups.com>; Wed, 21 Apr 2004 13:28:17 -0600 (MDT)
Received: from [129.153.128.28] (this.Central.Sun.COM [129.153.128.28])
by engmail2sun.Eng.Sun.COM (8.12.10+Sun/8.12.10/ENSMAIL,v2.2) with ESMTP id i3LJSHgr015435
for <pnfs-reqs@yahoogroups.com>; Wed, 21 Apr 2004 12:28:17 -0700 (PDT)
Mime-Version: 1.0 (Apple Message framework v613)
In-Reply-To: <3B147EB5-93BD-11D8-912C-000A95A94F04@panasas.com>
References: <20040419201537.02CB0207F1@citi.umich.edu> <3B147EB5-93BD-11D8-912C-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Message-Id: <73D2D8F0-93CA-11D8-AF89-000A95DBCB70@sun.com>
Content-Transfer-Encoding: 7bit
Date: Wed, 21 Apr 2004 14:31:14 -0500
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.613)
X-eGroups-Remote-IP: 192.18.98.34
From: Spencer Shepler <spencer.shepler@sun.com>
Subject: Re: [pnfs-reqs] Fwd: logistics status
X-Yahoo-Group-Post: member; u=129247794
X-Yahoo-Profile: s_shepler

ADVERTISEMENT

I have misplaced the concall info, Garth.

On Apr 21, 2004, at 12:56 PM, Garth Gibson wrote:

> Lets decide this at the logistics concall tomorrow morning 11 am EST.
>
> We have a proposal of Tues June 8 for the pNFS face-to-face at
> CITI/UMich in Ann Arbor.
>
> Anyone wanting to join the concall tomorrow who does not have the
> dialin, please send me direct email.
>
> Thanks
> garth
>
> On Apr 19, 2004, at 1:15 PM, William A.(Andy) Adamson wrote:
>
>> monday is spent setting up the bakeathon. i think it would be best if
>> we had
>> the pNFS meeting on tuesday june 8th, with the interim IETF meeting on
>> wednesday.
>>
>> i propose tuesday.
>>
>> -->Andy
>>
>>
>>
>> garth@panasas.com said:
>>> I think Peter Corbett and I cannot be in Ann Arbor at all on Thursday
>>> and Friday.
>>
>>> What about others? Lets find a day where the largest number can
>>> attend in person.
>>
>>> Pending the outcome of this poll, Andy & Peter -- who does a Monday
>>> pNFS face-to-face meeting look at your end?
>>
>>> garth
>>
>>
>>
>>
>>
>>
>> Yahoo! Groups Links
>>
>>
>>
>>
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> ---------------------~-->
> Buy Ink Cartridges or Refill Kits for your HP, Epson, Canon or Lexmark
> Printer at MyInks.com. Free s/h on orders $50 or more to the US &
> Canada.
> http://www.c1tracking.com/l.asp?cid=5511
> http://us.click.yahoo.com/mOAaAA/3exGAA/qnsNAA/W6uqlB/TM
> ---------------------------------------------------------------------
> ~->
>
>
> Yahoo! Groups Links
>
>
>
>
>

From andros@citi.umich.edu Mon May 03 10:58:35 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 40734 invoked from network); 3 May 2004 17:58:34 -0000
Received: from unknown (66.218.66.172)
by m24.grp.scd.yahoo.com with QMQP; 3 May 2004 17:58:34 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta4.grp.scd.yahoo.com with SMTP; 3 May 2004 17:58:34 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 1ACC8207C4; Mon, 3 May 2004 13:58:25 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com, nfsv4-wg@citi.umich.edu
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 03 May 2004 13:58:25 -0400
Message-Id: <20040503175825.1ACC8207C4@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Announcement of the pNFS face-tp-face meeting at CITI, June 8th.
X-Yahoo-Group-Post: member; u=169434965

CITI University of Michigan is pleased to host the next pNFS face-to-faces
meeting Tuesday June 8th 9:00am - 5:00pm. This meeting is scheduled during the
10th NFSv4 Bakeathon also held at CITI June 7-11, and one day prior to the
Interim NFSv4 IETF working group meeting, held at the Universtiy of Michigan
June 9th.

The meeting will be held in a conference room in the Argus building, which
also houses CITI. A teleconferencing call line will be provided for phone
participation. An agenda will be forth coming.

See
http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/10th_nfsv4_bakeathon.ht
ml for hotel and travel information. Bakeathon regisitration is not necessary
for pNFS meeting participation.

-->Andy Adamson

From ggrider@lanl.gov Mon May 03 15:23:46 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 44877 invoked from network); 3 May 2004 22:23:45 -0000
Received: from unknown (66.218.66.172)
by m10.grp.scd.yahoo.com with QMQP; 3 May 2004 22:23:45 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta4.grp.scd.yahoo.com with SMTP; 3 May 2004 22:23:45 -0000
Received: from mailrelay3.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i43MNiik000407
for <pnfs-reqs@yahoogroups.com>; Mon, 3 May 2004 16:23:44 -0600
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay3.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i43MNiQ8018776
for <pnfs-reqs@yahoogroups.com>; Mon, 3 May 2004 16:23:44 -0600
Received: from cthulu.lanl.gov (cthulu.lanl.gov [128.165.115.129])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i43MNhia004641
for <pnfs-reqs@yahoogroups.com>; Mon, 3 May 2004 16:23:43 -0600
Message-Id: <5.2.0.9.2.20040503162329.0328f0f8@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Mon, 03 May 2004 16:23:43 -0600
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <20040503175825.1ACC8207C4@citi.umich.edu>
Mime-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="=====================_29822873==.REL"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] Announcement of the pNFS face-tp-face meeting
at CITI, June 8th.
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

Will there be dial in?

Thanks
Gary

At 01:58 PM 5/3/2004 -0400, you wrote:

> CITI University of Michigan is pleased to host the next pNFS face-to-faces
> meeting Tuesday June 8th 9:00am - 5:00pm. This meeting is scheduled during the
> 10th NFSv4 Bakeathon also held at CITI June 7-11, and one day prior to the
> Interim NFSv4 IETF working group meeting, held at the Universtiy of Michigan
> June 9th.
>
> The meeting will be held in a conference room in the Argus building, which
> also houses CITI. A teleconferencing call line will be provided for phone
> participation. An agenda will be forth coming.
>
> See
> http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/10th_nfsv4_bakeathon.ht
> ml for hotel and travel information. Bakeathon regisitration is not necessary
> for pNFS meeting participation.
>
> -->Andy Adamson
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> 1c707da.jpg 
> 1c70834.jpg
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 



From andros@citi.umich.edu Mon May 03 15:50:39 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 58410 invoked from network); 3 May 2004 22:50:38 -0000
Received: from unknown (66.218.66.172)
by m19.grp.scd.yahoo.com with QMQP; 3 May 2004 22:50:38 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta4.grp.scd.yahoo.com with SMTP; 3 May 2004 22:50:37 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 50184207F8; Mon, 3 May 2004 18:50:37 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
In-reply-to: Your message of "Mon, 03 May 2004 16:23:43 MDT."
<5.2.0.9.2.20040503162329.0328f0f8@cic-mail.lanl.gov>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 03 May 2004 18:50:37 -0400
Message-Id: <20040503225037.50184207F8@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] Announcement of the pNFS face-tp-face meeting at
CITI, June 8th.
X-Yahoo-Group-Post: member; u=169434965

ADVERTISEMENT
click here
hi gary

yes, there will be dial-in.

-->Andy
>
> Will there be dial in?
>
> Thanks
> Gary
>
> At 01:58 PM 5/3/2004 -0400, you wrote:
> >CITI University of Michigan is pleased to host the next pNFS face-to-faces
> >meeting Tuesday June 8th 9:00am - 5:00pm. This meeting is scheduled during
> >the
> >10th NFSv4 Bakeathon also held at CITI June 7-11, and one day prior to the
> >Interim NFSv4 IETF working group meeting, held at the Universtiy of Michigan
> >June 9th.
> >
> >The meeting will be held in a conference room in the Argus building, which
> >also houses CITI. A teleconferencing call line will be provided for phone
> >participation. An agenda will be forth coming.
> >
> >See
> ><http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/10th_nfsv4_bakeathon.ht>http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/10th_nfsv4_bakeathon.ht
> >ml for hotel and travel information. Bakeathon regisitration is not necessary
> >for pNFS meeting participation.
> >
> >-->Andy Adamson
> >
> >
> >Yahoo! Groups Sponsor
> >ADVERTISEMENT
> ><http://rd.yahoo.com/SIG=129rtnm0m/M=295196.4901138.6050264.3001176/D=groups/S=1705701014:HM/EXP=1083693516/A=1874365/R=2/id=noscript/SIG=118tuuldn/*http://companion.yahoo.com/?.cpdl=srch>1c707da.jpg
> >
> >1c70834.jpg
> >
> >
> >----------
> >Yahoo! Groups Links
> > * To visit your group on the web, go to:
> > *
> > <http://groups.yahoo.com/group/pnfs-reqs/>http://groups.yahoo.com/group/pnfs-reqs/
> >
> > *
> > * To unsubscribe from this group, send an email to:
> > *
> > <mailto:pnfs-reqs-unsubscribe@yahoogroups.com?subject=Unsubscribe>pnfs-reqs-unsubscribe@yahoogroups.com
> >
> > *
> > * Your use of Yahoo! Groups is subject to the
> > <http://docs.yahoo.com/info/terms/>Yahoo! Terms of Service.
>
> --=====================_29822883==.ALT
> Content-Type: text/html; charset=US-ASCII
> Content-Transfer-Encoding: 7bit
>
> <html>
> <body>
>
>
> Will there be dial in?<br><br>
> Thanks<br>
> Gary<br><br>
> At 01:58 PM 5/3/2004 -0400, you wrote:<br>
> <blockquote type=cite class=cite cite><tt>CITI University of Michigan is
> pleased to host the next pNFS face-to-faces <br>
> meeting Tuesday June 8th 9:00am - 5:00pm. This meeting is scheduled
> during the <br>
> 10th NFSv4 Bakeathon also held at CITI June 7-11, and one day prior to
> the <br>
> Interim NFSv4 IETF working group meeting, held at the Universtiy of
> Michigan <br>
> June 9th.<br><br>
> The meeting will be held in a conference room in the Argus building,
> which <br>
> also houses CITI. A teleconferencing call line will be provided for phone
> <br>
> participation. An agenda will be forth coming.<br><br>
> See<br>
> <a href="http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/10th_nfsv4_bakeathon.ht">http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/10th_nfsv4_bakeathon.ht</a><br>
> ml for hotel and travel information. Bakeathon regisitration is not
> necessary <br>
> for pNFS meeting participation.<br><br>
> -->Andy Adamson<br><br>
> </tt><br>
> <font size=2 color="#003399"><b>Yahoo! Groups Sponsor</b></font> <br>
> <font face="arial" size=1>ADVERTISEMENT</font><br>
> <a href="http://rd.yahoo.com/SIG=129rtnm0m/M=295196.4901138.6050264.3001176/D=groups/S=1705701014:HM/EXP=1083693516/A=1874365/R=2/id=noscript/SIG=118tuuldn/*http://companion.yahoo.com/?.cpdl=srch"><img src="cid:5.2.0.9.2.20040503162329.0328f0f8@cic-mail.lanl.gov.0" width=300 height=250 alt="1c707da.jpg"></a>�<br>
> <img src="cid:5.2.0.9.2.20040503162329.0328f0f8@cic-mail.lanl.gov.1" width=1 height=1 alt="1c70834.jpg"><br><br>
> <hr>
> <tt>Yahoo! Groups Links
> <ul>
> <li>To visit your group on the web, go to:
> <li><a href="http://groups.yahoo.com/group/pnfs-reqs/">http://groups.yahoo.com/group/pnfs-reqs/</a>
> <li>�
> <li>To unsubscribe from this group, send an email to:
> <li><a href="mailto:pnfs-reqs-unsubscribe@yahoogroups.com?subject=Unsubscribe">pnfs-reqs-unsubscribe@yahoogroups.com</a>
> <li>�
> <li>Your use of Yahoo! Groups is subject to the
> <a href="http://docs.yahoo.com/info/terms/">Yahoo! Terms of Service</a>.
> </ul></tt></blockquote><tt>
>
> <br>
>
> <!-- |**|begin egp html banner|**| -->
>
> <table border=0 cellspacing=0 cellpadding=2>
> <tr bgcolor=#FFFFCC>
> <td align=center><font size="-1" color=#003399><b>Yahoo! Groups Sponsor</b></font></td>
> </tr>
> <tr bgcolor=#FFFFFF>
> <td align=center width=470><table border=0 cellpadding=0 cellspacing=0> <tr> <td align=center><font face=arial size=-2>ADVERTISEMENT</font><br><a href="http://rd.yahoo.com/SIG=129ar4a7j/M=295196.4901138.6052515.3001176/D=groups/S=1705701014:HM/EXP=1083709427/A=2128215/R=0/SIG=10se96mf6/*http://companion.yahoo.com" alt=""><img src="http://us.a1.yimg.com/us.yimg.com/a/ya/yahoo_companion/lrec_companion_043004.gif" alt="click here" width="300" height="250" border="0"></a></td></tr></table> </td>
> </tr>
> <tr><td><img alt="" width=1 height=1 src="http://us.adserver.yahoo.com/l?M=295196.4901138.6052515.3001176/D=groups/S=:HM/A=2128215/rand=306103613"></td></tr>
> </table>
>
> <!-- |**|end egp html banner|**| -->
>
>
>
> <!-- |**|begin egp html banner|**| -->
>
> <br>
> <tt><hr width="500">
> <b>Yahoo! Groups Links</b><br>
> <ul>
> <li>To visit your group on the web, go to:<br><a href="http://groups.yahoo.com/group/pnfs-reqs/">http://groups.yahoo.com/group/pnfs-reqs/</a><br>�
> <li>To unsubscribe from this group, send an email to:<br><a href="mailto:pnfs-reqs-unsubscribe@yahoogroups.com?subject=Unsubscribe">pnfs-reqs-unsubscribe@yahoogroups.com</a><br>�
> <li>Your use of Yahoo! Groups is subject to the <a href="http://docs.yahoo.com/info/terms/">Yahoo! Terms of Service</a>.
> </ul>
> </tt>
> 
>
> <!-- |**|end egp html banner|**| -->
>
>
> </tt></body>
> </html>
>
> --=====================_29822883==.ALT--
>
> --=====================_29822873==.REL
> Content-Type: image/jpeg; name="1c707da.jpg";
> x-mac-type="4A504547"; x-mac-creator="4A565752"
> Content-ID: <5.2.0.9.2.20040503162329.0328f0f8@cic-mail.lanl.gov.0>
> Content-Transfer-Encoding: base64
> Content-Disposition: inline; filename="1c707da.jpg"
>
> /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEB
> AQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQH/2wBDAQEBAQEBAQEBAQEBAQEBAQEBAQEB
> AQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQH/wAARCAD6ASwDASIA
> AhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQA
> AAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3
> ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWm
> p6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEA
> AwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSEx
> BhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElK
> U1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3
> uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD/AD/6
> KKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAoo
> ooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiii
> gAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKA
> CiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAK
> KKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAoo
> ooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiii
> gAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKA
> CiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAK
> KKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAoo
> ooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiii
> gAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKA
> CiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAK
> KKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAoo
> ooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiii
> gAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKA
> CiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAK
> KKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAoo
> ooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiii
> gAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKA
> CiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAK
> KKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAooooAKKKKACiiigAoo
> ooAKKKKACiiigAooooAKKKKAP//Z
> --=====================_29822873==.REL
> Content-Type: image/jpeg; name="1c70834.jpg";
> x-mac-type="4A504547"; x-mac-creator="4A565752"
> Content-ID: <5.2.0.9.2.20040503162329.0328f0f8@cic-mail.lanl.gov.1>
> Content-Transfer-Encoding: base64
> Content-Disposition: inline; filename="1c70834.jpg"
>
> /9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEB
> AQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQH/2wBDAQEBAQEBAQEBAQEBAQEBAQEBAQEB
> AQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQEBAQH/wAARCAABAAEDASIA
> AhEBAxEB/8QAHwAAAQUBAQEBAQEAAAAAAAAAAAECAwQFBgcICQoL/8QAtRAAAgEDAwIEAwUFBAQA
> AAF9AQIDAAQRBRIhMUEGE1FhByJxFDKBkaEII0KxwRVS0fAkM2JyggkKFhcYGRolJicoKSo0NTY3
> ODk6Q0RFRkdISUpTVFVWV1hZWmNkZWZnaGlqc3R1dnd4eXqDhIWGh4iJipKTlJWWl5iZmqKjpKWm
> p6ipqrKztLW2t7i5usLDxMXGx8jJytLT1NXW19jZ2uHi4+Tl5ufo6erx8vP09fb3+Pn6/8QAHwEA
> AwEBAQEBAQEBAQAAAAAAAAECAwQFBgcICQoL/8QAtREAAgECBAQDBAcFBAQAAQJ3AAECAxEEBSEx
> BhJBUQdhcRMiMoEIFEKRobHBCSMzUvAVYnLRChYkNOEl8RcYGRomJygpKjU2Nzg5OkNERUZHSElK
> U1RVVldYWVpjZGVmZ2hpanN0dXZ3eHl6goOEhYaHiImKkpOUlZaXmJmaoqOkpaanqKmqsrO0tba3
> uLm6wsPExcbHyMnK0tPU1dbX2Nna4uPk5ebn6Onq8vP09fb3+Pn6/9oADAMBAAIRAxEAPwD/AD/6
> KKKAP//Z
> --=====================_29822873==.REL--
>
> 

From garth@panasas.com Thu May 13 11:02:15 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 32521 invoked from network); 13 May 2004 18:02:13 -0000
Received: from unknown (66.218.66.166)
by m19.grp.scd.yahoo.com with QMQP; 13 May 2004 18:02:13 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 13 May 2004 18:02:13 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B563QFQ; Thu, 13 May 2004 14:02:10 -0400
Mime-Version: 1.0 (Apple Message framework v613)
In-Reply-To: <20040503175825.1ACC8207C4@citi.umich.edu>
References: <20040503175825.1ACC8207C4@citi.umich.edu>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Message-Id: <A6D1B004-A507-11D8-9E89-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Thu, 13 May 2004 14:02:08 -0400
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.613)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Announcement of the pNFS face-tp-face meeting at CITI, June 8th.
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Consider this a call for agenda items for the June 8 meeting.

In the absence of contributions, I'd suggest that we spend the morning
on NFSv4 extension operations, start the afternoon on the requirements
document and return to operations if time allows.

garth


On May 3, 2004, at 1:58 PM, William A.(Andy) Adamson wrote:
> CITI University of Michigan is pleased to host the next pNFS
> face-to-faces
> meeting Tuesday June 8th 9:00am - 5:00pm. This meeting is scheduled
> during the
> 10th NFSv4 Bakeathon also held at CITI June 7-11, and one day prior to
> the
> Interim NFSv4 IETF working group meeting, held at the Universtiy of
> Michigan
> June 9th.
>
> The meeting will be held in a conference room in the Argus building,
> which
> also houses CITI. A teleconferencing call line will be provided for
> phone
> participation. An agenda will be forth coming.
>
> See
> http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/
> 10th_nfsv4_bakeathon.ht
> ml for hotel and travel information. Bakeathon regisitration is not
> necessary
> for pNFS meeting participation.
>
> -->Andy Adamson

From andros@citi.umich.edu Thu May 13 12:07:24 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 90315 invoked from network); 13 May 2004 19:07:23 -0000
Received: from unknown (66.218.66.167)
by m22.grp.scd.yahoo.com with QMQP; 13 May 2004 19:07:23 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta6.grp.scd.yahoo.com with SMTP; 13 May 2004 19:07:23 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id A4562207E3; Thu, 13 May 2004 15:07:04 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
In-reply-to: Your message of "Thu, 13 May 2004 14:02:08 EDT."
<A6D1B004-A507-11D8-9E89-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Thu, 13 May 2004 15:07:04 -0400
Message-Id: <20040513190704.A4562207E3@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] Announcement of the pNFS face-tp-face meeting at
CITI, June 8th.
X-Yahoo-Group-Post: member; u=169434965

ok. i'm proposal writing until friday, and will be able to turn my attention
towards the face-to-face meeting next week...

-->Andy

> Consider this a call for agenda items for the June 8 meeting.
>
> In the absence of contributions, I'd suggest that we spend the morning
> on NFSv4 extension operations, start the afternoon on the requirements
> document and return to operations if time allows.
>
> garth
>
>
> On May 3, 2004, at 1:58 PM, William A.(Andy) Adamson wrote:
> > CITI University of Michigan is pleased to host the next pNFS
> > face-to-faces
> > meeting Tuesday June 8th 9:00am - 5:00pm. This meeting is scheduled
> > during the
> > 10th NFSv4 Bakeathon also held at CITI June 7-11, and one day prior to
> > the
> > Interim NFSv4 IETF working group meeting, held at the Universtiy of
> > Michigan
> > June 9th.
> >
> > The meeting will be held in a conference room in the Argus building,
> > which
> > also houses CITI. A teleconferencing call line will be provided for
> > phone
> > participation. An agenda will be forth coming.
> >
> > See
> > http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/
> > 10th_nfsv4_bakeathon.ht
> > ml for hotel and travel information. Bakeathon regisitration is not
> > necessary
> > for pNFS meeting participation.
> >
> > -->Andy Adamson
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>
> 

From andros@citi.umich.edu Thu Jun 03 10:09:42 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 24201 invoked from network); 3 Jun 2004 17:09:41 -0000
Received: from unknown (66.218.66.167)
by m22.grp.scd.yahoo.com with QMQP; 3 Jun 2004 17:09:41 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta6.grp.scd.yahoo.com with SMTP; 3 Jun 2004 17:09:41 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 3395B207F5; Thu, 3 Jun 2004 13:07:33 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-ops@yahoogroups.com, nfsv4-wg@citi.umich.edu
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Thu, 03 Jun 2004 13:07:33 -0400
Message-Id: <20040603170733.3395B207F5@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Announcement and Agenda for the pNFS face-tp-face meeting at CITI,
June 8th.
X-Yahoo-Group-Post: member; u=169434965

CITI is pleased to host the next pNFS face-to-face meeting Tuesday June 8th
9:00am - 5:00pm. This meeting is scheduled during the 10th NFSv4 Bakeathon
also held at CITI June 7-11, and one day prior to the Interim NFSv4 IETF
working group meeting, held at the Universtiy of Michigan June 9th.

See http://www.citi.umich.edu/projects/nfsv4/citi_bakeathon/10th_nfsv4_bakeatho
n.ht
ml for hotel and travel information. Bakeathon regisitration is not necessary
for pNFS meeting participation. pNFS protocol design meeting at CITI, Tuesday
June 8.

Location
--------
Conference room, 2202 Argus Bldg, one floor below CITI

see http://www.citi.umich.edu/location.html for directions

Agenda
------
9:00 - 10:30 use cases describe with high level pseudo operations
10:30 - 11:00 break
11:00 - 12:30 operation design part 1
12:30 - 2:00 lunch
2:00 - 3:30 operation design part 2
3:30 - 5:00 create protocol requirements document

Dial-in
-------

734-615-5502 no id# required

we have a polycom conference phone with 2 remotes. note that this is a design
meeting so there will be a bunch of scribbling on the white boards...


-->Andy

From bwelch@panasas.com Thu Jun 03 21:29:13 2004
Return-Path: <bwelch@panasas.com>
X-Sender: bwelch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 43292 invoked from network); 4 Jun 2004 04:29:12 -0000
Received: from unknown (66.218.66.218)
by m23.grp.scd.yahoo.com with QMQP; 4 Jun 2004 04:29:12 -0000
Received: from unknown (HELO n27.grp.scd.yahoo.com) (66.218.66.83)
by mta3.grp.scd.yahoo.com with SMTP; 4 Jun 2004 04:29:12 -0000
Received: from [66.218.67.183] by n27.grp.scd.yahoo.com with NNFMP; 04 Jun 2004 04:29:11 -0000
Date: Fri, 04 Jun 2004 04:29:11 -0000
To: pnfs-reqs@yahoogroups.com
Message-ID: <c9otqn+vo45@eGroups.com>
User-Agent: eGroups-EW/0.82
MIME-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1
Content-Length: 168
X-Mailer: Yahoo Groups Message Poster
X-eGroups-Remote-IP: 66.218.66.83
From: "Brent B. Welch" <bwelch@panasas.com>
X-Originating-IP: 63.80.58.202
Subject: draft ops document in prep for June 8 meeting
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

ADVERTISEMENT
I have uploaded an ops document to the yahoo site so you can
review it before the June 8 meeting. If you cannot get to it,
drop me an email and I can send you a copy.

From dnoveck@netapp.com Fri Jun 04 13:21:44 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 53896 invoked from network); 4 Jun 2004 20:21:43 -0000
Received: from unknown (66.218.66.172)
by m22.grp.scd.yahoo.com with QMQP; 4 Jun 2004 20:21:43 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 4 Jun 2004 20:21:43 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i54KKoMj029634
for <pnfs-reqs@yahoogroups.com>; Fri, 4 Jun 2004 13:20:50 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i54KKoIC012108
for <pnfs-reqs@yahoogroups.com>; Fri, 4 Jun 2004 13:20:50 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Fri, 4 Jun 2004 13:20:45 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548AB80E4C@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] draft ops document in prep for June 8 meeting
Thread-Index: AcRJ7KK86F6stPm+Q/uYgJfAUU8D7wAhDoRg
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] draft ops document in prep for June 8 meeting
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

Thanks. Below are a miscelleneous brag bag of my comments on going
though your document.

The key feature of the protocol extension is the ability for clients
> to perform read and write operations that go directly from the client
> to the storage system without funneling through the file server. Of
> course, the file server must coordinate the client I/O so that the
> file system retains its integrity.


suggested rewording:

The key feature of the protocol extension is the ability for clients
to perform read and write operations that go directly from the client
to individual storage system elements without funneling all such access
through a single file server. Of course, the file server must coordinate
the client I/O so that the file system retains its integrity.

> The delegation abstraction is extended so that clients can obtain and
> manage file layouts allowing direct I/O and servers can recall these
> layouts as needed.

suggested rewording:

The delegation abstraction is extended so that clients can obtain and
manage file layouts allowing parallel I/O and servers can recall these
layouts as needed.

> The pNFS extension to NFSv4 takes the form of new operations that
> return data location information. The layout information is
> protected by layout delegations.

It refers here to both "data location information" and "layout
information". Are these intended to be the same?

One worthwhile distinction that might be worth making is between
that which tells you where to go go access certain locations within
the file (i.e. what IP address the server is) and what addresses
the data has withn that server (e.g. the file handle or object ID
or block addresses).

> Metadata Information about a file, like its name, owner, where it
> stored, and so forth. This information is managed by the
> File server (sometimes called the metadata manager). In
> some storage protocols, metadata such as block pointers
> and indirect blocks may be hidden below the storage
> protocol and not managed directly by the metadata
> manager.

Not clear what you are intending to convey by "not managed
directly by the metadata manager"? There is this block information
which the clients gets from the server (the layout) and he
uses it to read and write the file. How is it "not managed
directly by the metadata manager"?

> Data Server (Also, "Storage device") this is a server that just
> controls the file's data, but leaves other metadata
> management up to the file server (or metadata manager).

This too make things more obscure. The data server is
used to read and write file data as approriate and allowed by the
layout. End of story. For example a SAN device doesn't leave
storage of ACL's up to the file server, it simply has no concept
of ACL's at all.


> Storage Protocol This is the protocol between the client and the
> data server used to access the file data. There are
> three primary types: file protocols (like NFSv4 or
> NFSv3),

like --> such as

> object protocols (OSD), and block protocols
> (SCSI-block commands, or "SB"). These protocols are in
> turn layered over transport protocols such as RPC/TCP/IP
> or iSCSI/TCP/IP or FC/SCSI.

> Aggregation schemes can describe layouts like simple one-to-one
> mapping, concatenation, mirroring, and other RAID arrangements. A
> general aggregation scheme allows nested maps so that more complex
> layouts can be compactly described.

I can see mirroring in the readonly case. Mirroring for write and
raid seem to me like biting off too much. I guess it is OK to say
that we may define aggregation schemes for experimental use, but
if it is proposed that we try to make a protocol which deals with
multiple mirrored writes being done then there is a ton of work
to figure out what you do when because of failures or bad clients
the same thing is not written to each mirror. If we have to have
solve those problems to get a pNFS extension, it isn't going to
happen for wuite a long time.

> The metadata server is in control of the layout for a file, but the
> client can provide hints to the server when a file is opened or
> created about preferred layout parameters. The pNFS extension
> introduces a LAYOUT attribute that the client can query at anytime,
> and can set with a compound SETATTR after OPEN or CREATE to provide a
> hint to the server for new files.

The layout is going to contain information that won't be part of a
reasonable hint (e.g. file handles, block addressess). I think there
will probably be separate LAYOUT_HINT attribute that you can specify
when creating the file.

> Each storage device has a type. There are major classifications of
> storage types, but we anticipate local variations. There is a tight
> coupling between the device type and the storage protocol type, but
> not necessarily a one-to-one mapping. You might have a file-based
> storage protocol, but use NFSv4 and NFSv3 storage devices.

I don't understand. This just seems confusing. You really have a
disinction between protocol class, e.g. file protocols, and protocols
themselves, e.g. NFSv4.0, nfsv4.1, nfsv3. I don't see why we are
making this into a distinction between protocols and storage devices.
It is very common and reasonable for a storage device (i.e. file
server) which supports nfsv4 to also support nfsv3 and nfsv2. Why
would you have separate storage devices for each protocol?

> pnfs_stortype LAYOUT
>
> The storage sub-system type of the object's filesystem. This
> determines the storage protocol used to access the file. It is
> assumed the client contains a module that supports this storage
> protocol. This module has to interpret the map information returned
> by LAYOUTGET and issue I/O commands to the storage devices.

May be an incorrect assumption. I think there needs to be some way
for the client to indicate what forms of LAYOUT he is pepared to
deal with.

> enum layoutget_type4 {
> LAYOUTGET_READ = 1,
> LAYOUTGET_WRITE = 2
> };

Not clear about this. What if I'm both reading and writing a file,
as many people do?

> struct LAYOUTGET4args {
> /* CURRENT_FH: file */
> layoutget_type4 type;

Seems to me there should be some way of asking for a particular
form of layout that the client knows it can handle (e.g. file,
object, block).

> stateid4 open_stateid;

Unclear what the function of this is. You say "Stateid
represents the current state of an open file" but how is this
related to layout you get. What happens to the layout if the
file is closed? May the layout be used to access the data on
behalf of other open files and users (which seems quite reasonable
in many cases of file access).

> offset4 offset;
> length4 length;
> bool notify;
> };

> IMPLEMENTATION
>
> Typically, LAYOUTGET will be called as part of a compound RPC after
> an OPEN operation and results in the client having location
> information for the file.

In a COMPOUND, how do you propagate the open_stateid from the
OPEN to the LAYOUTGET?

> If the OPEN does not return a delegation
> for the file then the client should not use the layout given to it
> for direct I/O but rather call the legacy READ and WRITE server
> operations.

Why not? I undertstand there may be specific sub-case that need
this restriction (the partial-blocks issue for SAN) but there many
cases where this would be a gratuitous hobbling of pnfs.

If I have multiple clients that can open the file for write and
thus I cannot get a delegation, then presumably I have set up
my application to adapt that and thus the "legacy READ and WRITE
server operations. If the server gives me layout information so
that I can spread my IO out, all the better.

> Yet, it can hold on to the returned layout as long as it
> is not recalled so that if a delegation is obtained later with
> DELEGASK the client can then use the layout for direct I/O.

> ISSUES

> Do we want to add a pnfs_storetype to the arguments so clients can
> ask for a particular kind of layout (e.g., file vs. object)?

Yes.

> Do we have to worry about races when this call doesn't get the layout
> but asks the server to notify later? Shouldn't we pass some token to
> the server that it passes back to us to avoid races?

I'm not worried about it but in the case I'm concerned with primarily
(i.e. file) the layout is going to be available.

> The client is expected to use a SETATTR operation in a compound right
> after flushing the delegation in order to set the access and modify
> times of the file.

In many cases (certainly in the case of a file protocol) it would be
easy to update the attributes as the reads and writes are done. I
think that at least we should have the option of the LAYOUT telling the
client that this is not needed.

> 10. Usage Scenarios
>
> What we need here is a description of common open, close, read, write
> interactions and how those work with layout delegations.
>
> 10.1 Basic Read Scenario
>
> Client does an OPEN to get an open stateID and open delegation.

But he may not get an open delegation and that's fine as far as
I'm concerned.

> Client does a LAYOUTGET for a range of the file, gets back a map and
> layout delegation stateid.
> Client uses the storage protocol and the map to access the file.
> Client returns the layout delegation with LAYOUTRETURN

Why? Has someone recalled it?

> Client returns open delegation with DELEGRETURN

Again why?

> Client closes stateID and open delegation with CLOSE.

CLOSE just closes the open file and doesn't do anything about
the delegation. If you retain the delegation, there is really
no reason to do the close. You can defer that until the
delegations is recalled.

> 10.2 Read with existing writers
>
> Client does an OPEN to get an open stateID and open delegation
> The file is open for writing elsewhere by different clients and so no
> open delegation is returned.
> A LAYOUTGET would return an error because there is no open
> delegation.

Elsewhere you say otherwise:

> Typically, LAYOUTGET will be called as part of a compound RPC after
> an OPEN operation and results in the client having location
> information for the file. If the OPEN does not return a delegation
> for the file then the client should not use the layout given to it
> for direct I/O but rather call the legacy READ and WRITE server
> operations. Yet, it can hold on to the returned layout as long as it
> is not recalled so that if a delegation is obtained later with
> DELEGASK the client can then use the layout for direct I/O.

> Client uses normal READ to get data.

Again, don't see why.

> Client closes open stateID with CLOSE

> 10.3 Read with later conflict
>
> ClientA does an OPEN to get an open stateID and open delegation.
> ClientA does a LAYOUTGET for a range of the file, gets back a map and
> layout delegation stateid.
> ClientA uses the storage protocol to access the file data.
> ClientB opens the file for WRITE
> File server issues CB_RECALL to ClientA
> ClientA issues DELEGRETURN
> File server issues CB_LAYOUTRECALL to ClientA

Why? He may do this if he feels the existence of the writer is
incompatible with the layout but in a lot cases it will not be.
This should not be done gratuitously.

> ClientA issues LAYOUTRETURN

> 10.4 Read with existing writers and subsequent callback
>
> ClientA does an OPEN to get an open stateID.
> The file is open for writing elsewhere by different clients (clientB)
> so no open delegation is returned.
> ClientA does a DELEGASK with the notify flag set to get notified when
> a delegation may be available.
> ClientB closes its use of the file with CLOSE.
> Server makes CB_DELEGAVAILABLE to ClientA.
> ClientA retries its DELEGASK to get the delegation, then
> ClientA does a LAYOUTGET using the open delegation.
> (proceed as with other read scenarios)

Again, where the layout may compatibly be held by multiple clients
then the client should be allowed to use it. Leave it up to the server

> 10.6 Large Write Case
>
> Client does an OPEN to get an open stateID and open delegation.
> (loop)
> Client does a LAYOUTGET for a range of the file, gets back a map and
> layout delegation stateid.
> Client does WRITEs to the file using the storage protocol.
> Client fills up the range covered by the layout delegation.
> Client releases layout with LAYOUTFLUSH, communicating about new EOF
> position.
> Client does SETATTR to update timestamps
> (end loop)

One interesting point is that for many forms of layout, the layout
corresponding to a very large range will be very short. You might
have 64 objects or files, each with a 1 MB strie size and carry that
pattern for a file that is PB's or EB's. For blocks, you do have to
be prepared to do the loop thing, but in many cases it won't be needed.

> Client does a DELEGRETURN

Why?

> Client does a CLOSE

> 10.7 Create with special layout
>
> Client does a CREATE and a SETATTR that specifies a particular layout
> type.
> Client gets back an open stateID and open delegation.
> (etc)

You would do a OPEN and specify initial attributes.

> 11.3 Replicated Map

> The file data is replicated on N data servers. The map consists of N
> <deviceID, objectID> tuples. When data is written using this map, it
> should be written to N objects in parallel. When data is read, any
> component object can be used.

> [This map type is controversial because it highlights the issues with
> error recovery. Those issues get interesting with any scheme that
> employs redundancy.]

I'd suggest we have enough "interesting" issues without looking for
trouble.

> 12. Issues

> Storage Protocol Negotiation
>
> Clients may want to negotiate with the metadata server about their
> preferred storage protocol, and to find out what storage protocols
> the server offers. The server could OPTIONALLY provide a negotiation
> operation where the client supplies a list of storage types, and the
> server responds with a subset of that list which it supports. Its
> devices might not support any of the storage protocols the client
> knows. The client could also specify a storage type on the LAYOUTGET
> operation so the server could restrict its results appropriately.

Yup.

> State Ids
>
> We need to have a discussion of all the NFSv4 state Ids and how the
> layout stateID is maintained along with them in a typical server
> implementation.
>
> Open state ID, associated with open owner
> Lock state ID, associated with lock owner
> Delegation state ID, returned with open delegation
> Layout state ID, returned with layout delegation

Not sure what there is to discuss here.

> Crash recovery
>
> Need to recover layout delegations in the same way as open
> delegations.

I haven't thought about this too much, but I'd start with the
clients doing the normal reclaim opens we already have, asking
for any delegations and layouts they need and go from there.

> There is a third component to crash now, which is the data service.
> We assume there is an independent recovery protocol with the storage
> devices. Do we want to formalize any part of the storage protocol
> recovery?

I think the issue is whether we want to formalize the protocol between
the metadata server and the data servers and if we do we have to addres
the recovery aspects of that, but if we don't (and so far noboody has
proposed that we do that) then we don't.

Am I missing something here?



-----Original Message-----
From: Brent B. Welch [mailto:bwelch@panasas.com]
Sent: Friday, June 04, 2004 12:29 AM
To: pnfs-reqs@yahoogroups.com
Subject: [pnfs-reqs] draft ops document in prep for June 8 meeting


I have uploaded an ops document to the yahoo site so you can
review it before the June 8 meeting. If you cannot get to it,
drop me an email and I can send you a copy.





Yahoo! Groups Links

From andros@citi.umich.edu Mon Jun 07 09:47:54 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 29548 invoked from network); 7 Jun 2004 16:47:53 -0000
Received: from unknown (66.218.66.172)
by m22.grp.scd.yahoo.com with QMQP; 7 Jun 2004 16:47:53 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta4.grp.scd.yahoo.com with SMTP; 7 Jun 2004 16:47:53 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 13428207E5; Mon, 7 Jun 2004 12:47:03 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
In-reply-to: Your message of "Fri, 04 Jun 2004 04:29:11 -0000."
<c9otqn+vo45@eGroups.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 07 Jun 2004 12:47:03 -0400
Message-Id: <20040607164703.13428207E5@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] draft ops document in prep for June 8 meeting
X-Yahoo-Group-Post: member; u=169434965

hi brent

yeah, i have yet to figure out how to get docs off the yahoo group service. ;(

please send me the ops doc

thanks!
-->Andy

bwelch@panasas.com said:
> I have uploaded an ops document to the yahoo site so you can review it
> before the June 8 meeting. If you cannot get to it, drop me an email
> and I can send you a copy. 

From dnoveck@netapp.com Mon Jun 07 09:57:25 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 74292 invoked from network); 7 Jun 2004 16:57:25 -0000
Received: from unknown (66.218.66.166)
by m25.grp.scd.yahoo.com with QMQP; 7 Jun 2004 16:57:25 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta5.grp.scd.yahoo.com with SMTP; 7 Jun 2004 16:57:25 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i57GvG8C015739
for <pnfs-reqs@yahoogroups.com>; Mon, 7 Jun 2004 09:57:17 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i57GvGIC014142
for <pnfs-reqs@yahoogroups.com>; Mon, 7 Jun 2004 09:57:16 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 7 Jun 2004 09:57:13 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548AB80E53@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] draft ops document in prep for June 8 meeting
Thread-Index: AcRMry+g190L/TS6TCSAn/YcGkpJtQAAPWfA
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] draft ops document in prep for June 8 meeting
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

ADVERTISEMENT
I had trouble with this too. They've made it so easy it is almost
impossible to figure out :-)

When you get to the group page for pnfs-reqs, there is a link "files"
and if you click that you can actually see the thing.

-----Original Message-----
From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
Sent: Monday, June 07, 2004 12:47 PM
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
Subject: Re: [pnfs-reqs] draft ops document in prep for June 8 meeting


hi brent

yeah, i have yet to figure out how to get docs off the yahoo group
service. ;(

please send me the ops doc

thanks!
-->Andy

bwelch@panasas.com said:
> I have uploaded an ops document to the yahoo site so you can review it
> before the June 8 meeting. If you cannot get to it, drop me an email
> and I can send you a copy.







Yahoo! Groups Links

From bwelch@panasas.com Mon Jun 07 10:30:25 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 4247 invoked from network); 7 Jun 2004 17:30:22 -0000
Received: from unknown (66.218.66.172)
by m19.grp.scd.yahoo.com with QMQP; 7 Jun 2004 17:30:22 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta4.grp.scd.yahoo.com with SMTP; 7 Jun 2004 17:30:22 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i57HUAJ21188
for <pnfs-reqs@yahoogroups.com>; Mon, 7 Jun 2004 10:30:10 -0700
Message-Id: <200406071730.i57HUAJ21188@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.6.4 04/07/2004 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com
In-reply-to: <C8CF60CFC4D8A74E9945E32CF096548AB80E53@silver.nane.netapp.com>
References: <C8CF60CFC4D8A74E9945E32CF096548AB80E53@silver.nane.netapp.com>
Comments: In-reply-to "Noveck, Dave" <dnoveck@netapp.com>
message dated "Mon, 07 Jun 2004 09:57:13 -0700."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: multipart/mixed ;
boundary="==_Exmh_17803490480"
Date: Mon, 07 Jun 2004 10:30:10 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] draft ops document in prep for June 8 meeting
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

ADVERTISEMENT
OK - here is an attached copy. See you tomorrow.

>>>"Noveck, Dave" said:
> I had trouble with this too. They've made it so easy it is almost
> impossible to figure out :-)
>
> When you get to the group page for pnfs-reqs, there is a link "files"
> and if you click that you can actually see the thing.
>
> -----Original Message-----
> From: William A.(Andy) Adamson [mailto:andros@citi.umich.edu]
> Sent: Monday, June 07, 2004 12:47 PM
> To: pnfs-reqs@yahoogroups.com
> Cc: andros@citi.umich.edu
> Subject: Re: [pnfs-reqs] draft ops document in prep for June 8 meeting
>
>
> hi brent
>
> yeah, i have yet to figure out how to get docs off the yahoo group
> service. ;(
>
> please send me the ops doc
>
> thanks!
> -->Andy

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com



Attachment (not stored)
pnfs_6_4.txt
Type: text/plain 

From pnfs-reqs@yahoogroups.com Tue Jun 08 06:34:54 2004
Return-Path: <notify@yahoogroups.com>
Received: (qmail 1187 invoked from network); 8 Jun 2004 13:34:53 -0000
Received: from unknown (66.218.66.167)
by m22.grp.scd.yahoo.com with QMQP; 8 Jun 2004 13:34:53 -0000
Received: from unknown (HELO n21.grp.scd.yahoo.com) (66.218.66.77)
by mta6.grp.scd.yahoo.com with SMTP; 8 Jun 2004 13:34:53 -0000
X-eGroups-Return: notify@yahoogroups.com
Received: from [66.218.67.147] by n21.grp.scd.yahoo.com with NNFMP; 08 Jun 2004 13:33:51 -0000
Received: (qmail 88581 invoked by uid 65534); 8 Jun 2004 13:33:51 -0000
Date: 8 Jun 2004 13:33:51 -0000
Message-ID: <1086701631.797.88578.w27@yahoogroups.com>
X-eGroups-Application: files
X-Yahoo-Group-Post: system
From: pnfs-reqs@yahoogroups.com
To: pnfs-reqs@yahoogroups.com
Subject: New file uploaded to pnfs-reqs
MIME-Version: 1.0
Content-Type: text/plain
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 66.218.66.77

Hello,

This email message is a notification to let you know that
a file has been uploaded to the Files area of the pnfs-reqs
group.

File : /pNFS Use Cases.ppt
Uploaded by : benny_halevy <bhalevy@panasas.com>
Description : pNFS Use Cases for 2004-06-08 discussion

You can access this file at the URL

http://groups.yahoo.com/group/pnfs-reqs/files/pNFS%20Use%20Cases.ppt

To learn more about file sharing for your group, please visit

http://help.yahoo.com/help/us/groups/files

Regards,

benny_halevy <bhalevy@panasas.com>

From pnfs-reqs@yahoogroups.com Tue Jun 08 13:25:00 2004
Return-Path: <notify@yahoogroups.com>
Received: (qmail 36837 invoked from network); 8 Jun 2004 20:24:59 -0000
Received: from unknown (66.218.66.166)
by m21.grp.scd.yahoo.com with QMQP; 8 Jun 2004 20:24:59 -0000
Received: from unknown (HELO n34.grp.scd.yahoo.com) (66.218.66.102)
by mta5.grp.scd.yahoo.com with SMTP; 8 Jun 2004 20:24:59 -0000
X-eGroups-Return: notify@yahoogroups.com
Received: from [66.218.67.152] by n34.grp.scd.yahoo.com with NNFMP; 08 Jun 2004 20:24:56 -0000
Received: (qmail 15820 invoked by uid 65534); 8 Jun 2004 20:24:56 -0000
Date: 8 Jun 2004 20:24:56 -0000
Message-ID: <1086726296.891.15816.w31@yahoogroups.com>
X-eGroups-Application: files
X-Yahoo-Group-Post: system
From: pnfs-reqs@yahoogroups.com
To: pnfs-reqs@yahoogroups.com
Subject: New file uploaded to pnfs-reqs
MIME-Version: 1.0
Content-Type: text/plain
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 66.218.66.102

ADVERTISEMENT

Hello,

This email message is a notification to let you know that
a file has been uploaded to the Files area of the pnfs-reqs
group.

File : /pNFS-June7-reqs-slides.ppt
Uploaded by : brent_welch_1960 <bwelch@panasas.com>
Description : requirements discussion June 7

You can access this file at the URL

http://groups.yahoo.com/group/pnfs-reqs/files/pNFS-June7-reqs-slides.ppt

To learn more about file sharing for your group, please visit

http://help.yahoo.com/help/us/groups/files

Regards,

brent_welch_1960 <bwelch@panasas.com>

From dnoveck@netapp.com Tue Jun 08 13:48:51 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 92671 invoked from network); 8 Jun 2004 20:48:50 -0000
Received: from unknown (66.218.66.166)
by m24.grp.scd.yahoo.com with QMQP; 8 Jun 2004 20:48:50 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta5.grp.scd.yahoo.com with SMTP; 8 Jun 2004 20:48:49 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i58Kml8C027243
for <pnfs-reqs@yahoogroups.com>; Tue, 8 Jun 2004 13:48:47 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i58KmkIC027745
for <pnfs-reqs@yahoogroups.com>; Tue, 8 Jun 2004 13:48:47 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Tue, 8 Jun 2004 13:48:42 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE05@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] New file uploaded to pnfs-reqs
Thread-Index: AcRNlq//cTCsmXTSRHCV7HCDPenUhwAAOtNg
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] New file uploaded to pnfs-reqs
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

Some typos.

Slide 2:

Principal goals: minor extension of NFSv4
Principal goals: scalable bandwidth & capacity

Slide 3:

Orthogonal and complementary to RDMA

Slide 4:

Referrals are intended for other purposes; it should not be necessary for referrals to be used to get high performance

Slide 9 (this seems wrong but I don't know how to fix it):

Must not require changes in existing internet infrastructure include NFSv4, with the exception of extensions to NFSv4

Slide 10:

Files and filesystems should be manageable through the NFSv4 server w/o necessity for direct client access to storage; although storage management may be used on storage in accordance with normal procedures under NFSv4

Slide 11:

Scope of client trust (what can it get to with a layout) should be well documented and as constrainted to layout delegation as is possible cost-effectively

Slide 14:

Orthogonal and complementary to transport improvements (RDMA)

NFS extensions for control & consistency of metadata, not meaning


-----Original Message-----
From: pnfs-reqs@yahoogroups.com [mailto:pnfs-reqs@yahoogroups.com]
Sent: Tuesday, June 08, 2004 4:25 PM
To: pnfs-reqs@yahoogroups.com
Subject: [pnfs-reqs] New file uploaded to pnfs-reqs



Hello,

This email message is a notification to let you know that
a file has been uploaded to the Files area of the pnfs-reqs
group.

File : /pNFS-June7-reqs-slides.ppt
Uploaded by : brent_welch_1960 <bwelch@panasas.com>
Description : requirements discussion June 7

You can access this file at the URL

http://groups.yahoo.com/group/pnfs-reqs/files/pNFS-June7-reqs-slides.ppt

To learn more about file sharing for your group, please visit

http://help.yahoo.com/help/us/groups/files

Regards,

brent_welch_1960 <bwelch@panasas.com>








Yahoo! Groups Links

From bwelch@panasas.com Fri Jun 11 17:18:27 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 59406 invoked from network); 12 Jun 2004 00:18:26 -0000
Received: from unknown (66.218.66.172)
by m22.grp.scd.yahoo.com with QMQP; 12 Jun 2004 00:18:26 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta4.grp.scd.yahoo.com with SMTP; 12 Jun 2004 00:18:26 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i5C0IQO19920
for <pnfs-reqs@yahoogroups.com>; Fri, 11 Jun 2004 17:18:26 -0700
Message-Id: <200406120018.i5C0IQO19920@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.6.4 04/07/2004 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com
In-reply-to: <c9otqn+vo45@eGroups.com>
References: <c9otqn+vo45@eGroups.com>
Comments: In-reply-to "Brent B. Welch" <bwelch@panasas.com>
message dated "Fri, 04 Jun 2004 04:29:11 -0000."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Fri, 11 Jun 2004 17:18:26 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] draft ops document in prep for June 8 meeting
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

FYI - I promised an updated ops document that reflects what we
talked about on Tuesday. I have a draft completed but need to
review it a bit more before circulating it.

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From andros@citi.umich.edu Mon Jun 14 08:27:33 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 97373 invoked from network); 14 Jun 2004 15:27:32 -0000
Received: from unknown (66.218.66.216)
by m20.grp.scd.yahoo.com with QMQP; 14 Jun 2004 15:27:32 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta1.grp.scd.yahoo.com with SMTP; 14 Jun 2004 15:27:32 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 4F0C020844; Mon, 14 Jun 2004 11:27:12 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: bwickman@umich.edu
Mime-Version: 1.0
Content-Type: multipart/mixed ;
boundary="==_Exmh_1223277690"
Date: Mon, 14 Jun 2004 11:27:12 -0400
Message-Id: <20040614152712.4F0C020844@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Scribes notes from the pNFS meeting at CITI
X-Yahoo-Group-Post: member; u=169434965

Thanks to brian for taking notes.

-->Andy




Attachment (not stored)
minutes-6-8-2004.pNFS
Type: text/plain 

From bwelch@panasas.com Thu Jun 17 18:15:55 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 37691 invoked from network); 18 Jun 2004 01:15:54 -0000
Received: from unknown (66.218.66.218)
by m3.grp.scd.yahoo.com with QMQP; 18 Jun 2004 01:15:54 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta3.grp.scd.yahoo.com with SMTP; 18 Jun 2004 01:15:54 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i5I1Fq118244
for <pnfs-reqs@yahoogroups.com>; Thu, 17 Jun 2004 18:15:52 -0700
Message-Id: <200406180115.i5I1Fq118244@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.6.4 06/14/2004 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Thu, 17 Jun 2004 18:15:52 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: summary of June face-to-face
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

ADVERTISEMENT
Here is a summary of where I think we are. I've been updating the
draft document, but I think it'll be more clear to chart out the
decisions we (may have) made and review some of the issues we
(may have) closed. I'm sure you won't hesitate to chime in if
I've got something wrong. The scribe's notes were a great help.
I will see what sort of discussion ensues from this and then
send out an updated version of the draft.

Terminology

We went over some standardized terminology first, and I think it
will be helpful if we stick with these terms:

"Layout" implies "layout delegation" but we try not to actually
use the d-word when talking about layouts to avoid confusion with
the existing delegation mechanism in NFSv4. When a client gets a
layout, it also gets a promise from the service to recall that
layout when it must no longer use it. (More on this later)

"Storage Device" is what the client uses to get the data
when bypassing the NFS server. Previous terms this should
replace include "Data Server" and "Device" and "Storage Server".

"Storage Protocol" is the network protocol the client uses
when talking to the Storage Device.

"Storage Management Protocol" is the protocol between the
NFSv4 server and the Storage Device, which is not specified
in our protocol spec, but a private channel used for management
purposes.

Layouts and Data Delegations

The most central issue we discussed was decoupling layouts from
the existing data delegations. While this makes some nervous,
I think it makes a good deal of sense because the existing delegations
are about reducing open/close traffic and allowing clients to give
out byte range locks. By keeping layout information orthogonal
and independent from data delegations things are simpler. Most
arguments about trying to leverage data delegations are really
concerns about client cache consistency during concurrent access.
However, for our target application (high bandwidth access to a
file from multiple storage devices), the issues of cache consistency
are orthogonal.

Server Control over Conflicting Layouts

That said, there are still cases where a server may want to enforce
strict access control to clients that are using layouts, and so the
server can still recall layouts as necessary to support that. We
discussed the notion of a "shared" vs. "exclusive" layout and letting
the client hint to the server about what kind of layout it wants.
(We talked at length about this today during the conference call.)

I think this is the gist of the current proposal:
Client does open to get a state ID.
LAYOUTGET has an open_stateID argument, plus a
"sharing_token_id" argument. (that's a bad name for it,
what did we use this morning?) There are three possibilities
for the sharing token, strict, don't care, and a group identifier.
The idea is that if you pass in strict (e.g. 0x1) then the server
does not issue conflicting/overlapping layouts to different clients.
If you pass in don't care (e.g., 0x0) then the server can do
whatever it wants. If you pass in some other value, then it
represents the client's identifier for a cooperating set of
applications that don't mind getting overlapping layout information
because they are asserting they provide an external synchronization
mechanism. Finally, in any case a client *must* honor a layout
recall and stop using it - it must blindly follow directions.
In particular, if a client asked for a "dont' care" but the
server wanted to be strict, it still could.

Layout Commit

We also decided to rename LAYOUTFLUSH to LAYOUTCOMMIT to better reflect
its meaning. This operation pushes new EOF information to the server
and ensures that other clients will see new data that has been written,
especially data past the end of file. There is still LAYOUTRETURN
that gives back the layout information. One way to think about this
is that the client will make a LAYOUTCOMMIT whenever it would make
a COMMIT. But COMMIT doesn't apply, of course, because the client
has not been writing data to the NFS server, but directly to storage.

Callbacks and pending callbacks

The draft-00 had a notion of a pending callback and a notification
that a layout was available. We think we can live without this.
Instead, the server just gives out layouts and recalls them. If
a client does not get a layout, then it retries later, or it uses
regular READ/WRITE operations through the server instead.

When the server recalls a layout the client may have to complete
some I/O requests (storage writes). We have not yet decided if
the client responds to the recall immediately, then does its writes,
and finishes up with LAYOUTRETURN, or it does the I/O and the
LAYOUTRETURN before replying to the CB_LAYOUTRECALL

Storage Protocol Negotiation

The client needs to be able to ask for particular kinds of layouts
that involve particular storage protocols. The LAYOUTGET operation
will include a storage protocol type, and the server will fail to
return a layout if it cannot give it that type.

We discussed variations
on this scheme where the client supplies a list of possibilities and
the server returns the first one it supports. We had a lengthy
discussion about whether it is possible to do this negotiation for
the whole filesystem first, but rejected that approach
because different files might be available through different storage
protocols. I believe we postulated an attribute on the file that
identified the "preferred" storage protocol that a client could
use to access that file.

Relation to share mode reservations

Open-time share mode locks (i.e., share reservations) came up
in the discussion, but I'm pretty sure they are completely orthogonal
to layouts.

Relation to byte range locks

Much discussion about this, see above. Advisory locks are orthogonal
to layout information. *However*, if a server is implementing
mandatory locks, then it really must recall layouts that would allow
conflicting I/O. Ignoring mandatory locks, advisory byte range locks
are a nice complement to layouts because they provide a way for
clients that have overlapping layout information to synchronize
their access. Because there is no "lock recall", clients that
synchronize this way must be prompt about releasing write locks.

Interactions with legacy reads and writes

We only touched on this, but to me the best way to think about this
is that the NFS server can be thought of as a peer to the clients.
It must be using the same sort of layout information to access
the storage (unless it happens to be the storage device), so we
can apply the same rules about conflicting layouts to the server.
Each WRITE, for example, is as if the NFS server is doing a
LAYOUTGET for that file. Depending on what sort of consistency
the server and its clients want affects whether or not that WRITE
causes the server to recall layouts. We need to discuss this more.

Error recovery (server or client crash)

If the server crashes, then the clients need to get their layouts
after it restarts. This needs to be described in the draft doc.

GETDEVINFO

I think we can chuck all the GETDEVINFO and GETDEVLIST from
the protocol. That was about mapping from a compact device ID to
the full addressing information.
This is private to the storage protocol. There
may be some additional message traffic between the client and
the NFS server, but for now we can assume that layouts specify
storage devices with something like
struct pnfs_devaddr4 {
string r_netid<>; /* network ID */
string r_addr<>; /* universal address */
}
I know you can put IP addr/port information here, and we should
be able to put universal SCSI device addresses in there, too.

DELEGASK / DELEGRETURN

We think this is an orthogonal NFSv4 extension (dubbed
"c-NFS" for cache-coherent NFS) that would all a more sophisticated
cache consistency implementation. This is likely to be
controversial and we think we can get most of the value prop
of pNFS without this.

Summary

I think this boils it down to LAYOUTGET, LAYOUTCOMMIT, and
LAYOUTRETURN, with some negotiation about preferred storage
protocols and the kind of sharing the client and server expect.

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From dhildebz@eecs.umich.edu Fri Jun 18 08:46:58 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 35376 invoked from network); 18 Jun 2004 15:46:57 -0000
Received: from unknown (66.218.66.172)
by m24.grp.scd.yahoo.com with QMQP; 18 Jun 2004 15:46:57 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta4.grp.scd.yahoo.com with SMTP; 18 Jun 2004 15:46:56 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i5IFks2E002573
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Fri, 18 Jun 2004 11:46:56 -0400
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i5IFkshS002570
for <pnfs-reqs@yahoogroups.com>; Fri, 18 Jun 2004 11:46:54 -0400
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Fri, 18 Jun 2004 11:46:54 -0400 (EDT)
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <200406180115.i5I1Fq118244@medlicott.panasas.com>
Message-ID: <Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu>
References: <200406180115.i5I1Fq118244@medlicott.panasas.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

This is great, thanks to Brian and Brent.

> Callbacks and pending callbacks
>
> The draft-00 had a notion of a pending callback and a notification
> that a layout was available.  We think we can live without this.
> Instead, the server just gives out layouts and recalls them.  If
> a client does not get a layout, then it retries later, or it uses
> regular READ/WRITE operations through the server instead.
Just to understand this a little better, are we essentially adding
additional functionality to existing parallel file systems with the
ability for the client to write to the NFS server? Is it true that in
existing parallel file systems, if a client requests a conflicting byte
range (assuming the client cares about conflicts) it is essentially halted
until the conflict is resolved?

> Storage Protocol Negotiation
>
> The client needs to be able to ask for particular kinds of layouts
> that involve particular storage protocols.  The LAYOUTGET operation
> will include a storage protocol type, and the server will fail to
> return a layout if it cannot give it that type.
>
> We discussed variations
> on this scheme where the client supplies a list of possibilities and
> the server returns the first one it supports.  We had a lengthy
> discussion about whether it is possible to do this negotiation for
> the whole filesystem first, but rejected that approach
> because different files might be available through different storage
> protocols.   I believe we postulated an attribute on the file that
> identified the "preferred" storage protocol that a client could
> use to access that file.
I'm wondering if this is unecessary. If a file exists then the preferred
type of file system for access must be available (otherwise who is storing
the file?). Using the attribute idea, the client requests the attribute
and if it has a I/O module to support that type of access it can use
parallel I/O, otherwise it must read/write through the NFS server. If the
attribute doesn't exist or is NULL or something, parallel I/O does not
exist for this file.


> Relation to share mode reservations
> Relation to byte range locks
> Much discussion about this, see above.  Advisory locks are orthogonal
> to layout information.  *However*, if a server is implementing
> mandatory locks, then it really must recall layouts that would allow
> conflicting I/O. 
I think the only real requirement here is that clients do not end up with
conflicting layouts. If client A and client B do not have conflicting
locks BUT they have conflicting layouts, then the server should be able to
only recall the conflicting bits of the layouts. I'm wondering how much
we let implementors/applications shoot themselves in the foot. Do we need
a requirement that servers cannot return layouts that conflict with
outstanding locks? My thoughts are that we should tune the protocol so
clients have the greatest chance of being able to use parallel I/O and not
accidentally end up writing through the NFS server b/c their ops were out
of order or soemthing.

Dean

From black_david@emc.com Fri Jun 18 17:10:01 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 2019 invoked from network); 19 Jun 2004 00:09:59 -0000
Received: from unknown (66.218.66.172)
by m11.grp.scd.yahoo.com with QMQP; 19 Jun 2004 00:09:59 -0000
Received: from unknown (HELO mailhub.lss.emc.com) (168.159.2.31)
by mta4.grp.scd.yahoo.com with SMTP; 19 Jun 2004 00:09:59 -0000
Received: from MAHO3MSX2.corp.emc.com (maho3msx2.isus.emc.com [128.221.11.32])
by mailhub.lss.emc.com (Switch-2.2.8/Switch-2.2.0) with ESMTP id i5J09vB24376
for <pnfs-reqs@yahoogroups.com>; Fri, 18 Jun 2004 20:09:57 -0400 (EDT)
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <M6BMT7VA>; Fri, 18 Jun 2004 20:09:57 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A5BA2@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com
Date: Fri, 18 Jun 2004 20:09:56 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-PMX-Version: 4.6.0.97784, Antispam-Core: 4.6.0.97340, Antispam-Data: 2004.6.18.104407
X-eGroups-Remote-IP: 168.159.2.31
From: black_david@emc.com
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

Brent,

> Server Control over Conflicting Layouts
>
> That said, there are still cases where a server may want to enforce
> strict access control to clients that are using layouts, and so the
> server can still recall layouts as necessary to support that. We
> discussed the notion of a "shared" vs. "exclusive" layout and letting
> the client hint to the server about what kind of layout it wants.
> (We talked at length about this today during the conference call.)

I don't like the "shared" vs. "exclusive" terminology, because this
seems to be placing responsibility for a consistency decision on the
client. I think about this in terms of the client asking the server
for the consistency policy it wants. In this context, the "shared"
case client is asking that accesses from components of a specific
application (identified by some sort of ID) not be considered to be
in conflict by the server. This is mostly about terminology, but
I think we really should be viewing this in terms of the results
the client wants to see.

> I think this is the gist of the current proposal:

[... snip ...]

The proposal's ok, except that this really ought to be done at
coarser granularity than each LAYOUT operation. A mount op solely
for pNFS is an attractive way to address this and protocol
negotiation problem.

> Callbacks and pending callbacks
>
> The draft-00 had a notion of a pending callback and a notification
> that a layout was available. We think we can live without this.
> Instead, the server just gives out layouts and recalls them. If
> a client does not get a layout, then it retries later, or it uses
> regular READ/WRITE operations through the server instead.

This is at risk of breakage under heavy contention, as it consumes
RPC contexts if the server wants to try to do liveness among clients.
The ability to queue a layout request and issue a callback when
the server has completed it (callback gives the requested layout
to the client) helps.

> When the server recalls a layout the client may have to complete
> some I/O requests (storage writes). We have not yet decided if
> the client responds to the recall immediately, then does its writes,
> and finishes up with LAYOUTRETURN, or it does the I/O and the
> LAYOUTRETURN before replying to the CB_LAYOUTRECALL

If the LAYOUTRECALL happens when the client is done and implies the
return of the layouts that the server wanted recalled, it saves a
LAYOUTRETURN op and a round trip if a read layout range is being
recalled.

> Storage Protocol Negotiation
>
> The client needs to be able to ask for particular kinds of layouts
> that involve particular storage protocols. The LAYOUTGET operation
> will include a storage protocol type, and the server will fail to
> return a layout if it cannot give it that type.

I'd be much happier if this were at a coarser granularity than each
layout operation (e.g., at open time, or add a mount operation
solely for pNFS).

> We discussed variations
> on this scheme where the client supplies a list of possibilities and
> the server returns the first one it supports. We had a lengthy
> discussion about whether it is possible to do this negotiation for
> the whole filesystem first, but rejected that approach
> because different files might be available through different storage
> protocols. I believe we postulated an attribute on the file that
> identified the "preferred" storage protocol that a client could
> use to access that file.

This is much better done at some coarser filesystem granularity.

> GETDEVINFO
>
> I think we can chuck all the GETDEVINFO and GETDEVLIST from
> the protocol. That was about mapping from a compact device ID to
> the full addressing information. This is private to the storage
> protocol. There
> may be some additional message traffic between the client and
> the NFS server, but for now we can assume that layouts specify
> storage devices with something like
> struct pnfs_devaddr4 {
> string r_netid<>; /* network ID */
> string r_addr<>; /* universal address */
> }
> I know you can put IP addr/port information here, and we should
> be able to put universal SCSI device addresses in there, too.

That won't fly in the block world. We need to pass some
potentially large disk signature info, and again, mount time is
a convenient time to do that. The worst problem is that
there's no such thing as a universal SCSI device address for
parallel SCSI or private Fibre Channel loops. Beyond that,
using device addressing is going to run into multipathing
concerns (what you think is one SCSI device actually has
multiple addresses, whose use may vary over time) - it's
better not to have to try to cope with this. In general,
it's significantly more robust and reliable to just use
disk signatures.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From garth@panasas.com Fri Jun 18 20:26:44 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 9841 invoked from network); 19 Jun 2004 03:26:43 -0000
Received: from unknown (66.218.66.218)
by m21.grp.scd.yahoo.com with QMQP; 19 Jun 2004 03:26:43 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 19 Jun 2004 03:26:43 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56SAHC; Fri, 18 Jun 2004 23:26:41 -0400
Mime-Version: 1.0 (Apple Message framework v618)
In-Reply-To: <Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu>
References: <200406180115.i5I1Fq118244@medlicott.panasas.com> <Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Message-Id: <5A871606-C185-11D8-AC74-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Date: Fri, 18 Jun 2004 17:12:30 -0700
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

On Jun 18, 2004, at 8:46 AM, Dean Hildebrand wrote:
> This is great, thanks to Brian and Brent.
>
>> Callbacks and pending callbacks
>>
>> The draft-00 had a notion of a pending callback and a notification
>> that a layout was available.  We think we can live without this.
>> Instead, the server just gives out layouts and recalls them.  If
>> a client does not get a layout, then it retries later, or it uses
>> regular READ/WRITE operations through the server instead.
> Just to understand this a little better, are we essentially adding
> additional functionality to existing parallel file systems with the
> ability for the client to write to the NFS server? Is it true that in

I'd say we are adding functionality to NFSv4 to give it the data
parallelism possible in many of today's parallel file systems.

> existing parallel file systems, if a client requests a conflicting byte
> range (assuming the client cares about conflicts) it is essentially
> halted
> until the conflict is resolved?

Different parallel file systems do this differently.

>
>> Storage Protocol Negotiation
>>
>> The client needs to be able to ask for particular kinds of layouts
>> that involve particular storage protocols.  The LAYOUTGET operation
>> will include a storage protocol type, and the server will fail to
>> return a layout if it cannot give it that type.
>>
>> We discussed variations
>> on this scheme where the client supplies a list of possibilities and
>> the server returns the first one it supports.  We had a lengthy
>> discussion about whether it is possible to do this negotiation for
>> the whole filesystem first, but rejected that approach
>> because different files might be available through different storage
>> protocols.   I believe we postulated an attribute on the file that
>> identified the "preferred" storage protocol that a client could
>> use to access that file.
> I'm wondering if this is unecessary. If a file exists then the
> preferred
> type of file system for access must be available (otherwise who is
> storing
> the file?). Using the attribute idea, the client requests the
> attribute
> and if it has a I/O module to support that type of access it can use
> parallel I/O, otherwise it must read/write through the NFS server. If
> the
> attribute doesn't exist or is NULL or something, parallel I/O does not
> exist for this file.

I agree that it is most likely that a specific file can only be read
directly using one storage protocol. But it is certainly possible for
two different files under the control of the same NFSv4 server to be
directly available through different protocols (for example, an NFSv4
server with pNFS extensions exporting two different local file systems
that are different types of parallel file systems.

What I'd like to avoid is a round trip on each open for the client to
determine what type of layout to request. I'd like this to be known or
mostly guessable before the open. Of course, many clients will support
only one choice, so that is the one to use :-) But it seems not
unlikely that a client could support SBC on iSCSI, NFS striping and OSD
on iSCSI using the same GE NIC.

>> Relation to share mode reservations
>> Relation to byte range locks
>> Much discussion about this, see above.  Advisory locks are orthogonal
>> to layout information.  *However*, if a server is implementing
>> mandatory locks, then it really must recall layouts that would allow
>> conflicting I/O. 
> I think the only real requirement here is that clients do not end up
> with
> conflicting layouts. If client A and client B do not have conflicting
> locks BUT they have conflicting layouts, then the server should be
> able to
> only recall the conflicting bits of the layouts. I'm wondering how
> much
> we let implementors/applications shoot themselves in the foot. Do we

I like your choice of words, because I think we let implementors shoot
themselves in the foot, simplistically resolve all conflicts with "use
the NFSv4 server" or extensively optimize performance for aggressive
parallel applications.

> need
> a requirement that servers cannot return layouts that conflict with
> outstanding locks? My thoughts are that we should tune the protocol so
> clients have the greatest chance of being able to use parallel I/O and
> not
> accidentally end up writing through the NFS server b/c their ops were
> out
> of order or soemthing.

I agree, but I'd say it differently. I think we want to work hard to
make it so that implementors can do this, but also work hard to allow
other implementors to get some of the benefit with much simpler
implementations. That is, we try not to use the wire protocol to force
a specific implementation (ie., limit the implementation creativity),
or to force the base cost of providing an implementation to be too
high.

From ggrider@lanl.gov Fri Jun 18 21:24:41 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 24343 invoked from network); 19 Jun 2004 04:24:40 -0000
Received: from unknown (66.218.66.217)
by m24.grp.scd.yahoo.com with QMQP; 19 Jun 2004 04:24:40 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta2.grp.scd.yahoo.com with SMTP; 19 Jun 2004 04:24:39 -0000
Received: from mailrelay1.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i5J4OZCp030233
for <pnfs-reqs@yahoogroups.com>; Fri, 18 Jun 2004 22:24:35 -0600
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay1.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i5J4OZ3V022563
for <pnfs-reqs@yahoogroups.com>; Fri, 18 Jun 2004 22:24:35 -0600
Received: from cthulu.lanl.gov (vpn-client-148.lanl.gov [128.165.253.148])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i5J4OUia028628;
Fri, 18 Jun 2004 22:24:31 -0600
Message-Id: <5.2.0.9.2.20040618222113.040751c8@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Fri, 18 Jun 2004 22:24:31 -0600
To: pnfs-reqs@yahoogroups.com, pnfs-reqs@yahoogroups.com
In-Reply-To: <5A871606-C185-11D8-AC74-000A95A94F04@panasas.com>
References: <Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu>
<200406180115.i5I1Fq118244@medlicott.panasas.com>
<Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu>
Mime-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="=====================_4936237==.REL"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

At 05:12 PM 6/18/2004 -0700, Garth Gibson wrote:

> On Jun 18, 2004, at 8:46 AM, Dean Hildebrand wrote:
> > This is great, thanks to Brian and Brent.
> >
> >> Callbacks and pending callbacks
> >>
> >> The draft-00 had a notion of a pending callback and a notification
> >> that a layout was available.  We think we can live without this.
> >> Instead, the server just gives out layouts and recalls them.  If
> >> a client does not get a layout, then it retries later, or it uses
> >> regular READ/WRITE operations through the server instead.
> > Just to understand this a little better, are we essentially adding
> > additional functionality to existing parallel file systems with the
> > ability for the client to write to the NFS server?  Is it true that in
>
> I'd say we are adding functionality to NFSv4 to give it the data
> parallelism possible in many of today's parallel file systems.


I would agree, and I have a few of them


> > existing parallel file systems, if a client requests a conflicting byte
> > range (assuming the client cares about conflicts) it is essentially
> > halted
> > until the conflict is resolved?
>
> Different parallel file systems do this differently.
>
> >
> >> Storage Protocol Negotiation
> >>
> >> The client needs to be able to ask for particular kinds of layouts
> >> that involve particular storage protocols.  The LAYOUTGET operation
> >> will include a storage protocol type, and the server will fail to
> >> return a layout if it cannot give it that type.
> >>
> >> We discussed variations
> >> on this scheme where the client supplies a list of possibilities and
> >> the server returns the first one it supports.  We had a lengthy
> >> discussion about whether it is possible to do this negotiation for
> >> the whole filesystem first, but rejected that approach
> >> because different files might be available through different storage
> >> protocols.   I believe we postulated an attribute on the file that
> >> identified the "preferred" storage protocol that a client could
> >> use to access that file.
> > I'm wondering if this is unecessary.  If a file exists then the
> > preferred
> > type of file system for access must be available (otherwise who is
> > storing
> > the file?).  Using the attribute idea, the client requests the
> > attribute
> > and if it has a I/O module to support that type of access it can use
> > parallel I/O, otherwise it must read/write through the NFS server.  If
> > the
> > attribute doesn't exist or is NULL or something, parallel I/O does not
> > exist for this file.
>
> I agree that it is most likely that a specific file can only be read
> directly using one storage protocol.  But it is certainly possible for
> two different files under the control of the same NFSv4 server to be
> directly available through different protocols (for example, an NFSv4
> server with pNFS extensions exporting two different local file systems
> that are different types of parallel file systems.
>
> What I'd like to avoid is a round trip on each open for the client to
> determine what type of layout to request.  I'd like this to be known or
> mostly guessable before the open.  Of course, many clients will support
> only one choice, so that is the one to use :-)  But it seems not
> unlikely that a client could support SBC on iSCSI, NFS striping and OSD
> on iSCSI using the same GE NIC.
>
> >> Relation to share mode reservations
> >> Relation to byte range locks
> >> Much discussion about this, see above.  Advisory locks are orthogonal
> >> to layout information.  *However*, if a server is implementing
> >> mandatory locks, then it really must recall layouts that would allow
> >> conflicting I/O.
> > I think the only real requirement here is that clients do not end up
> > with
> > conflicting layouts.  If client A and client B do not have conflicting
> > locks BUT they have conflicting layouts, then the server should be
> > able to
> > only recall the conflicting bits of the layouts.  I'm wondering how
> > much
> > we let implementors/applications shoot themselves in the foot.  Do we
>
> I like your choice of words, because I think we let implementors shoot
> themselves in the foot, simplistically resolve all conflicts with "use
> the NFSv4 server" or extensively optimize performance for aggressive
> parallel applications.
>
> > need
> > a requirement that servers cannot return layouts that conflict with
> > outstanding locks?  My thoughts are that we should tune the protocol so
> > clients have the greatest chance of being able to use parallel I/O and
> > not
> > accidentally end up writing through the NFS server b/c their ops were
> > out
> > of order or soemthing.
>
> I agree, but I'd say it differently.  I think we want to work hard to
> make it so that implementors can do this, but also work hard to allow
> other implementors to get some of the benefit with much simpler
> implementations.  That is, we try not to use the wire protocol to force
> a specific implementation (ie., limit the implementation creativity),
> or to force the base cost of providing an implementation to be too
> high.


Agree with this strongly.

Thanks
Gary


> Yahoo! Groups Sponsor
> ADVERTISEMENT
> 4b5093.jpg
> 4b51e7.jpg
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From dnoveck@netapp.com Sat Jun 19 07:11:36 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 68855 invoked from network); 19 Jun 2004 14:11:35 -0000
Received: from unknown (66.218.66.172)
by m7.grp.scd.yahoo.com with QMQP; 19 Jun 2004 14:11:35 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 19 Jun 2004 14:11:35 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i5JEBZkX010076
for <pnfs-reqs@yahoogroups.com>; Sat, 19 Jun 2004 07:11:35 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i5JEBZcu027647
for <pnfs-reqs@yahoogroups.com>; Sat, 19 Jun 2004 07:11:35 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Sat, 19 Jun 2004 07:11:31 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A021421D6@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] summary of June face-to-face
Thread-Index: AcRU0dAkHSQOMgi/QMqFXhpt3UKnlQAf6uhQ
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

> We discussed variations
> on this scheme where the client supplies a list of possibilities and
> the server returns the first one it supports. We had a lengthy
> discussion about whether it is possible to do this negotiation for
> the whole filesystem first, but rejected that approach
> because different files might be available through different storage
> protocols.

I wasn't there for the discussion but I have a lot of trouble
imagining circumstances where we have a single file system such
that different files are available through different sets of
storage protocols. And if there are a few, how much is this
worth? This is the kind of thing that really adds to
implementation (and especialy testing) complexity. I'd vote
for keeping things simple(r) by saying fs's have to be uniform
in this regard.

With regard to Garth's recent comments, "What I'd like to avoid
is a round trip on each open for the client to determine what
type of layout to request. I'd like this to be known or mostly
guessable before the open", I think we're OK. If the client
has opened a file on the same fs, no matter what we do, it will
be an awfully good guess that the layout types for every file on
that fs are the same. It'll be pretty close to a sure thing.
But why make the client deal with the complexity of the case
where it isn't the correct guess? There's enough needed
complexity in this endeavour that we shouldn't add to it
gratuitously.


> Much discussion about this, see above. Advisory locks are orthogonal
> to layout information. *However*, if a server is implementing
> mandatory locks, then it really must recall layouts that would allow
> conflicting I/O.

Note that if the server provides another way (e.g. through its
storage management protocol) to prevent the mandatory locks from
being violated, it doesn't have to recall the layouts since,
under those conditions, the layout would not allow conflicting
I/O.

> If the server crashes, then the clients need to get their layouts
> after it restarts. This needs to be described in the draft doc.

There's a lot of stuff in this area. When clients reboot, the
server has to recognize that they're layouts are no longer held.
We need to explan the case where the client dies an never comes
back up and where the client keeps renewing its lease but is
asked to return its layout but never does do.

> DELEGASK / DELEGRETURN
>
> I think this is an orthogonal NFSv4 extension (dubbed
> "c-NFS" for cache-coherent NFS) that would all a more sophisticated
> cache consistency implementation. This is likely to be
> controversial and we think we can get most of the value prop
> of pNFS without this.

Although I think you are right that it will be controversial,
my feeling is that the controversy will not be quite as sharp
if it is proposed as an optional feature, which it would have
to be anyway for a minor version.

I agree that this is orthogonal to pnfs and needs to be pursued
separately. Those who really want this and want to use it with
pnfs can pursue it in their own I-D and depending on how things
work out, it might wind up the same minor version.

> I think this boils it down to LAYOUTGET, LAYOUTCOMMIT, and
> LAYOUTRETURN, with some negotiation about preferred storage
> protocols and the kind of sharing the client and server expect.

Sounds like it's almost ready to ship :-)


From ggrider@lanl.gov Sat Jun 19 14:58:30 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 846 invoked from network); 19 Jun 2004 21:58:28 -0000
Received: from unknown (66.218.66.218)
by m20.grp.scd.yahoo.com with QMQP; 19 Jun 2004 21:58:28 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta3.grp.scd.yahoo.com with SMTP; 19 Jun 2004 21:58:28 -0000
Received: from mailrelay2.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i5JLueCp014531
for <pnfs-reqs@yahoogroups.com>; Sat, 19 Jun 2004 15:56:40 -0600
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay2.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i5JLueVU020956
for <pnfs-reqs@yahoogroups.com>; Sat, 19 Jun 2004 15:56:40 -0600
Received: from cthulu.lanl.gov (vpn-client-150.lanl.gov [128.165.253.150])
by cic-mail.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i5JLucia018317;
Sat, 19 Jun 2004 15:56:38 -0600
Message-Id: <5.2.0.9.2.20040619155417.0370c548@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Sat, 19 Jun 2004 15:56:39 -0600
To: pnfs-reqs@yahoogroups.com, <pnfs-reqs@yahoogroups.com>
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A021421D6@silver.nane.netap
p.com>
Mime-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="=====================_971937==.REL"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

ADVERTISEMENT
At 07:11 AM 6/19/2004 -0700, Noveck, Dave wrote:

> > We discussed variations
> > on this scheme where the client supplies a list of possibilities and
> > the server returns the first one it supports.  We had a lengthy
> > discussion about whether it is possible to do this negotiation for
> > the whole filesystem first, but rejected that approach
> > because different files might be available through different storage
> > protocols.  
>
> I wasn't there for the discussion but I have a lot of trouble
> imagining circumstances where we have a single file system such
> that different files are available through different sets of
> storage protocols. 


I assume you mean that for the most part most files should be available via 3 methods
1) native file system
2) pNFS via one of the three pNFS methods
3) regular NFS

but it will be unlikely that the files would be available under more than one pNFS method.

Gary


>  And if there are a few, how much is this
> worth?  This is the kind of thing that really adds to
> implementation (and especialy testing) complexity.  I'd vote
> for keeping things simple(r) by saying fs's have to be uniform
> in this regard.
>
> With regard to Garth's recent comments, "What I'd like to avoid
> is a round trip on each open for the client to determine what
> type of layout to request.  I'd like this to be known or mostly
> guessable before the open", I think we're OK.  If the client
> has opened a file on the same fs, no matter what we do, it will
> be an awfully good guess that the layout types for every file on
> that fs are the same.  It'll be pretty close to a sure thing.
> But why make the client deal with the complexity of the case
> where it isn't the correct guess?  There's enough needed
> complexity in this endeavour that we shouldn't add to it
> gratuitously.
>    
>
> > Much discussion about this, see above.  Advisory locks are orthogonal
> > to layout information.  *However*, if a server is implementing
> > mandatory locks, then it really must recall layouts that would allow
> > conflicting I/O. 
>
> Note that if the server provides another way (e.g. through its
> storage management protocol) to prevent the mandatory locks from
> being violated, it doesn't have to recall the layouts since,
> under those conditions, the layout would not allow conflicting
> I/O.
>
> > If the server crashes, then the clients need to get their layouts
> > after it restarts.  This needs to be described in the draft doc.
>
> There's a lot of stuff in this area.  When clients reboot, the
> server has to recognize that they're layouts are no longer held. 
> We need to explan the case where the client dies an never comes
> back up and where the client keeps renewing its lease but is
> asked to return its layout but never does do.
>
> > DELEGASK / DELEGRETURN
> >
> > I think this is an orthogonal NFSv4 extension (dubbed
> > "c-NFS" for cache-coherent NFS) that would all a more sophisticated
> > cache consistency implementation.  This is likely to be
> > controversial and we think we can get most of the value prop
> > of pNFS without this.
>
> Although I think you are right that it will be controversial,
> my feeling is that the controversy will not be quite as sharp
> if it is proposed as an optional feature, which it would have
> to be anyway for a minor version.
>
> I agree that this is orthogonal to pnfs and needs to be pursued
> separately.  Those who really want this and want to use it with
> pnfs can pursue it in their own I-D and depending on how things
> work out, it might wind up the same minor version.
>
> > I think this boils it down to LAYOUTGET, LAYOUTCOMMIT, and
> > LAYOUTRETURN, with some negotiation about preferred storage
> > protocols and the kind of sharing the client and server expect.
>
> Sounds like it's almost ready to ship :-)
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> ed3bb.jpg
> ed45b.jpg
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From dnoveck@netapp.com Sat Jun 19 17:47:05 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 60649 invoked from network); 20 Jun 2004 00:47:03 -0000
Received: from unknown (66.218.66.172)
by m7.grp.scd.yahoo.com with QMQP; 20 Jun 2004 00:47:03 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 20 Jun 2004 00:47:02 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i5K0l2kX020783
for <pnfs-reqs@yahoogroups.com>; Sat, 19 Jun 2004 17:47:02 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i5K0l2cu005079
for <pnfs-reqs@yahoogroups.com>; Sat, 19 Jun 2004 17:47:02 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="----_=_NextPart_001_01C45660.120B7EA3"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Sat, 19 Jun 2004 17:46:49 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A021421D9@silver.nane.netapp.com>
X-MS-Has-Attach: yes
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] summary of June face-to-face
Thread-Index: AcRWSJB0jNbQW15VTTKgIo4WtTSTMwAFM+7Q
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

I assume you mean that for the most part most files should be available via 3 methods
1) native file system
 
Maybe, maybe not.  On appliance servers such as our system there is nothing that really
corresponds to local file system access as it's generally thought of.

2) pNFS via one of the three pNFS methods
 
One or more than one.  I think I was the first one to mention the possibility that servers
could make files available via more than one of these.  Whether that turns out to be
common will depend on how implementations evolve.  If there are sets of clients that
each support disjoint sets of methods (files and blocks) then it may worth it for a
server to accommodate both.

3) regular NFS
 
Certainly.

but it will be unlikely that the files would be available under more than one pNFS method.
 
If you had to make a bet (even money) about a given server selected at random, guessing
one pNFS method would be where the smart money would go.  But more than one
shouldn't be considered a real long shot.  It depends on the clients.
 
My point was (and is) that whatever choice a server makes on this issue should be considered
as a per-fs choice, and not a per-file choice.  I can't imagine a server making a different choice
for different files on the same fs and the protocol should not complicate itself to allow that
possibility. 

    -----Original Message-----
    From: Gary Grider [mailto:ggrider@lanl.gov]
    Sent: Saturday, June 19, 2004 5:57 PM
    To: pnfs-reqs@yahoogroups.com; pnfs-reqs@yahoogroups.com
    Subject: RE: [pnfs-reqs] summary of June face-to-face

    At 07:11 AM 6/19/2004 -0700, Noveck, Dave wrote:

>     > We discussed variations
>     > on this scheme where the client supplies a list of possibilities and
>     > the server returns the first one it supports.  We had a lengthy
>     > discussion about whether it is possible to do this negotiation for
>     > the whole filesystem first, but rejected that approach
>     > because different files might be available through different storage
>     > protocols.  
>
>     I wasn't there for the discussion but I have a lot of trouble
>     imagining circumstances where we have a single file system such
>     that different files are available through different sets of
>     storage protocols. 


    I assume you mean that for the most part most files should be available via 3 methods
    1) native file system
    2) pNFS via one of the three pNFS methods
    3) regular NFS

    but it will be unlikely that the files would be available under more than one pNFS method.

    Gary


>      And if there are a few, how much is this
>     worth?  This is the kind of thing that really adds to
>     implementation (and especialy testing) complexity.  I'd vote
>     for keeping things simple(r) by saying fs's have to be uniform
>     in this regard.
>
>     With regard to Garth's recent comments, "What I'd like to avoid
>     is a round trip on each open for the client to determine what
>     type of layout to request.  I'd like this to be known or mostly
>     guessable before the open", I think we're OK.  If the client
>     has opened a file on the same fs, no matter what we do, it will
>     be an awfully good guess that the layout types for every file on
>     that fs are the same.  It'll be pretty close to a sure thing.
>     But why make the client deal with the complexity of the case
>     where it isn't the correct guess?  There's enough needed
>     complexity in this endeavour that we shouldn't add to it
>     gratuitously.
>        
>
>     > Much discussion about this, see above.  Advisory locks are orthogonal
>     > to layout information.  *However*, if a server is implementing
>     > mandatory locks, then it really must recall layouts that would allow
>     > conflicting I/O. 
>
>     Note that if the server provides another way (e.g. through its
>     storage management protocol) to prevent the mandatory locks from
>     being violated, it doesn't have to recall the layouts since,
>     under those conditions, the layout would not allow conflicting
>     I/O.
>
>     > If the server crashes, then the clients need to get their layouts
>     > after it restarts.  This needs to be described in the draft doc.
>
>     There's a lot of stuff in this area.  When clients reboot, the
>     server has to recognize that they're layouts are no longer held. 
>     We need to explan the case where the client dies an never comes
>     back up and where the client keeps renewing its lease but is
>     asked to return its layout but never does do.
>
>     > DELEGASK / DELEGRETURN
>     >
>     > I think this is an orthogonal NFSv4 extension (dubbed
>     > "c-NFS" for cache-coherent NFS) that would all a more sophisticated
>     > cache consistency implementation.  This is likely to be
>     > controversial and we think we can get most of the value prop
>     > of pNFS without this.
>
>     Although I think you are right that it will be controversial,
>     my feeling is that the controversy will not be quite as sharp
>     if it is proposed as an optional feature, which it would have
>     to be anyway for a minor version.
>
>     I agree that this is orthogonal to pnfs and needs to be pursued
>     separately.  Those who really want this and want to use it with
>     pnfs can pursue it in their own I-D and depending on how things
>     work out, it might wind up the same minor version.
>
>     > I think this boils it down to LAYOUTGET, LAYOUTCOMMIT, and
>     > LAYOUTRETURN, with some negotiation about preferred storage
>     > protocols and the kind of sharing the client and server expect.
>
>     Sounds like it's almost ready to ship :-)
>
>     Yahoo! Groups Sponsor
>     ADVERTISEMENT
>     ed3bb.jpg
>     ed45b.jpg
>
>     Yahoo! Groups Links
>
>         * To visit your group on the web, go to:
>         * http://groups.yahoo.com/group/pnfs-reqs/
>        *
>         * To unsubscribe from this group, send an email to:
>         * pnfs-reqs-unsubscribe@yahoogroups.com
>        *
>         * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From garth@panasas.com Sat Jun 19 21:36:14 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 66720 invoked from network); 20 Jun 2004 04:36:13 -0000
Received: from unknown (66.218.66.166)
by m24.grp.scd.yahoo.com with QMQP; 20 Jun 2004 04:36:13 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 20 Jun 2004 04:36:13 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56SCL7; Sun, 20 Jun 2004 00:36:11 -0400
Mime-Version: 1.0 (Apple Message framework v618)
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A021421D9@silver.nane.netapp.com>
References: <C8CF60CFC4D8A74E9945E32CF096548A021421D9@silver.nane.netapp.com>
Content-Type: text/plain; charset=WINDOWS-1252; format=flowed
Message-Id: <5A10EE7C-C273-11D8-AC74-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Date: Sun, 20 Jun 2004 00:36:09 -0400
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

I agree that typing a filesystem (as defined by files sharing the same
fsid) as supporting the same pNFS storage protocol(s) is reasonable and
effective. This does not prohibit a single NFSv4 server from managing
multiple different pNFS storage protocols in different exported
filesystems, nor does it prohibit a single client from being capable of
directly accessing files of different storage protocols.

We still need a method of discovering the pNFS storage protocol(s) for
a given filesystem. An NFS attribute on all member files will do,
though I'd like the client implementation to be able to simply inspect
the attributes of the root of the filesystem to learn the appropriate
storage protocols.

garth


On Jun 19, 2004, at 8:46 PM, Noveck, Dave wrote:

> I assume you mean that for the most part most files should be
> available via 3 methods
> 1) native file system
>  
> Maybe, maybe not.  On appliance servers such as our system there is
> nothing that really
> corresponds to local file system access as it's generally thought of.
>
>
> 2) pNFS via one of the three pNFS methods
>  
> One or more than one.  I think I was the first one to mention the
> possibility that servers
> could make files available via more than one of these.  Whether that
> turns out to be
> common will depend on how implementations evolve.  If there are sets
> of clients that
> each support disjoint sets of methods (files and blocks) then it may
> worth it for a
> server to accommodate both.
>
>
> 3) regular NFS
>  
> Certainly.
>
>
> but it will be unlikely that the files would be available under more
> than one pNFS method.
>  
> If you had to make a bet (even money) about a given server selected at
> random, guessing
> one pNFS method would be where the smart money would go.  But more
> than one
> shouldn't be considered a real long shot.  It depends on the clients.
>  
> My point was (and is) that whatever choice a server makes on this
> issue should be considered
> as a per-fs choice, and not a per-file choice.  I can't imagine a
> server making a different choice
> for different files on the same fs and the protocol should not
> complicate itself to allow that
> possibility. 
>
>
> -----Original Message-----
> From: Gary Grider [mailto:ggrider@lanl.gov]
> Sent: Saturday, June 19, 2004 5:57 PM
> To: pnfs-reqs@yahoogroups.com; pnfs-reqs@yahoogroups.com
> Subject: RE: [pnfs-reqs] summary of June face-to-face
>
> At 07:11 AM 6/19/2004 -0700, Noveck, Dave wrote:
>
> > We discussed variations
> > on this scheme where the client supplies a list of possibilities and
> > the server returns the first one it supports.  We had a lengthy
> > discussion about whether it is possible to do this negotiation for
> > the whole filesystem first, but rejected that approach
> > because different files might be available through different storage
> > protocols.  
>
> I wasn't there for the discussion but I have a lot of trouble
> imagining circumstances where we have a single file system such
> that different files are available through different sets of
> storage protocols.
>
> I assume you mean that for the most part most files should be
> available via 3 methods
> 1) native file system
> 2) pNFS via one of the three pNFS methods
> 3) regular NFS
>
> but it will be unlikely that the files would be available under more
> than one pNFS method.
>
> Gary
>
>
>
>  And if there are a few, how much is this
> worth?  This is the kind of thing that really adds to
> implementation (and especialy testing) complexity.  I'd vote
> for keeping things simple(r) by saying fs's have to be uniform
> in this regard.
>
> With regard to Garth's recent comments, "What I'd like to avoid
> is a round trip on each open for the client to determine what
> type of layout to request.  I'd like this to be known or mostly
> guessable before the open", I think we're OK.  If the client
> has opened a file on the same fs, no matter what we do, it will
> be an awfully good guess that the layout types for every file on
> that fs are the same.  It'll be pretty close to a sure thing.
> But why make the client deal with the complexity of the case
> where it isn't the correct guess?  There's enough needed
> complexity in this endeavour that we shouldn't add to it
> gratuitously.
>    
>
> > Much discussion about this, see above.  Advisory locks are
> orthogonal
> > to layout information.  *However*, if a server is implementing
> > mandatory locks, then it really must recall layouts that would allow
> > conflicting I/O. 
>
> Note that if the server provides another way (e.g. through its
> storage management protocol) to prevent the mandatory locks from
> being violated, it doesn't have to recall the layouts since,
> under those conditions, the layout would not allow conflicting
> I/O.
>
> > If the server crashes, then the clients need to get their layouts
> > after it restarts.  This needs to be described in the draft doc.
>
> There's a lot of stuff in this area.  When clients reboot, the
> server has to recognize that they're layouts are no longer held. 
> We need to explan the case where the client dies an never comes
> back up and where the client keeps renewing its lease but is
> asked to return its layout but never does do.
>
> > DELEGASK / DELEGRETURN
> >
> > I think this is an orthogonal NFSv4 extension (dubbed
> > "c-NFS" for cache-coherent NFS) that would all a more sophisticated
> > cache consistency implementation.  This is likely to be
> > controversial and we think we can get most of the value prop
> > of pNFS without this.
>
> Although I think you are right that it will be controversial,
> my feeling is that the controversy will not be quite as sharp
> if it is proposed as an optional feature, which it would have
> to be anyway for a minor version.
>
> I agree that this is orthogonal to pnfs and needs to be pursued
> separately.  Those who really want this and want to use it with
> pnfs can pursue it in their own I-D and depending on how things
> work out, it might wind up the same minor version.
>
> > I think this boils it down to LAYOUTGET, LAYOUTCOMMIT, and
> > LAYOUTRETURN, with some negotiation about preferred storage
> > protocols and the kind of sharing the client and server expect.
>
> Sounds like it's almost ready to ship :-)
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> <image.tiff>
> <image.tiff>
>
>
> Yahoo! Groups Links
> � To visit your group on the web, go to:
> � http://groups.yahoo.com/group/pnfs-reqs/
> � � To unsubscribe from this group, send an email to:
> � pnfs-reqs-unsubscribe@yahoogroups.com
> � � Your use of Yahoo! Groups is subject to the Yahoo! Terms of
> Service.
>
>
>
>
> Yahoo! Groups Sponsor
>
> ADVERTISEMENT
> <image.tiff>
> <image.tiff>
>
> Yahoo! Groups Links
>
> � To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>  
> � To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>  
> � Your use of Yahoo! Groups is subject to the Yahoo! Terms of
> Service.
>
>


From bwelch@panasas.com Sun Jun 20 21:01:48 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 14264 invoked from network); 21 Jun 2004 04:01:47 -0000
Received: from unknown (66.218.66.216)
by m2.grp.scd.yahoo.com with QMQP; 21 Jun 2004 04:01:47 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta1.grp.scd.yahoo.com with SMTP; 21 Jun 2004 04:01:47 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i5L41kx18199
for <pnfs-reqs@yahoogroups.com>; Sun, 20 Jun 2004 21:01:46 -0700
Message-Id: <200406210401.i5L41kx18199@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.6.4 06/14/2004 with nmh-1.0.4
X-Exmh-Isig-CompType: repl
X-Exmh-Isig-Folder: pnfs
To: pnfs-reqs@yahoogroups.com
In-reply-to: <Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu>
References: <200406180115.i5I1Fq118244@medlicott.panasas.com>
<Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu>
Comments: In-reply-to Dean Hildebrand <dhildebz@eecs.umich.edu>
message dated "Fri, 18 Jun 2004 11:46:54 -0400."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain
Date: Sun, 20 Jun 2004 21:01:46 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

>>>Dean Hildebrand said:
> This is great, thanks to Brian and Brent.
>
> > Callbacks and pending callbacks
> >
> > The draft-00 had a notion of a pending callback and a notification
> > that a layout was available. We think we can live without this.
> > Instead, the server just gives out layouts and recalls them. If
> > a client does not get a layout, then it retries later, or it uses
> > regular READ/WRITE operations through the server instead.
> Just to understand this a little better, are we essentially adding
> additional functionality to existing parallel file systems with the
> ability for the client to write to the NFS server? Is it true that in
> existing parallel file systems, if a client requests a conflicting byte
> range (assuming the client cares about conflicts) it is essentially
> halted
> until the conflict is resolved?

That's a two-part question. For the first, what about existing parallel
filesystems, we are postulating a new NFS interface to them. They'll have
to export the NFSv4 interface somehow, including "legacy" READ/WRITE
as well as the new LAYOUTGET et. al.

For the second, if clients request conflicting layouts the server
serializes them somehow. Some servers export async interfaces so a
client can essentially block waiting for the layout. Other servers
return a failure and force the client to retry.
It is up to the server implementation.

> > Storage Protocol Negotiation
> >
> > The client needs to be able to ask for particular kinds of layouts
> > that involve particular storage protocols. The LAYOUTGET operation
> > will include a storage protocol type, and the server will fail to
> > return a layout if it cannot give it that type.
> >
> > We discussed variations
> > on this scheme where the client supplies a list of possibilities and
> > the server returns the first one it supports. We had a lengthy
> > discussion about whether it is possible to do this negotiation for
> > the whole filesystem first, but rejected that approach
> > because different files might be available through different storage
> > protocols. I believe we postulated an attribute on the file
> that
> > identified the "preferred" storage protocol that a client could
> > use to access that file.
> I'm wondering if this is unecessary. If a file exists then the
> preferred
> type of file system for access must be available (otherwise who is
> storing
> the file?). Using the attribute idea, the client requests the
> attribute
> and if it has a I/O module to support that type of access it can use
> parallel I/O, otherwise it must read/write through the NFS server. If
> the
> attribute doesn't exist or is NULL or something, parallel I/O does not
> exist for this file.

The point is that there could be multiple storage protocols that can
be used to access a file (e.g., both object and file. Ultimately there
may be many storage protocols). Also, you don't necessarily want to
force the client to do a getattr to get that attribute
before it requests the layout.

> > Relation to share mode reservations
> > Relation to byte range locks
> > Much discussion about this, see above. Advisory locks are
> orthogonal
> > to layout information. *However*, if a server is implementing
> > mandatory locks, then it really must recall layouts that would allow
> > conflicting I/O.
> I think the only real requirement here is that clients do not end up
> with
> conflicting layouts. If client A and client B do not have conflicting
> locks BUT they have conflicting layouts, then the server should be able
> to
> only recall the conflicting bits of the layouts. I'm wondering how
> much
> we let implementors/applications shoot themselves in the foot. Do we
> need
> a requirement that servers cannot return layouts that conflict with
> outstanding locks? My thoughts are that we should tune the protocol so
> clients have the greatest chance of being able to use parallel I/O and
> not
> accidentally end up writing through the NFS server b/c their ops were
> out
> of order or soemthing.

Here the main point is that if you can ignore mandatory locking, then
we can keep the layouts independent of locking (and data delegations).
If the clients are using advisory locks, then we assume all clients are
participating in the locking protocol and we can ignore what layouts
they have. Hidden in your question is an interesting point, however,
which is the ability of the server to recall conflicting ranges of
the layout. We don't have anything like that in the protocol right
now. The nearest thing we have is that when a client asks for a
write (or read) layout the server may return the layout for a smaller
range. For now a recall is for the complete layout.

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From dhildebz@eecs.umich.edu Mon Jun 21 09:31:34 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 32096 invoked from network); 21 Jun 2004 16:31:32 -0000
Received: from unknown (66.218.66.217)
by m24.grp.scd.yahoo.com with QMQP; 21 Jun 2004 16:31:32 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta2.grp.scd.yahoo.com with SMTP; 21 Jun 2004 16:31:32 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i5LGVR5H005731
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Mon, 21 Jun 2004 12:31:28 -0400
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i5LGVRIi005728
for <pnfs-reqs@yahoogroups.com>; Mon, 21 Jun 2004 12:31:27 -0400
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Mon, 21 Jun 2004 12:31:27 -0400 (EDT)
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <200406210401.i5L41kx18199@medlicott.panasas.com>
Message-ID: <Pine.LNX.4.58.0406211140280.4629@willow.eecs.umich.edu>
References: <200406180115.i5I1Fq118244@medlicott.panasas.com>
<Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu>
<200406210401.i5L41kx18199@medlicott.panasas.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

<snip>
> The point is that there could be multiple storage protocols that can
> be used to access a file (e.g., both object and file.  Ultimately there
> may be many storage protocols).  Also, you don't necessarily want to
> force the client to do a getattr to get that attribute
> before it requests the layout.
As Garth mentioned, 1 supported parallel storage protocol per fsid seems
reasonable to me. The NFSv4 protocol already does a GETATTR on a file,
so there isn't any additional overhead. There would be an assumption that
the protocol does not change on the fly (which would be amazing sysadmin
magic anyways) and that the protocol returned on a GETATTR is valid during
the current incarnation of the server.

As we know, a single mount point on the client does not correspond to a
single fsid as directories down the tree may have different fsid's. I'm
not sure if an NFSv4 client can determine when this occurs, so simply
adding the attribute to the GETATTR which proceeds an OPEN will catch the
change in protocol (all done in the same compound of course).

I also propose we change LAYOUTGET to GETLAYOUT to correspond with
GETATTR.

<snip>
> Here the main point is that if you can ignore mandatory locking, then
> we can keep the layouts independent of locking (and data delegations).
> If the clients are using advisory locks, then we assume all clients are
> participating in the locking protocol and we can ignore what layouts
> they have. Hidden in your question is an interesting point, however,
> which is the ability of the server to recall conflicting ranges of
> the layout.  We don't have anything like that in the protocol right
> now.  The nearest thing we have is that when a client asks for a
> write (or read) layout the server may return the layout for a smaller
> range.  For now a recall is for the complete layout.

I didn't think it was hidden, but out in front in a polite way.:) I guess
a client, after having its layout recalled, can request a new smaller
layout. It seems a little more efficient to simply recall the conflicting
parts. Since the server may return larger layouts than requested, this
seems like a common need. (Isn't this similar to Lustre and many others'
lock management scheme?)

This discussion seems to bring up an issue: We will support a group id
with layouts but not with locks. Wouldn't it be better (protocol wise) to
support a group lock command than create a back door method (or have
both)? Is it important to match consistency abilities (exclusive, group,
etc) between locks and layouts or do we see layouts as a whole new way of
achieving consistency?

Dean Hildebrand
University of Michigan


From dnoveck@netapp.com Mon Jun 21 10:59:50 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 26513 invoked from network); 21 Jun 2004 17:59:49 -0000
Received: from unknown (66.218.66.172)
by m20.grp.scd.yahoo.com with QMQP; 21 Jun 2004 17:59:49 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 21 Jun 2004 17:59:49 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i5LHxmkX022089
for <pnfs-reqs@yahoogroups.com>; Mon, 21 Jun 2004 10:59:48 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i5LHxmmb024772
for <pnfs-reqs@yahoogroups.com>; Mon, 21 Jun 2004 10:59:48 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Mon, 21 Jun 2004 10:59:42 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE34@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] summary of June face-to-face
Thread-Index: AcRXrTj2HGbeWnAVS0GvoVwhs08LzwABOqHw
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

Dean Hildebrand wrote:
> <snip>
> > The point is that there could be multiple storage protocols that can
> > be used to access a file (e.g., both object and file.  Ultimately there
> > may be many storage protocols).  Also, you don't necessarily want to
> > force the client to do a getattr to get that attribute
> > before it requests the layout.
> As Garth mentioned, 1 supported parallel storage protocol per fsid seems
> reasonable to me.

Huh? Garth's words were "I agree that typing a filesystem (as defined
by files sharing the same fsid) as supporting the same pNFS storage
protocol(s) is reasonable and effective." I interpret "protocol(s)"
to indicate "one or more than one", not "1 supported storage protocol
per fsid".

> The NFSv4 protocol already does a GETATTR on a file,
> so there isn't any additional overhead.

If you're going to do this on a per-fsid basis (which I thought
you said you were OK with) then you don't need to do the GETATTR
to determine the protocol. If you do this on a per-file basic
then you might have additional overhead. If you know the protocol
in advance, then you can do a LAYOUTGET compounded to the OPEN
(and the GETATTR) whereas if this is a per-file option then you
have to wait for GETATTR to return before issuing the LAYOUTGET.

> There would be an assumption that
> the protocol does not change on the fly (which would be amazing
> sysadmin magic anyways)

I don't know what you mean by an "assumption" here. Either
the server is allowed to do this within the protocol or it
isn't. If it is allowed, then in coding and testing we have
to assume that it will happen or else everybody's code will
fail in interesting ways when it does.

> and that the protocol returned on a GETATTR is valid during
> the current incarnation of the server.

The problem here is in determining "the current incarnation".
If we do a GETATTR before we have done a SETCLIENTID, which is
legal, then I, the client, have no way to determine with which
incarnation the GETATTR is associated.

So here's my best guess about how to deal with this issue:

If a server changes the set of support protocols, it is
under no obligation to notify clients of additional
supported protocols. Clients may if they choose,
periodically interrogate the supported protocol attribute
for an fs, if they think that they might usefully respond
to a change.

If a server makes a previously supported protocol non-
supported, it should respond to LAYOUTGET with a
distinctive error code that indicate that the layout
type is no longer supported. This will allow clients
to do a GETATTR to determine the new list of support
protocols for that fs.

A server which has layouts outstanding and then removes
the protocol associated with the layout from the list
of supported protocols, may, if it wishes, continue to
support the existing layouts while refusing to create
any new ones (and it must so refuse once that protocol
is no longer in the list). It may also recall the
layouts for the no longer supported protocols. In that
latter case, a client would presumably try to reget a
layout of the previous type and thereby find out that
that protocol is no longer supported and then do a GETATTR
to get the new current list.


> As we know, a single mount point on the client does not correspond to a
> single fsid as directories down the tree may have different fsid's. I'm
> not sure if an NFSv4 client can determine when this occurs, so simply
> adding the attribute to the GETATTR which proceeds an OPEN will catch the
> change in protocol (all done in the same compound of course).

If he can do a GETATTR to see when the protocol changes, he can do a
GETATTR to see when the fsid change. If the protocol doesn't change,
he still wants to know when he fsid changes (fileid uniqueness domains,
fs_locations values, etc.).

> I also propose we change LAYOUTGET to GETLAYOUT to correspond with
> GETATTR.

There is also DELEGRETURN so we can't match everything we'd like to.

Why don't we flip a coin to decide this at the next ftf meeting.

From bwelch@panasas.com Thu Jun 24 08:17:02 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 20490 invoked from network); 24 Jun 2004 15:17:01 -0000
Received: from unknown (66.218.66.172)
by m20.grp.scd.yahoo.com with QMQP; 24 Jun 2004 15:17:01 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta4.grp.scd.yahoo.com with SMTP; 24 Jun 2004 15:17:01 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i5OFAfh09615
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 08:10:41 -0700
Message-Id: <200406241510.i5OFAfh09615@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
To: pnfs-reqs@yahoogroups.com
In-reply-to: <Pine.LNX.4.58.0406211140280.4629@willow.eecs.umich.edu>
References: <200406180115.i5I1Fq118244@medlicott.panasas.com> <Pine.LNX.4.58.0406181115150.1687@willow.eecs.umich.edu> <200406210401.i5L41kx18199@medlicott.panasas.com> <Pine.LNX.4.58.0406211140280.4629@willow.eecs.umich.edu>
Comments: In-reply-to Dean Hildebrand <dhildebz@eecs.umich.edu>
message dated "Mon, 21 Jun 2004 12:31:27 -0400."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-ID: <9613.1088089841.1@panasas.com>
Date: Thu, 24 Jun 2004 08:10:41 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

>>>Dean Hildebrand said:
> <snip>
> > The point is that there could be multiple storage protocols that can
> > be used to access a file (e.g., both object and file. Ultimately
> there
> > may be many storage protocols). Also, you don't necessarily want
> to
> > force the client to do a getattr to get that attribute
> > before it requests the layout.
> As Garth mentioned, 1 supported parallel storage protocol per fsid
> seems
> reasonable to me. The NFSv4 protocol already does a GETATTR on a file,
> so there isn't any additional overhead. There would be an assumption
> that
> the protocol does not change on the fly (which would be amazing
> sysadmin
> magic anyways) and that the protocol returned on a GETATTR is valid
> during
> the current incarnation of the server.
>
> As we know, a single mount point on the client does not correspond to a
> single fsid as directories down the tree may have different fsid's.
> I'm
> not sure if an NFSv4 client can determine when this occurs, so simply
> adding the attribute to the GETATTR which proceeds an OPEN will catch
> the
> change in protocol (all done in the same compound of course).

That's fine - I don't feel super strongly about this.

> I also propose we change LAYOUTGET to GETLAYOUT to correspond with
> GETATTR.

Hmm - I'm inclined to keep LAYOUT as the prefix for all the ops in this extension.

> <snip>
> > Here the main point is that if you can ignore mandatory locking, then
> > we can keep the layouts independent of locking (and data
> delegations).
> > If the clients are using advisory locks, then we assume all clients
> are
> > participating in the locking protocol and we can ignore what layouts
> > they have. Hidden in your question is an interesting point, however,
> > which is the ability of the server to recall conflicting ranges of
> > the layout. We don't have anything like that in the protocol right
> > now. The nearest thing we have is that when a client asks for a
> > write (or read) layout the server may return the layout for a smaller
> > range. For now a recall is for the complete layout.
>
> I didn't think it was hidden, but out in front in a polite way.:) I
> guess
> a client, after having its layout recalled, can request a new smaller
> layout. It seems a little more efficient to simply recall the
> conflicting
> parts. Since the server may return larger layouts than requested, this
> seems like a common need. (Isn't this similar to Lustre and many
> others'
> lock management scheme?)

Let's not confuse layouts with locking. I think the server will be much
simpler if it doesn't have to try and recall parts of a layout.

> This discussion seems to bring up an issue: We will support a group id
> with layouts but not with locks. Wouldn't it be better (protocol wise)
> to
> support a group lock command than create a back door method (or have
> both)? Is it important to match consistency abilities (exclusive,
> group,
> etc) between locks and layouts or do we see layouts as a whole new way
> of
> achieving consistency?

No - let's not confuse layouts with locking. I think it will be better
all around to keep them independent.

> Dean Hildebrand
> University of Michigan

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com


From bwelch@panasas.com Thu Jun 24 08:25:07 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 94963 invoked from network); 24 Jun 2004 15:25:05 -0000
Received: from unknown (66.218.66.166)
by m1.grp.scd.yahoo.com with QMQP; 24 Jun 2004 15:25:05 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta5.grp.scd.yahoo.com with SMTP; 24 Jun 2004 15:25:04 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i5OFLjM09728
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 08:21:45 -0700
Message-Id: <200406241521.i5OFLjM09728@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
To: pnfs-reqs@yahoogroups.com
In-reply-to: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE34@silver.nane.netapp.com>
References: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE34@silver.nane.netapp.com>
Comments: In-reply-to "Noveck, Dave" <dnoveck@netapp.com>
message dated "Mon, 21 Jun 2004 10:59:42 -0700."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-ID: <9726.1088090505.1@panasas.com>
Date: Thu, 24 Jun 2004 08:21:45 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

ADVERTISEMENT
click here
>>>"Noveck, Dave" said:
> So here's my best guess about how to deal with this issue:
>
> If a server changes the set of support protocols, it is
> under no obligation to notify clients of additional
> supported protocols. Clients may if they choose,
> periodically interrogate the supported protocol attribute
> for an fs, if they think that they might usefully respond
> to a change.
>
> If a server makes a previously supported protocol non-
> supported, it should respond to LAYOUTGET with a
> distinctive error code that indicate that the layout
> type is no longer supported. This will allow clients
> to do a GETATTR to determine the new list of support
> protocols for that fs.

Is there precident for an attribute that has a list of values?
I'm OK with that, but I had assumed that would be awkward.

> A server which has layouts outstanding and then removes
> the protocol associated with the layout from the list
> of supported protocols, may, if it wishes, continue to
> support the existing layouts while refusing to create
> any new ones (and it must so refuse once that protocol
> is no longer in the list). It may also recall the
> layouts for the no longer supported protocols. In that
> latter case, a client would presumably try to reget a
> layout of the previous type and thereby find out that
> that protocol is no longer supported and then do a GETATTR
> to get the new current list.

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From dnoveck@netapp.com Thu Jun 24 09:00:20 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 80319 invoked from network); 24 Jun 2004 16:00:18 -0000
Received: from unknown (66.218.66.218)
by m17.grp.scd.yahoo.com with QMQP; 24 Jun 2004 16:00:18 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 24 Jun 2004 16:00:18 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i5OFwVkX007203
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 08:58:31 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i5OFwV2I006082
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 08:58:31 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Thu, 24 Jun 2004 08:58:28 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A021421F3@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] summary of June face-to-face
Thread-Index: AcRZ/3BOLLKHv0UtT+6sfxgSx7W5xgABIWWQ
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

ADVERTISEMENT
Brent Welch wrote:
> > If a server makes a previously supported protocol non-
> > supported, it should respond to LAYOUTGET with a
> > distinctive error code that indicate that the layout
> > type is no longer supported. This will allow clients
> > to do a GETATTR to determine the new list of support
> > protocols for that fs.
>
> Is there precident for an attribute that has a list of values?
> I'm OK with that, but I had assumed that would be awkward.

Not in the strict sense of XDR lists as are used in READDIR
responses. However, I was using "list" in a more generic
sense for any variable-length set and there is plenty of
precedent for that. The fs_locations attribute is one
example:

fs_location4

struct fs_location4 {
utf8str_cis server<>;
pathname4 rootpath;
};


fs_locations4

struct fs_locations4 {
pathname4 fs_root;
fs_location4 locations<>;
};

ACL's are another attribute that includes multiple levels
of variable-length arays (i.e. an array of ace's with each
ace containing a variable-length string to specify the
user or group whose access is being allowed or denied (or
audited or made the subject of an alarm)).

I think that within the v4 attribute model you are pretty
free to use any form of attribute that you can represent in
XDR, although if the format is too complicated, you will get
some pushback from people.

From dhildebz@eecs.umich.edu Thu Jun 24 09:16:58 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 18074 invoked from network); 24 Jun 2004 16:16:57 -0000
Received: from unknown (66.218.66.172)
by m3.grp.scd.yahoo.com with QMQP; 24 Jun 2004 16:16:57 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta4.grp.scd.yahoo.com with SMTP; 24 Jun 2004 16:16:57 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i5OGFt6X008871
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 12:15:55 -0400
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i5OGFtmQ008868
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 12:15:55 -0400
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Thu, 24 Jun 2004 12:15:55 -0400 (EDT)
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A021421F3@silver.nane.netapp.com>
Message-ID: <Pine.LNX.4.58.0406241211540.29375@willow.eecs.umich.edu>
References: <C8CF60CFC4D8A74E9945E32CF096548A021421F3@silver.nane.netapp.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

I would still like someone to explain to me how the NFS server, which is
talking to a single file system through an OS's 'VFS' layer, can support
multiple protocols for a single file at the same time. Each fsid or node
exported via exportfs can have a different file system type, but for a
single file? Is this something that a specialized implementation of the
NFS server, like Netapp's, is looking to support?

Dean

On Thu, 24 Jun 2004, Noveck, Dave wrote:

> Brent Welch wrote:
> > >     If a server makes a previously supported protocol non-
> > >     supported, it should respond to LAYOUTGET with a
> > >     distinctive error code that indicate that the layout
> > >     type is no longer supported.  This will allow clients
> > >     to do a GETATTR to determine the new list of support
> > >     protocols for that fs.
> >
> > Is there precident for an attribute that has a list of values?
> > I'm OK with that, but I had assumed that would be awkward.
>
> Not in the strict sense of XDR lists as are used in READDIR
> responses.  However, I was using "list" in a more generic
> sense for any variable-length set and there is plenty of
> precedent for that.  The fs_locations attribute is one
> example:
>
>    fs_location4
>
>                   struct fs_location4 {
>                           utf8str_cis    server<>;
>                           pathname4     rootpath;
>                   };
>
>
>    fs_locations4
>
>                   struct fs_locations4 {
>                           pathname4     fs_root;
>                           fs_location4  locations<>;
>                   };
>
> ACL's are another attribute that includes multiple levels
> of variable-length arays (i.e. an array of ace's with each
> ace containing a variable-length string to specify the
> user or group whose access is being allowed or denied (or
> audited or made the subject of an alarm)).
>
> I think that within the v4 attribute model you are pretty
> free to use any form of attribute that you can represent in
> XDR, although if the format is too complicated, you will get
> some pushback from people.
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> click here
> [rand=437384642]
>
> ________________________________________________________________________________
> Yahoo! Groups Links
> * To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>  
> * To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>  
> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>

From dnoveck@netapp.com Thu Jun 24 10:06:25 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 52498 invoked from network); 24 Jun 2004 17:06:24 -0000
Received: from unknown (66.218.66.167)
by m11.grp.scd.yahoo.com with QMQP; 24 Jun 2004 17:06:24 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 24 Jun 2004 17:06:24 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i5OH6MkX016066
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 10:06:22 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i5OH6M2I027910
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 10:06:22 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Thu, 24 Jun 2004 10:06:17 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE3F@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] summary of June face-to-face
Thread-Index: AcRaBq4vxpkrbW5KQJ6x7FjEVNyOwgAAWiPw
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

Let's me answer things a bit out of order.

> Is this something that a specialized implementation of the
> NFS server, like Netapp's, is looking to support?

Speaking personally, I'd prefer to implement a single storage
protocol because doing that is the minimum amount of work,
and I kind of like doing the minimum amount of work.

However, I don't make those kind of decisions. Ultimately,
the number of storage protocols we would support depends on
what customers want. However, if we do wind up supporting
more than one single protocol, I can't imagine us requiring
that each fs only be accessible via a single storage protocol.
That's just a configuration nightmare. If you support a
set of storage protocols, then I think you might as well
allow any of them to be used on any fs.

Now let's go to the "how" question.

> I would still like someone to explain to me how the NFS server,
> which is talking to a single file system through an OS's 'VFS'
> layer, can support multiple protocols for a single file at the
> same time.

I don't think there is anything involved in doing this
which is fundamentally incompatible with the sort of
architecture you are talking about. Of course, as my
artful language suggests, you're probably going to wind
up changing your VFS interface in various ways. That's
just something you have to accept when you do these sorts
of things. Otherwise life could get unbearably dull.

> Each fsid or node exported via exportfs can have a
> different file system type, but for a single file?

I think you are misunderstanding the relationship of
the protocol by which access to an fs is exported and
the VFS file system type. Let us suppose that I
have an NFS server which is acting as the storage
server and providing the file-style access on behalf
of a pnfs metadata server on another machine. Let's
just pick an fs type for the underlying fs. Let's
say it is UFS. It doesn't matter. File-style storage
access, in this case NFS access, can be provided at the
same time that the UFS files within that store are
accessed by local application, if there are any. In
this context the NFS server is just an application that
is accessing those files. Now suppose I build a server
for another storage protocol, let's say objects. It
would receive CDB's instead of RPC requests but its
relationship to the UFS file system through VFS would
be exactly the same as that of the NFS server. It
would do VFS_READ and VFS_WRITE calls. If I'm
implementing a blocks storage protocol and I choose
to present in pnfs virtualized block addresses tied
to the actual VFS file objects, then things are similar
in the blocks case as well. If I want to export the
actual physical block address within the UFS file
system then things get nasty and you probably have to
build things a different way, but the point is that
the many storage protocols may implemented on top of
VFS access to a single fs.


-----Original Message-----
From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
Sent: Thursday, June 24, 2004 12:16 PM
To: pnfs-reqs@yahoogroups.com
Subject: RE: [pnfs-reqs] summary of June face-to-face


I would still like someone to explain to me how the NFS server, which is
talking to a single file system through an OS's 'VFS' layer, can support
multiple protocols for a single file at the same time. Each fsid or
node
exported via exportfs can have a different file system type, but for a
single file? Is this something that a specialized implementation of the
NFS server, like Netapp's, is looking to support?

From dhildebz@eecs.umich.edu Thu Jun 24 10:27:10 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 85834 invoked from network); 24 Jun 2004 17:27:09 -0000
Received: from unknown (66.218.66.166)
by m25.grp.scd.yahoo.com with QMQP; 24 Jun 2004 17:27:09 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta5.grp.scd.yahoo.com with SMTP; 24 Jun 2004 17:27:09 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i5OHR7fu009548
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 13:27:08 -0400
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i5OHR7H0009545
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 13:27:07 -0400
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Thu, 24 Jun 2004 13:27:07 -0400 (EDT)
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE3F@silver.nane.netapp.com>
Message-ID: <Pine.LNX.4.58.0406241317210.29375@willow.eecs.umich.edu>
References: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE3F@silver.nane.netapp.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

Dave, thanks for the detailed explanation, but I think you missed my
point (or maybe it is just your 'artful language' :) )

I understand how VFS type layers work, but the question is, "How does a
single NFSv4 server export multiple protocols?" You can have multiple
NFSv4 servers each exporting a different protocol, but it seems to me that
a NFSv4 server is bound to the metadata server of the file sytem is
exports. If this is true, then a client accessing a single NFSv4 server
should only be told about the protocol of the metadata server the NFSv4
server is exporting. If a client wants to use a different parallel I/O
protocol, then it would have to mount a different NFSv4 server exporting
a different file protocol.

How you would have a single file exported via 2 parallel file systems is a
whole other issue.
Dean

On Thu, 24 Jun 2004, Noveck, Dave wrote:

> Let's me answer things a bit out of order.
>
> > Is this something that a specialized implementation of the
> > NFS server, like Netapp's, is looking to support?
>
> Speaking personally, I'd prefer to implement a single storage
> protocol because doing that is the minimum amount of work,
> and I kind of like doing the minimum amount of work.
>
> However, I don't make those kind of decisions.  Ultimately,
> the number of storage protocols we would support depends on
> what customers want.  However, if we do wind up supporting
> more than one single protocol, I can't imagine us requiring
> that each fs only be accessible via a single storage protocol.
> That's just a configuration nightmare.  If you support a
> set of storage protocols, then I think you might as well
> allow any of them to be used on any fs.
>
> Now let's go to the "how" question.
>
> > I would still like someone to explain to me how the NFS server,
> > which is talking to a single file system through an OS's 'VFS'
> > layer, can support multiple protocols for a single file at the
> > same time.
>
> I don't think there is anything involved in doing this
> which is fundamentally incompatible with the sort of
> architecture you are talking about.  Of course, as my
> artful language suggests, you're probably going to wind
> up changing your VFS interface in various ways.  That's
> just something you have to accept when you do these sorts
> of things.  Otherwise life could get unbearably dull.
>
> > Each fsid or node exported via exportfs can have a
> > different file system type, but for a single file?
>
> I think you are misunderstanding the relationship of
> the protocol by which access to an fs is exported and
> the VFS file system type.  Let us suppose that I
> have an NFS server which is acting as the storage
> server and providing the file-style access on behalf
> of a pnfs metadata server on another machine.  Let's
> just pick an fs type for the underlying fs.  Let's
> say it is UFS.  It doesn't matter.  File-style storage
> access, in this case NFS access, can be provided at the
> same time that the UFS files within that store are
> accessed by local application, if there are any.  In
> this context the NFS server is just an application that
> is accessing those files.  Now suppose I build a server
> for another storage protocol, let's say objects.  It
> would receive CDB's instead of RPC requests but its
> relationship to the UFS file system through VFS would
> be exactly the same as that of the NFS server.  It
> would do VFS_READ and VFS_WRITE calls.  If I'm
> implementing a blocks storage protocol and I choose
> to present in pnfs virtualized block addresses tied
> to the actual VFS file objects, then things are similar
> in the blocks case as well.  If I want to export the
> actual physical block address within the UFS file
> system then things get nasty and you probably have to
> build things a different way, but the point is that
> the many storage protocols may implemented on top of
> VFS access to a single fs.
>
>
> -----Original Message-----
> From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
> Sent: Thursday, June 24, 2004 12:16 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: RE: [pnfs-reqs] summary of June face-to-face
>
>
> I would still like someone to explain to me how the NFS server, which is
> talking to a single file system through an OS's 'VFS' layer, can support
> multiple protocols for a single file at the same time.  Each fsid or
> node
> exported via exportfs can have a different file system type, but for a
> single file?  Is this something that a specialized implementation of the
> NFS server, like Netapp's, is looking to support?
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> click here
> [rand=706282213]
>
> ________________________________________________________________________________
> Yahoo! Groups Links
> * To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>  
> * To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>  
> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>

From dnoveck@netapp.com Thu Jun 24 10:58:24 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 99212 invoked from network); 24 Jun 2004 17:58:22 -0000
Received: from unknown (66.218.66.218)
by m3.grp.scd.yahoo.com with QMQP; 24 Jun 2004 17:58:22 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 24 Jun 2004 17:58:22 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i5OHsZkX022424
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 10:54:35 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i5OHsZI1010194
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 10:54:35 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Thu, 24 Jun 2004 10:54:30 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE40@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] summary of June face-to-face
Thread-Index: AcRaEHzDsOrEr8L5QpuiZ2uzlPLkpAAAdDNg
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

Dean Hildebrand wrote:
> Dave, thanks for the detailed explanation, but I think you missed my
> point (or maybe it is just your 'artful language' :) )

OK. Let's try again.

> I understand how VFS type layers work, but the question is, "How does
a
> single NFSv4 server export multiple protocols?" You can have multiple
> NFSv4 servers each exporting a different protocol, but it seems to me
that
> a NFSv4 server is bound to the metadata server of the file sytem is
> exports.

I'm having trouble understanding you. You have an NFSv4.x server.
It stores its metadata somewhere, typically in a file system local
to it. It exports access to file that are represented by that
metadata and store in a number of related servers which connected
by a storage managemnt protocol, which is not defined by pnfs.

I don't undertstand what "the metadata server of the file system is
(it?) exports" is.

> If this is true, then a client accessing a single NFSv4 server
> should only be told about the protocol of the metadata server the
NFSv4
> server is exporting.

The protocol of the metadata server would seem to be pnfs, whether
you are exportng the file varaint, the object variant, or the blocks
variant or some combination for the backend.

> If a client wants to use a different parallel I/O
> protocol, then it would have to mount a different NFSv4 server
exporting
> a different file protocol.

What do mean by "a different parallel I/O protocol"? If you mean
a different variant (blocks/objects/files), then I don't see why
you need a different NFSv4 server.

> How you would have a single file exported via 2 parallel file systems
is a
> whole other issue.


From garth@panasas.com Thu Jun 24 11:02:41 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 32119 invoked from network); 24 Jun 2004 18:02:41 -0000
Received: from unknown (66.218.66.167)
by m25.grp.scd.yahoo.com with QMQP; 24 Jun 2004 18:02:41 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 24 Jun 2004 18:02:40 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56SQ35; Thu, 24 Jun 2004 14:01:53 -0400
Mime-Version: 1.0 (Apple Message framework v618)
In-Reply-To: <Pine.LNX.4.58.0406241317210.29375@willow.eecs.umich.edu>
References: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE3F@silver.nane.netapp.com> <Pine.LNX.4.58.0406241317210.29375@willow.eecs.umich.edu>
Content-Type: text/plain; charset=ISO-8859-1; delsp=yes; format=flowed
Message-Id: <910D8241-C608-11D8-AC74-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Date: Thu, 24 Jun 2004 14:01:50 -0400
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

I think there are two questions:
- is it possible
- is it a common case for our design
for a single file or single filesystem to be accessible from clients
using two different storage protocols.

I am certain it is possible, but I am not certain it is a common case.
That is, I am not sure I can generate a compelling use case for us to
make it important to worry a lot about this case.

For example, if a data server supports iSCSI and NFS command protocols,
a "parallel file system" could spread a file over multiple data servers
in a way that allows it to be seen as parallel block file or a set of
files striped over NFS. This is certainly complex, and I can't think
of a good reason for it, but it is possible.

Dean, perhaps you are thinking of implementations where the backend
parallel file systems are only the ones that already exist, and you are
suggesting that a file would never be managed by two different backend
parallel file services. I would agree. But a backend could be built
that did this. Or a middle layer could be built that put NFS frontend
in each disk array, used a block-based backend for each NFS server to
privately access the blocks on its disk array, and the middle layer
could over striped files. Perhaps because a file that is usually
accessed through a data center SAN is now needed by a cluster in
another building and someone believes that it is better to use NFS/TCP
to handle the extra layers of routers.

Anyway, possible I think it is. Important to initial implementations,
unlikely. Useful for the standard to leave possible, maybe, if there
are little or no ramifications in the standard draft.

garth

On Jun 24, 2004, at 1:27 PM, Dean Hildebrand wrote:

> Dave, thanks for the detailed explanation, but I think you missed my
> point (or maybe it is just your 'artful language' :) )
>
> I understand how VFS type layers work, but the question is, "How does a
> single NFSv4 server export multiple protocols?" You can have multiple
> NFSv4 servers each exporting a different protocol, but it seems to me
> that
> a NFSv4 server is bound to the metadata server of the file sytem is
> exports. If this is true, then a client accessing a single NFSv4
> server
> should only be told about the protocol of the metadata server the NFSv4
> server is exporting. If a client wants to use a different parallel I/O
> protocol, then it would have to mount a different NFSv4 server
> exporting
> a different file protocol.
>
> How you would have a single file exported via 2 parallel file systems
> is a
> whole other issue.
> Dean
>
> On Thu, 24 Jun 2004, Noveck, Dave wrote:
>
>> Let's me answer things a bit out of order.
>>
>>> Is this something that a specialized implementation of the
>>> NFS server, like Netapp's, is looking to support?
>>
>> Speaking personally, I'd prefer to implement a single storage
>> protocol because doing that is the minimum amount of work,
>> and I kind of like doing the minimum amount of work.
>>
>> However, I don't make those kind of decisions.  Ultimately,
>> the number of storage protocols we would support depends on
>> what customers want.  However, if we do wind up supporting
>> more than one single protocol, I can't imagine us requiring
>> that each fs only be accessible via a single storage protocol.
>> That's just a configuration nightmare.  If you support a
>> set of storage protocols, then I think you might as well
>> allow any of them to be used on any fs.
>>
>> Now let's go to the "how" question.
>>
>>> I would still like someone to explain to me how the NFS server,
>>> which is talking to a single file system through an OS's 'VFS'
>>> layer, can support multiple protocols for a single file at the
>>> same time.
>>
>> I don't think there is anything involved in doing this
>> which is fundamentally incompatible with the sort of
>> architecture you are talking about.  Of course, as my
>> artful language suggests, you're probably going to wind
>> up changing your VFS interface in various ways.  That's
>> just something you have to accept when you do these sorts
>> of things.  Otherwise life could get unbearably dull.
>>
>>> Each fsid or node exported via exportfs can have a
>>> different file system type, but for a single file?
>>
>> I think you are misunderstanding the relationship of
>> the protocol by which access to an fs is exported and
>> the VFS file system type.  Let us suppose that I
>> have an NFS server which is acting as the storage
>> server and providing the file-style access on behalf
>> of a pnfs metadata server on another machine.  Let's
>> just pick an fs type for the underlying fs.  Let's
>> say it is UFS.  It doesn't matter.  File-style storage
>> access, in this case NFS access, can be provided at the
>> same time that the UFS files within that store are
>> accessed by local application, if there are any.  In
>> this context the NFS server is just an application that
>> is accessing those files.  Now suppose I build a server
>> for another storage protocol, let's say objects.  It
>> would receive CDB's instead of RPC requests but its
>> relationship to the UFS file system through VFS would
>> be exactly the same as that of the NFS server.  It
>> would do VFS_READ and VFS_WRITE calls.  If I'm
>> implementing a blocks storage protocol and I choose
>> to present in pnfs virtualized block addresses tied
>> to the actual VFS file objects, then things are similar
>> in the blocks case as well.  If I want to export the
>> actual physical block address within the UFS file
>> system then things get nasty and you probably have to
>> build things a different way, but the point is that
>> the many storage protocols may implemented on top of
>> VFS access to a single fs.
>>
>>
>> -----Original Message-----
>> From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>> Sent: Thursday, June 24, 2004 12:16 PM
>> To: pnfs-reqs@yahoogroups.com
>> Subject: RE: [pnfs-reqs] summary of June face-to-face
>>
>>
>> I would still like someone to explain to me how the NFS server, which
>> is
>> talking to a single file system through an OS's 'VFS' layer, can
>> support
>> multiple protocols for a single file at the same time.  Each fsid or
>> node
>> exported via exportfs can have a different file system type, but for a
>> single file?  Is this something that a specialized implementation of
>> the
>> NFS server, like Netapp's, is looking to support?
>>
>>
>> Yahoo! Groups Sponsor
>> ADVERTISEMENT
>> click here
>> [rand=706282213]
>>
>> ______________________________________________________________________
>> __________
>> Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>  
>> * To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>  
>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
>> Service.
>>
>>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>

From dnoveck@netapp.com Thu Jun 24 11:19:30 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 81840 invoked from network); 24 Jun 2004 18:19:29 -0000
Received: from unknown (66.218.66.216)
by m23.grp.scd.yahoo.com with QMQP; 24 Jun 2004 18:19:29 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 24 Jun 2004 18:19:29 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i5OIJRkX026210
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 11:19:27 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i5OIJQ2I025075
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 11:19:27 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Thu, 24 Jun 2004 11:19:25 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE41@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] summary of June face-to-face
Thread-Index: AcRaFXLQrtx3gw4CRgemwEZuPbUYrwAAWwEA
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

ADVERTISEMENT
click here
The use case depends on how clients evolve. If all
all clients are sensible, and implement at least the
protocol that is the best then everything is OK
(the only problem is that we all disagree on what
is the best :-).

The problem will be if clients are like Garth, and
David, and me and disagree. If there are a
significant set of cleints that only implement
files (because it is the best) and another set
that only implements objects (because they
mistakenly think it is the best) and a third
set that only implements blocks (because they
mistakely think it is the best), then we are going
to want to have servers that support both the correct
and misguided servers. Everybody can permute the
above to reflect their own version of reality.

-----Original Message-----
From: Garth Gibson [mailto:garth@panasas.com]
Sent: Thursday, June 24, 2004 2:02 PM
To: pnfs-reqs@yahoogroups.com
Subject: Re: [pnfs-reqs] summary of June face-to-face


I think there are two questions:
- is it possible
- is it a common case for our design
for a single file or single filesystem to be accessible from clients
using two different storage protocols.

I am certain it is possible, but I am not certain it is a common case.
That is, I am not sure I can generate a compelling use case for us to
make it important to worry a lot about this case.

For example, if a data server supports iSCSI and NFS command protocols,
a "parallel file system" could spread a file over multiple data servers
in a way that allows it to be seen as parallel block file or a set of
files striped over NFS. This is certainly complex, and I can't think
of a good reason for it, but it is possible.

Dean, perhaps you are thinking of implementations where the backend
parallel file systems are only the ones that already exist, and you are
suggesting that a file would never be managed by two different backend
parallel file services. I would agree. But a backend could be built
that did this. Or a middle layer could be built that put NFS frontend
in each disk array, used a block-based backend for each NFS server to
privately access the blocks on its disk array, and the middle layer
could over striped files. Perhaps because a file that is usually
accessed through a data center SAN is now needed by a cluster in
another building and someone believes that it is better to use NFS/TCP
to handle the extra layers of routers.

Anyway, possible I think it is. Important to initial implementations,
unlikely. Useful for the standard to leave possible, maybe, if there
are little or no ramifications in the standard draft.

garth

On Jun 24, 2004, at 1:27 PM, Dean Hildebrand wrote:

> Dave, thanks for the detailed explanation, but I think you missed my
> point (or maybe it is just your 'artful language' :) )
>
> I understand how VFS type layers work, but the question is, "How does a
> single NFSv4 server export multiple protocols?" You can have multiple
> NFSv4 servers each exporting a different protocol, but it seems to me
> that
> a NFSv4 server is bound to the metadata server of the file sytem is
> exports. If this is true, then a client accessing a single NFSv4
> server
> should only be told about the protocol of the metadata server the NFSv4
> server is exporting. If a client wants to use a different parallel I/O
> protocol, then it would have to mount a different NFSv4 server
> exporting
> a different file protocol.
>
> How you would have a single file exported via 2 parallel file systems
> is a
> whole other issue.
> Dean
>
> On Thu, 24 Jun 2004, Noveck, Dave wrote:
>
>> Let's me answer things a bit out of order.
>>
>>> Is this something that a specialized implementation of the
>>> NFS server, like Netapp's, is looking to support?
>>
>> Speaking personally, I'd prefer to implement a single storage
>> protocol because doing that is the minimum amount of work,
>> and I kind of like doing the minimum amount of work.
>>
>> However, I don't make those kind of decisions.  Ultimately,
>> the number of storage protocols we would support depends on
>> what customers want.  However, if we do wind up supporting
>> more than one single protocol, I can't imagine us requiring
>> that each fs only be accessible via a single storage protocol.
>> That's just a configuration nightmare.  If you support a
>> set of storage protocols, then I think you might as well
>> allow any of them to be used on any fs.
>>
>> Now let's go to the "how" question.
>>
>>> I would still like someone to explain to me how the NFS server,
>>> which is talking to a single file system through an OS's 'VFS'
>>> layer, can support multiple protocols for a single file at the
>>> same time.
>>
>> I don't think there is anything involved in doing this
>> which is fundamentally incompatible with the sort of
>> architecture you are talking about.  Of course, as my
>> artful language suggests, you're probably going to wind
>> up changing your VFS interface in various ways.  That's
>> just something you have to accept when you do these sorts
>> of things.  Otherwise life could get unbearably dull.
>>
>>> Each fsid or node exported via exportfs can have a
>>> different file system type, but for a single file?
>>
>> I think you are misunderstanding the relationship of
>> the protocol by which access to an fs is exported and
>> the VFS file system type.  Let us suppose that I
>> have an NFS server which is acting as the storage
>> server and providing the file-style access on behalf
>> of a pnfs metadata server on another machine.  Let's
>> just pick an fs type for the underlying fs.  Let's
>> say it is UFS.  It doesn't matter.  File-style storage
>> access, in this case NFS access, can be provided at the
>> same time that the UFS files within that store are
>> accessed by local application, if there are any.  In
>> this context the NFS server is just an application that
>> is accessing those files.  Now suppose I build a server
>> for another storage protocol, let's say objects.  It
>> would receive CDB's instead of RPC requests but its
>> relationship to the UFS file system through VFS would
>> be exactly the same as that of the NFS server.  It
>> would do VFS_READ and VFS_WRITE calls.  If I'm
>> implementing a blocks storage protocol and I choose
>> to present in pnfs virtualized block addresses tied
>> to the actual VFS file objects, then things are similar
>> in the blocks case as well.  If I want to export the
>> actual physical block address within the UFS file
>> system then things get nasty and you probably have to
>> build things a different way, but the point is that
>> the many storage protocols may implemented on top of
>> VFS access to a single fs.
>>
>>
>> -----Original Message-----
>> From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
>> Sent: Thursday, June 24, 2004 12:16 PM
>> To: pnfs-reqs@yahoogroups.com
>> Subject: RE: [pnfs-reqs] summary of June face-to-face
>>
>>
>> I would still like someone to explain to me how the NFS server, which
>> is
>> talking to a single file system through an OS's 'VFS' layer, can
>> support
>> multiple protocols for a single file at the same time.  Each fsid or
>> node
>> exported via exportfs can have a different file system type, but for a
>> single file?  Is this something that a specialized implementation of
>> the
>> NFS server, like Netapp's, is looking to support?
>>
>>
>> Yahoo! Groups Sponsor
>> ADVERTISEMENT
>> click here
>> [rand=706282213]
>>
>> ______________________________________________________________________
>> __________
>> Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>  
>> * To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>  
>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
>> Service.
>>
>>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>





Yahoo! Groups Links

From dhildebz@eecs.umich.edu Thu Jun 24 11:26:19 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 24311 invoked from network); 24 Jun 2004 18:26:19 -0000
Received: from unknown (66.218.66.216)
by m25.grp.scd.yahoo.com with QMQP; 24 Jun 2004 18:26:19 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta1.grp.scd.yahoo.com with SMTP; 24 Jun 2004 18:26:19 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i5OIQGWT010379
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 14:26:16 -0400
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i5OIQGHr010376
for <pnfs-reqs@yahoogroups.com>; Thu, 24 Jun 2004 14:26:16 -0400
X-Authentication-Warning: willow.eecs.umich.edu: dhildebz owned process doing -bs
Date: Thu, 24 Jun 2004 14:26:16 -0400 (EDT)
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <910D8241-C608-11D8-AC74-000A95A94F04@panasas.com>
Message-ID: <Pine.LNX.4.58.0406241416080.29375@willow.eecs.umich.edu>
References: <C8CF60CFC4D8A74E9945E32CF096548A01C8EE3F@silver.nane.netapp.com>
<Pine.LNX.4.58.0406241317210.29375@willow.eecs.umich.edu>
<910D8241-C608-11D8-AC74-000A95A94F04@panasas.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

ADVERTISEMENT
Thanks Garth, I think that cleared it up.

Does anyone know of an existing file system that exports more than 1
parallel interface?
Dean

On Thu, 24 Jun 2004, Garth Gibson wrote:

> I think there are two questions:
> - is it possible
> - is it a common case for our design
> for a single file or single filesystem to be accessible from clients 
> using two different storage protocols.
>
> I am certain it is possible, but I am not certain it is a common case.  
> That is, I am not sure I can generate a compelling use case for us to 
> make it important to worry a lot about this case.
>
> For example, if a data server supports iSCSI and NFS command protocols, 
> a "parallel file system" could spread a file over multiple data servers 
> in a way that allows it to be seen as parallel block file or a set of 
> files striped over NFS.  This is certainly complex, and I can't think 
> of a good reason for it, but it is possible.
>
> Dean, perhaps you are thinking of implementations where the backend 
> parallel file systems are only the ones that already exist, and you are 
> suggesting that a file would never be managed by two different backend 
> parallel file services.  I would agree.  But a backend could be built 
> that did this.  Or a middle layer could be built that put NFS frontend 
> in each disk array, used a block-based backend for each NFS server to 
> privately access the blocks on its disk array, and the middle layer 
> could over striped files.  Perhaps because a file that is usually 
> accessed through a data center SAN is now needed by a cluster in 
> another building and someone believes that it is better to use NFS/TCP 
> to handle the extra layers of routers.
>
> Anyway, possible I think it is.  Important to initial implementations, 
> unlikely.  Useful for the standard to leave possible, maybe, if there 
> are little or no ramifications in the standard draft.
>
> garth
>
> On Jun 24, 2004, at 1:27 PM, Dean Hildebrand wrote:
>
> > Dave, thanks for the detailed explanation, but I think you missed my
> > point (or maybe it is just your 'artful language' :) )
> >
> > I understand how VFS type layers work, but the question is, "How does a
> > single NFSv4 server export multiple protocols?"  You can have multiple
> > NFSv4 servers each exporting a different protocol, but it seems to me 
> > that
> > a NFSv4 server is bound to the metadata server of the file sytem is
> > exports.  If this is true, then a client accessing a single NFSv4 
> > server
> > should only be told about the protocol of the metadata server the NFSv4
> > server is exporting.  If a client wants to use a different parallel I/O
> > protocol, then it would have to mount a different NFSv4 server 
> > exporting
> > a different file protocol.
> >
> > How you would have a single file exported via 2 parallel file systems 
> > is a
> > whole other issue.
> > Dean
> >
> > On Thu, 24 Jun 2004, Noveck, Dave wrote:
> >
> >> Let's me answer things a bit out of order.
> >>
> >>> Is this something that a specialized implementation of the
> >>> NFS server, like Netapp's, is looking to support?
> >>
> >> Speaking personally, I'd prefer to implement a single storage
> >> protocol because doing that is the minimum amount of work,
> >> and I kind of like doing the minimum amount of work.
> >>
> >> However, I don't make those kind of decisions.  Ultimately,
> >> the number of storage protocols we would support depends on
> >> what customers want.  However, if we do wind up supporting
> >> more than one single protocol, I can't imagine us requiring
> >> that each fs only be accessible via a single storage protocol.
> >> That's just a configuration nightmare.  If you support a
> >> set of storage protocols, then I think you might as well
> >> allow any of them to be used on any fs.
> >>
> >> Now let's go to the "how" question.
> >>
> >>> I would still like someone to explain to me how the NFS server,
> >>> which is talking to a single file system through an OS's 'VFS'
> >>> layer, can support multiple protocols for a single file at the
> >>> same time.
> >>
> >> I don't think there is anything involved in doing this
> >> which is fundamentally incompatible with the sort of
> >> architecture you are talking about.  Of course, as my
> >> artful language suggests, you're probably going to wind
> >> up changing your VFS interface in various ways.  That's
> >> just something you have to accept when you do these sorts
> >> of things.  Otherwise life could get unbearably dull.
> >>
> >>> Each fsid or node exported via exportfs can have a
> >>> different file system type, but for a single file?
> >>
> >> I think you are misunderstanding the relationship of
> >> the protocol by which access to an fs is exported and
> >> the VFS file system type.  Let us suppose that I
> >> have an NFS server which is acting as the storage
> >> server and providing the file-style access on behalf
> >> of a pnfs metadata server on another machine.  Let's
> >> just pick an fs type for the underlying fs.  Let's
> >> say it is UFS.  It doesn't matter.  File-style storage
> >> access, in this case NFS access, can be provided at the
> >> same time that the UFS files within that store are
> >> accessed by local application, if there are any.  In
> >> this context the NFS server is just an application that
> >> is accessing those files.  Now suppose I build a server
> >> for another storage protocol, let's say objects.  It
> >> would receive CDB's instead of RPC requests but its
> >> relationship to the UFS file system through VFS would
> >> be exactly the same as that of the NFS server.  It
> >> would do VFS_READ and VFS_WRITE calls.  If I'm
> >> implementing a blocks storage protocol and I choose
> >> to present in pnfs virtualized block addresses tied
> >> to the actual VFS file objects, then things are similar
> >> in the blocks case as well.  If I want to export the
> >> actual physical block address within the UFS file
> >> system then things get nasty and you probably have to
> >> build things a different way, but the point is that
> >> the many storage protocols may implemented on top of
> >> VFS access to a single fs.
> >>
> >>
> >> -----Original Message-----
> >> From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
> >> Sent: Thursday, June 24, 2004 12:16 PM
> >> To: pnfs-reqs@yahoogroups.com
> >> Subject: RE: [pnfs-reqs] summary of June face-to-face
> >>
> >>
> >> I would still like someone to explain to me how the NFS server, which 
> >> is
> >> talking to a single file system through an OS's 'VFS' layer, can 
> >> support
> >> multiple protocols for a single file at the same time.  Each fsid or
> >> node
> >> exported via exportfs can have a different file system type, but for a
> >> single file?  Is this something that a specialized implementation of 
> >> the
> >> NFS server, like Netapp's, is looking to support?
> >>
> >>
> >> Yahoo! Groups Sponsor
> >> ADVERTISEMENT
> >> click here
> >> [rand=706282213]
> >>
> >> ______________________________________________________________________
> >> __________
> >> Yahoo! Groups Links
> >>  *  To visit your group on the web, go to:
> >>     http://groups.yahoo.com/group/pnfs-reqs/
> >>      
> >>  *  To unsubscribe from this group, send an email to:
> >>     pnfs-reqs-unsubscribe@yahoogroups.com
> >>      
> >>  *  Your use of Yahoo! Groups is subject to the Yahoo! Terms of 
> >> Service.
> >>
> >>
> >
> >
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
> >
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> click here
> [rand=317263051]
>
> ________________________________________________________________________________
> Yahoo! Groups Links
> * To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>  
> * To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>  
> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>


From julian_satran@il.ibm.com Sun Jun 27 01:46:34 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 49967 invoked from network); 27 Jun 2004 08:46:32 -0000
Received: from unknown (66.218.66.172)
by m20.grp.scd.yahoo.com with QMQP; 27 Jun 2004 08:46:32 -0000
Received: from unknown (HELO mtagate3.de.ibm.com) (195.212.29.152)
by mta4.grp.scd.yahoo.com with SMTP; 27 Jun 2004 08:46:31 -0000
Received: from d12nrmr1607.megacenter.de.ibm.com (d12nrmr1607.megacenter.de.ibm.com [9.149.167.49])
by mtagate3.de.ibm.com (8.12.10/8.12.10) with ESMTP id i5R8kUGm073740
for <pnfs-reqs@yahoogroups.com>; Sun, 27 Jun 2004 08:46:30 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1607.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i5R8kTMm153378
for <pnfs-reqs@yahoogroups.com>; Sun, 27 Jun 2004 10:46:30 +0200
In-Reply-To: <Pine.LNX.4.58.0406241416080.29375@willow.eecs.umich.edu>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.1 January 21, 2004
Message-ID: <OF537C9E0F.612A90A7-ONC2256EC0.0020023B-C2256EC0.003033C4@il.ibm.com>
Date: Sun, 27 Jun 2004 11:46:28 +0300
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2HF259 | March 11, 2004) at
27/06/2004 11:46:29,
Serialize complete at 27/06/2004 11:46:29
Content-Type: multipart/alternative; boundary="=_alternative 0020E1A1C2256EC0_="
X-eGroups-Remote-IP: 195.212.29.152
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

Definitely - Lustre OST can (in theory) export OST objects (their flavor) or files - as they use the native fs (that can be exported through NFS too).
But however hard I try I can't come up with a good reason one may want to do it - unless we want to guarantee that any client can access any piece of data through the client's favorite or only protocol and still support pNFS.  

I would add also that there is probably no big deal in supporting both object and file but definitely a lot of (unwanted) complexity in supporting object/file and block,

And we did not talk about security yet - that IMHO is unmanageable if we admit block and file/object access to the same file data.

Julo




Dean Hildebrand <dhildebz@eecs.umich.edu>

24/06/04 21:26
Please respond to
pnfs-reqs

	
To
	pnfs-reqs@yahoogroups.com
cc
	
Subject
	Re: [pnfs-reqs] summary of June face-to-face

	




Thanks Garth, I think that cleared it up.

Does anyone know of an existing file system that exports more than 1
parallel interface?
Dean

On Thu, 24 Jun 2004, Garth Gibson wrote:

> I think there are two questions:
> - is it possible
> - is it a common case for our design
> for a single file or single filesystem to be accessible from clients 
> using two different storage protocols.
>
> I am certain it is possible, but I am not certain it is a common case.  
> That is, I am not sure I can generate a compelling use case for us to 
> make it important to worry a lot about this case.
>
> For example, if a data server supports iSCSI and NFS command protocols, 
> a "parallel file system" could spread a file over multiple data servers 
> in a way that allows it to be seen as parallel block file or a set of 
> files striped over NFS.  This is certainly complex, and I can't think 
> of a good reason for it, but it is possible.
>
> Dean, perhaps you are thinking of implementations where the backend 
> parallel file systems are only the ones that already exist, and you are 
> suggesting that a file would never be managed by two different backend 
> parallel file services.  I would agree.  But a backend could be built 
> that did this.  Or a middle layer could be built that put NFS frontend 
> in each disk array, used a block-based backend for each NFS server to 
> privately access the blocks on its disk array, and the middle layer 
> could over striped files.  Perhaps because a file that is usually 
> accessed through a data center SAN is now needed by a cluster in 
> another building and someone believes that it is better to use NFS/TCP 
> to handle the extra layers of routers.
>
> Anyway, possible I think it is.  Important to initial implementations, 
> unlikely.  Useful for the standard to leave possible, maybe, if there 
> are little or no ramifications in the standard draft.
>
> garth
>
> On Jun 24, 2004, at 1:27 PM, Dean Hildebrand wrote:
>
> > Dave, thanks for the detailed explanation, but I think you missed my
> > point (or maybe it is just your 'artful language' :) )
> >
> > I understand how VFS type layers work, but the question is, "How does a
> > single NFSv4 server export multiple protocols?"  You can have multiple
> > NFSv4 servers each exporting a different protocol, but it seems to me 
> > that
> > a NFSv4 server is bound to the metadata server of the file sytem is
> > exports.  If this is true, then a client accessing a single NFSv4 
> > server
> > should only be told about the protocol of the metadata server the NFSv4
> > server is exporting.  If a client wants to use a different parallel I/O
> > protocol, then it would have to mount a different NFSv4 server 
> > exporting
> > a different file protocol.
> >
> > How you would have a single file exported via 2 parallel file systems 
> > is a
> > whole other issue.
> > Dean
> >
> > On Thu, 24 Jun 2004, Noveck, Dave wrote:
> >
> >> Let's me answer things a bit out of order.
> >>
> >>> Is this something that a specialized implementation of the
> >>> NFS server, like Netapp's, is looking to support?
> >>
> >> Speaking personally, I'd prefer to implement a single storage
> >> protocol because doing that is the minimum amount of work,
> >> and I kind of like doing the minimum amount of work.
> >>
> >> However, I don't make those kind of decisions.  Ultimately,
> >> the number of storage protocols we would support depends on
> >> what customers want.  However, if we do wind up supporting
> >> more than one single protocol, I can't imagine us requiring
> >> that each fs only be accessible via a single storage protocol.
> >> That's just a configuration nightmare.  If you support a
> >> set of storage protocols, then I think you might as well
> >> allow any of them to be used on any fs.
> >>
> >> Now let's go to the "how" question.
> >>
> >>> I would still like someone to explain to me how the NFS server,
> >>> which is talking to a single file system through an OS's 'VFS'
> >>> layer, can support multiple protocols for a single file at the
> >>> same time.
> >>
> >> I don't think there is anything involved in doing this
> >> which is fundamentally incompatible with the sort of
> >> architecture you are talking about.  Of course, as my
> >> artful language suggests, you're probably going to wind
> >> up changing your VFS interface in various ways.  That's
> >> just something you have to accept when you do these sorts
> >> of things.  Otherwise life could get unbearably dull.
> >>
> >>> Each fsid or node exported via exportfs can have a
> >>> different file system type, but for a single file?
> >>
> >> I think you are misunderstanding the relationship of
> >> the protocol by which access to an fs is exported and
> >> the VFS file system type.  Let us suppose that I
> >> have an NFS server which is acting as the storage
> >> server and providing the file-style access on behalf
> >> of a pnfs metadata server on another machine.  Let's
> >> just pick an fs type for the underlying fs.  Let's
> >> say it is UFS.  It doesn't matter.  File-style storage
> >> access, in this case NFS access, can be provided at the
> >> same time that the UFS files within that store are
> >> accessed by local application, if there are any.  In
> >> this context the NFS server is just an application that
> >> is accessing those files.  Now suppose I build a server
> >> for another storage protocol, let's say objects.  It
> >> would receive CDB's instead of RPC requests but its
> >> relationship to the UFS file system through VFS would
> >> be exactly the same as that of the NFS server.  It
> >> would do VFS_READ and VFS_WRITE calls.  If I'm
> >> implementing a blocks storage protocol and I choose
> >> to present in pnfs virtualized block addresses tied
> >> to the actual VFS file objects, then things are similar
> >> in the blocks case as well.  If I want to export the
> >> actual physical block address within the UFS file
> >> system then things get nasty and you probably have to
> >> build things a different way, but the point is that
> >> the many storage protocols may implemented on top of
> >> VFS access to a single fs.
> >>
> >>
> >> -----Original Message-----
> >> From: Dean Hildebrand [mailto:dhildebz@eecs.umich.edu]
> >> Sent: Thursday, June 24, 2004 12:16 PM
> >> To: pnfs-reqs@yahoogroups.com
> >> Subject: RE: [pnfs-reqs] summary of June face-to-face
> >>
> >>
> >> I would still like someone to explain to me how the NFS server, which 
> >> is
> >> talking to a single file system through an OS's 'VFS' layer, can 
> >> support
> >> multiple protocols for a single file at the same time.  Each fsid or
> >> node
> >> exported via exportfs can have a different file system type, but for a
> >> single file?  Is this something that a specialized implementation of 
> >> the
> >> NFS server, like Netapp's, is looking to support?
> >>
> >>
> >> Yahoo! Groups Sponsor
> >> ADVERTISEMENT
> >> click here
> >> [rand=706282213]
> >>
> >> ______________________________________________________________________
> >> __________
> >> Yahoo! Groups Links
> >>  *  To visit your group on the web, go to:
> >>     http://groups.yahoo.com/group/pnfs-reqs/
> >>      
> >>  *  To unsubscribe from this group, send an email to:
> >>     pnfs-reqs-unsubscribe@yahoogroups.com
> >>      
> >>  *  Your use of Yahoo! Groups is subject to the Yahoo! Terms of 
> >> Service.
> >>
> >>
> >
> >
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
> >
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> click here
> [rand=317263051]
>
> ________________________________________________________________________________
> Yahoo! Groups Links
>  *  To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>      
>  *  To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>      
>  *  Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>


------------------------ Yahoo! Groups Sponsor --------------------~-->
Yahoo! Domains - Claim yours for only $14.70
http://us.click.yahoo.com/Z1wmxD/DREIAA/yQLSAA/W6uqlB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/pnfs-reqs/

<*> To unsubscribe from this group, send an email to:
    pnfs-reqs-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/

From andros@citi.umich.edu Mon Jun 28 15:42:30 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 15708 invoked from network); 28 Jun 2004 22:42:29 -0000
Received: from unknown (66.218.66.167)
by m16.grp.scd.yahoo.com with QMQP; 28 Jun 2004 22:42:29 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta6.grp.scd.yahoo.com with SMTP; 28 Jun 2004 22:42:28 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 390531301E; Mon, 28 Jun 2004 18:42:28 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
In-reply-to: Your message of "Sun, 27 Jun 2004 11:46:28 +0300."
<OF537C9E0F.612A90A7-ONC2256EC0.0020023B-C2256EC0.003033C4@il.ibm.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: quoted-printable
Date: Mon, 28 Jun 2004 18:42:28 -0400
Message-Id: <20040628224228.390531301E@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] summary of June face-to-face
X-Yahoo-Group-Post: member; u=169434965

julian_satran@il.ibm.com said:
> Definitely - Lustre OST can (in theory) export OST objects (their
> flavor) or files - as they use the native fs (that can be exported
> through NFS too). But however hard I try I can't come up with a good
> reason one may want to do it - unless we want to guarantee that any
> client can access any piece of data through the client's favorite or
> only protocol and still support pNFS.  


i could see OSD inside a cluster, and files for WAN exporting of cluster data.

> I would add also that there is probably no big deal in supporting both
> object and file but definitely a lot of (unwanted) complexity in
> supporting object/file and block,

> And we did not talk about security yet - that IMHO is unmanageable if
> we admit block and file/object access to the same file data.

yes, the security discussion should get started asap!

> Julo 

From garth@panasas.com Thu Jul 01 10:25:40 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 8670 invoked from network); 1 Jul 2004 17:25:39 -0000
Received: from unknown (66.218.66.172)
by m23.grp.scd.yahoo.com with QMQP; 1 Jul 2004 17:25:39 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 1 Jul 2004 17:25:39 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56TFDH; Thu, 1 Jul 2004 13:25:15 -0400
Mime-Version: 1.0 (Apple Message framework v618)
Content-Transfer-Encoding: 7bit
Message-Id: <9D5D27D6-CB83-11D8-A956-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Thu, 1 Jul 2004 13:25:14 -0400
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: planning future meetings
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Folks,

We've gotten behind on meeting plans.

The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.
See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt

Hopefully pNFS will get some time on that agenda, but it will be a
small amount at most. What I'd like to propose is that we meet f-2-f
on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
and Marina), so I'm also looking for proposals for where to hold the
meeting.

Since timing is tight on this, please reply soon.

And while you are thinking about your schedule, lets talk about the
meeting after. The next IETF is Nov 7-12 in Washington DC (which
overlaps SC04 in Pittsburgh). Again we should target a piece of the
not-very-long NFSv4 meeting and decide if we want to also have a f-2-f.

I'm happy to propose a day long interim meeting at CMU in Pittsburgh in
September, before the cut off dates to submit for the IETF meeting in
Nov.

Opinions? Date suggestions? Alternatives?

garth


From Thomas.Talpey@netapp.com Thu Jul 01 10:39:17 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 86323 invoked from network); 1 Jul 2004 17:39:16 -0000
Received: from unknown (66.218.66.167)
by m24.grp.scd.yahoo.com with QMQP; 1 Jul 2004 17:39:16 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 1 Jul 2004 17:39:16 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i61HdFkX008337
for <pnfs-reqs@yahoogroups.com>; Thu, 1 Jul 2004 10:39:15 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i61HdFI6011218
for <pnfs-reqs@yahoogroups.com>; Thu, 1 Jul 2004 10:39:15 -0700 (PDT)
Received: from tmt.netapp.com ([10.97.6.37]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Thu, 1 Jul 2004 13:39:09 -0400
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C45F92.50485C80"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Thu, 1 Jul 2004 10:39:07 -0700
Message-ID: <6.1.2.0.2.20040701133553.034c6ec0@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] planning future meetings
Thread-Index: AcRfklCgSsNlJf+5TcihZOsoILI4Mw==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] planning future meetings
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

At 01:25 PM 7/1/2004, Garth Gibson wrote:
>The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego. 
>See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt

***DRAFT*** agenda. Don't count on this date until it's "final"!

>Hopefully pNFS will get some time on that agenda, but it will be a
>small amount at most.  What I'd like to propose is that we meet f-2-f
>on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.

I would be very reluctant to commit to this without knowing what
conflicts it introduces with IETF business. There are several other
working groups that I attend. Also, I believe this may a bad precedent
to set for pNFS w.r.t. the IETF. That said, a nonconflicting f2f would
be good.

Tom.  

From ggrider@lanl.gov Thu Jul 01 17:30:27 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 12610 invoked from network); 2 Jul 2004 00:30:26 -0000
Received: from unknown (66.218.66.172)
by m22.grp.scd.yahoo.com with QMQP; 2 Jul 2004 00:30:26 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta4.grp.scd.yahoo.com with SMTP; 2 Jul 2004 00:30:26 -0000
Received: from mailrelay3.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i620TNCp007407
for <pnfs-reqs@yahoogroups.com>; Thu, 1 Jul 2004 18:29:23 -0600
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay3.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i620TM24008505
for <pnfs-reqs@yahoogroups.com>; Thu, 1 Jul 2004 18:29:22 -0600
Received: from cthulu.lanl.gov (vpn-client-153.lanl.gov [128.165.253.153])
by cic-mail.lanl.gov (8.12.11/8.12.11/(ccn-5)) with ESMTP id i620TKas027778
for <pnfs-reqs@yahoogroups.com>; Thu, 1 Jul 2004 18:29:21 -0600
Message-Id: <5.2.0.9.2.20040701182847.03623f30@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Thu, 01 Jul 2004 18:29:24 -0600
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <9D5D27D6-CB83-11D8-A956-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="=====================_1525173==.REL"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] planning future meetings
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

I prefer the 3rd or 4th, not the 2nd.

Thanks
Gary

At 01:25 PM 7/1/2004 -0400, Garth Gibson wrote:

> Folks,
>
> We've gotten behind on meeting plans.
>
> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego. 
> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>
> Hopefully pNFS will get some time on that agenda, but it will be a
> small amount at most.  What I'd like to propose is that we meet f-2-f
> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego. 
> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> and Marina), so I'm also looking for proposals for where to hold the
> meeting.
>
> Since timing is tight on this, please reply soon.
>
> And while you are thinking about your schedule, lets talk about the
> meeting after.  The next IETF is Nov 7-12 in Washington DC (which
> overlaps SC04 in Pittsburgh).  Again we should target a piece of the
> not-very-long NFSv4 meeting and decide if we want to also have a f-2-f.
>
> I'm happy to propose a day long interim meeting at CMU in Pittsburgh in
> September, before the cut off dates to submit for the IETF meeting in
> Nov.
>
> Opinions?  Date suggestions?  Alternatives?
>
> garth
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> 1744c4.jpg
> 174532.jpg
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From julian_satran@il.ibm.com Thu Jul 01 23:42:12 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 34945 invoked from network); 2 Jul 2004 06:42:11 -0000
Received: from unknown (66.218.66.217)
by m10.grp.scd.yahoo.com with QMQP; 2 Jul 2004 06:42:11 -0000
Received: from unknown (HELO mtagate4.de.ibm.com) (195.212.29.153)
by mta2.grp.scd.yahoo.com with SMTP; 2 Jul 2004 06:42:10 -0000
Received: from d12nrmr1507.megacenter.de.ibm.com (d12nrmr1507.megacenter.de.ibm.com [9.149.167.1])
by mtagate4.de.ibm.com (8.12.10/8.12.10) with ESMTP id i626fkFA112586
for <pnfs-reqs@yahoogroups.com>; Fri, 2 Jul 2004 06:41:46 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1507.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i626fjCD260396
for <pnfs-reqs@yahoogroups.com>; Fri, 2 Jul 2004 08:41:46 +0200
In-Reply-To: <9D5D27D6-CB83-11D8-A956-000A95A94F04@panasas.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.1 January 21, 2004
Message-ID: <OFAE55ECBE.98EC3D32-ONC2256EC5.001C845A-C2256EC5.0024C769@il.ibm.com>
Date: Fri, 2 Jul 2004 09:41:43 +0300
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2HF259 | March 11, 2004) at
02/07/2004 09:41:46,
Serialize complete at 02/07/2004 09:41:46
Content-Type: multipart/alternative; boundary="=_alternative 001D1239C2256EC5_="
X-eGroups-Remote-IP: 195.212.29.153
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] planning future meetings
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

ADVERTISEMENT

The IETF agenda is not final (I don't see a requested IPS session yet) but except for this August 3 (afternoon) looks fine.
As for September - for me the last dates are second week before the SNIA-OSD F2F in California. After that is are the Jewish fall holidays that will make only several days towards the end feasible.

Julo


Garth Gibson <garth@panasas.com>

01/07/04 20:25
Please respond to
pnfs-reqs

	
To
	pnfs-reqs@yahoogroups.com
cc
	
Subject
	[pnfs-reqs] planning future meetings

	




Folks,

We've gotten behind on meeting plans.

The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.  
See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt

Hopefully pNFS will get some time on that agenda, but it will be a
small amount at most.  What I'd like to propose is that we meet f-2-f
on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.  
I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
and Marina), so I'm also looking for proposals for where to hold the
meeting.

Since timing is tight on this, please reply soon.

And while you are thinking about your schedule, lets talk about the
meeting after.  The next IETF is Nov 7-12 in Washington DC (which
overlaps SC04 in Pittsburgh).  Again we should target a piece of the
not-very-long NFSv4 meeting and decide if we want to also have a f-2-f.

I'm happy to propose a day long interim meeting at CMU in Pittsburgh in
September, before the cut off dates to submit for the IETF meeting in
Nov.

Opinions?  Date suggestions?  Alternatives?

garth



------------------------ Yahoo! Groups Sponsor --------------------~-->
Yahoo! Domains - Claim yours for only $14.70
http://us.click.yahoo.com/Z1wmxD/DREIAA/yQLSAA/W6uqlB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/pnfs-reqs/

<*> To unsubscribe from this group, send an email to:
    pnfs-reqs-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/

From black_david@emc.com Mon Jul 05 10:40:55 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 13220 invoked from network); 5 Jul 2004 17:40:53 -0000
Received: from unknown (66.218.66.217)
by m6.grp.scd.yahoo.com with QMQP; 5 Jul 2004 17:40:53 -0000
Received: from unknown (HELO mailhub.lss.emc.com) (168.159.2.31)
by mta2.grp.scd.yahoo.com with SMTP; 5 Jul 2004 17:40:53 -0000
Received: from mxic2.corp.emc.com (mxic2.corp.emc.com [128.221.12.9])
by mailhub.lss.emc.com (Switch-2.2.8/Switch-2.2.0) with ESMTP id i65HeOV10919
for <pnfs-reqs@yahoogroups.com>; Mon, 5 Jul 2004 13:40:25 -0400 (EDT)
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <MZC23F0V>; Mon, 5 Jul 2004 13:40:24 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A5C0A@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com
Date: Mon, 5 Jul 2004 13:40:16 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C462B7.224E492A"
X-PMX-Version: 4.6.0.97784, Antispam-Core: 4.6.0.97340, Antispam-Data: 2004.7.4.105843
X-eGroups-Remote-IP: 168.159.2.31
From: black_david@emc.com
Subject: RE: [pnfs-reqs] planning future meetings
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

ADVERTISEMENT
IPS is combined with IMSS on Monday evening.  Tom's comment
that this agenda is subject to change is important - that
combined session might move elsewhere in the day on Monday
... and we won't know final meeting slots for anything until
about 2 weeks prior to the meeting.  One exception is that
the combined IPS/IMSS will not move from Monday, as these
two WGs will meet that day in order to minimize a conflict
with T11 meetings the same week in Colorado.
 
Also, once pNFS is a formal IETF effort, this sort of use
of private face-to-face meetings and concalls to work on the
draft(s) starts to become problematic - Julian may remember
some of what had to be done in the early days of IP Storage
to ensure that the drafts were worked on in a sufficiently
public fashion.  Private meetings among a small author team
are ok (getting together at the bar at IETF meetings is a
long-standing tradition), but any requirement to attend
phone calls or meetings other than official WG interim
meetings in order to have input into drafts under
development is definitely not ok.
 
One interesting opportunity is that I have *not* seen any
announcement of a social event, so a dinner/evening working
session of some form on Tuesday (assuming the NFSv4 WG
meeting stays on Tuesday) sounds promising.  I'd stay away
from Tuesday afternoon because the fact that it's broken
into 4 one-hour sessions increases the possibility of
an important conflict.
 
Thanks,
--David

    -----Original Message-----
    From: Julian Satran [mailto:julian_satran@il.ibm.com]
    Sent: Friday, July 02, 2004 2:42 AM
    To: pnfs-reqs@yahoogroups.com
    Cc: pnfs-reqs@yahoogroups.com
    Subject: Re: [pnfs-reqs] planning future meetings


    The IETF agenda is not final (I don't see a requested IPS session yet) but except for this August 3 (afternoon) looks fine.
    As for September - for me the last dates are second week before the SNIA-OSD F2F in California. After that is are the Jewish fall holidays that will make only several days towards the end feasible.

    Julo


    Garth Gibson <garth@panasas.com>

    01/07/04 20:25
    Please respond to
    pnfs-reqs

    	
    To
    	pnfs-reqs@yahoogroups.com
    cc
    	
    Subject
    	[pnfs-reqs] planning future meetings

    	




    Folks,

    We've gotten behind on meeting plans.

    The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.  
    See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt

    Hopefully pNFS will get some time on that agenda, but it will be a
    small amount at most.  What I'd like to propose is that we meet f-2-f
    on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.  
    I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
    and Marina), so I'm also looking for proposals for where to hold the
    meeting.

    Since timing is tight on this, please reply soon.

    And while you are thinking about your schedule, lets talk about the
    meeting after.  The next IETF is Nov 7-12 in Washington DC (which
    overlaps SC04 in Pittsburgh).  Again we should target a piece of the
    not-very-long NFSv4 meeting and decide if we want to also have a f-2-f.

    I'm happy to propose a day long interim meeting at CMU in Pittsburgh in
    September, before the cut off dates to submit for the IETF meeting in
    Nov.

    Opinions?  Date suggestions?  Alternatives?

    garth



    ------------------------ Yahoo! Groups Sponsor --------------------~-->
    Yahoo! Domains - Claim yours for only $14.70
    http://us.click.yahoo.com/Z1wmxD/DREIAA/yQLSAA/W6uqlB/TM
    --------------------------------------------------------------------~->


    Yahoo! Groups Links

    <*> To visit your group on the web, go to:
        http://groups.yahoo.com/group/pnfs-reqs/

    <*> To unsubscribe from this group, send an email to:
        pnfs-reqs-unsubscribe@yahoogroups.com

    <*> Your use of Yahoo! Groups is subject to:
        http://docs.yahoo.com/info/terms/

From garth@panasas.com Thu Jul 08 09:13:44 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 24409 invoked from network); 8 Jul 2004 16:13:43 -0000
Received: from unknown (66.218.66.167)
by m22.grp.scd.yahoo.com with QMQP; 8 Jul 2004 16:13:43 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 8 Jul 2004 16:13:43 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56T478; Thu, 8 Jul 2004 12:13:41 -0400
Mime-Version: 1.0 (Apple Message framework v618)
In-Reply-To: <9D5D27D6-CB83-11D8-A956-000A95A94F04@panasas.com>
References: <9D5D27D6-CB83-11D8-A956-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <C65A00D9-D0F9-11D8-B0CA-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Thu, 8 Jul 2004 12:13:40 -0400
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: planning future meetings
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Here is what I think is on the table for comments/interest from this
community:

Tues Aug 3, IETF 60, San Diego:
- IETF NFSv4 meeting is in the morning
- in the afternoon, and evening if appropriate, interested and
available pNFS participants gather in the hotel lobby bar. the goal is
to get 4-6 hours of meeting time to justify those of us traveling to
San Diego that will not otherwise attend the IETF meeting. we might try
to arrange a room in a local chinese restaurant, for example, but we do
not expect any meeting rooms will be available at the IETF hotel and we
do not want to create an officially conflicting meeting.
- IETF schedule is not final and may change as late as 2 weeks before
the meeting, so people's Tues afternoon may change. this is in part
why we are suggesting that the meeting may go into the evening.
staying the night is not a bad thing for most of us as the RDDP meeting
is likely to be first thing on Wed morning anyway.

Would folks please speak up on this list if they are going to attend
this meeting.

Thurs Sept 30, Pennsylvania:
- an informal face-to-face in the Pittsburgh area, at either CMU or
Nemacolin (for those who collaborate with or support CMU's PDL, this is
the day after the PDL retreat)
- this might be the last informal pNFS meeting done outside the NFSv4
TWG schedule as we hope to soon convince the NFSv4 group to take us in

week of Nov 8, IETF 61, Washington DC
- similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
during the IETF meeting week

garth


On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:

> Folks,
>
> We've gotten behind on meeting plans.
>
> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.
> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>
> Hopefully pNFS will get some time on that agenda, but it will be a
> small amount at most. What I'd like to propose is that we meet f-2-f
> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> and Marina), so I'm also looking for proposals for where to hold the
> meeting.
>
> Since timing is tight on this, please reply soon.
>
> And while you are thinking about your schedule, lets talk about the
> meeting after. The next IETF is Nov 7-12 in Washington DC (which
> overlaps SC04 in Pittsburgh). Again we should target a piece of the
> not-very-long NFSv4 meeting and decide if we want to also have a f-2-f.
>
> I'm happy to propose a day long interim meeting at CMU in Pittsburgh in
> September, before the cut off dates to submit for the IETF meeting in
> Nov.
>
> Opinions? Date suggestions? Alternatives?
>
> garth
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>

From Thomas.Talpey@netapp.com Thu Jul 08 13:36:06 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 55047 invoked from network); 8 Jul 2004 20:36:05 -0000
Received: from unknown (66.218.66.172)
by m21.grp.scd.yahoo.com with QMQP; 8 Jul 2004 20:36:05 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 8 Jul 2004 20:36:05 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i68Ka3kX019101
for <pnfs-reqs@yahoogroups.com>; Thu, 8 Jul 2004 13:36:04 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i68KZe05014138
for <pnfs-reqs@yahoogroups.com>; Thu, 8 Jul 2004 13:36:03 -0700 (PDT)
Received: from tmt.netapp.com ([10.97.6.31]) by silver.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.5329); Thu, 8 Jul 2004 16:35:48 -0400
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C4652B.26ABAA00"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
content-class: urn:content-classes:message
Date: Thu, 8 Jul 2004 13:35:24 -0700
Message-ID: <6.1.2.0.2.20040708163437.01db5508@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: planning future meetings
Thread-Index: AcRlKyb0ymyRZseOTi2GvdUG/hxnRA==
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

At 12:13 PM 7/8/2004, Garth Gibson wrote:
>Would folks please speak up on this list if they are going to attend
>this meeting.

I'll be attending IETF from Sunday->Thursday and will attend
the nonconflicting parts of the pNFS meeting.

Tom. 

From black_david@emc.com Thu Jul 08 13:40:33 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 74018 invoked from network); 8 Jul 2004 20:40:33 -0000
Received: from unknown (66.218.66.216)
by m17.grp.scd.yahoo.com with QMQP; 8 Jul 2004 20:40:33 -0000
Received: from unknown (HELO mailhub.lss.emc.com) (168.159.2.31)
by mta1.grp.scd.yahoo.com with SMTP; 8 Jul 2004 20:40:32 -0000
Received: from MAHO3MSX2.corp.emc.com (maho3msx2.isus.emc.com [128.221.11.32])
by mailhub.lss.emc.com (Switch-3.1.6/Switch-2.2.0) with ESMTP id i68KeT6D023513
for <pnfs-reqs@yahoogroups.com>; Thu, 8 Jul 2004 16:40:29 -0400 (EDT)
Received: by maho3msx2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <M6BNJWJZ>; Thu, 8 Jul 2004 16:40:28 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A5C44@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com
Date: Thu, 8 Jul 2004 16:40:25 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C4652B.CC03D425"
X-PMX-Version: 4.6.0.97784, Antispam-Core: 4.6.0.97340, Antispam-Data: 2004.7.8.106429
X-eGroups-Remote-IP: 168.159.2.31
From: black_david@emc.com
Subject: RE: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

Ditto.  --David

    -----Original Message-----
    From: Talpey, Thomas [mailto:Thomas.Talpey@netapp.com]
    Sent: Thursday, July 08, 2004 4:35 PM
    To: pnfs-reqs@yahoogroups.com
    Subject: Re: [pnfs-reqs] Re: planning future meetings

    At 12:13 PM 7/8/2004, Garth Gibson wrote:
    >Would folks please speak up on this list if they are going to attend
    >this meeting.

    I'll be attending IETF from Sunday->Thursday and will attend
    the nonconflicting parts of the pNFS meeting.

    Tom. 

From pcorbett@netapp.com Thu Jul 08 13:42:42 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 82421 invoked from network); 8 Jul 2004 20:42:42 -0000
Received: from unknown (66.218.66.172)
by m21.grp.scd.yahoo.com with QMQP; 8 Jul 2004 20:42:42 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 8 Jul 2004 20:42:42 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i68KggkX019829
for <pnfs-reqs@yahoogroups.com>; Thu, 8 Jul 2004 13:42:42 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i68Kgfxv016932
for <pnfs-reqs@yahoogroups.com>; Thu, 8 Jul 2004 13:42:41 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C4652C.1B75ECDD"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Thu, 8 Jul 2004 13:42:38 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A020161FD@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: planning future meetings
Thread-Index: AcRlKyb0ymyRZseOTi2GvdUG/hxnRAAAOcdA
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

I�m planning to attend Tues and Wed.

 

From: Talpey, Thomas
Sent: Thursday, July 08, 2004 4:35 PM
To: pnfs-reqs@yahoogroups.com
Subject: Re: [pnfs-reqs] Re: planning future meetings

 

At 12:13 PM 7/8/2004, Garth Gibson wrote:
>Would folks please speak up on this list if they are going to attend
>this meeting.

I'll be attending IETF from Sunday->Thursday and will attend
the nonconflicting parts of the pNFS meeting.

Tom.

From dnoveck@netapp.com Fri Jul 09 06:21:15 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 39778 invoked from network); 9 Jul 2004 13:21:12 -0000
Received: from unknown (66.218.66.216)
by m3.grp.scd.yahoo.com with QMQP; 9 Jul 2004 13:21:12 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 9 Jul 2004 13:21:12 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i69DDVkX019499
for <pnfs-reqs@yahoogroups.com>; Fri, 9 Jul 2004 06:13:31 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i69DDVJW002477
for <pnfs-reqs@yahoogroups.com>; Fri, 9 Jul 2004 06:13:31 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C465B6.85DD1712"
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Fri, 9 Jul 2004 06:13:27 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A0214225D@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: planning future meetings
Thread-Index: AcRlKyb0ymyRZseOTi2GvdUG/hxnRAAAOcdAACKUhnA=
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

I'm probably going to attend Tues and Wed, but that's subject to change
if the nfsv4 working group moves.

    -----Original Message-----
    From: Corbett, Peter
    Sent: Thursday, July 08, 2004 4:43 PM
    To: pnfs-reqs@yahoogroups.com
    Subject: RE: [pnfs-reqs] Re: planning future meetings

    I�m planning to attend Tues and Wed.

     

    From: Talpey, Thomas
    Sent: Thursday, July 08, 2004 4:35 PM
    To: pnfs-reqs@yahoogroups.com
    Subject: Re: [pnfs-reqs] Re: planning future meetings

     

    At 12:13 PM 7/8/2004, Garth Gibson wrote:
>Would folks please speak up on this list if they are going to attend
>this meeting.

    I'll be attending IETF from Sunday->Thursday and will attend
    the nonconflicting parts of the pNFS meeting.

    Tom.

From julian_satran@il.ibm.com Sun Jul 11 19:07:53 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 51797 invoked from network); 12 Jul 2004 02:07:51 -0000
Received: from unknown (66.218.66.216)
by m11.grp.scd.yahoo.com with QMQP; 12 Jul 2004 02:07:51 -0000
Received: from unknown (HELO mtagate4.de.ibm.com) (195.212.29.153)
by mta1.grp.scd.yahoo.com with SMTP; 12 Jul 2004 02:07:50 -0000
Received: from d12nrmr1507.megacenter.de.ibm.com (d12nrmr1507.megacenter.de.ibm.com [9.149.167.1])
by mtagate4.de.ibm.com (8.12.10/8.12.10) with ESMTP id i6C27jFA117364
for <pnfs-reqs@yahoogroups.com>; Mon, 12 Jul 2004 02:07:45 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1507.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i6C27ivM255956
for <pnfs-reqs@yahoogroups.com>; Mon, 12 Jul 2004 04:07:45 +0200
In-Reply-To: <C65A00D9-D0F9-11D8-B0CA-000A95A94F04@panasas.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.1 January 21, 2004
Message-ID: <OF86EDF194.8B8BCF59-ONC2256ECE.002F767B-C2256ECF.000BB151@il.ibm.com>
Date: Mon, 12 Jul 2004 05:07:42 +0300
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.0.2CF2HF259 | March 11, 2004) at
12/07/2004 05:07:44,
Serialize complete at 12/07/2004 05:07:44
Content-Type: multipart/alternative; boundary="=_alternative 002FC628C2256ECE_="
X-eGroups-Remote-IP: 195.212.29.153
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

ADVERTISEMENT
click here

I will participate in the August and November meeting. For the September - depends on other travel arrangements.

Regards,
Julo


Garth Gibson <garth@panasas.com>

08/07/04 19:13
Please respond to
pnfs-reqs

	
To
	pnfs-reqs@yahoogroups.com
cc
	
Subject
	[pnfs-reqs] Re: planning future meetings

	




Here is what I think is on the table for comments/interest from this
community:

Tues Aug 3, IETF 60, San Diego:
- IETF NFSv4 meeting is in the morning
- in the afternoon, and evening if appropriate, interested and
available pNFS participants gather in the hotel lobby bar.  the goal is
to get 4-6 hours of meeting time to justify those of us traveling to
San Diego that will not otherwise attend the IETF meeting. we might try
to arrange a room in a local chinese restaurant, for example, but we do
not expect any meeting rooms will be available at the IETF hotel and we
do not want to create an officially conflicting meeting.
- IETF schedule is not final and may change as late as 2 weeks before
the meeting, so people's Tues afternoon may change.  this is in part
why we are suggesting that the meeting may go into the evening.  
staying the night is not a bad thing for most of us as the RDDP meeting
is likely to be first thing on Wed morning anyway.

Would folks please speak up on this list if they are going to attend
this meeting.

Thurs Sept 30, Pennsylvania:
- an informal face-to-face in the Pittsburgh area, at either CMU or
Nemacolin (for those who collaborate with or support CMU's PDL, this is
the day after the PDL retreat)
- this might be the last informal pNFS meeting done outside the NFSv4
TWG schedule as we hope to soon convince the NFSv4 group to take us in

week of Nov 8, IETF 61, Washington DC
- similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
during the IETF meeting week

garth


On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:

> Folks,
>
> We've gotten behind on meeting plans.
>
> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.
> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>
> Hopefully pNFS will get some time on that agenda, but it will be a
> small amount at most.  What I'd like to propose is that we meet f-2-f
> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> and Marina), so I'm also looking for proposals for where to hold the
> meeting.
>
> Since timing is tight on this, please reply soon.
>
> And while you are thinking about your schedule, lets talk about the
> meeting after.  The next IETF is Nov 7-12 in Washington DC (which
> overlaps SC04 in Pittsburgh).  Again we should target a piece of the
> not-very-long NFSv4 meeting and decide if we want to also have a f-2-f.
>
> I'm happy to propose a day long interim meeting at CMU in Pittsburgh in
> September, before the cut off dates to submit for the IETF meeting in
> Nov.
>
> Opinions?  Date suggestions?  Alternatives?
>
> garth
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>



------------------------ Yahoo! Groups Sponsor --------------------~-->
Yahoo! Domains - Claim yours for only $14.70
http://us.click.yahoo.com/Z1wmxD/DREIAA/yQLSAA/W6uqlB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
   http://groups.yahoo.com/group/pnfs-reqs/

<*> To unsubscribe from this group, send an email to:
   pnfs-reqs-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
   http://docs.yahoo.com/info/terms/

From ggrider@lanl.gov Sat Jul 17 18:34:14 2004
Return-Path: <ggrider@lanl.gov>
X-Sender: ggrider@lanl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 42452 invoked from network); 18 Jul 2004 01:34:13 -0000
Received: from unknown (66.218.66.216)
by m4.grp.scd.yahoo.com with QMQP; 18 Jul 2004 01:34:13 -0000
Received: from unknown (HELO mailwasher-b.lanl.gov) (192.16.0.25)
by mta1.grp.scd.yahoo.com with SMTP; 18 Jul 2004 01:34:13 -0000
Received: from mailrelay2.lanl.gov (localhost.localdomain [127.0.0.1])
by mailwasher-b.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i6I1YCCp029568
for <pnfs-reqs@yahoogroups.com>; Sat, 17 Jul 2004 19:34:12 -0600
Received: from cic-mail.lanl.gov (localhost.localdomain [127.0.0.1])
by mailrelay2.lanl.gov (8.12.10/8.12.10/(ccn-5)) with ESMTP id i6I1YCVU000942
for <pnfs-reqs@yahoogroups.com>; Sat, 17 Jul 2004 19:34:12 -0600
Received: from cthulu.lanl.gov (vpn-client-148.lanl.gov [128.165.253.148])
by cic-mail.lanl.gov (8.12.11/8.12.11/(ccn-5)) with ESMTP id i6I1YAsi004686
for <pnfs-reqs@yahoogroups.com>; Sat, 17 Jul 2004 19:34:10 -0600
Message-Id: <5.2.0.9.2.20040717193400.017d8c80@cic-mail.lanl.gov>
X-Sender: ggrider@cic-mail.lanl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Sat, 17 Jul 2004 19:34:21 -0600
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <6.1.2.0.2.20040708163437.01db5508@silver.nane.netapp.com>
Mime-Version: 1.0
Content-Type: multipart/related;
type="multipart/alternative";
boundary="=====================_12255131==.REL"
X-Scanned-By: MIMEDefang 2.35
X-eGroups-Remote-IP: 192.16.0.25
From: Gary Grider <ggrider@lanl.gov>
Subject: Re: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=169341185
X-Yahoo-Profile: ggriderpnfs

ADVERTISEMENT
I plan to come to the pNFS meeting at this point.

Thanks
Gary

At 01:35 PM 7/8/2004 -0700, you wrote:

> At 12:13 PM 7/8/2004, Garth Gibson wrote:
> >Would folks please speak up on this list if they are going to attend
> >this meeting.
>
> I'll be attending IETF from Sunday->Thursday and will attend
> the nonconflicting parts of the pNFS meeting.
>
> Tom.
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> bafdf7.jpg
> baff2d.jpg
>
> Yahoo! Groups Links
>
>     * To visit your group on the web, go to:
>     * http://groups.yahoo.com/group/pnfs-reqs/
>     *  
>     * To unsubscribe from this group, send an email to:
>     * pnfs-reqs-unsubscribe@yahoogroups.com
>     *  
>     * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service. 

From garth@panasas.com Mon Jul 19 06:13:16 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 10786 invoked from network); 19 Jul 2004 13:13:14 -0000
Received: from unknown (66.218.66.167)
by m3.grp.scd.yahoo.com with QMQP; 19 Jul 2004 13:13:14 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 19 Jul 2004 13:13:14 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B564K6X; Mon, 19 Jul 2004 09:13:11 -0400
Mime-Version: 1.0 (Apple Message framework v618)
Content-Transfer-Encoding: 7bit
Message-Id: <61271004-D985-11D8-BCEB-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Mon, 19 Jul 2004 09:13:09 -0400
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Aug 3 f2f meeting reminder
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Begin forwarded message:
> From: Garth Gibson <garth@panasas.com>
> Date: July 8, 2004 12:13:40 PM EDT
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] Re: planning future meetings
> Reply-To: pnfs-reqs@yahoogroups.com
>
> Here is what I think is on the table for comments/interest from this
> community:
>
> Tues Aug 3, IETF 60, San Diego:
> - IETF NFSv4 meeting is in the morning
> - in the afternoon, and evening if appropriate, interested and
> available pNFS participants gather in the hotel lobby bar. the goal is
> to get 4-6 hours of meeting time to justify those of us traveling to
> San Diego that will not otherwise attend the IETF meeting. we might try
> to arrange a room in a local chinese restaurant, for example, but we do
> not expect any meeting rooms will be available at the IETF hotel and we
> do not want to create an officially conflicting meeting.
> - IETF schedule is not final and may change as late as 2 weeks before
> the meeting, so people's Tues afternoon may change. this is in part
> why we are suggesting that the meeting may go into the evening.
> staying the night is not a bad thing for most of us as the RDDP meeting
> is likely to be first thing on Wed morning anyway.
>
> Would folks please speak up on this list if they are going to attend
> this meeting.
>
> Thurs Sept 30, Pennsylvania:
> - an informal face-to-face in the Pittsburgh area, at either CMU or
> Nemacolin (for those who collaborate with or support CMU's PDL, this is
> the day after the PDL retreat)
> - this might be the last informal pNFS meeting done outside the NFSv4
> TWG schedule as we hope to soon convince the NFSv4 group to take us in
>
> week of Nov 8, IETF 61, Washington DC
> - similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
> during the IETF meeting week
>
> garth
>
>
> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>
>> Folks,
>>
>> We've gotten behind on meeting plans.
>>
>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.
>> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>>
>> Hopefully pNFS will get some time on that agenda, but it will be a
>> small amount at most. What I'd like to propose is that we meet f-2-f
>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
>> and Marina), so I'm also looking for proposals for where to hold the
>> meeting.
>>
>> Since timing is tight on this, please reply soon.
>>
>> And while you are thinking about your schedule, lets talk about the
>> meeting after. The next IETF is Nov 7-12 in Washington DC (which
>> overlaps SC04 in Pittsburgh). Again we should target a piece of the
>> not-very-long NFSv4 meeting and decide if we want to also have a
>> f-2-f.
>>
>> I'm happy to propose a day long interim meeting at CMU in Pittsburgh
>> in
>> September, before the cut off dates to submit for the IETF meeting in
>> Nov.
>>
>> Opinions? Date suggestions? Alternatives?
>>
>> garth


From julian_satran@il.ibm.com Mon Jul 19 07:30:40 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 69155 invoked from network); 19 Jul 2004 14:30:39 -0000
Received: from unknown (66.218.66.167)
by m20.grp.scd.yahoo.com with QMQP; 19 Jul 2004 14:30:39 -0000
Received: from unknown (HELO mtagate1.de.ibm.com) (195.212.29.150)
by mta6.grp.scd.yahoo.com with SMTP; 19 Jul 2004 14:30:38 -0000
Received: from d12nrmr1507.megacenter.de.ibm.com (d12nrmr1507.megacenter.de.ibm.com [9.149.167.1])
by mtagate1.de.ibm.com (8.12.10/8.12.10) with ESMTP id i6JEUXGP100072
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Jul 2004 14:30:33 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1507.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i6JEUWxi227234
for <pnfs-reqs@yahoogroups.com>; Mon, 19 Jul 2004 16:30:33 +0200
In-Reply-To: <61271004-D985-11D8-BCEB-000A95A94F04@panasas.com>
To: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.1 January 21, 2004
Message-ID: <OFC78C96A0.C28DE96D-ONC2256ED6.004C5B80-C2256ED6.004FB258@il.ibm.com>
Date: Mon, 19 Jul 2004 17:30:31 +0300
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.5.1| March 5, 2004) at
19/07/2004 17:30:33,
Serialize complete at 19/07/2004 17:30:33
Content-Type: multipart/alternative; boundary="=_alternative 004C95C1C2256ED6_="
X-eGroups-Remote-IP: 195.212.29.150
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] Aug 3 f2f meeting reminder
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

September 30 is out for me (most probably) but I will be available during the Nov 8 week.

Julo


Garth Gibson <garth@panasas.com>

19/07/04 16:13
Please respond to
pnfs-reqs

	
To
	pnfs-reqs@yahoogroups.com
cc
	
Subject
	[pnfs-reqs] Aug 3 f2f meeting reminder

	




Begin forwarded message:
> From: Garth Gibson <garth@panasas.com>
> Date: July 8, 2004 12:13:40 PM EDT
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] Re: planning future meetings
> Reply-To: pnfs-reqs@yahoogroups.com
>
> Here is what I think is on the table for comments/interest from this
> community:
>
> Tues Aug 3, IETF 60, San Diego:
> - IETF NFSv4 meeting is in the morning
> - in the afternoon, and evening if appropriate, interested and
> available pNFS participants gather in the hotel lobby bar.  the goal is
> to get 4-6 hours of meeting time to justify those of us traveling to
> San Diego that will not otherwise attend the IETF meeting. we might try
> to arrange a room in a local chinese restaurant, for example, but we do
> not expect any meeting rooms will be available at the IETF hotel and we
> do not want to create an officially conflicting meeting.
> - IETF schedule is not final and may change as late as 2 weeks before
> the meeting, so people's Tues afternoon may change.  this is in part
> why we are suggesting that the meeting may go into the evening.
> staying the night is not a bad thing for most of us as the RDDP meeting
> is likely to be first thing on Wed morning anyway.
>
> Would folks please speak up on this list if they are going to attend
> this meeting.
>
> Thurs Sept 30, Pennsylvania:
> - an informal face-to-face in the Pittsburgh area, at either CMU or
> Nemacolin (for those who collaborate with or support CMU's PDL, this is
> the day after the PDL retreat)
> - this might be the last informal pNFS meeting done outside the NFSv4
> TWG schedule as we hope to soon convince the NFSv4 group to take us in
>
> week of Nov 8, IETF 61, Washington DC
> - similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
> during the IETF meeting week
>
> garth
>
>
> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>
>> Folks,
>>
>> We've gotten behind on meeting plans.
>>
>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.
>> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>>
>> Hopefully pNFS will get some time on that agenda, but it will be a
>> small amount at most.  What I'd like to propose is that we meet f-2-f
>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
>> and Marina), so I'm also looking for proposals for where to hold the
>> meeting.
>>
>> Since timing is tight on this, please reply soon.
>>
>> And while you are thinking about your schedule, lets talk about the
>> meeting after.  The next IETF is Nov 7-12 in Washington DC (which
>> overlaps SC04 in Pittsburgh).  Again we should target a piece of the
>> not-very-long NFSv4 meeting and decide if we want to also have a
>> f-2-f.
>>
>> I'm happy to propose a day long interim meeting at CMU in Pittsburgh
>> in
>> September, before the cut off dates to submit for the IETF meeting in
>> Nov.
>>
>> Opinions?  Date suggestions?  Alternatives?
>>
>> garth



------------------------ Yahoo! Groups Sponsor --------------------~-->
Yahoo! Domains - Claim yours for only $14.70
http://us.click.yahoo.com/Z1wmxD/DREIAA/yQLSAA/W6uqlB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
   http://groups.yahoo.com/group/pnfs-reqs/

<*> To unsubscribe from this group, send an email to:
   pnfs-reqs-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
   http://docs.yahoo.com/info/terms/

From bwelch@panasas.com Wed Jul 21 23:21:49 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 84471 invoked from network); 22 Jul 2004 06:21:48 -0000
Received: from unknown (66.218.66.166)
by m22.grp.scd.yahoo.com with QMQP; 22 Jul 2004 06:21:48 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta5.grp.scd.yahoo.com with SMTP; 22 Jul 2004 06:21:48 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i6M6LlR06070
for <pnfs-reqs@yahoogroups.com>; Wed, 21 Jul 2004 23:21:47 -0700
Message-Id: <200407220621.i6M6LlR06070@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
To: pnfs-reqs@yahoogroups.com
In-reply-to: <C65A00D9-D0F9-11D8-B0CA-000A95A94F04@panasas.com>
References: <9D5D27D6-CB83-11D8-A956-000A95A94F04@panasas.com> <C65A00D9-D0F9-11D8-B0CA-000A95A94F04@panasas.com>
Comments: In-reply-to Garth Gibson <garth@panasas.com>
message dated "Thu, 08 Jul 2004 12:13:40 -0400."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-ID: <6068.1090477307.1@panasas.com>
Date: Wed, 21 Jul 2004 23:21:47 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

I will be there Tuesday morning, probably returning Wednesday AM.

>>>Garth Gibson said:
> Here is what I think is on the table for comments/interest from this
> community:
>
> Tues Aug 3, IETF 60, San Diego:
> - IETF NFSv4 meeting is in the morning
> - in the afternoon, and evening if appropriate, interested and
> available pNFS participants gather in the hotel lobby bar. the goal is
> to get 4-6 hours of meeting time to justify those of us traveling to
> San Diego that will not otherwise attend the IETF meeting. we might try
> to arrange a room in a local chinese restaurant, for example, but we do
> not expect any meeting rooms will be available at the IETF hotel and we
> do not want to create an officially conflicting meeting.
> - IETF schedule is not final and may change as late as 2 weeks before
> the meeting, so people's Tues afternoon may change. this is in part
> why we are suggesting that the meeting may go into the evening.
> staying the night is not a bad thing for most of us as the RDDP meeting
> is likely to be first thing on Wed morning anyway.
>
> Would folks please speak up on this list if they are going to attend
> this meeting.
>
> Thurs Sept 30, Pennsylvania:
> - an informal face-to-face in the Pittsburgh area, at either CMU or
> Nemacolin (for those who collaborate with or support CMU's PDL, this is
> the day after the PDL retreat)
> - this might be the last informal pNFS meeting done outside the NFSv4
> TWG schedule as we hope to soon convince the NFSv4 group to take us in
>
> week of Nov 8, IETF 61, Washington DC
> - similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
> during the IETF meeting week
>
> garth
>
>
> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>
> > Folks,
> >
> > We've gotten behind on meeting plans.
> >
> > The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.
> > See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
> >
> > Hopefully pNFS will get some time on that agenda, but it will be a
> > small amount at most. What I'd like to propose is that we meet f-2-f
> > on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
> > I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> > and Marina), so I'm also looking for proposals for where to hold the
> > meeting.
> >
> > Since timing is tight on this, please reply soon.
> >
> > And while you are thinking about your schedule, lets talk about the
> > meeting after. The next IETF is Nov 7-12 in Washington DC (which
> > overlaps SC04 in Pittsburgh). Again we should target a piece of the
> > not-very-long NFSv4 meeting and decide if we want to also have a f-2-f.
> >
> > I'm happy to propose a day long interim meeting at CMU in Pittsburgh in
> > September, before the cut off dates to submit for the IETF meeting in
> > Nov.
> >
> > Opinions? Date suggestions? Alternatives?
> >
> > garth
> >
> >
> >
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From mclarty3@llnl.gov Thu Jul 22 08:16:08 2004
Return-Path: <mclarty3@llnl.gov>
X-Sender: mclarty3@llnl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 98258 invoked from network); 22 Jul 2004 15:16:07 -0000
Received: from unknown (66.218.66.217)
by m21.grp.scd.yahoo.com with QMQP; 22 Jul 2004 15:16:07 -0000
Received: from unknown (HELO smtp-4.llnl.gov) (128.115.41.84)
by mta2.grp.scd.yahoo.com with SMTP; 22 Jul 2004 15:16:06 -0000
Received: from mailfe-1.llnl.gov (localhost [127.0.0.1])
by smtp-4.llnl.gov (8.12.3p2-20030917/8.12.3/LLNL evision: 1.15 $) with ESMTP id i6MFG3If028453
for <pnfs-reqs@yahoogroups.com>; Thu, 22 Jul 2004 08:16:03 -0700 (PDT)
Received: from POLARBEAR.llnl.gov ([134.9.18.59] verified)
by mailfe-1.llnl.gov (CommuniGate Pro SMTP 4.1.8)
with ESMTP id 4115786 for pnfs-reqs@yahoogroups.com; Thu, 22 Jul 2004 08:16:03 -0700
Message-Id: <5.0.0.25.2.20040722081450.026a3830@poptop.llnl.gov>
X-Sender: e002801@poptop.llnl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Thu, 22 Jul 2004 08:16:03 -0700
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <C65A00D9-D0F9-11D8-B0CA-000A95A94F04@panasas.com>
References: <9D5D27D6-CB83-11D8-A956-000A95A94F04@panasas.com>
<9D5D27D6-CB83-11D8-A956-000A95A94F04@panasas.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-eGroups-Remote-IP: 128.115.41.84
From: Tyce McLarty <mclarty3@llnl.gov>
Subject: Re: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=169320772
X-Yahoo-Profile: mclarty3

Bill Loewe & I will be there Tuesday for the pNFS meetings departing Wed. AM.

Tyce

At 09:13 AM 7/8/2004, you wrote:
>Here is what I think is on the table for comments/interest from this
>community:
>
>Tues Aug 3, IETF 60, San Diego:
>- IETF NFSv4 meeting is in the morning
>- in the afternoon, and evening if appropriate, interested and
>available pNFS participants gather in the hotel lobby bar. the goal is
>to get 4-6 hours of meeting time to justify those of us traveling to
>San Diego that will not otherwise attend the IETF meeting. we might try
>to arrange a room in a local chinese restaurant, for example, but we do
>not expect any meeting rooms will be available at the IETF hotel and we
>do not want to create an officially conflicting meeting.
>- IETF schedule is not final and may change as late as 2 weeks before
>the meeting, so people's Tues afternoon may change. this is in part
>why we are suggesting that the meeting may go into the evening.
>staying the night is not a bad thing for most of us as the RDDP meeting
>is likely to be first thing on Wed morning anyway.
>
>Would folks please speak up on this list if they are going to attend
>this meeting.
>
>Thurs Sept 30, Pennsylvania:
>- an informal face-to-face in the Pittsburgh area, at either CMU or
>Nemacolin (for those who collaborate with or support CMU's PDL, this is
>the day after the PDL retreat)
>- this might be the last informal pNFS meeting done outside the NFSv4
>TWG schedule as we hope to soon convince the NFSv4 group to take us in
>
>week of Nov 8, IETF 61, Washington DC
>- similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
>during the IETF meeting week
>
>garth
>
>
>On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>
> > Folks,
> >
> > We've gotten behind on meeting plans.
> >
> > The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.
> > See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
> >
> > Hopefully pNFS will get some time on that agenda, but it will be a
> > small amount at most. What I'd like to propose is that we meet f-2-f
> > on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
> > I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> > and Marina), so I'm also looking for proposals for where to hold the
> > meeting.
> >
> > Since timing is tight on this, please reply soon.
> >
> > And while you are thinking about your schedule, lets talk about the
> > meeting after. The next IETF is Nov 7-12 in Washington DC (which
> > overlaps SC04 in Pittsburgh). Again we should target a piece of the
> > not-very-long NFSv4 meeting and decide if we want to also have a f-2-f.
> >
> > I'm happy to propose a day long interim meeting at CMU in Pittsburgh in
> > September, before the cut off dates to submit for the IETF meeting in
> > Nov.
> >
> > Opinions? Date suggestions? Alternatives?
> >
> > garth
> >
> >
> >
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>


From pcorbett@netapp.com Thu Jul 22 08:35:17 2004
Return-Path: <Peter.Corbett@netapp.com>
X-Sender: Peter.Corbett@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 68053 invoked from network); 22 Jul 2004 15:35:16 -0000
Received: from unknown (66.218.66.167)
by m23.grp.scd.yahoo.com with QMQP; 22 Jul 2004 15:35:16 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta6.grp.scd.yahoo.com with SMTP; 22 Jul 2004 15:35:16 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i6MFZEFC028941
for <pnfs-reqs@yahoogroups.com>; Thu, 22 Jul 2004 08:35:14 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i6MFZE53026866
for <pnfs-reqs@yahoogroups.com>; Thu, 22 Jul 2004 08:35:14 -0700 (PDT)
content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
X-MimeOLE: Produced By Microsoft Exchange V6.0.6249.0
Date: Thu, 22 Jul 2004 08:35:11 -0700
Message-ID: <C8CF60CFC4D8A74E9945E32CF096548A0201636C@silver.nane.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Re: planning future meetings
Thread-Index: AcRwAS1aDSl3xAtOTG2PRf8jsEXfvQAAD6dA
To: <pnfs-reqs@yahoogroups.com>
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Corbett, Peter" <Peter.Corbett@netapp.com>
From: "Corbett, Peter" <pcorbett@netapp.com>
Subject: RE: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=44152959
X-Yahoo-Profile: pfcorbett2004

ADVERTISEMENT
I'll be there all day Tuesday, arriving Monday afternoon.

-----Original Message-----
From: Tyce McLarty [mailto:mclarty3@llnl.gov]
Sent: Thursday, July 22, 2004 11:16 AM
To: pnfs-reqs@yahoogroups.com
Subject: Re: [pnfs-reqs] Re: planning future meetings

Bill Loewe & I will be there Tuesday for the pNFS meetings departing
Wed. AM.

Tyce

At 09:13 AM 7/8/2004, you wrote:
>Here is what I think is on the table for comments/interest from this
>community:
>
>Tues Aug 3, IETF 60, San Diego:
>- IETF NFSv4 meeting is in the morning
>- in the afternoon, and evening if appropriate, interested and
>available pNFS participants gather in the hotel lobby bar. the goal is
>to get 4-6 hours of meeting time to justify those of us traveling to
>San Diego that will not otherwise attend the IETF meeting. we might try
>to arrange a room in a local chinese restaurant, for example, but we do
>not expect any meeting rooms will be available at the IETF hotel and we
>do not want to create an officially conflicting meeting.
>- IETF schedule is not final and may change as late as 2 weeks before
>the meeting, so people's Tues afternoon may change. this is in part
>why we are suggesting that the meeting may go into the evening.
>staying the night is not a bad thing for most of us as the RDDP meeting
>is likely to be first thing on Wed morning anyway.
>
>Would folks please speak up on this list if they are going to attend
>this meeting.
>
>Thurs Sept 30, Pennsylvania:
>- an informal face-to-face in the Pittsburgh area, at either CMU or
>Nemacolin (for those who collaborate with or support CMU's PDL, this is
>the day after the PDL retreat)
>- this might be the last informal pNFS meeting done outside the NFSv4
>TWG schedule as we hope to soon convince the NFSv4 group to take us in
>
>week of Nov 8, IETF 61, Washington DC
>- similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
>during the IETF meeting week
>
>garth
>
>
>On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>
> > Folks,
> >
> > We've gotten behind on meeting plans.
> >
> > The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
Diego.
> > See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
> >
> > Hopefully pNFS will get some time on that agenda, but it will be a
> > small amount at most. What I'd like to propose is that we meet
f-2-f
> > on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San
Diego.
> > I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> > and Marina), so I'm also looking for proposals for where to hold the
> > meeting.
> >
> > Since timing is tight on this, please reply soon.
> >
> > And while you are thinking about your schedule, lets talk about the
> > meeting after. The next IETF is Nov 7-12 in Washington DC (which
> > overlaps SC04 in Pittsburgh). Again we should target a piece of the
> > not-very-long NFSv4 meeting and decide if we want to also have a
f-2-f.
> >
> > I'm happy to propose a day long interim meeting at CMU in Pittsburgh
in
> > September, before the cut off dates to submit for the IETF meeting
in
> > Nov.
> >
> > Opinions? Date suggestions? Alternatives?
> >
> > garth
> >
> >
> >
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>





Yahoo! Groups Links


From fan@rainfinity.com Thu Jul 22 11:22:22 2004
Return-Path: <fan@rainfinity.com>
X-Sender: fan@rainfinity.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 19242 invoked from network); 22 Jul 2004 18:22:20 -0000
Received: from unknown (66.218.66.167)
by m19.grp.scd.yahoo.com with QMQP; 22 Jul 2004 18:22:20 -0000
Received: from unknown (HELO mail1.rainfinity.com) (128.242.125.75)
by mta6.grp.scd.yahoo.com with SMTP; 22 Jul 2004 18:22:20 -0000
Received: from localhost (localhost.rainfinity.com [127.0.0.1])
by mail1.rainfinity.com (Postfix) with ESMTP
id 8911F1040DE; Thu, 22 Jul 2004 11:19:11 -0700 (PDT)
Received: from mail1.rainfinity.com ([127.0.0.1])
by localhost (mail1.rainfinity.com [127.0.0.1]) (amavisd-new, port 10024)
with ESMTP id 10781-11; Thu, 22 Jul 2004 11:19:10 -0700 (PDT)
Received: from [192.168.0.47] (hq.rainfinity.com [128.242.125.65])
by mail1.rainfinity.com (Postfix) with ESMTP
id 85F7D1040A1; Thu, 22 Jul 2004 11:19:10 -0700 (PDT)
Message-ID: <4100051C.30906@rainfinity.com>
Date: Thu, 22 Jul 2004 11:19:08 -0700
User-Agent: Mozilla Thunderbird 0.6 (Windows/20040502)
X-Accept-Language: en-us, en
MIME-Version: 1.0
To: pnfs-reqs@yahoogroups.com
References: <C8CF60CFC4D8A74E9945E32CF096548A0201636C@silver.nane.netapp.com>
In-Reply-To: <C8CF60CFC4D8A74E9945E32CF096548A0201636C@silver.nane.netapp.com>
Content-Type: multipart/alternative;
boundary="------------060808000902090804050909"
X-Virus-Scanned: by amavisd-new at rainfinity.com
X-eGroups-Remote-IP: 128.242.125.75
From: Chenggong Charles Fan <fan@rainfinity.com>
Subject: Re: [pnfs-reqs] Re: planning future meetings
X-Yahoo-Group-Post: member; u=37364696
X-Yahoo-Profile: fanrainfinity

ADVERTISEMENT
I'll join the Aug 3 meeting as well.

Charles

Corbett, Peter wrote:

>I'll be there all day Tuesday, arriving Monday afternoon.
>-----Original Message-----
>From: Tyce McLarty [mailto:mclarty3@llnl.gov] Sent: Thursday, July 22, 2004 11:16 AM
>To: pnfs-reqs@yahoogroups.com
>Subject: Re: [pnfs-reqs] Re: planning future meetings
>Bill Loewe & I will be there Tuesday for the pNFS meetings departing
>Wed. AM.
>Tyce
>At 09:13 AM 7/8/2004, you wrote:
>
>>Here is what I think is on the table for comments/interest from this
>>community:
>>Tues Aug 3, IETF 60, San Diego:
>>- IETF NFSv4 meeting is in the morning
>>- in the afternoon, and evening if appropriate, interested and
>>available pNFS participants gather in the hotel lobby bar. the goal is
>>to get 4-6 hours of meeting time to justify those of us traveling to
>>San Diego that will not otherwise attend the IETF meeting. we might try
>>to arrange a room in a local chinese restaurant, for example, but we do
>>not expect any meeting rooms will be available at the IETF hotel and we
>>do not want to create an officially conflicting meeting.
>>- IETF schedule is not final and may change as late as 2 weeks before
>>the meeting, so people's Tues afternoon may change. this is in part
>>why we are suggesting that the meeting may go into the evening.
>>staying the night is not a bad thing for most of us as the RDDP meeting
>>is likely to be first thing on Wed morning anyway.
>>Would folks please speak up on this list if they are going to attend
>>this meeting.
>>Thurs Sept 30, Pennsylvania:
>>- an informal face-to-face in the Pittsburgh area, at either CMU or
>>Nemacolin (for those who collaborate with or support CMU's PDL, this is
>>the day after the PDL retreat)
>>- this might be the last informal pNFS meeting done outside the NFSv4
>>TWG schedule as we hope to soon convince the NFSv4 group to take us in
>>week of Nov 8, IETF 61, Washington DC
>>- similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
>>during the IETF meeting week
>>garth
>>On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>>
>>>Folks,
>>>We've gotten behind on meeting plans.
>>>The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
>>>
>Diego.
>
>>>See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>>>Hopefully pNFS will get some time on that agenda, but it will be a
>>>small amount at most. What I'd like to propose is that we meet
>>>
>f-2-f
>
>>>on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San
>>>
>Diego.
>
>>>I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
>>>and Marina), so I'm also looking for proposals for where to hold the
>>>meeting.
>>>Since timing is tight on this, please reply soon.
>>>And while you are thinking about your schedule, lets talk about the
>>>meeting after. The next IETF is Nov 7-12 in Washington DC (which
>>>overlaps SC04 in Pittsburgh). Again we should target a piece of the
>>>not-very-long NFSv4 meeting and decide if we want to also have a
>>>
>f-2-f.
>
>>>I'm happy to propose a day long interim meeting at CMU in Pittsburgh
>>>
>in
>
>>>September, before the cut off dates to submit for the IETF meeting
>>>
>in
>
>>>Nov.
>>>Opinions? Date suggestions? Alternatives?
>>>garth
>>>Yahoo! Groups Links
>>>
>>Yahoo! Groups Links
>>
>
>Yahoo! Groups Links
>------------------------ Yahoo! Groups Sponsor --------------------~--> Make a clean sweep of pop-up ads. Yahoo! Companion Toolbar.
>Now with Pop-Up Blocker. Get it for free!
>http://us.click.yahoo.com/L5YrjA/eSIIAA/yQLSAA/W6uqlB/TM
>--------------------------------------------------------------------~-> Yahoo! Groups Links
><*> To visit your group on the web, go to:
>http://groups.yahoo.com/group/pnfs-reqs/
><*> To unsubscribe from this group, send an email to:
>pnfs-reqs-unsubscribe@yahoogroups.com
><*> Your use of Yahoo! Groups is subject to:
>http://docs.yahoo.com/info/terms/
>


From bwelch@panasas.com Thu Jul 29 19:22:57 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 26820 invoked from network); 30 Jul 2004 02:22:56 -0000
Received: from unknown (66.218.66.166)
by m22.grp.scd.yahoo.com with QMQP; 30 Jul 2004 02:22:56 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta5.grp.scd.yahoo.com with SMTP; 30 Jul 2004 02:22:55 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i6U2Msr27922
for <pnfs-reqs@yahoogroups.com>; Thu, 29 Jul 2004 19:22:54 -0700
Message-Id: <200407300222.i6U2Msr27922@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.7.1 07/28/2004 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: multipart/mixed ;
boundary="==_Exmh_-21130399060"
Date: Thu, 29 Jul 2004 19:22:54 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: pnfs ops, draft of June 8
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

Here is an update to my ops draft based on the June 7 meeting.
I apologize for not sending it out promptly, but here 'tis.
I do not include the discussion of why things look as they do now,
but you will notice a slimmer set of ops. Much slimmer. I could
upload this to the yahoo groups web site, but most of you may find
simple inline text easier to deal with :-)

The ops boil down to:
7.1 LAYOUTGET - Get Layout Information 9
7.2 LAYOUTCOMMIT - Commit writes made using a layout 12
7.3 LAYOUTRETURN - Release Layout Information 14
8.1 CB_LAYOUTRECALL 14

Food for thought next Tuesday - see you then.
If you have feedback before then, I'll have time Monday evening
to go through this.
--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com





Attachment (not stored)
pnfs_6_8.txt
Type: text/plain 

From garth@panasas.com Sun Aug 01 15:41:08 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 23287 invoked from network); 1 Aug 2004 22:41:07 -0000
Received: from unknown (66.218.66.167)
by m16.grp.scd.yahoo.com with QMQP; 1 Aug 2004 22:41:07 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 1 Aug 2004 22:41:06 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56VJ01; Sun, 1 Aug 2004 18:41:05 -0400
Mime-Version: 1.0 (Apple Message framework v618)
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <D9C01BC4-E40B-11D8-89F0-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Cc: Spencer Shepler <spencer.shepler@sun.com>,
Brian Pawlowski <Brian.Pawlowski@netapp.com>
Date: Sun, 1 Aug 2004 15:40:55 -0700
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Aug 3 pNFS f-2-f reminder
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Folks,

I've heard from about 11 people that intend to gather during the
afternoon of Tues Aug 3 to discuss pNFS extensions to NFSv4. The
nominal plan is to "gather in the hotel lobby bar in the afternoon".
Maybe some details are needed? :-)

The NFSv4 IETF meeting is Tues Aug 3 9-11:30 am in the Marina II
meeting room of East Tower (first building on the only road into the
"Island") of the Sheraton Harbor Island
(www.ietf.org/meetings/hotels_60.html, "across the street" from the
airport). Some of us will be in that meeting, but not all, and IETF
registration onsite is over $600, so some pNFS attendees will probably
not be registered.

The hotel lobby bar is Quinn's -- it is behind reception on the right
as you pass reception. It is not a big place, so depending on how many
IETF attendees prefer a bar to a meeting room, we might have a problem.

I thought we should at least have a good starting point, so I made a
reservation for 10 people (oops, I'll have to fix that) at 1pm in the
Harbor's Edge restaurant -- in the lobby of the East Tower behind
reception on the left and a little around the corner. Perhaps we
should start at noon, but I thought I'd leave a little time for the
NFSv4 attendees to "debrief" after the NFSv4 meeting before starting
this one.

Hopefully enough of us can hold off eating lunch until 1pm so that the
restaurant treats us nicely. Once we think we have exhausted our
welcome there, we can drift over to Quinn's and drink ourselves into
technical agreement on pNFS.

Attendees I expect at this time: Tom Talpay, Peter Corbett, Dave
Noveck, David Black, Julian Satran, Lee Ward, Tyce McLarty, Bill Loewe,
Charles Fan, Brent Welch, Garth Gibson.

Please let us know if you're coming and are not on this list.

Thanks!
garth

Begin forwarded message:

> From: Garth Gibson <garth@panasas.com>
> Date: July 8, 2004 9:13:40 AM PDT
> To: pnfs-reqs@yahoogroups.com
> Subject: [pnfs-reqs] Re: planning future meetings
> Reply-To: pnfs-reqs@yahoogroups.com
>
> Here is what I think is on the table for comments/interest from this
> community:
>
> Tues Aug 3, IETF 60, San Diego:
> - IETF NFSv4 meeting is in the morning
> - in the afternoon, and evening if appropriate, interested and
> available pNFS participants gather in the hotel lobby bar. the goal is
> to get 4-6 hours of meeting time to justify those of us traveling to
> San Diego that will not otherwise attend the IETF meeting. we might try
> to arrange a room in a local chinese restaurant, for example, but we do
> not expect any meeting rooms will be available at the IETF hotel and we
> do not want to create an officially conflicting meeting.
> - IETF schedule is not final and may change as late as 2 weeks before
> the meeting, so people's Tues afternoon may change. this is in part
> why we are suggesting that the meeting may go into the evening.
> staying the night is not a bad thing for most of us as the RDDP meeting
> is likely to be first thing on Wed morning anyway.
>
> Would folks please speak up on this list if they are going to attend
> this meeting.
>
> Thurs Sept 30, Pennsylvania:
> - an informal face-to-face in the Pittsburgh area, at either CMU or
> Nemacolin (for those who collaborate with or support CMU's PDL, this is
> the day after the PDL retreat)
> - this might be the last informal pNFS meeting done outside the NFSv4
> TWG schedule as we hope to soon convince the NFSv4 group to take us in
>
> week of Nov 8, IETF 61, Washington DC
> - similar to the Aug 3 meeting, a meeting in the bar or in a restaurant
> during the IETF meeting week
>
> garth
>
>
> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>
>> Folks,
>>
>> We've gotten behind on meeting plans.
>>
>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San Diego.
>> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>>
>> Hopefully pNFS will get some time on that agenda, but it will be a
>> small amount at most. What I'd like to propose is that we meet f-2-f
>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
>> and Marina), so I'm also looking for proposals for where to hold the
>> meeting.
>>
>> Since timing is tight on this, please reply soon.
>>
>> And while you are thinking about your schedule, lets talk about the
>> meeting after. The next IETF is Nov 7-12 in Washington DC (which
>> overlaps SC04 in Pittsburgh). Again we should target a piece of the
>> not-very-long NFSv4 meeting and decide if we want to also have a
>> f-2-f.
>>
>> I'm happy to propose a day long interim meeting at CMU in Pittsburgh
>> in
>> September, before the cut off dates to submit for the IETF meeting in
>> Nov.
>>
>> Opinions? Date suggestions? Alternatives?
>>
>> garth
>>
>>
>>
>>
>>
>> Yahoo! Groups Links
>>
>>
>>
>>
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>

From garth@panasas.com Sun Aug 01 16:59:19 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 81161 invoked from network); 1 Aug 2004 23:59:18 -0000
Received: from unknown (66.218.66.167)
by m3.grp.scd.yahoo.com with QMQP; 1 Aug 2004 23:59:18 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta6.grp.scd.yahoo.com with SMTP; 1 Aug 2004 23:59:18 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56VKAW; Sun, 1 Aug 2004 19:59:16 -0400
Mime-Version: 1.0 (Apple Message framework v618)
In-Reply-To: <D9C01BC4-E40B-11D8-89F0-000A95A94F04@panasas.com>
References: <D9C01BC4-E40B-11D8-89F0-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <C926E13C-E416-11D8-89F0-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Sun, 1 Aug 2004 16:59:12 -0700
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

I'm thinking I should book a dinner for us too. One recommendation was
"The Boathouse", a one mile walk along the harbor from the East Tower,
'seafood and steak, $$$". I like the idea of a 20 min walk after an
afternoon in the bar. And the hotel shuttle will drive anyone who
wants to skip the walk.

Speak up if you'd like to encourage me to keep it in the one building
(dinner back at the same place as lunch) or in the next tower (5 minute
walk, "mediterranean bistro, $$").

garth

On Aug 1, 2004, at 3:40 PM, Garth Gibson wrote:
> Folks,
>
> I've heard from about 11 people that intend to gather during the
> afternoon of Tues Aug 3 to discuss pNFS extensions to NFSv4. The
> nominal plan is to "gather in the hotel lobby bar in the afternoon".
> Maybe some details are needed? :-)
>
> The NFSv4 IETF meeting is Tues Aug 3 9-11:30 am in the Marina II
> meeting room of East Tower (first building on the only road into the
> "Island") of the Sheraton Harbor Island
> (www.ietf.org/meetings/hotels_60.html, "across the street" from the
> airport). Some of us will be in that meeting, but not all, and IETF
> registration onsite is over $600, so some pNFS attendees will probably
> not be registered.
>
> The hotel lobby bar is Quinn's -- it is behind reception on the right
> as you pass reception. It is not a big place, so depending on how many
> IETF attendees prefer a bar to a meeting room, we might have a problem.
>
> I thought we should at least have a good starting point, so I made a
> reservation for 10 people (oops, I'll have to fix that) at 1pm in the
> Harbor's Edge restaurant -- in the lobby of the East Tower behind
> reception on the left and a little around the corner. Perhaps we
> should start at noon, but I thought I'd leave a little time for the
> NFSv4 attendees to "debrief" after the NFSv4 meeting before starting
> this one.
>
> Hopefully enough of us can hold off eating lunch until 1pm so that the
> restaurant treats us nicely. Once we think we have exhausted our
> welcome there, we can drift over to Quinn's and drink ourselves into
> technical agreement on pNFS.
>
> Attendees I expect at this time: Tom Talpay, Peter Corbett, Dave
> Noveck, David Black, Julian Satran, Lee Ward, Tyce McLarty, Bill Loewe,
> Charles Fan, Brent Welch, Garth Gibson.
>
> Please let us know if you're coming and are not on this list.
>
> Thanks!
> garth
>
> Begin forwarded message:
>
>> From: Garth Gibson <garth@panasas.com>
>> Date: July 8, 2004 9:13:40 AM PDT
>> To: pnfs-reqs@yahoogroups.com
>> Subject: [pnfs-reqs] Re: planning future meetings
>> Reply-To: pnfs-reqs@yahoogroups.com
>>
>> Here is what I think is on the table for comments/interest from this
>> community:
>>
>> Tues Aug 3, IETF 60, San Diego:
>> - IETF NFSv4 meeting is in the morning
>> - in the afternoon, and evening if appropriate, interested and
>> available pNFS participants gather in the hotel lobby bar. the goal
>> is
>> to get 4-6 hours of meeting time to justify those of us traveling to
>> San Diego that will not otherwise attend the IETF meeting. we might
>> try
>> to arrange a room in a local chinese restaurant, for example, but we
>> do
>> not expect any meeting rooms will be available at the IETF hotel and
>> we
>> do not want to create an officially conflicting meeting.
>> - IETF schedule is not final and may change as late as 2 weeks before
>> the meeting, so people's Tues afternoon may change. this is in part
>> why we are suggesting that the meeting may go into the evening.
>> staying the night is not a bad thing for most of us as the RDDP
>> meeting
>> is likely to be first thing on Wed morning anyway.
>>
>> Would folks please speak up on this list if they are going to attend
>> this meeting.
>>
>> Thurs Sept 30, Pennsylvania:
>> - an informal face-to-face in the Pittsburgh area, at either CMU or
>> Nemacolin (for those who collaborate with or support CMU's PDL, this
>> is
>> the day after the PDL retreat)
>> - this might be the last informal pNFS meeting done outside the NFSv4
>> TWG schedule as we hope to soon convince the NFSv4 group to take us in
>>
>> week of Nov 8, IETF 61, Washington DC
>> - similar to the Aug 3 meeting, a meeting in the bar or in a
>> restaurant
>> during the IETF meeting week
>>
>> garth
>>
>>
>> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>>
>>> Folks,
>>>
>>> We've gotten behind on meeting plans.
>>>
>>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
>>> Diego.
>>> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>>>
>>> Hopefully pNFS will get some time on that agenda, but it will be a
>>> small amount at most. What I'd like to propose is that we meet f-2-f
>>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
>>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
>>> and Marina), so I'm also looking for proposals for where to hold the
>>> meeting.
>>>
>>> Since timing is tight on this, please reply soon.
>>>
>>> And while you are thinking about your schedule, lets talk about the
>>> meeting after. The next IETF is Nov 7-12 in Washington DC (which
>>> overlaps SC04 in Pittsburgh). Again we should target a piece of the
>>> not-very-long NFSv4 meeting and decide if we want to also have a
>>> f-2-f.
>>>
>>> I'm happy to propose a day long interim meeting at CMU in Pittsburgh
>>> in
>>> September, before the cut off dates to submit for the IETF meeting in
>>> Nov.
>>>
>>> Opinions? Date suggestions? Alternatives?
>>>
>>> garth
>>>
>>>
>>>
>>>
>>>
>>> Yahoo! Groups Links
>>>
>>>
>>>
>>>
>>
>>
>>
>>
>>
>> Yahoo! Groups Links
>>
>>
>>
>>
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>

From Brian.Pawlowski@netapp.com Sun Aug 01 17:02:54 2004
Return-Path: <beepy@netapp.com>
X-Sender: beepy@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 34035 invoked from network); 2 Aug 2004 00:02:53 -0000
Received: from unknown (66.218.66.172)
by m5.grp.scd.yahoo.com with QMQP; 2 Aug 2004 00:02:53 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 2 Aug 2004 00:02:53 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i7202pFC001794;
Sun, 1 Aug 2004 17:02:51 -0700 (PDT)
Received: from tooting.eng.netapp.com (tooting-fe.eng.netapp.com [10.56.10.118])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i7202pgC000817;
Sun, 1 Aug 2004 17:02:51 -0700 (PDT)
Received: (from beepy@localhost)
by tooting.eng.netapp.com (8.11.7p1+Sun/8.11.6) id i7202oT16617;
Sun, 1 Aug 2004 17:02:50 -0700 (PDT)
Message-Id: <200408020002.i7202oT16617@tooting.eng.netapp.com>
In-Reply-To: <C926E13C-E416-11D8-89F0-000A95A94F04@panasas.com> from Garth Gibson at "Aug 1, 4 04:59:12 pm"
To: pnfs-reqs@yahoogroups.com
Date: Sun, 1 Aug 2004 17:02:50 -0700 (PDT)
Cc: trond.myklebust@fys.uio.no (Trond Myklebust)
X-Mailer: ELM [version 2.4ME++ PL40 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: Brian Pawlowski <beepy@netapp.com>
From: Brian Pawlowski <Brian.Pawlowski@netapp.com>
Subject: Re: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
X-Yahoo-Group-Post: member; u=169504717
X-Yahoo-Profile: brianpawlowski

I believe a 20 minute walk is fine with us!

Right Trond?

> I'm thinking I should book a dinner for us too. One recommendation was
> "The Boathouse", a one mile walk along the harbor from the East Tower,
> 'seafood and steak, $$$". I like the idea of a 20 min walk after an
> afternoon in the bar. And the hotel shuttle will drive anyone who
> wants to skip the walk.
>
> Speak up if you'd like to encourage me to keep it in the one building
> (dinner back at the same place as lunch) or in the next tower (5 minute
> walk, "mediterranean bistro, $$").
>
> garth
>
> On Aug 1, 2004, at 3:40 PM, Garth Gibson wrote:
> > Folks,
> >
> > I've heard from about 11 people that intend to gather during the
> > afternoon of Tues Aug 3 to discuss pNFS extensions to NFSv4. The
> > nominal plan is to "gather in the hotel lobby bar in the afternoon".
> > Maybe some details are needed? :-)
> >
> > The NFSv4 IETF meeting is Tues Aug 3 9-11:30 am in the Marina II
> > meeting room of East Tower (first building on the only road into the
> > "Island") of the Sheraton Harbor Island
> > (www.ietf.org/meetings/hotels_60.html, "across the street" from the
> > airport). Some of us will be in that meeting, but not all, and IETF
> > registration onsite is over $600, so some pNFS attendees will probably
> > not be registered.
> >
> > The hotel lobby bar is Quinn's -- it is behind reception on the right
> > as you pass reception. It is not a big place, so depending on how many
> > IETF attendees prefer a bar to a meeting room, we might have a problem.
> >
> > I thought we should at least have a good starting point, so I made a
> > reservation for 10 people (oops, I'll have to fix that) at 1pm in the
> > Harbor's Edge restaurant -- in the lobby of the East Tower behind
> > reception on the left and a little around the corner. Perhaps we
> > should start at noon, but I thought I'd leave a little time for the
> > NFSv4 attendees to "debrief" after the NFSv4 meeting before starting
> > this one.
> >
> > Hopefully enough of us can hold off eating lunch until 1pm so that the
> > restaurant treats us nicely. Once we think we have exhausted our
> > welcome there, we can drift over to Quinn's and drink ourselves into
> > technical agreement on pNFS.
> >
> > Attendees I expect at this time: Tom Talpay, Peter Corbett, Dave
> > Noveck, David Black, Julian Satran, Lee Ward, Tyce McLarty, Bill Loewe,
> > Charles Fan, Brent Welch, Garth Gibson.
> >
> > Please let us know if you're coming and are not on this list.
> >
> > Thanks!
> > garth
> >
> > Begin forwarded message:
> >
> >> From: Garth Gibson <garth@panasas.com>
> >> Date: July 8, 2004 9:13:40 AM PDT
> >> To: pnfs-reqs@yahoogroups.com
> >> Subject: [pnfs-reqs] Re: planning future meetings
> >> Reply-To: pnfs-reqs@yahoogroups.com
> >>
> >> Here is what I think is on the table for comments/interest from this
> >> community:
> >>
> >> Tues Aug 3, IETF 60, San Diego:
> >> - IETF NFSv4 meeting is in the morning
> >> - in the afternoon, and evening if appropriate, interested and
> >> available pNFS participants gather in the hotel lobby bar. the goal
> >> is
> >> to get 4-6 hours of meeting time to justify those of us traveling to
> >> San Diego that will not otherwise attend the IETF meeting. we might
> >> try
> >> to arrange a room in a local chinese restaurant, for example, but we
> >> do
> >> not expect any meeting rooms will be available at the IETF hotel and
> >> we
> >> do not want to create an officially conflicting meeting.
> >> - IETF schedule is not final and may change as late as 2 weeks before
> >> the meeting, so people's Tues afternoon may change. this is in part
> >> why we are suggesting that the meeting may go into the evening.
> >> staying the night is not a bad thing for most of us as the RDDP
> >> meeting
> >> is likely to be first thing on Wed morning anyway.
> >>
> >> Would folks please speak up on this list if they are going to attend
> >> this meeting.
> >>
> >> Thurs Sept 30, Pennsylvania:
> >> - an informal face-to-face in the Pittsburgh area, at either CMU or
> >> Nemacolin (for those who collaborate with or support CMU's PDL, this
> >> is
> >> the day after the PDL retreat)
> >> - this might be the last informal pNFS meeting done outside the NFSv4
> >> TWG schedule as we hope to soon convince the NFSv4 group to take us in
> >>
> >> week of Nov 8, IETF 61, Washington DC
> >> - similar to the Aug 3 meeting, a meeting in the bar or in a
> >> restaurant
> >> during the IETF meeting week
> >>
> >> garth
> >>
> >>
> >> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
> >>
> >>> Folks,
> >>>
> >>> We've gotten behind on meeting plans.
> >>>
> >>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
> >>> Diego.
> >>> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
> >>>
> >>> Hopefully pNFS will get some time on that agenda, but it will be a
> >>> small amount at most. What I'd like to propose is that we meet f-2-f
> >>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
> >>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> >>> and Marina), so I'm also looking for proposals for where to hold the
> >>> meeting.
> >>>
> >>> Since timing is tight on this, please reply soon.
> >>>
> >>> And while you are thinking about your schedule, lets talk about the
> >>> meeting after. The next IETF is Nov 7-12 in Washington DC (which
> >>> overlaps SC04 in Pittsburgh). Again we should target a piece of the
> >>> not-very-long NFSv4 meeting and decide if we want to also have a
> >>> f-2-f.
> >>>
> >>> I'm happy to propose a day long interim meeting at CMU in Pittsburgh
> >>> in
> >>> September, before the cut off dates to submit for the IETF meeting in
> >>> Nov.
> >>>
> >>> Opinions? Date suggestions? Alternatives?
> >>>
> >>> garth
> >>>
> >>>
> >>>
> >>>
> >>>
> >>> Yahoo! Groups Links
> >>>
> >>>
> >>>
> >>>
> >>
> >>
> >>
> >>
> >>
> >> Yahoo! Groups Links
> >>
> >>
> >>
> >>
> >
> >
> >
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From trond.myklebust@fys.uio.no Sun Aug 01 17:07:12 2004
Return-Path: <trondmy@trondhjem.org>
X-Sender: trondmy@trondhjem.org
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 31342 invoked from network); 2 Aug 2004 00:07:12 -0000
Received: from unknown (66.218.66.172)
by m16.grp.scd.yahoo.com with QMQP; 2 Aug 2004 00:07:12 -0000
Received: from unknown (HELO lade.trondhjem.org) (207.214.87.84)
by mta4.grp.scd.yahoo.com with SMTP; 2 Aug 2004 00:07:02 -0000
Received: from trondmy by lade.trondhjem.org with local (Exim 4.34)
id 1BrQLw-0001V6-Tx; Sun, 01 Aug 2004 17:06:57 -0700
To: Brian Pawlowski <beepy@netapp.com>
Cc: pnfs-reqs@yahoogroups.com
In-Reply-To: <200408020002.i7202oT16617@tooting.eng.netapp.com>
References: <200408020002.i7202oT16617@tooting.eng.netapp.com>
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: quoted-printable
Message-Id: <1091405216.4730.4.camel@lade.trondhjem.org>
Mime-Version: 1.0
X-Mailer: Ximian Evolution 1.4.6
Date: Sun, 01 Aug 2004 17:06:56 -0700
Sender: Trond Myklebust <trondmy@trondhjem.org>
X-eGroups-Remote-IP: 207.214.87.84
From: Trond Myklebust <trond.myklebust@fys.uio.no>
Subject: Re: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
X-Yahoo-Group-Post: member; u=194000208
X-Yahoo-Profile: trondmy

ADVERTISEMENT
click here
P� su , 01/08/2004 klokka 17:02, skreiv Brian Pawlowski:
> I believe a 20 minute walk is fine with us!
>
> Right Trond?

Afternoon in bar + walk + food is a winning combination as far as I'm
concerned...

Trond

> > I'm thinking I should book a dinner for us too. One recommendation was
> > "The Boathouse", a one mile walk along the harbor from the East Tower,
> > 'seafood and steak, $$$". I like the idea of a 20 min walk after an
> > afternoon in the bar. And the hotel shuttle will drive anyone who
> > wants to skip the walk.
> >
> > Speak up if you'd like to encourage me to keep it in the one building
> > (dinner back at the same place as lunch) or in the next tower (5 minute
> > walk, "mediterranean bistro, $$").
> >
> > garth
> >
> > On Aug 1, 2004, at 3:40 PM, Garth Gibson wrote:
> > > Folks,
> > >
> > > I've heard from about 11 people that intend to gather during the
> > > afternoon of Tues Aug 3 to discuss pNFS extensions to NFSv4. The
> > > nominal plan is to "gather in the hotel lobby bar in the afternoon".
> > > Maybe some details are needed? :-)
> > >
> > > The NFSv4 IETF meeting is Tues Aug 3 9-11:30 am in the Marina II
> > > meeting room of East Tower (first building on the only road into the
> > > "Island") of the Sheraton Harbor Island
> > > (www.ietf.org/meetings/hotels_60.html, "across the street" from the
> > > airport). Some of us will be in that meeting, but not all, and IETF
> > > registration onsite is over $600, so some pNFS attendees will probably
> > > not be registered.
> > >
> > > The hotel lobby bar is Quinn's -- it is behind reception on the right
> > > as you pass reception. It is not a big place, so depending on how many
> > > IETF attendees prefer a bar to a meeting room, we might have a problem.
> > >
> > > I thought we should at least have a good starting point, so I made a
> > > reservation for 10 people (oops, I'll have to fix that) at 1pm in the
> > > Harbor's Edge restaurant -- in the lobby of the East Tower behind
> > > reception on the left and a little around the corner. Perhaps we
> > > should start at noon, but I thought I'd leave a little time for the
> > > NFSv4 attendees to "debrief" after the NFSv4 meeting before starting
> > > this one.
> > >
> > > Hopefully enough of us can hold off eating lunch until 1pm so that the
> > > restaurant treats us nicely. Once we think we have exhausted our
> > > welcome there, we can drift over to Quinn's and drink ourselves into
> > > technical agreement on pNFS.
> > >
> > > Attendees I expect at this time: Tom Talpay, Peter Corbett, Dave
> > > Noveck, David Black, Julian Satran, Lee Ward, Tyce McLarty, Bill Loewe,
> > > Charles Fan, Brent Welch, Garth Gibson.
> > >
> > > Please let us know if you're coming and are not on this list.
> > >
> > > Thanks!
> > > garth
> > >
> > > Begin forwarded message:
> > >
> > >> From: Garth Gibson <garth@panasas.com>
> > >> Date: July 8, 2004 9:13:40 AM PDT
> > >> To: pnfs-reqs@yahoogroups.com
> > >> Subject: [pnfs-reqs] Re: planning future meetings
> > >> Reply-To: pnfs-reqs@yahoogroups.com
> > >>
> > >> Here is what I think is on the table for comments/interest from this
> > >> community:
> > >>
> > >> Tues Aug 3, IETF 60, San Diego:
> > >> - IETF NFSv4 meeting is in the morning
> > >> - in the afternoon, and evening if appropriate, interested and
> > >> available pNFS participants gather in the hotel lobby bar. the goal
> > >> is
> > >> to get 4-6 hours of meeting time to justify those of us traveling to
> > >> San Diego that will not otherwise attend the IETF meeting. we might
> > >> try
> > >> to arrange a room in a local chinese restaurant, for example, but we
> > >> do
> > >> not expect any meeting rooms will be available at the IETF hotel and
> > >> we
> > >> do not want to create an officially conflicting meeting.
> > >> - IETF schedule is not final and may change as late as 2 weeks before
> > >> the meeting, so people's Tues afternoon may change. this is in part
> > >> why we are suggesting that the meeting may go into the evening.
> > >> staying the night is not a bad thing for most of us as the RDDP
> > >> meeting
> > >> is likely to be first thing on Wed morning anyway.
> > >>
> > >> Would folks please speak up on this list if they are going to attend
> > >> this meeting.
> > >>
> > >> Thurs Sept 30, Pennsylvania:
> > >> - an informal face-to-face in the Pittsburgh area, at either CMU or
> > >> Nemacolin (for those who collaborate with or support CMU's PDL, this
> > >> is
> > >> the day after the PDL retreat)
> > >> - this might be the last informal pNFS meeting done outside the NFSv4
> > >> TWG schedule as we hope to soon convince the NFSv4 group to take us in
> > >>
> > >> week of Nov 8, IETF 61, Washington DC
> > >> - similar to the Aug 3 meeting, a meeting in the bar or in a
> > >> restaurant
> > >> during the IETF meeting week
> > >>
> > >> garth
> > >>
> > >>
> > >> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
> > >>
> > >>> Folks,
> > >>>
> > >>> We've gotten behind on meeting plans.
> > >>>
> > >>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
> > >>> Diego.
> > >>> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
> > >>>
> > >>> Hopefully pNFS will get some time on that agenda, but it will be a
> > >>> small amount at most. What I'd like to propose is that we meet f-2-f
> > >>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
> > >>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> > >>> and Marina), so I'm also looking for proposals for where to hold the
> > >>> meeting.
> > >>>
> > >>> Since timing is tight on this, please reply soon.
> > >>>
> > >>> And while you are thinking about your schedule, lets talk about the
> > >>> meeting after. The next IETF is Nov 7-12 in Washington DC (which
> > >>> overlaps SC04 in Pittsburgh). Again we should target a piece of the
> > >>> not-very-long NFSv4 meeting and decide if we want to also have a
> > >>> f-2-f.
> > >>>
> > >>> I'm happy to propose a day long interim meeting at CMU in Pittsburgh
> > >>> in
> > >>> September, before the cut off dates to submit for the IETF meeting in
> > >>> Nov.
> > >>>
> > >>> Opinions? Date suggestions? Alternatives?
> > >>>
> > >>> garth
> > >>>
> > >>>
> > >>>
> > >>>
> > >>>
> > >>> Yahoo! Groups Links
> > >>>
> > >>>
> > >>>
> > >>>
> > >>
> > >>
> > >>
> > >>
> > >>
> > >> Yahoo! Groups Links
> > >>
> > >>
> > >>
> > >>
> > >
> > >
> > >
> > >
> > >
> > > Yahoo! Groups Links
> > >
> > >
> > >
> > >
> >
> >
> >
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
> >
> 

From garth@panasas.com Sun Aug 01 17:17:28 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 70380 invoked from network); 2 Aug 2004 00:17:28 -0000
Received: from unknown (66.218.66.218)
by m17.grp.scd.yahoo.com with QMQP; 2 Aug 2004 00:17:28 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 2 Aug 2004 00:17:28 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56VKB2; Sun, 1 Aug 2004 20:17:25 -0400
In-Reply-To: <1091405216.4730.4.camel@lade.trondhjem.org>
References: <200408020002.i7202oT16617@tooting.eng.netapp.com> <1091405216.4730.4.camel@lade.trondhjem.org>
Mime-Version: 1.0 (Apple Message framework v618)
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Message-Id: <50183701-E419-11D8-89F0-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Cc: Brian Pawlowski <beepy@netapp.com>,
Trond Myklebust <trond.myklebust@fys.uio.no>
Date: Sun, 1 Aug 2004 17:17:17 -0700
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

ADVERTISEMENT
Great! Rapid response and 2 new attendees swings it for me. I've
booked a table for 15 at 6pm at The Boathouse, 2040 Harbor Island Dr,
291-8010, in my name.

garth gibson

On Aug 1, 2004, at 5:06 PM, Trond Myklebust wrote:

> P� su , 01/08/2004 klokka 17:02, skreiv Brian Pawlowski:
>> I believe a 20 minute walk is fine with us!
>>
>> Right Trond?
>
> Afternoon in bar + walk + food is a winning combination as far as I'm
> concerned...
>
> Trond
>
>>> I'm thinking I should book a dinner for us too. One recommendation
>>> was
>>> "The Boathouse", a one mile walk along the harbor from the East
>>> Tower,
>>> 'seafood and steak, $$$". I like the idea of a 20 min walk after an
>>> afternoon in the bar. And the hotel shuttle will drive anyone who
>>> wants to skip the walk.
>>>
>>> Speak up if you'd like to encourage me to keep it in the one building
>>> (dinner back at the same place as lunch) or in the next tower (5
>>> minute
>>> walk, "mediterranean bistro, $$").
>>>
>>> garth
>>>
>>> On Aug 1, 2004, at 3:40 PM, Garth Gibson wrote:
>>>> Folks,
>>>>
>>>> I've heard from about 11 people that intend to gather during the
>>>> afternoon of Tues Aug 3 to discuss pNFS extensions to NFSv4. The
>>>> nominal plan is to "gather in the hotel lobby bar in the afternoon".
>>>> Maybe some details are needed? :-)
>>>>
>>>> The NFSv4 IETF meeting is Tues Aug 3 9-11:30 am in the Marina II
>>>> meeting room of East Tower (first building on the only road into the
>>>> "Island") of the Sheraton Harbor Island
>>>> (www.ietf.org/meetings/hotels_60.html, "across the street" from the
>>>> airport). Some of us will be in that meeting, but not all, and IETF
>>>> registration onsite is over $600, so some pNFS attendees will
>>>> probably
>>>> not be registered.
>>>>
>>>> The hotel lobby bar is Quinn's -- it is behind reception on the
>>>> right
>>>> as you pass reception. It is not a big place, so depending on how
>>>> many
>>>> IETF attendees prefer a bar to a meeting room, we might have a
>>>> problem.
>>>>
>>>> I thought we should at least have a good starting point, so I made a
>>>> reservation for 10 people (oops, I'll have to fix that) at 1pm in
>>>> the
>>>> Harbor's Edge restaurant -- in the lobby of the East Tower behind
>>>> reception on the left and a little around the corner. Perhaps we
>>>> should start at noon, but I thought I'd leave a little time for the
>>>> NFSv4 attendees to "debrief" after the NFSv4 meeting before starting
>>>> this one.
>>>>
>>>> Hopefully enough of us can hold off eating lunch until 1pm so that
>>>> the
>>>> restaurant treats us nicely. Once we think we have exhausted our
>>>> welcome there, we can drift over to Quinn's and drink ourselves into
>>>> technical agreement on pNFS.
>>>>
>>>> Attendees I expect at this time: Tom Talpay, Peter Corbett, Dave
>>>> Noveck, David Black, Julian Satran, Lee Ward, Tyce McLarty, Bill
>>>> Loewe,
>>>> Charles Fan, Brent Welch, Garth Gibson.
>>>>
>>>> Please let us know if you're coming and are not on this list.
>>>>
>>>> Thanks!
>>>> garth
>>>>
>>>> Begin forwarded message:
>>>>
>>>>> From: Garth Gibson <garth@panasas.com>
>>>>> Date: July 8, 2004 9:13:40 AM PDT
>>>>> To: pnfs-reqs@yahoogroups.com
>>>>> Subject: [pnfs-reqs] Re: planning future meetings
>>>>> Reply-To: pnfs-reqs@yahoogroups.com
>>>>>
>>>>> Here is what I think is on the table for comments/interest from
>>>>> this
>>>>> community:
>>>>>
>>>>> Tues Aug 3, IETF 60, San Diego:
>>>>> - IETF NFSv4 meeting is in the morning
>>>>> - in the afternoon, and evening if appropriate, interested and
>>>>> available pNFS participants gather in the hotel lobby bar. the
>>>>> goal
>>>>> is
>>>>> to get 4-6 hours of meeting time to justify those of us traveling
>>>>> to
>>>>> San Diego that will not otherwise attend the IETF meeting. we might
>>>>> try
>>>>> to arrange a room in a local chinese restaurant, for example, but
>>>>> we
>>>>> do
>>>>> not expect any meeting rooms will be available at the IETF hotel
>>>>> and
>>>>> we
>>>>> do not want to create an officially conflicting meeting.
>>>>> - IETF schedule is not final and may change as late as 2 weeks
>>>>> before
>>>>> the meeting, so people's Tues afternoon may change. this is in
>>>>> part
>>>>> why we are suggesting that the meeting may go into the evening.
>>>>> staying the night is not a bad thing for most of us as the RDDP
>>>>> meeting
>>>>> is likely to be first thing on Wed morning anyway.
>>>>>
>>>>> Would folks please speak up on this list if they are going to
>>>>> attend
>>>>> this meeting.
>>>>>
>>>>> Thurs Sept 30, Pennsylvania:
>>>>> - an informal face-to-face in the Pittsburgh area, at either CMU or
>>>>> Nemacolin (for those who collaborate with or support CMU's PDL,
>>>>> this
>>>>> is
>>>>> the day after the PDL retreat)
>>>>> - this might be the last informal pNFS meeting done outside the
>>>>> NFSv4
>>>>> TWG schedule as we hope to soon convince the NFSv4 group to take
>>>>> us in
>>>>>
>>>>> week of Nov 8, IETF 61, Washington DC
>>>>> - similar to the Aug 3 meeting, a meeting in the bar or in a
>>>>> restaurant
>>>>> during the IETF meeting week
>>>>>
>>>>> garth
>>>>>
>>>>>
>>>>> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>>>>>
>>>>>> Folks,
>>>>>>
>>>>>> We've gotten behind on meeting plans.
>>>>>>
>>>>>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
>>>>>> Diego.
>>>>>> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
>>>>>>
>>>>>> Hopefully pNFS will get some time on that agenda, but it will be a
>>>>>> small amount at most. What I'd like to propose is that we meet
>>>>>> f-2-f
>>>>>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San
>>>>>> Diego.
>>>>>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego
>>>>>> Hotel
>>>>>> and Marina), so I'm also looking for proposals for where to hold
>>>>>> the
>>>>>> meeting.
>>>>>>
>>>>>> Since timing is tight on this, please reply soon.
>>>>>>
>>>>>> And while you are thinking about your schedule, lets talk about
>>>>>> the
>>>>>> meeting after. The next IETF is Nov 7-12 in Washington DC (which
>>>>>> overlaps SC04 in Pittsburgh). Again we should target a piece of
>>>>>> the
>>>>>> not-very-long NFSv4 meeting and decide if we want to also have a
>>>>>> f-2-f.
>>>>>>
>>>>>> I'm happy to propose a day long interim meeting at CMU in
>>>>>> Pittsburgh
>>>>>> in
>>>>>> September, before the cut off dates to submit for the IETF
>>>>>> meeting in
>>>>>> Nov.
>>>>>>
>>>>>> Opinions? Date suggestions? Alternatives?
>>>>>>
>>>>>> garth
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> Yahoo! Groups Links
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> Yahoo! Groups Links
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> Yahoo! Groups Links
>>>>
>>>>
>>>>
>>>>
>>>
>>>
>>>
>>>
>>>
>>> Yahoo! Groups Links
>>>
>>>
>>>
>>>
>>>
>>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>

From black_david@emc.com Sun Aug 01 18:35:57 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 26370 invoked from network); 2 Aug 2004 01:35:56 -0000
Received: from unknown (66.218.66.216)
by m22.grp.scd.yahoo.com with QMQP; 2 Aug 2004 01:35:56 -0000
Received: from unknown (HELO mailhub.lss.emc.com) (168.159.2.31)
by mta1.grp.scd.yahoo.com with SMTP; 2 Aug 2004 01:35:56 -0000
Received: from MAHO3MSX2.corp.emc.com (maho3msx2.isus.emc.com [128.221.11.32])
by mailhub.lss.emc.com (Switch-3.1.6/Switch-3.1.6) with ESMTP id i721ZkDg020606
for <pnfs-reqs@yahoogroups.com>; Sun, 1 Aug 2004 21:35:47 -0400 (EDT)
Received: by maho3msx2.isus.emc.com with Internet Mail Service (5.5.2653.19)
id <PNMAD55Z>; Sun, 1 Aug 2004 21:35:48 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA7A5D78@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com
Date: Sun, 1 Aug 2004 21:35:48 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-PMX-Version: 4.6.0.97784, Antispam-Core: 4.6.0.97340, Antispam-Data: 2004.8.1.109244
X-PerlMx-Spam: SPAM=17%, Report='CASHCASHCASH 1.599, ONLY_COST 0.297, __IMS_MSGID 0, __HAS_MSGID 0, __SANE_MSGID 0, NO_REAL_NAME 0.000, __TO_MALFORMED_2 0, SUBJECT_MONTH 0, SUBJECT_MONTH_2 0, __MIME_VERSION 0, __ANY_IMS_MUA 0, EXCHANGE_SERVER 0, __HAS_X_MAILER 0, __IMS_MUA 0, __EVITE_CTYPE 0, __CTYPE_CHARSET_QUOTED 0, __CT_TEXT_PLAIN 0, __CT 0, __CTE 0, EMC_FROM_0 -0, TO_BE_REMOVED_REPLY 0.000, __THREE_DOLLARS 0, QUOTED_EMAIL_TEXT 0, __MIME_TEXT_ONLY 0, __TLG_EMC_ENVFROM_0 0'
X-eGroups-Remote-IP: 168.159.2.31
From: black_david@emc.com
Subject: RE: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

ADVERTISEMENT
Needs to be ~1 hour later, please. There are a couple of
5-6pm IETF sessions on Tue that I may need/want to attend.

Thanks,
--David

> -----Original Message-----
> From: Garth Gibson [mailto:garth@panasas.com]
> Sent: Sunday, August 01, 2004 8:17 PM
> To: pnfs-reqs@yahoogroups.com
> Cc: Brian Pawlowski; Trond Myklebust
> Subject: Re: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
>
>
> Great! Rapid response and 2 new attendees swings it for me. I've
> booked a table for 15 at 6pm at The Boathouse, 2040 Harbor Island Dr,
> 291-8010, in my name.
>
> garth gibson
>
> On Aug 1, 2004, at 5:06 PM, Trond Myklebust wrote:
>
> > P� su , 01/08/2004 klokka 17:02, skreiv Brian Pawlowski:
> >> I believe a 20 minute walk is fine with us!
> >>
> >> Right Trond?
> >
> > Afternoon in bar + walk + food is a winning combination as
> far as I'm
> > concerned...
> >
> > Trond
> >
> >>> I'm thinking I should book a dinner for us too. One
> recommendation
> >>> was
> >>> "The Boathouse", a one mile walk along the harbor from the East
> >>> Tower,
> >>> 'seafood and steak, $$$". I like the idea of a 20 min
> walk after an
> >>> afternoon in the bar. And the hotel shuttle will drive anyone who
> >>> wants to skip the walk.
> >>>
> >>> Speak up if you'd like to encourage me to keep it in the
> one building
> >>> (dinner back at the same place as lunch) or in the next tower (5
> >>> minute
> >>> walk, "mediterranean bistro, $$").
> >>>
> >>> garth
> >>>
> >>> On Aug 1, 2004, at 3:40 PM, Garth Gibson wrote:
> >>>> Folks,
> >>>>
> >>>> I've heard from about 11 people that intend to gather during the
> >>>> afternoon of Tues Aug 3 to discuss pNFS extensions to NFSv4. The
> >>>> nominal plan is to "gather in the hotel lobby bar in the
> afternoon".
> >>>> Maybe some details are needed? :-)
> >>>>
> >>>> The NFSv4 IETF meeting is Tues Aug 3 9-11:30 am in the Marina II
> >>>> meeting room of East Tower (first building on the only
> road into the
> >>>> "Island") of the Sheraton Harbor Island
> >>>> (www.ietf.org/meetings/hotels_60.html, "across the
> street" from the
> >>>> airport). Some of us will be in that meeting, but not
> all, and IETF
> >>>> registration onsite is over $600, so some pNFS attendees will
> >>>> probably
> >>>> not be registered.
> >>>>
> >>>> The hotel lobby bar is Quinn's -- it is behind reception on the
> >>>> right
> >>>> as you pass reception. It is not a big place, so
> depending on how
> >>>> many
> >>>> IETF attendees prefer a bar to a meeting room, we might have a
> >>>> problem.
> >>>>
> >>>> I thought we should at least have a good starting point,
> so I made a
> >>>> reservation for 10 people (oops, I'll have to fix that)
> at 1pm in
> >>>> the
> >>>> Harbor's Edge restaurant -- in the lobby of the East Tower behind
> >>>> reception on the left and a little around the corner. Perhaps we
> >>>> should start at noon, but I thought I'd leave a little
> time for the
> >>>> NFSv4 attendees to "debrief" after the NFSv4 meeting
> before starting
> >>>> this one.
> >>>>
> >>>> Hopefully enough of us can hold off eating lunch until
> 1pm so that
> >>>> the
> >>>> restaurant treats us nicely. Once we think we have exhausted our
> >>>> welcome there, we can drift over to Quinn's and drink
> ourselves into
> >>>> technical agreement on pNFS.
> >>>>
> >>>> Attendees I expect at this time: Tom Talpay, Peter Corbett, Dave
> >>>> Noveck, David Black, Julian Satran, Lee Ward, Tyce McLarty, Bill
> >>>> Loewe,
> >>>> Charles Fan, Brent Welch, Garth Gibson.
> >>>>
> >>>> Please let us know if you're coming and are not on this list.
> >>>>
> >>>> Thanks!
> >>>> garth
> >>>>
> >>>> Begin forwarded message:
> >>>>
> >>>>> From: Garth Gibson <garth@panasas.com>
> >>>>> Date: July 8, 2004 9:13:40 AM PDT
> >>>>> To: pnfs-reqs@yahoogroups.com
> >>>>> Subject: [pnfs-reqs] Re: planning future meetings
> >>>>> Reply-To: pnfs-reqs@yahoogroups.com
> >>>>>
> >>>>> Here is what I think is on the table for comments/interest from
> >>>>> this
> >>>>> community:
> >>>>>
> >>>>> Tues Aug 3, IETF 60, San Diego:
> >>>>> - IETF NFSv4 meeting is in the morning
> >>>>> - in the afternoon, and evening if appropriate, interested and
> >>>>> available pNFS participants gather in the hotel lobby bar. the
> >>>>> goal
> >>>>> is
> >>>>> to get 4-6 hours of meeting time to justify those of us
> traveling
> >>>>> to
> >>>>> San Diego that will not otherwise attend the IETF
> meeting. we might
> >>>>> try
> >>>>> to arrange a room in a local chinese restaurant, for
> example, but
> >>>>> we
> >>>>> do
> >>>>> not expect any meeting rooms will be available at the
> IETF hotel
> >>>>> and
> >>>>> we
> >>>>> do not want to create an officially conflicting meeting.
> >>>>> - IETF schedule is not final and may change as late as 2 weeks
> >>>>> before
> >>>>> the meeting, so people's Tues afternoon may change. this is in
> >>>>> part
> >>>>> why we are suggesting that the meeting may go into the evening.
> >>>>> staying the night is not a bad thing for most of us as the RDDP
> >>>>> meeting
> >>>>> is likely to be first thing on Wed morning anyway.
> >>>>>
> >>>>> Would folks please speak up on this list if they are going to
> >>>>> attend
> >>>>> this meeting.
> >>>>>
> >>>>> Thurs Sept 30, Pennsylvania:
> >>>>> - an informal face-to-face in the Pittsburgh area, at
> either CMU or
> >>>>> Nemacolin (for those who collaborate with or support CMU's PDL,
> >>>>> this
> >>>>> is
> >>>>> the day after the PDL retreat)
> >>>>> - this might be the last informal pNFS meeting done outside the
> >>>>> NFSv4
> >>>>> TWG schedule as we hope to soon convince the NFSv4
> group to take
> >>>>> us in
> >>>>>
> >>>>> week of Nov 8, IETF 61, Washington DC
> >>>>> - similar to the Aug 3 meeting, a meeting in the bar or in a
> >>>>> restaurant
> >>>>> during the IETF meeting week
> >>>>>
> >>>>> garth
> >>>>>
> >>>>>
> >>>>> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
> >>>>>
> >>>>>> Folks,
> >>>>>>
> >>>>>> We've gotten behind on meeting plans.
> >>>>>>
> >>>>>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
> >>>>>> Diego.
> >>>>>> See the IETF agenda at
> http://www.ietf.org/meetings/agenda_60.txt
> >>>>>>
> >>>>>> Hopefully pNFS will get some time on that agenda, but
> it will be a
> >>>>>> small amount at most. What I'd like to propose is
> that we meet
> >>>>>> f-2-f
> >>>>>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San
> >>>>>> Diego.
> >>>>>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego
> >>>>>> Hotel
> >>>>>> and Marina), so I'm also looking for proposals for
> where to hold
> >>>>>> the
> >>>>>> meeting.
> >>>>>>
> >>>>>> Since timing is tight on this, please reply soon.
> >>>>>>
> >>>>>> And while you are thinking about your schedule, lets
> talk about
> >>>>>> the
> >>>>>> meeting after. The next IETF is Nov 7-12 in
> Washington DC (which
> >>>>>> overlaps SC04 in Pittsburgh). Again we should target
> a piece of
> >>>>>> the
> >>>>>> not-very-long NFSv4 meeting and decide if we want to
> also have a
> >>>>>> f-2-f.
> >>>>>>
> >>>>>> I'm happy to propose a day long interim meeting at CMU in
> >>>>>> Pittsburgh
> >>>>>> in
> >>>>>> September, before the cut off dates to submit for the IETF
> >>>>>> meeting in
> >>>>>> Nov.
> >>>>>>
> >>>>>> Opinions? Date suggestions? Alternatives?
> >>>>>>
> >>>>>> garth
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>> Yahoo! Groups Links
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>> Yahoo! Groups Links
> >>>>>
> >>>>>
> >>>>>
> >>>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>> Yahoo! Groups Links
> >>>>
> >>>>
> >>>>
> >>>>
> >>>
> >>>
> >>>
> >>>
> >>>
> >>> Yahoo! Groups Links
> >>>
> >>>
> >>>
> >>>
> >>>
> >>
> >
> >
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> --------------------~-->
> Yahoo! Domains - Claim yours for only $14.70
> http://us.click.yahoo.com/Z1wmxD/DREIAA/yQLSAA/W6uqlB/TM
> --------------------------------------------------------------
> ------~->
>
>
> Yahoo! Groups Links
>
>
>
>
> 

From Brian.Pawlowski@netapp.com Sun Aug 01 18:52:33 2004
Return-Path: <beepy@netapp.com>
X-Sender: beepy@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 62751 invoked from network); 2 Aug 2004 01:52:33 -0000
Received: from unknown (66.218.66.216)
by m4.grp.scd.yahoo.com with QMQP; 2 Aug 2004 01:52:33 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 2 Aug 2004 01:52:33 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i721niFC008496;
Sun, 1 Aug 2004 18:49:44 -0700 (PDT)
Received: from tooting.eng.netapp.com (tooting-fe.eng.netapp.com [10.56.10.118])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i721nhkE011897;
Sun, 1 Aug 2004 18:49:43 -0700 (PDT)
Received: (from beepy@localhost)
by tooting.eng.netapp.com (8.11.7p1+Sun/8.11.6) id i721nhq28990;
Sun, 1 Aug 2004 18:49:43 -0700 (PDT)
Message-Id: <200408020149.i721nhq28990@tooting.eng.netapp.com>
In-Reply-To: <1091405216.4730.4.camel@lade.trondhjem.org> from Trond Myklebust at "Aug 1, 4 05:06:56 pm"
To: trond.myklebust@fys.uio.no (Trond Myklebust)
Date: Sun, 1 Aug 2004 18:49:43 -0700 (PDT)
Cc: beepy@netapp.com, pnfs-reqs@yahoogroups.com
X-Mailer: ELM [version 2.4ME++ PL40 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: Brian Pawlowski <beepy@netapp.com>
From: Brian Pawlowski <Brian.Pawlowski@netapp.com>
Subject: Re: SPAM: Re: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
X-Yahoo-Group-Post: member; u=169504717
X-Yahoo-Profile: brianpawlowski

Ah - again the reply to is the whole alias.

Our secret is out...

> P� su , 01/08/2004 klokka 17:02, skreiv Brian Pawlowski:
> > I believe a 20 minute walk is fine with us!
> >
> > Right Trond?
>
> Afternoon in bar + walk + food is a winning combination as far as I'm
> concerned...
>
> Trond
>
> > > I'm thinking I should book a dinner for us too. One recommendation was
> > > "The Boathouse", a one mile walk along the harbor from the East Tower,
> > > 'seafood and steak, $$$". I like the idea of a 20 min walk after an
> > > afternoon in the bar. And the hotel shuttle will drive anyone who
> > > wants to skip the walk.
> > >
> > > Speak up if you'd like to encourage me to keep it in the one building
> > > (dinner back at the same place as lunch) or in the next tower (5 minute
> > > walk, "mediterranean bistro, $$").
> > >
> > > garth
> > >
> > > On Aug 1, 2004, at 3:40 PM, Garth Gibson wrote:
> > > > Folks,
> > > >
> > > > I've heard from about 11 people that intend to gather during the
> > > > afternoon of Tues Aug 3 to discuss pNFS extensions to NFSv4. The
> > > > nominal plan is to "gather in the hotel lobby bar in the afternoon".
> > > > Maybe some details are needed? :-)
> > > >
> > > > The NFSv4 IETF meeting is Tues Aug 3 9-11:30 am in the Marina II
> > > > meeting room of East Tower (first building on the only road into the
> > > > "Island") of the Sheraton Harbor Island
> > > > (www.ietf.org/meetings/hotels_60.html, "across the street" from the
> > > > airport). Some of us will be in that meeting, but not all, and IETF
> > > > registration onsite is over $600, so some pNFS attendees will probably
> > > > not be registered.
> > > >
> > > > The hotel lobby bar is Quinn's -- it is behind reception on the right
> > > > as you pass reception. It is not a big place, so depending on how many
> > > > IETF attendees prefer a bar to a meeting room, we might have a problem.
> > > >
> > > > I thought we should at least have a good starting point, so I made a
> > > > reservation for 10 people (oops, I'll have to fix that) at 1pm in the
> > > > Harbor's Edge restaurant -- in the lobby of the East Tower behind
> > > > reception on the left and a little around the corner. Perhaps we
> > > > should start at noon, but I thought I'd leave a little time for the
> > > > NFSv4 attendees to "debrief" after the NFSv4 meeting before starting
> > > > this one.
> > > >
> > > > Hopefully enough of us can hold off eating lunch until 1pm so that the
> > > > restaurant treats us nicely. Once we think we have exhausted our
> > > > welcome there, we can drift over to Quinn's and drink ourselves into
> > > > technical agreement on pNFS.
> > > >
> > > > Attendees I expect at this time: Tom Talpay, Peter Corbett, Dave
> > > > Noveck, David Black, Julian Satran, Lee Ward, Tyce McLarty, Bill Loewe,
> > > > Charles Fan, Brent Welch, Garth Gibson.
> > > >
> > > > Please let us know if you're coming and are not on this list.
> > > >
> > > > Thanks!
> > > > garth
> > > >
> > > > Begin forwarded message:
> > > >
> > > >> From: Garth Gibson <garth@panasas.com>
> > > >> Date: July 8, 2004 9:13:40 AM PDT
> > > >> To: pnfs-reqs@yahoogroups.com
> > > >> Subject: [pnfs-reqs] Re: planning future meetings
> > > >> Reply-To: pnfs-reqs@yahoogroups.com
> > > >>
> > > >> Here is what I think is on the table for comments/interest from this
> > > >> community:
> > > >>
> > > >> Tues Aug 3, IETF 60, San Diego:
> > > >> - IETF NFSv4 meeting is in the morning
> > > >> - in the afternoon, and evening if appropriate, interested and
> > > >> available pNFS participants gather in the hotel lobby bar. the goal
> > > >> is
> > > >> to get 4-6 hours of meeting time to justify those of us traveling to
> > > >> San Diego that will not otherwise attend the IETF meeting. we might
> > > >> try
> > > >> to arrange a room in a local chinese restaurant, for example, but we
> > > >> do
> > > >> not expect any meeting rooms will be available at the IETF hotel and
> > > >> we
> > > >> do not want to create an officially conflicting meeting.
> > > >> - IETF schedule is not final and may change as late as 2 weeks before
> > > >> the meeting, so people's Tues afternoon may change. this is in part
> > > >> why we are suggesting that the meeting may go into the evening.
> > > >> staying the night is not a bad thing for most of us as the RDDP
> > > >> meeting
> > > >> is likely to be first thing on Wed morning anyway.
> > > >>
> > > >> Would folks please speak up on this list if they are going to attend
> > > >> this meeting.
> > > >>
> > > >> Thurs Sept 30, Pennsylvania:
> > > >> - an informal face-to-face in the Pittsburgh area, at either CMU or
> > > >> Nemacolin (for those who collaborate with or support CMU's PDL, this
> > > >> is
> > > >> the day after the PDL retreat)
> > > >> - this might be the last informal pNFS meeting done outside the NFSv4
> > > >> TWG schedule as we hope to soon convince the NFSv4 group to take us in
> > > >>
> > > >> week of Nov 8, IETF 61, Washington DC
> > > >> - similar to the Aug 3 meeting, a meeting in the bar or in a
> > > >> restaurant
> > > >> during the IETF meeting week
> > > >>
> > > >> garth
> > > >>
> > > >>
> > > >> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
> > > >>
> > > >>> Folks,
> > > >>>
> > > >>> We've gotten behind on meeting plans.
> > > >>>
> > > >>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
> > > >>> Diego.
> > > >>> See the IETF agenda at http://www.ietf.org/meetings/agenda_60.txt
> > > >>>
> > > >>> Hopefully pNFS will get some time on that agenda, but it will be a
> > > >>> small amount at most. What I'd like to propose is that we meet f-2-f
> > > >>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San Diego.
> > > >>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego Hotel
> > > >>> and Marina), so I'm also looking for proposals for where to hold the
> > > >>> meeting.
> > > >>>
> > > >>> Since timing is tight on this, please reply soon.
> > > >>>
> > > >>> And while you are thinking about your schedule, lets talk about the
> > > >>> meeting after. The next IETF is Nov 7-12 in Washington DC (which
> > > >>> overlaps SC04 in Pittsburgh). Again we should target a piece of the
> > > >>> not-very-long NFSv4 meeting and decide if we want to also have a
> > > >>> f-2-f.
> > > >>>
> > > >>> I'm happy to propose a day long interim meeting at CMU in Pittsburgh
> > > >>> in
> > > >>> September, before the cut off dates to submit for the IETF meeting in
> > > >>> Nov.
> > > >>>
> > > >>> Opinions? Date suggestions? Alternatives?
> > > >>>
> > > >>> garth
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>> Yahoo! Groups Links
> > > >>>
> > > >>>
> > > >>>
> > > >>>
> > > >>
> > > >>
> > > >>
> > > >>
> > > >>
> > > >> Yahoo! Groups Links
> > > >>
> > > >>
> > > >>
> > > >>
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > Yahoo! Groups Links
> > > >
> > > >
> > > >
> > > >
> > >
> > >
> > >
> > >
> > >
> > > Yahoo! Groups Links
> > >
> > >
> > >
> > >
> > >
> >
> 

From garth@panasas.com Sun Aug 01 19:06:34 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 9370 invoked from network); 2 Aug 2004 02:06:33 -0000
Received: from unknown (66.218.66.217)
by m20.grp.scd.yahoo.com with QMQP; 2 Aug 2004 02:06:33 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta2.grp.scd.yahoo.com with SMTP; 2 Aug 2004 02:06:27 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56VKD1; Sun, 1 Aug 2004 22:06:20 -0400
In-Reply-To: <B459CE1AFFC52D4688B2A5B842CA35EA7A5D78@corpmx14.corp.emc.com>
References: <B459CE1AFFC52D4688B2A5B842CA35EA7A5D78@corpmx14.corp.emc.com>
Mime-Version: 1.0 (Apple Message framework v618)
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Message-Id: <88CFC258-E428-11D8-89F0-000A95A94F04@panasas.com>
Content-Transfer-Encoding: quoted-printable
Cc: Garth Gibson <garth@panasas.com>
Date: Sun, 1 Aug 2004 19:06:15 -0700
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Sounds like a good reason for adjusting our plans. I rescheduled to
7pm.

So the schedule currently looks like:

1pm - (3pm?) : Harbor's Edge Restaurant (working lunch)
then - 6:15ish : Quinn's bar
6:30 - 7 : walk to The Boathouse
7pm - 9ish : The Boathouse for (working/unwinding) dinner

Folks, it would help if I could get a better head count for at least
lunch and dinner.

My current list of 13 people joining us for both is:

Tom Talpay
Peter Corbett
Dave Noveck
David Black
Julian Satran
Lee Ward
Tyce McLarty
Bill Loewe
Charles Fan
Brent Welch
Garth Gibson
Trond Myklebust
Brian Pawlowski

Please let me know if a name needs to be added, or if anyone will be
skipping lunch or dinner.

Thanks
garth


On Aug 1, 2004, at 6:35 PM, black_david@emc.com wrote:

> Needs to be ~1 hour later, please. There are a couple of
> 5-6pm IETF sessions on Tue that I may need/want to attend.
>
> Thanks,
> --David
>
>> -----Original Message-----
>> From: Garth Gibson [mailto:garth@panasas.com]
>> Sent: Sunday, August 01, 2004 8:17 PM
>> To: pnfs-reqs@yahoogroups.com
>> Cc: Brian Pawlowski; Trond Myklebust
>> Subject: Re: [pnfs-reqs] Aug 3 pNFS f-2-f reminder
>>
>>
>> Great! Rapid response and 2 new attendees swings it for me. I've
>> booked a table for 15 at 6pm at The Boathouse, 2040 Harbor Island Dr,
>> 291-8010, in my name.
>>
>> garth gibson
>>
>> On Aug 1, 2004, at 5:06 PM, Trond Myklebust wrote:
>>
>>> P� su , 01/08/2004 klokka 17:02, skreiv Brian Pawlowski:
>>>> I believe a 20 minute walk is fine with us!
>>>>
>>>> Right Trond?
>>>
>>> Afternoon in bar + walk + food is a winning combination as
>> far as I'm
>>> concerned...
>>>
>>> Trond
>>>
>>>>> I'm thinking I should book a dinner for us too. One
>> recommendation
>>>>> was
>>>>> "The Boathouse", a one mile walk along the harbor from the East
>>>>> Tower,
>>>>> 'seafood and steak, $$$". I like the idea of a 20 min
>> walk after an
>>>>> afternoon in the bar. And the hotel shuttle will drive anyone who
>>>>> wants to skip the walk.
>>>>>
>>>>> Speak up if you'd like to encourage me to keep it in the
>> one building
>>>>> (dinner back at the same place as lunch) or in the next tower (5
>>>>> minute
>>>>> walk, "mediterranean bistro, $$").
>>>>>
>>>>> garth
>>>>>
>>>>> On Aug 1, 2004, at 3:40 PM, Garth Gibson wrote:
>>>>>> Folks,
>>>>>>
>>>>>> I've heard from about 11 people that intend to gather during the
>>>>>> afternoon of Tues Aug 3 to discuss pNFS extensions to NFSv4. The
>>>>>> nominal plan is to "gather in the hotel lobby bar in the
>> afternoon".
>>>>>> Maybe some details are needed? :-)
>>>>>>
>>>>>> The NFSv4 IETF meeting is Tues Aug 3 9-11:30 am in the Marina II
>>>>>> meeting room of East Tower (first building on the only
>> road into the
>>>>>> "Island") of the Sheraton Harbor Island
>>>>>> (www.ietf.org/meetings/hotels_60.html, "across the
>> street" from the
>>>>>> airport). Some of us will be in that meeting, but not
>> all, and IETF
>>>>>> registration onsite is over $600, so some pNFS attendees will
>>>>>> probably
>>>>>> not be registered.
>>>>>>
>>>>>> The hotel lobby bar is Quinn's -- it is behind reception on the
>>>>>> right
>>>>>> as you pass reception. It is not a big place, so
>> depending on how
>>>>>> many
>>>>>> IETF attendees prefer a bar to a meeting room, we might have a
>>>>>> problem.
>>>>>>
>>>>>> I thought we should at least have a good starting point,
>> so I made a
>>>>>> reservation for 10 people (oops, I'll have to fix that)
>> at 1pm in
>>>>>> the
>>>>>> Harbor's Edge restaurant -- in the lobby of the East Tower behind
>>>>>> reception on the left and a little around the corner. Perhaps we
>>>>>> should start at noon, but I thought I'd leave a little
>> time for the
>>>>>> NFSv4 attendees to "debrief" after the NFSv4 meeting
>> before starting
>>>>>> this one.
>>>>>>
>>>>>> Hopefully enough of us can hold off eating lunch until
>> 1pm so that
>>>>>> the
>>>>>> restaurant treats us nicely. Once we think we have exhausted our
>>>>>> welcome there, we can drift over to Quinn's and drink
>> ourselves into
>>>>>> technical agreement on pNFS.
>>>>>>
>>>>>> Attendees I expect at this time: Tom Talpay, Peter Corbett, Dave
>>>>>> Noveck, David Black, Julian Satran, Lee Ward, Tyce McLarty, Bill
>>>>>> Loewe,
>>>>>> Charles Fan, Brent Welch, Garth Gibson.
>>>>>>
>>>>>> Please let us know if you're coming and are not on this list.
>>>>>>
>>>>>> Thanks!
>>>>>> garth
>>>>>>
>>>>>> Begin forwarded message:
>>>>>>
>>>>>>> From: Garth Gibson <garth@panasas.com>
>>>>>>> Date: July 8, 2004 9:13:40 AM PDT
>>>>>>> To: pnfs-reqs@yahoogroups.com
>>>>>>> Subject: [pnfs-reqs] Re: planning future meetings
>>>>>>> Reply-To: pnfs-reqs@yahoogroups.com
>>>>>>>
>>>>>>> Here is what I think is on the table for comments/interest from
>>>>>>> this
>>>>>>> community:
>>>>>>>
>>>>>>> Tues Aug 3, IETF 60, San Diego:
>>>>>>> - IETF NFSv4 meeting is in the morning
>>>>>>> - in the afternoon, and evening if appropriate, interested and
>>>>>>> available pNFS participants gather in the hotel lobby bar. the
>>>>>>> goal
>>>>>>> is
>>>>>>> to get 4-6 hours of meeting time to justify those of us
>> traveling
>>>>>>> to
>>>>>>> San Diego that will not otherwise attend the IETF
>> meeting. we might
>>>>>>> try
>>>>>>> to arrange a room in a local chinese restaurant, for
>> example, but
>>>>>>> we
>>>>>>> do
>>>>>>> not expect any meeting rooms will be available at the
>> IETF hotel
>>>>>>> and
>>>>>>> we
>>>>>>> do not want to create an officially conflicting meeting.
>>>>>>> - IETF schedule is not final and may change as late as 2 weeks
>>>>>>> before
>>>>>>> the meeting, so people's Tues afternoon may change. this is in
>>>>>>> part
>>>>>>> why we are suggesting that the meeting may go into the evening.
>>>>>>> staying the night is not a bad thing for most of us as the RDDP
>>>>>>> meeting
>>>>>>> is likely to be first thing on Wed morning anyway.
>>>>>>>
>>>>>>> Would folks please speak up on this list if they are going to
>>>>>>> attend
>>>>>>> this meeting.
>>>>>>>
>>>>>>> Thurs Sept 30, Pennsylvania:
>>>>>>> - an informal face-to-face in the Pittsburgh area, at
>> either CMU or
>>>>>>> Nemacolin (for those who collaborate with or support CMU's PDL,
>>>>>>> this
>>>>>>> is
>>>>>>> the day after the PDL retreat)
>>>>>>> - this might be the last informal pNFS meeting done outside the
>>>>>>> NFSv4
>>>>>>> TWG schedule as we hope to soon convince the NFSv4
>> group to take
>>>>>>> us in
>>>>>>>
>>>>>>> week of Nov 8, IETF 61, Washington DC
>>>>>>> - similar to the Aug 3 meeting, a meeting in the bar or in a
>>>>>>> restaurant
>>>>>>> during the IETF meeting week
>>>>>>>
>>>>>>> garth
>>>>>>>
>>>>>>>
>>>>>>> On Jul 1, 2004, at 1:25 PM, Garth Gibson wrote:
>>>>>>>
>>>>>>>> Folks,
>>>>>>>>
>>>>>>>> We've gotten behind on meeting plans.
>>>>>>>>
>>>>>>>> The next IETF NFSv4 meeting is the morning of Tues Aug 3 in San
>>>>>>>> Diego.
>>>>>>>> See the IETF agenda at
>> http://www.ietf.org/meetings/agenda_60.txt
>>>>>>>>
>>>>>>>> Hopefully pNFS will get some time on that agenda, but
>> it will be a
>>>>>>>> small amount at most. What I'd like to propose is
>> that we meet
>>>>>>>> f-2-f
>>>>>>>> on Aug 2 or 3 (my preference is the afternoon on Aug 3) in San
>>>>>>>> Diego.
>>>>>>>> I doubt we'll get a room in the IETF hotel (Sheraton San Diego
>>>>>>>> Hotel
>>>>>>>> and Marina), so I'm also looking for proposals for
>> where to hold
>>>>>>>> the
>>>>>>>> meeting.
>>>>>>>>
>>>>>>>> Since timing is tight on this, please reply soon.
>>>>>>>>
>>>>>>>> And while you are thinking about your schedule, lets
>> talk about
>>>>>>>> the
>>>>>>>> meeting after. The next IETF is Nov 7-12 in
>> Washington DC (which
>>>>>>>> overlaps SC04 in Pittsburgh). Again we should target
>> a piece of
>>>>>>>> the
>>>>>>>> not-very-long NFSv4 meeting and decide if we want to
>> also have a
>>>>>>>> f-2-f.
>>>>>>>>
>>>>>>>> I'm happy to propose a day long interim meeting at CMU in
>>>>>>>> Pittsburgh
>>>>>>>> in
>>>>>>>> September, before the cut off dates to submit for the IETF
>>>>>>>> meeting in
>>>>>>>> Nov.
>>>>>>>>
>>>>>>>> Opinions? Date suggestions? Alternatives?
>>>>>>>>
>>>>>>>> garth
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> Yahoo! Groups Links
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> Yahoo! Groups Links
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> Yahoo! Groups Links
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> Yahoo! Groups Links
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>
>>>
>>>
>>>
>>>
>>> Yahoo! Groups Links
>>>
>>>
>>>
>>>
>>
>>
>>
>> ------------------------ Yahoo! Groups Sponsor
>> --------------------~-->
>> Yahoo! Domains - Claim yours for only $14.70
>> http://us.click.yahoo.com/Z1wmxD/DREIAA/yQLSAA/W6uqlB/TM
>> --------------------------------------------------------------
>> ------~->
>>
>>
>> Yahoo! Groups Links
>>
>>
>>
>>
>>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>

From mclarty3@llnl.gov Mon Aug 02 15:14:33 2004
Return-Path: <mclarty3@llnl.gov>
X-Sender: mclarty3@llnl.gov
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 15314 invoked from network); 2 Aug 2004 22:14:32 -0000
Received: from unknown (66.218.66.172)
by m25.grp.scd.yahoo.com with QMQP; 2 Aug 2004 22:14:32 -0000
Received: from unknown (HELO smtp-2.llnl.gov) (128.115.250.82)
by mta4.grp.scd.yahoo.com with SMTP; 2 Aug 2004 22:14:31 -0000
Received: from mailfe-1.llnl.gov (localhost [127.0.0.1])
by smtp-2.llnl.gov (8.12.3p2-20030917/8.12.3/LLNL evision: 1.15 $) with ESMTP id i72MEUUw004543
for <pnfs-reqs@yahoogroups.com>; Mon, 2 Aug 2004 15:14:31 -0700 (PDT)
Received: from POLARBEAR.llnl.gov ([134.9.18.59] verified)
by mailfe-1.llnl.gov (CommuniGate Pro SMTP 4.1.8)
with ESMTP id 4788222 for pnfs-reqs@yahoogroups.com; Mon, 02 Aug 2004 15:14:30 -0700
Message-Id: <5.0.0.25.2.20040802145754.027006f8@poptop.llnl.gov>
X-Sender: e002801@poptop.llnl.gov
X-Mailer: QUALCOMM Windows Eudora Version 5.0
Date: Mon, 02 Aug 2004 15:14:30 -0700
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <200407300222.i6U2Msr27922@medlicott.panasas.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-eGroups-Remote-IP: 128.115.250.82
From: Tyce McLarty <mclarty3@llnl.gov>
Subject: Re: [pnfs-reqs] pnfs ops, draft of June 8
X-Yahoo-Group-Post: member; u=169320772
X-Yahoo-Profile: mclarty3

Brent,

This is my first serious reading of the ops document, so I apologize for
replowing ground that you have covered before. I have not gotten to the
end, but I have a couple of basic questions that will impact much of what
follows.

The defintion of Layout says Layout = "map" which I did not think was a
good idea. Later I found it is not quite true either.

In Section 2 paragraph 1: It seems like it would be nice symmetry if the
block device layout had an aggregation map as well as OBD and file. If we
call the "block number and block count" part of the layout an aggregation
map, then all three look much the same. The layouts all have a device-id, a
logical-id of some sort, and an aggregation map - symmetry. Must be a good
reason not to do this because it seems too simple.

In Section 2 paragraph 3: where it gets into the details of aggragation
maps, layout used is where it would not make sense to mean LAYOUT, so we
need some other descriptive phrase, like "data distributions". We need to
be careful about using one of our defined terms in a generic way to avoid
confusion. Also the mention of nesting sounds very powerful, but needs to
be more specific about whether it is just maps or can LAYOUTs be nested as
well.

Regards,
Tyce

At 07:22 PM 7/29/2004, you wrote:
>Here is an update to my ops draft based on the June 7 meeting.
>I apologize for not sending it out promptly, but here 'tis.
>I do not include the discussion of why things look as they do now,
>but you will notice a slimmer set of ops. Much slimmer. I could
>upload this to the yahoo groups web site, but most of you may find
>simple inline text easier to deal with :-)
>
>The ops boil down to:
>7.1 LAYOUTGET - Get Layout Information 9
>7.2 LAYOUTCOMMIT - Commit writes made using a layout 12
>7.3 LAYOUTRETURN - Release Layout Information 14
>8.1 CB_LAYOUTRECALL 14
>
>Food for thought next Tuesday - see you then.
>If you have feedback before then, I'll have time Monday evening
>to go through this.
>--
>Brent Welch
>Software Architect, Panasas Inc
>Delivering the premier storage system for scalable Linux clusters
>
>www.panasas.com
>welch@panasas.com
>
>
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>

From dhildebz@eecs.umich.edu Mon Aug 02 17:16:43 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 49060 invoked from network); 3 Aug 2004 00:16:43 -0000
Received: from unknown (66.218.66.166)
by m24.grp.scd.yahoo.com with QMQP; 3 Aug 2004 00:16:43 -0000
Received: from unknown (HELO smtp.eecs.umich.edu) (141.213.4.43)
by mta5.grp.scd.yahoo.com with SMTP; 3 Aug 2004 00:16:42 -0000
Received: from oemcomputer (pcp09767352pcs.albqrq01.nm.comcast.net [69.240.68.165])
(authenticated bits=0)
by smtp.eecs.umich.edu (8.13.0/8.13.0) with ESMTP id i730Gbee026982
(version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Mon, 2 Aug 2004 20:16:38 -0400
Message-ID: <010f01c478ee$a0a0d160$a544f045@oemcomputer>
To: <pnfs-reqs@yahoogroups.com>
References: <200407300222.i6U2Msr27922@medlicott.panasas.com>
Date: Mon, 2 Aug 2004 18:12:49 -0600
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_NextPart_000_010C_01C478BC.51C4E720"
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2800.1437
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1441
X-Spam-Status: No -- Hits: -9.396 Required: 5
X-Spam-Summary: BAYES_00,HTML_50_60,HTML_MESSAGE,HTML_TAG_EXIST_TBODY,MAILTO_WITH_SUBJ
X-Scanned-By: MIMEDefang 2.42
X-eGroups-Remote-IP: 141.213.4.43
From: "Dean Hildebrand" <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] pnfs ops, draft of June 8
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

ADVERTISEMENT
Since I can't make the pNFS meeting, here are my comments about the new ops document. Thanks Brent.

This has been batted around a lot, but the document refers to specific storage types, limiting its scope. Section 4 talks about enumerating layout types. I think that is great as background information, but I believe the layout should be opaque, not opaque per layout type. It seems unecessary to bless any layouts.
Meaning
pnfs_layout4
union pnfs_layout4 switch (pnfs_layouttype4 type) {
default:
opaque layout_data<>;
};

would become

pnfs_layout4 {
default:
opaque layout_data<>;
};

Having pnfs_layouttype4 is ok as long as it is not interpreted by NFS.

Also, here is LAYOUTGET4args
struct LAYOUTGET4args {
/* CURRENT_FH: file */
pnfs_stortype4 storage_type;
layoutget_iomode4 iomode;
layoutget_sharemode4 sharemode;
offset4 offset;
length4 length;
};

I think we will need to add some type of cookie identifiers like READDIR to handle large layout maps. I need to spend some more time thinking about this, but there will be a size limitation on the layout blob. We could even create a new mount option like rsize/wsize or something.

Also, if LAYOUTGET is to return a stateid, you need to send a clientid in the request
struct layout_owner4 {
clientid4 clientid;
opaque owner<NFS4_OPAQUE_LIMIT>;
};

Also in Section 3, where it talks about security information. Are people thinking that the security information would be passed back in the opaque layout or would it somehow fit into the existing NFS security infrastructure? I find the file layout blob is turning into a more general data access information blob.

I have another question about Section 9.2
>>>>
9.2 Multiple Reads to a File

Client does an OPEN to get a file handle.
Client does a LAYOUTGET for a range of the file, gets back a layout.
Client uses the storage protocol and the layout to access the file.
Client closes stateID and with CLOSE.

Client does an OPEN to get a file handle.
Client finds cached layout associated with file handle.
Client uses the storage protocol and the layout to access the file.
Client closes stateID and with CLOSE.

A bit more interesting as we've saved the LAYOUTGET operation, but we
are still doing server round-trips.
>>>>>
There seems to be an issue of when the layout blob cache expires. When does the LAYOUTRETURN occur? I guess this fits into the layout delegation conversations. Since the first client closes the file, I doubt any layout information will be around for client 2 as the close removes all state.


In section 9.3
>>>
9.3 Multiple Reads to a File with Delegations

Client does an OPEN to get a file handle and an open delegation.
Client does a LAYOUTGET for a range of the file, gets back a layout.
Client uses the storage protocol and the layout to access the file.
Application does a close(), but client keeps state under the delegation.
(time passes)
Application does another open(), which client handles under the delegation.
Client finds cached layout associated with file handle.
Client uses the storage protocol and the layout to access the file.
(pattern continues until open delegation and/or layout is recalled)

This illustrates the efficiency of combining open delegations and layouts
to eliminate interactions with the file server altogether.
>>>
If we are talking open delegations, then this example can't happen because delegations are OPEN to CLOSE. This could work if client 1 doesn't close the file.

Dean

----- Original Message -----
From: Brent Welch
To: pnfs-reqs@yahoogroups.com
Sent: Thursday, July 29, 2004 8:22 PM
Subject: [pnfs-reqs] pnfs ops, draft of June 8


Here is an update to my ops draft based on the June 7 meeting.
I apologize for not sending it out promptly, but here 'tis.
I do not include the discussion of why things look as they do now,
but you will notice a slimmer set of ops. Much slimmer. I could
upload this to the yahoo groups web site, but most of you may find
simple inline text easier to deal with :-)

The ops boil down to:
7.1 LAYOUTGET - Get Layout Information 9
7.2 LAYOUTCOMMIT - Commit writes made using a layout 12
7.3 LAYOUTRETURN - Release Layout Information 14
8.1 CB_LAYOUTRECALL 14

Food for thought next Tuesday - see you then.
If you have feedback before then, I'll have time Monday evening
to go through this.
--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com




Yahoo! Groups Sponsor
ADVERTISEMENT





------------------------------------------------------------------------------
Yahoo! Groups Links

a.. To visit your group on the web, go to:
http://groups.yahoo.com/group/pnfs-reqs/

b.. To unsubscribe from this group, send an email to:
pnfs-reqs-unsubscribe@yahoogroups.com

c.. Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.


From dhildebz@eecs.umich.edu Tue Aug 03 11:12:54 2004
Return-Path: <dhildebz@eecs.umich.edu>
X-Sender: dhildebz@eecs.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 49939 invoked from network); 3 Aug 2004 18:12:53 -0000
Received: from unknown (66.218.66.172)
by m21.grp.scd.yahoo.com with QMQP; 3 Aug 2004 18:12:53 -0000
Received: from unknown (HELO willow.eecs.umich.edu) (141.213.4.14)
by mta4.grp.scd.yahoo.com with SMTP; 3 Aug 2004 18:12:53 -0000
Received: from willow.eecs.umich.edu (localhost.eecs.umich.edu [127.0.0.1])
by willow.eecs.umich.edu (8.12.11/8.12.11) with ESMTP id i73ICpKw022514
(version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO)
for <pnfs-reqs@yahoogroups.com>; Tue, 3 Aug 2004 14:12:51 -0400
Received: from localhost (dhildebz@localhost)
by willow.eecs.umich.edu (8.12.11/8.12.11/Submit) with ESMTP id i73ICpRa022511
for <pnfs-reqs@yahoogroups.com>; Tue, 3 Aug 2004 14:12:51 -0400
Date: Tue, 3 Aug 2004 14:12:51 -0400 (EDT)
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <200407300222.i6U2Msr27922@medlicott.panasas.com>
Message-ID: <Pine.LNX.4.58.0408031410280.21744@willow.eecs.umich.edu>
References: <200407300222.i6U2Msr27922@medlicott.panasas.com>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
X-eGroups-Remote-IP: 141.213.4.14
From: Dean Hildebrand <dhildebz@eecs.umich.edu>
Subject: Re: [pnfs-reqs] pnfs ops, draft of June 8
X-Yahoo-Group-Post: member; u=169352062
X-Yahoo-Profile: seattleplus

Reading the document again, I see the following info under LAYOUTGET:
>>
IMPLEMENTATION

Typically, LAYOUTGET will be called as part of a compound RPC after an
OPEN operation and results in the client having location information
for the file. The client specifies a storage type that limits what kind
of layout the server will return. This prevents servers from issuing
layouts that are unusable by the client.
>>

A LAYOUTGET cannot be part of the same compound RPC as OPEN since it
requires an offset and length. We have to pay the extra round trip price
unless a map for the entire file is requested (maybe on O_CREAT).
Dean


On Thu, 29 Jul 2004, Brent Welch wrote:

> Here is an update to my ops draft based on the June 7 meeting.
> I apologize for not sending it out promptly, but here 'tis.
> I do not include the discussion of why things look as they do now,
> but you will notice a slimmer set of ops. Much slimmer. I could
> upload this to the yahoo groups web site, but most of you may find
> simple inline text easier to deal with :-)
>
> The ops boil down to:
> 7.1 LAYOUTGET - Get Layout Information 9
> 7.2 LAYOUTCOMMIT - Commit writes made using a layout 12
> 7.3 LAYOUTRETURN - Release Layout Information 14
> 8.1 CB_LAYOUTRECALL 14
>
> Food for thought next Tuesday - see you then.
> If you have feedback before then, I'll have time Monday evening
> to go through this.
> --
> Brent Welch
> Software Architect, Panasas Inc
> Delivering the premier storage system for scalable Linux clusters
>
> www.panasas.com
> welch@panasas.com
>
>
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> click here
> [rand=631492116]
>
> ________________________________________________________________________________
> Yahoo! Groups Links
> * To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
>
> * To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
>
> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>

From bwelch@panasas.com Fri Aug 06 18:39:28 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 16176 invoked from network); 7 Aug 2004 01:39:27 -0000
Received: from unknown (66.218.66.218)
by m24.grp.scd.yahoo.com with QMQP; 7 Aug 2004 01:39:27 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.202)
by mta3.grp.scd.yahoo.com with SMTP; 7 Aug 2004 01:39:27 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i771dRY29224
for <pnfs-reqs@yahoogroups.com>; Fri, 6 Aug 2004 18:39:27 -0700
Message-Id: <200408070139.i771dRY29224@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.7.1 07/28/2004 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com
In-reply-to: <Pine.LNX.4.58.0408031410280.21744@willow.eecs.umich.edu>
References: <200407300222.i6U2Msr27922@medlicott.panasas.com>
<Pine.LNX.4.58.0408031410280.21744@willow.eecs.umich.edu>
Comments: In-reply-to Dean Hildebrand <dhildebz@eecs.umich.edu>
message dated "Tue, 03 Aug 2004 14:12:51 -0400."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Fri, 06 Aug 2004 18:39:27 -0700
X-eGroups-Remote-IP: 63.80.58.202
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] pnfs ops, draft of June 8
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

Clients can ask for layout information with any offset and length
when they first access a file. It never hurts to ask for
too much as the server would simply return a layout for less if the
file wasn't that big. Similarly, for newly created files
the client can ask for a write-layout for the first chunk of the file
they expect to write.

>>>Dean Hildebrand said:
> Reading the document again, I see the following info under LAYOUTGET:
> >>
> IMPLEMENTATION
>
> Typically, LAYOUTGET will be called as part of a compound RPC after an
> OPEN operation and results in the client having location information
> for the file. The client specifies a storage type that limits what kind
> of layout the server will return. This prevents servers from issuing
> layouts that are unusable by the client.
> >>
>
> A LAYOUTGET cannot be part of the same compound RPC as OPEN since it
> requires an offset and length. We have to pay the extra round trip price
> unless a map for the entire file is requested (maybe on O_CREAT).
> Dean
>
>
> On Thu, 29 Jul 2004, Brent Welch wrote:
>
> > Here is an update to my ops draft based on the June 7 meeting.
> > I apologize for not sending it out promptly, but here 'tis.
> > I do not include the discussion of why things look as they do now,
> > but you will notice a slimmer set of ops. Much slimmer. I could
> > upload this to the yahoo groups web site, but most of you may find
> > simple inline text easier to deal with :-)
> >
> > The ops boil down to:
> > 7.1 LAYOUTGET - Get Layout Information 9
> > 7.2 LAYOUTCOMMIT - Commit writes made using a layout 12
> > 7.3 LAYOUTRETURN - Release Layout Information 14
> > 8.1 CB_LAYOUTRECALL 14
> >
> > Food for thought next Tuesday - see you then.
> > If you have feedback before then, I'll have time Monday evening
> > to go through this.
> > --
> > Brent Welch
> > Software Architect, Panasas Inc
> > Delivering the premier storage system for scalable Linux clusters
> >
> > www.panasas.com
> > welch@panasas.com
> >
> >
> >
> >
> > Yahoo! Groups Sponsor
> > ADVERTISEMENT
> > click here
> > [rand=631492116]
> >
> > _______________________________________________________________________
___
______
> > Yahoo! Groups Links
> > * To visit your group on the web, go to:
> > http://groups.yahoo.com/group/pnfs-reqs/
> >
> > * To unsubscribe from this group, send an email to:
> > pnfs-reqs-unsubscribe@yahoogroups.com
> >
> > * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
Service.
> >
> >
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com


From julian_satran@il.ibm.com Sun Aug 08 10:07:15 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 48917 invoked from network); 8 Aug 2004 17:07:14 -0000
Received: from unknown (66.218.66.218)
by m21.grp.scd.yahoo.com with QMQP; 8 Aug 2004 17:07:14 -0000
Received: from unknown (HELO mtagate2.de.ibm.com) (195.212.29.151)
by mta3.grp.scd.yahoo.com with SMTP; 8 Aug 2004 17:07:13 -0000
Received: from d12nrmr1507.megacenter.de.ibm.com (d12nrmr1507.megacenter.de.ibm.com [9.149.167.1])
by mtagate2.de.ibm.com (8.12.10/8.12.10) with ESMTP id i78H5rgB151088
for <pnfs-reqs@yahoogroups.com>; Sun, 8 Aug 2004 17:05:53 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1507.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i78H5qVA213216
for <pnfs-reqs@yahoogroups.com>; Sun, 8 Aug 2004 19:05:52 +0200
In-Reply-To: <Pine.LNX.4.58.0408031410280.21744@willow.eecs.umich.edu>
To: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.2 June 01, 2004
Message-ID: <OFABE73B1C.E89B8280-ON88256EEA.0041C1C7-88256EEA.005DE63F@il.ibm.com>
Date: Sun, 8 Aug 2004 20:05:44 +0300
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.5.1| March 5, 2004) at
08/08/2004 20:05:51,
Serialize complete at 08/08/2004 20:05:51
Content-Type: multipart/alternative; boundary="=_alternative 004328C588256EEA_="
X-eGroups-Remote-IP: 195.212.29.151
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] pnfs ops, draft of June 8
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

 Following this thread and the F2F discussion it became clear that for the block mode the model you are considering for allocation is having the matadata do specific allocation per file "withing the confines" of a layout. Although this model keeps the optimisation of the layout with the metadata server it is hardly the only one possible and arguably scalable (e.g., creating temporary and/or small files involves always an interaction with the MDS). An alternative mode that could be acceptable and is used in the block mode is to make allocations "global - per FS and client" and delegate the exact allocation to the client. This is a scheme that is more scalable and avoids some critical privacy pittfals (like keeping track which allocations are committed to avoid applications getting to read "old/different" file data.

To keep the delegation model almost unchanged this type of allocation can be achieve by delegating an allocation map per device and have the clients use it with only occasional releases.

Regards,
Julo


Dean Hildebrand <dhildebz@eecs.umich.edu>

03/08/04 11:12
Please respond to
pnfs-reqs

	
To
	pnfs-reqs@yahoogroups.com
cc
	
Subject
	Re: [pnfs-reqs] pnfs ops, draft of June 8

	




Reading the document again, I see the following info under LAYOUTGET:
>>
IMPLEMENTATION

Typically, LAYOUTGET will be called as part of a compound RPC after an
OPEN operation and results in the client having location information
for the file. The client specifies a storage type that limits what kind
of layout the server will return.  This prevents servers from issuing
layouts that are unusable by the client.
>>

A LAYOUTGET cannot be part of the same compound RPC as OPEN since it
requires an offset and length.  We have to pay the extra round trip price
unless a map for the entire file is requested (maybe on O_CREAT).
Dean


On Thu, 29 Jul 2004, Brent Welch wrote:

> Here is an update to my ops draft based on the June 7 meeting.
> I apologize for not sending it out promptly, but here 'tis.
> I do not include the discussion of why things look as they do now,
> but you will notice a slimmer set of ops.  Much slimmer.  I could
> upload this to the yahoo groups web site, but most of you may find
> simple inline text easier to deal with :-)
>
> The ops boil down to:
> 7.1 LAYOUTGET - Get Layout Information      9
> 7.2 LAYOUTCOMMIT - Commit writes made using a layout      12
> 7.3 LAYOUTRETURN - Release Layout Information      14
> 8.1 CB_LAYOUTRECALL      14
>
> Food for thought next Tuesday - see you then.
> If you have feedback before then, I'll have time Monday evening
> to go through this.
> --
> Brent Welch
> Software Architect, Panasas Inc
> Delivering the premier storage system for scalable Linux clusters
>
> www.panasas.com
> welch@panasas.com
>
>
>
>
> Yahoo! Groups Sponsor
> ADVERTISEMENT
> click here
> [rand=631492116]
>
> ________________________________________________________________________________
> Yahoo! Groups Links
>  *  To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
>  *  To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
>  *  Your use of Yahoo! Groups is subject to the Yahoo! Terms of Service.
>
>


------------------------ Yahoo! Groups Sponsor --------------------~-->
Make a clean sweep of pop-up ads. Yahoo! Companion Toolbar.
Now with Pop-Up Blocker. Get it for free!
http://us.click.yahoo.com/L5YrjA/eSIIAA/yQLSAA/W6uqlB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
   http://groups.yahoo.com/group/pnfs-reqs/

<*> To unsubscribe from this group, send an email to:
   pnfs-reqs-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
   http://docs.yahoo.com/info/terms/

From garth@panasas.com Thu Aug 12 13:23:11 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 48091 invoked from network); 12 Aug 2004 20:23:09 -0000
Received: from unknown (66.218.66.218)
by m5.grp.scd.yahoo.com with QMQP; 12 Aug 2004 20:23:09 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 12 Aug 2004 20:23:09 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56WCTD; Thu, 12 Aug 2004 16:23:03 -0400
Mime-Version: 1.0 (Apple Message framework v618)
In-Reply-To: <OFABE73B1C.E89B8280-ON88256EEA.0041C1C7-88256EEA.005DE63F@il.ibm.com>
References: <OFABE73B1C.E89B8280-ON88256EEA.0041C1C7-88256EEA.005DE63F@il.ibm.com>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
Message-Id: <4AB38983-EC76-11D8-BADC-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Date: Thu, 12 Aug 2004 08:43:01 -0700
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.618)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] pnfs ops, draft of June 8
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Delegating an allocation map to the client, allowing the client to
allocate from the map for multiple files open for write, seems to me to
be a large chunk of functionality and recovery specification. I hope
I'm wrong on this, because we are trying for a small amount of NFSv4
semantics that will nevertheless lead to a lot of functionality when
applied to block, object and file storage devices. Can you suggest a
thin but safe specification of this multi-file delegated block
allocation, Julian?

garth

On Aug 8, 2004, at 1:05 PM, Julian Satran wrote:

>
> Following this thread and the F2F discussion it became clear that for
> the block mode the model you are considering for allocation is having
> the matadata do specific allocation per file "withing the confines" of
> a
> layout. Although this model keeps the optimisation of the layout with
> the metadata server it is hardly the only one possible and arguably
> scalable (e.g., creating temporary and/or small files involves always
> an
> interaction with the MDS). An alternative mode that could be acceptable
> and is used in the block mode is to make allocations "global - per FS
> and client" and delegate the exact allocation to the client. This is a
> scheme that is more scalable and avoids some critical privacy pittfals
> (like keeping track which allocations are committed to avoid
> applications getting to read "old/different" file data.
>
> To keep the delegation model almost unchanged this type of allocation
> can be achieve by delegating an allocation map per device and have the
> clients use it with only occasional releases.
>
> Regards,
> Julo
>
>
>
> Dean Hildebrand <dhildebz@eecs.umich.edu>
>
>
> 03/08/04 11:12
>
>
> Please respond to
> pnfs-reqs
>
>
> To
> pnfs-reqs@yahoogroups.com
>
> cc
>
> Subject
> Re: [pnfs-reqs] pnfs ops, draft of June 8
>
>
>
>
>
>
>
> Reading the document again, I see the following info under LAYOUTGET:
>>>
> IMPLEMENTATION
>
> Typically, LAYOUTGET will be called as part of a compound RPC after an
> OPEN operation and results in the client having location information
> for the file. The client specifies a storage type that limits what kind
> of layout the server will return. This prevents servers from issuing
> layouts that are unusable by the client.
>>>
>
> A LAYOUTGET cannot be part of the same compound RPC as OPEN since it
> requires an offset and length. We have to pay the extra round trip
> price
> unless a map for the entire file is requested (maybe on O_CREAT).
> Dean
>
>
> On Thu, 29 Jul 2004, Brent Welch wrote:
>
>> Here is an update to my ops draft based on the June 7 meeting.
>> I apologize for not sending it out promptly, but here 'tis.
>> I do not include the discussion of why things look as they do now,
>> but you will notice a slimmer set of ops. Much slimmer. I could
>> upload this to the yahoo groups web site, but most of you may find
>> simple inline text easier to deal with :-)
>>
>> The ops boil down to:
>> 7.1 LAYOUTGET - Get Layout Information 9
>> 7.2 LAYOUTCOMMIT - Commit writes made using a layout 12
>> 7.3 LAYOUTRETURN - Release Layout Information 14
>> 8.1 CB_LAYOUTRECALL 14
>>
>> Food for thought next Tuesday - see you then.
>> If you have feedback before then, I'll have time Monday evening
>> to go through this.
>> --
>> Brent Welch
>> Software Architect, Panasas Inc
>> Delivering the premier storage system for scalable Linux clusters
>>
>> www.panasas.com
>> welch@panasas.com
>>
>>
>>
>>
>> Yahoo! Groups Sponsor
>> ADVERTISEMENT
>> click here
>> [rand=631492116]
>>
>>
> _______________________________________________________________________
> _
> ________
>> Yahoo! Groups Links
>> * To visit your group on the web, go to:
>> http://groups.yahoo.com/group/pnfs-reqs/
>>
>> * To unsubscribe from this group, send an email to:
>> pnfs-reqs-unsubscribe@yahoogroups.com
>>
>> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
> Service.
>>
>>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>
>
>
>
> Yahoo! Groups Sponsor
>
> ADVERTISEMENT
>
> <http://us.ard.yahoo.com/SIG=129oc6kl0/
> M=298184.5285298.6392945.3001176/
> D=groups/S=1705701014:HM/EXP=1092071235/A=2164331/R=0/SIG=11eaelai9/
> *htt
> p://www.netflix.com/Default?mqso=60183351> click here
>
> <http://us.adserver.yahoo.com/l?M=298184.5285298.6392945.3001176/
> D=group
> s/S=:HM/A=2164331/rand=468494106>
>
> _____
>
> Yahoo! Groups Links
>
>
> * To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
> <http://groups.yahoo.com/group/pnfs-reqs/>
>
>
> * To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
> <mailto:pnfs-reqs-unsubscribe@yahoogroups.com?subject=Unsubscribe>
>
>
> * Your use of Yahoo! Groups is subject to the Yahoo! Terms of
> Service <http://docs.yahoo.com/info/terms/> .
>
>

From julian_satran@il.ibm.com Fri Aug 13 00:32:24 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 93450 invoked from network); 13 Aug 2004 07:32:23 -0000
Received: from unknown (66.218.66.218)
by m24.grp.scd.yahoo.com with QMQP; 13 Aug 2004 07:32:23 -0000
Received: from unknown (HELO mtagate2.de.ibm.com) (195.212.29.151)
by mta3.grp.scd.yahoo.com with SMTP; 13 Aug 2004 07:32:21 -0000
Received: from d12nrmr1507.megacenter.de.ibm.com (d12nrmr1507.megacenter.de.ibm.com [9.149.167.1])
by mtagate2.de.ibm.com (8.12.10/8.12.10) with ESMTP id i7D7WGgB140850
for <pnfs-reqs@yahoogroups.com>; Fri, 13 Aug 2004 07:32:16 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1507.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i7D7WG2x202734
for <pnfs-reqs@yahoogroups.com>; Fri, 13 Aug 2004 09:32:16 +0200
In-Reply-To: <4AB38983-EC76-11D8-BADC-000A95A94F04@panasas.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Release 6.5.2 June 01, 2004
Message-ID: <OF46493756.92FA6540-ONC2256EEF.00269E0A-C2256EEF.00296653@il.ibm.com>
Date: Fri, 13 Aug 2004 10:32:10 +0300
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.5.1| March 5, 2004) at
13/08/2004 10:32:15,
Serialize complete at 13/08/2004 10:32:15
Content-Type: multipart/alternative; boundary="=_alternative 0026A639C2256EEF_="
X-eGroups-Remote-IP: 195.212.29.151
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] pnfs ops, draft of June 8
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

ADVERTISEMENT

I will try - Julo


Garth Gibson <garth@panasas.com>

12/08/04 18:43
Please respond to
pnfs-reqs

	
To
	pnfs-reqs@yahoogroups.com
cc
	
Subject
	Re: [pnfs-reqs] pnfs ops, draft of June 8

	




Delegating an allocation map to the client, allowing the client to  
allocate from the map for multiple files open for write, seems to me to  
be a large chunk of functionality and recovery specification.  I hope  
I'm wrong on this, because we are trying for a small amount of NFSv4  
semantics that will nevertheless lead to a lot of functionality when  
applied to block, object and file storage devices.  Can you suggest a  
thin but safe specification of this multi-file delegated block  
allocation, Julian?

garth

On Aug 8, 2004, at 1:05 PM, Julian Satran wrote:

>
>  Following this thread and the F2F discussion it became clear that for
> the block mode the model you are considering for allocation is having
> the matadata do specific allocation per file "withing the confines" of  
> a
> layout. Although this model keeps the optimisation of the layout with
> the metadata server it is hardly the only one possible and arguably
> scalable (e.g., creating temporary and/or small files involves always  
> an
> interaction with the MDS). An alternative mode that could be acceptable
> and is used in the block mode is to make allocations "global - per FS
> and client" and delegate the exact allocation to the client. This is a
> scheme that is more scalable and avoids some critical privacy pittfals
> (like keeping track which allocations are committed to avoid
> applications getting to read "old/different" file data.
>
> To keep the delegation model almost unchanged this type of allocation
> can be achieve by delegating an allocation map per device and have the
> clients use it with only occasional releases.
>
> Regards,
> Julo
>
>
>
> Dean Hildebrand <dhildebz@eecs.umich.edu>
>
>
> 03/08/04 11:12
>
>
> Please respond to
> pnfs-reqs
>
>
> To
> pnfs-reqs@yahoogroups.com
>
> cc
>
> Subject
> Re: [pnfs-reqs] pnfs ops, draft of June 8
>
>                  
>
>
>
>
>
> Reading the document again, I see the following info under LAYOUTGET:
>>>
> IMPLEMENTATION
>
> Typically, LAYOUTGET will be called as part of a compound RPC after an
> OPEN operation and results in the client having location information
> for the file. The client specifies a storage type that limits what kind
> of layout the server will return.  This prevents servers from issuing
> layouts that are unusable by the client.
>>>
>
> A LAYOUTGET cannot be part of the same compound RPC as OPEN since it
> requires an offset and length.  We have to pay the extra round trip
> price
> unless a map for the entire file is requested (maybe on O_CREAT).
> Dean
>
>
> On Thu, 29 Jul 2004, Brent Welch wrote:
>
>> Here is an update to my ops draft based on the June 7 meeting.
>> I apologize for not sending it out promptly, but here 'tis.
>> I do not include the discussion of why things look as they do now,
>> but you will notice a slimmer set of ops.  Much slimmer.  I could
>> upload this to the yahoo groups web site, but most of you may find
>> simple inline text easier to deal with :-)
>>
>> The ops boil down to:
>> 7.1 LAYOUTGET - Get Layout Information      9
>> 7.2 LAYOUTCOMMIT - Commit writes made using a layout      12
>> 7.3 LAYOUTRETURN - Release Layout Information      14
>> 8.1 CB_LAYOUTRECALL      14
>>
>> Food for thought next Tuesday - see you then.
>> If you have feedback before then, I'll have time Monday evening
>> to go through this.
>> --
>> Brent Welch
>> Software Architect, Panasas Inc
>> Delivering the premier storage system for scalable Linux clusters
>>
>> www.panasas.com
>> welch@panasas.com
>>
>>
>>
>>
>> Yahoo! Groups Sponsor
>> ADVERTISEMENT
>> click here
>> [rand=631492116]
>>
>>
> _______________________________________________________________________
> _
> ________
>> Yahoo! Groups Links
>>  *  To visit your group on the web, go to:
>>     http://groups.yahoo.com/group/pnfs-reqs/
>>
>>  *  To unsubscribe from this group, send an email to:
>>     pnfs-reqs-unsubscribe@yahoogroups.com
>>
>>  *  Your use of Yahoo! Groups is subject to the Yahoo! Terms of
> Service.
>>
>>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>
>
>
>
> Yahoo! Groups Sponsor                
>
> ADVERTISEMENT
>
> <http://us.ard.yahoo.com/SIG=129oc6kl0/
> M=298184.5285298.6392945.3001176/
> D=groups/S=1705701014:HM/EXP=1092071235/A=2164331/R=0/SIG=11eaelai9/
> *htt
> p://www.netflix.com/Default?mqso=60183351> click here                
>
> <http://us.adserver.yahoo.com/l?M=298184.5285298.6392945.3001176/
> D=group
> s/S=:HM/A=2164331/rand=468494106>                  
>
>   _____
>
> Yahoo! Groups Links
>
>
> *                 To visit your group on the web, go to:
> http://groups.yahoo.com/group/pnfs-reqs/
> <http://groups.yahoo.com/group/pnfs-reqs/>
>
>
> *                 To unsubscribe from this group, send an email to:
> pnfs-reqs-unsubscribe@yahoogroups.com
> <mailto:pnfs-reqs-unsubscribe@yahoogroups.com?subject=Unsubscribe>
>
>
> *                 Your use of Yahoo! Groups is subject to the Yahoo! Terms of
> Service <http://docs.yahoo.com/info/terms/> .
>
>



------------------------ Yahoo! Groups Sponsor --------------------~-->
Make a clean sweep of pop-up ads. Yahoo! Companion Toolbar.
Now with Pop-Up Blocker. Get it for free!
http://us.click.yahoo.com/L5YrjA/eSIIAA/yQLSAA/W6uqlB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
   http://groups.yahoo.com/group/pnfs-reqs/

<*> To unsubscribe from this group, send an email to:
   pnfs-reqs-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
   http://docs.yahoo.com/info/terms/

From black_david@emc.com Mon Sep 13 14:31:59 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 90291 invoked from network); 13 Sep 2004 21:31:56 -0000
Received: from unknown (66.218.66.172)
by m1.grp.scd.yahoo.com with QMQP; 13 Sep 2004 21:31:56 -0000
Received: from unknown (HELO mailhub.lss.emc.com) (168.159.2.31)
by mta4.grp.scd.yahoo.com with SMTP; 13 Sep 2004 21:31:55 -0000
Received: from mxic2.corp.emc.com (mxic2.corp.emc.com [128.221.12.9])
by mailhub.lss.emc.com (Switch-3.1.6/Switch-3.1.6) with ESMTP id i8DLVrQY007934
for <pnfs-reqs@yahoogroups.com>; Mon, 13 Sep 2004 17:31:53 -0400 (EDT)
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <S39Y8FWW>; Mon, 13 Sep 2004 17:31:53 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA07E5C8BF@corpmx14.corp.emc.com>
To: pnfs-reqs@yahoogroups.com
Date: Mon, 13 Sep 2004 17:31:43 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-PMX-Version: 4.6.1.107272, Antispam-Core: 4.6.1.106808, Antispam-Data: 2004.9.13.1
X-PerlMx-Spam: Gauge=, SPAM=0%, Report='EMC_FROM_0 -0, __TLG_EMC_ENVFROM_0 0, __IMS_MSGID 0, __HAS_MSGID 0, __SANE_MSGID 0, NO_REAL_NAME 0, __TO_MALFORMED_2 0, SUBJECT_MONTH 0, SUBJECT_MONTH_2 0, __MIME_VERSION 0, __ANY_IMS_MUA 0, EXCHANGE_SERVER 0, __HAS_X_MAILER 0, __IMS_MUA 0, __CT_TEXT_PLAIN 0, __CT 0, __C230066_P5 0, __MIME_TEXT_ONLY 0, EMC_BODY_1 -5'
X-eGroups-Remote-IP: 168.159.2.31
From: black_david@emc.com
Subject: Pittsburgh meeting Sep. 30th
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

ADVERTISEMENT
My notes say we scheduled a pNFS face-to-face in
Pittsburgh for Sep. 30th. Is this still a go?

Thanks,
--David

----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From Thomas.Talpey@netapp.com Tue Sep 14 10:56:00 2004
Return-Path: <Thomas.Talpey@netapp.com>
X-Sender: Thomas.Talpey@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 10318 invoked from network); 14 Sep 2004 17:55:59 -0000
Received: from unknown (66.218.66.216)
by m13.grp.scd.yahoo.com with QMQP; 14 Sep 2004 17:55:59 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta1.grp.scd.yahoo.com with SMTP; 14 Sep 2004 17:55:59 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i8EHt7FC001026
for <pnfs-reqs@yahoogroups.com>; Tue, 14 Sep 2004 10:55:07 -0700 (PDT)
Received: from svlexc02.hq.netapp.com (svlexc02.corp.netapp.com [10.57.157.136])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i8EHt6t8019725
for <pnfs-reqs@yahoogroups.com>; Tue, 14 Sep 2004 10:55:06 -0700 (PDT)
Received: from lavender.hq.netapp.com ([10.56.11.75]) by svlexc02.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.6713);
Tue, 14 Sep 2004 10:55:06 -0700
Received: from exnane01.hq.netapp.com ([10.97.0.61]) by lavender.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.6713);
Tue, 14 Sep 2004 10:55:06 -0700
Received: from tmt.netapp.com ([10.97.6.31]) by exnane01.hq.netapp.com with Microsoft SMTPSVC(6.0.3790.0);
Tue, 14 Sep 2004 13:55:04 -0400
Message-Id: <6.1.2.0.2.20040914135338.038523a0@exnane01.nane.netapp.com>
X-Nil:
Date: Tue, 14 Sep 2004 13:54:55 -0400
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <B459CE1AFFC52D4688B2A5B842CA35EA07E5C8BF@corpmx14.corp.emc
.com>
References: <B459CE1AFFC52D4688B2A5B842CA35EA07E5C8BF@corpmx14.corp.emc.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
X-OriginalArrivalTime: 14 Sep 2004 17:55:05.0044 (UTC) FILETIME=[F71C5940:01C49A83]
X-eGroups-Remote-IP: 198.95.226.53
From: "Talpey, Thomas" <Thomas.Talpey@netapp.com>
Subject: Re: [pnfs-reqs] Pittsburgh meeting Sep. 30th
X-Yahoo-Group-Post: member; u=44154239
X-Yahoo-Profile: tmtymailu

At 05:31 PM 9/13/2004, black_david@emc.com wrote:
>
>My notes say we scheduled a pNFS face-to-face in
>Pittsburgh for Sep. 30th. Is this still a go?

I can't go, personally. Given that we've made the decision to
move this into IETF, should we consider cancelling, and planning
something in Washington DC at IETF-61?

Tom.


From garth@panasas.com Tue Sep 14 11:09:32 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 84004 invoked from network); 14 Sep 2004 18:09:31 -0000
Received: from unknown (66.218.66.216)
by m22.grp.scd.yahoo.com with QMQP; 14 Sep 2004 18:09:31 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta1.grp.scd.yahoo.com with SMTP; 14 Sep 2004 18:09:31 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56Y166; Tue, 14 Sep 2004 14:09:19 -0400
In-Reply-To: <6.1.2.0.2.20040914135338.038523a0@exnane01.nane.netapp.com>
References: <B459CE1AFFC52D4688B2A5B842CA35EA07E5C8BF@corpmx14.corp.emc.com> <6.1.2.0.2.20040914135338.038523a0@exnane01.nane.netapp.com>
Mime-Version: 1.0 (Apple Message framework v619)
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <2ECAEB36-0679-11D9-9498-000A95A94F04@panasas.com>
Content-Transfer-Encoding: 7bit
Cc: Garth Gibson <garth@panasas.com>
Date: Tue, 14 Sep 2004 14:09:12 -0400
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.619)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Pittsburgh meeting Sep. 30th
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

This is still a go. CMU arrangements are made. I should send out a
reminder later today.

garth


On Sep 14, 2004, at 1:54 PM, Talpey, Thomas wrote:

> At 05:31 PM 9/13/2004, black_david@emc.com wrote:
>>
>> My notes say we scheduled a pNFS face-to-face in
>> Pittsburgh for Sep. 30th. Is this still a go?
>
> I can't go, personally. Given that we've made the decision to
> move this into IETF, should we consider cancelling, and planning
> something in Washington DC at IETF-61?
>
> Tom.
>
>
>
>
>
> Yahoo! Groups Links
>
>
>
>

From garth@panasas.com Wed Sep 15 14:14:52 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 52594 invoked from network); 15 Sep 2004 21:14:49 -0000
Received: from unknown (66.218.66.172)
by m16.grp.scd.yahoo.com with QMQP; 15 Sep 2004 21:14:49 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 15 Sep 2004 21:14:48 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56YHRH; Wed, 15 Sep 2004 17:14:46 -0400
Mime-Version: 1.0 (Apple Message framework v619)
Content-Transfer-Encoding: 7bit
Message-Id: <4048ABC8-075C-11D9-9498-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Wed, 15 Sep 2004 14:14:38 -0700
X-Mailer: Apple Mail (2.619)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Sept 30 working meeting in Pittsburgh
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

CMU has people who know how to help organize things, so please look
here http://www.pdl.cmu.edu/pNFS/ for the details.

This will be the last non-IETF meeting, and we are requested to put
materials for this meeting into the IETF as internet drafts so they are
visible to all. Future pNFS conversations, after Sept 30, should
happen on the IETF mailing list and face-to-face meetings should be
requests for items on the NFSv4 meeting agendas.

garth



From andros@citi.umich.edu Tue Sep 21 13:37:04 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 25309 invoked from network); 21 Sep 2004 20:37:02 -0000
Received: from unknown (66.218.66.167)
by m7.grp.scd.yahoo.com with QMQP; 21 Sep 2004 20:37:02 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta6.grp.scd.yahoo.com with SMTP; 21 Sep 2004 20:37:02 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 3C1E21BB05; Tue, 21 Sep 2004 16:37:02 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Tue, 21 Sep 2004 16:37:02 -0400
Message-Id: <20040921203702.3C1E21BB05@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: September 30th meeting at CMU
X-Yahoo-Group-Post: member; u=169434965

dean hildebrand and i are attending the September 30th at CMU - just wondering
who else will be there!

-->Andy

From fan@rainfinity.com Tue Sep 21 14:35:28 2004
Return-Path: <fan@rainfinity.com>
X-Sender: fan@rainfinity.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 5929 invoked from network); 21 Sep 2004 21:35:26 -0000
Received: from unknown (66.218.66.218)
by m14.grp.scd.yahoo.com with QMQP; 21 Sep 2004 21:35:26 -0000
Received: from unknown (HELO mail1.rainfinity.com) (128.242.125.75)
by mta3.grp.scd.yahoo.com with SMTP; 21 Sep 2004 21:35:26 -0000
Received: from localhost (localhost.rainfinity.com [127.0.0.1])
by mail1.rainfinity.com (Postfix) with ESMTP
id 7687C104161; Tue, 21 Sep 2004 14:34:40 -0700 (PDT)
Received: from mail1.rainfinity.com ([127.0.0.1])
by localhost (mail1.rainfinity.com [127.0.0.1]) (amavisd-new, port 10024)
with ESMTP id 24363-06; Tue, 21 Sep 2004 14:34:40 -0700 (PDT)
Received: from [192.168.0.47] (hq.rainfinity.com [128.242.125.65])
by mail1.rainfinity.com (Postfix) with ESMTP
id 0232D104152; Tue, 21 Sep 2004 14:34:40 -0700 (PDT)
Message-ID: <41509E7A.5030304@rainfinity.com>
Date: Tue, 21 Sep 2004 14:34:50 -0700
User-Agent: Mozilla Thunderbird 0.6 (Windows/20040502)
X-Accept-Language: en-us, en
MIME-Version: 1.0
To: pnfs-reqs@yahoogroups.com
Cc: andros@citi.umich.edu
References: <20040921203702.3C1E21BB05@citi.umich.edu>
In-Reply-To: <20040921203702.3C1E21BB05@citi.umich.edu>
Content-Type: text/plain; charset=us-ascii; format=flowed
Content-Transfer-Encoding: 7bit
X-Virus-Scanned: by amavisd-new at rainfinity.com
X-eGroups-Remote-IP: 128.242.125.75
From: Chenggong Charles Fan <fan@rainfinity.com>
Subject: Re: [pnfs-reqs] September 30th meeting at CMU
X-Yahoo-Group-Post: member; u=37364696
X-Yahoo-Profile: fanrainfinity

I will be there.

Charles

William A.(Andy) Adamson wrote:

>dean hildebrand and i are attending the September 30th at CMU - just wondering
>who else will be there!
>
>-->Andy
>
>
>
>
>
>
>Yahoo! Groups Links
>
>
>
>
>
>


From craigev@us.ibm.com Fri Sep 24 09:39:25 2004
Return-Path: <craigev@us.ibm.com>
X-Sender: craigev@us.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 15937 invoked from network); 24 Sep 2004 16:39:24 -0000
Received: from unknown (66.218.66.167)
by m13.grp.scd.yahoo.com with QMQP; 24 Sep 2004 16:39:24 -0000
Received: from unknown (HELO e32.co.us.ibm.com) (32.97.110.130)
by mta6.grp.scd.yahoo.com with SMTP; 24 Sep 2004 16:39:24 -0000
Received: from westrelay04.boulder.ibm.com (westrelay04.boulder.ibm.com [9.17.193.32])
by e32.co.us.ibm.com (8.12.10/8.12.9) with ESMTP id i8OGdNbZ421586
for <pnfs-reqs@yahoogroups.com>; Fri, 24 Sep 2004 12:39:23 -0400
Received: from d03nm130.boulder.ibm.com (d03av04.boulder.ibm.com [9.17.195.170])
by westrelay04.boulder.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i8OGdMMI070992
for <pnfs-reqs@yahoogroups.com>; Fri, 24 Sep 2004 10:39:22 -0600
In-Reply-To: <20040921203702.3C1E21BB05@citi.umich.edu>
To: pnfs-reqs@yahoogroups.com
X-Mailer: Lotus Notes Release 6.0.2CF1 June 9, 2003
Message-ID: <OF96BF883A.69A63254-ON88256F19.005B5FBB-85256F19.005B7D8A@us.ibm.com>
Date: Fri, 24 Sep 2004 12:39:19 -0400
X-MIMETrack: Serialize by Router on D03NM130/03/M/IBM(Release 6.51HF338 | June 21, 2004) at
09/24/2004 10:39:21
MIME-Version: 1.0
Content-type: multipart/related;
Boundary="0__=07BBE58ADFC8D92B8f9e8a93df938690918c07BBE58ADFC8D92B"
X-eGroups-Remote-IP: 32.97.110.130
From: Craig Everhart <craigev@us.ibm.com>
Subject: Re: [pnfs-reqs] September 30th meeting at CMU
X-Yahoo-Group-Post: member; u=67958684

I'll be there! I'll be getting there late, like maybe 10-10:30am, but I'll be there.

Craig

Craig Everhart
+1 919 543 2169 (tie 441 2169)

Inactive hide details for "William A.(Andy) Adamson" <andros@citi.umich.edu>"William A.(Andy) Adamson" <andros@citi.umich.edu>


                        "William A.(Andy) Adamson" <andros@citi.umich.edu>

                        09/21/2004 04:37 PM
                        Please respond to
                        pnfs-reqs

	

To
	
pnfs-reqs@yahoogroups.com

cc
	
andros@citi.umich.edu

Subject
	
[pnfs-reqs] September 30th meeting at CMU
	

dean hildebrand and i are attending the September 30th at CMU - just wondering
who else will be there!

-->Andy




------------------------ Yahoo! Groups Sponsor --------------------~-->
Make a clean sweep of pop-up ads. Yahoo! Companion Toolbar.
Now with Pop-Up Blocker. Get it for free!
http://us.click.yahoo.com/L5YrjA/eSIIAA/yQLSAA/W6uqlB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
   http://groups.yahoo.com/group/pnfs-reqs/

<*> To unsubscribe from this group, send an email to:
   pnfs-reqs-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
   http://docs.yahoo.com/info/terms/





Attachment (not stored)
pic00864.gif
Type: image/gif

From garth@panasas.com Mon Sep 27 14:27:55 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 84254 invoked from network); 27 Sep 2004 21:27:53 -0000
Received: from unknown (66.218.66.166)
by m3.grp.scd.yahoo.com with QMQP; 27 Sep 2004 21:27:53 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 27 Sep 2004 21:27:53 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56Z114; Mon, 27 Sep 2004 17:27:51 -0400
In-Reply-To: <4048ABC8-075C-11D9-9498-000A95A94F04@panasas.com>
References: <4048ABC8-075C-11D9-9498-000A95A94F04@panasas.com>
Mime-Version: 1.0 (Apple Message framework v619)
Content-Type: text/plain; charset=US-ASCII; format=flowed
Message-Id: <1578551C-10CC-11D9-85F1-000393754F12@panasas.com>
Content-Transfer-Encoding: 7bit
Cc: Greg Ganger <ganger@ece.cmu.edu>,
Karen Lindenfelser <karen@ece.cmu.edu>,
Garth Gibson <garth@panasas.com>
Date: Mon, 27 Sep 2004 17:27:50 -0400
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.619)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Sept 30 working meeting in Pittsburgh
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Reminder: final adhoc pNFS meeting is this Thurs at CMU. After this,
all meetings will be on the ietf nfsv4 mailing lists and meeting
schedules.

See web link below for maps, transportation facilities -- since you
will be walking onto a campus, I highly recommend printing these maps
and showing them to the students as you walk around saying "how do I
get to 8220 Wean Hall".

Continental breakfast will be in 8220 Wean at 8:30 am.

At lunch Dean Hildebrand of U. Mich will be giving a pNFS related talk
in CMU's Systems Design and Implementation seminar and we are all
invited.

Our schedule ends at 4pm, allowing for people to catch a 6pm flight to
the west coast. For those that can stay a little longer, CMU's storage
group, the Parallel Data Lab (www.pdl.cmu.edu), will be offering a
poster session from 4-6pm. Thanks to Greg Ganger and the PDL team for
doing another poster session after three solid days of review
9/27-9/29.

Attendees I know about include: David Black (EMC), Andy Adamson
(U.Mich), Dean Hildebrand (U.Mich), Craig Everhardt (IBM), David Ford
(NetApp), Yuichi Yagawa (Hitachi), Charles Fan (Rainfinity), John
Howard (Sun), Tushar Tambay (Veritas), Garth Gibson (Panasas/CMU),
Brent Welch (Panasas).

See you there!

garth


On Sep 15, 2004, at 5:14 PM, Garth Gibson wrote:
> CMU has people who know how to help organize things, so please look
> here http://www.pdl.cmu.edu/pNFS/ for the details.
>
> This will be the last non-IETF meeting, and we are requested to put
> materials for this meeting into the IETF as internet drafts so they are
> visible to all. Future pNFS conversations, after Sept 30, should
> happen on the IETF mailing list and face-to-face meetings should be
> requests for items on the NFSv4 meeting agendas.
>
> garth

From garth@panasas.com Wed Sep 29 13:08:41 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 60498 invoked from network); 29 Sep 2004 20:08:36 -0000
Received: from unknown (66.218.66.172)
by m13.grp.scd.yahoo.com with QMQP; 29 Sep 2004 20:08:36 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 29 Sep 2004 20:08:28 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 2B56ZJWN; Wed, 29 Sep 2004 16:08:22 -0400
In-Reply-To: <1578551C-10CC-11D9-85F1-000393754F12@panasas.com>
References: <4048ABC8-075C-11D9-9498-000A95A94F04@panasas.com> <1578551C-10CC-11D9-85F1-000393754F12@panasas.com>
Mime-Version: 1.0 (Apple Message framework v619)
Content-Type: multipart/mixed; boundary=Apple-Mail-38-69699667
Message-Id: <4EF61B00-1253-11D9-85F1-000393754F12@panasas.com>
Cc: Karen Lindenfelser <karen@ece.cmu.edu>
Date: Wed, 29 Sep 2004 16:08:19 -0400
To: pnfs-reqs@yahoogroups.com
X-Mailer: Apple Mail (2.619)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Re: [pnfs-reqs] Sept 30 working meeting in Pittsburgh
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

Attendees,

If you need parking at the campus and don't know the area the safest
bet is to park in the Carnegie Museum parking lot. It'll cost $15 for
the day, but it is easy and reliable. I've attached a few more maps:
- CMU-map -- the Carnegie Museum parking lot is at Forbes and South
Craig, about where the North-South-cross is on this map
- CarnegieMuseum map - shows how to get to the Carnegie Museum parking
-- walk east on Forbes to CMU
- wean-map -- coming east on Forbes from the Carnegie Museum you will
be at the bottom of this map, walk around Hamburg Hall, through
Newell-Simon Hall (go up the stairs one floor) to Wean Hall (you enter
on the 4th floor and the meeting is on the 8th)

There is a Starbucks at South Craig and Forbes, with a better coffee
shop across South Craig, and even better coffee in the booth in the
lobby of 5th floor of Wean, and we will have less good coffee in 8220
Wean Hall.

See you tomorrow,
garth


On Sep 27, 2004, at 5:27 PM, Garth Gibson wrote:

> Reminder: final adhoc pNFS meeting is this Thurs at CMU. After this,
> all meetings will be on the ietf nfsv4 mailing lists and meeting
> schedules.
>
> See web link below for maps, transportation facilities -- since you
> will be walking onto a campus, I highly recommend printing these maps
> and showing them to the students as you walk around saying "how do I
> get to 8220 Wean Hall".
>
> Continental breakfast will be in 8220 Wean at 8:30 am.
>
> At lunch Dean Hildebrand of U. Mich will be giving a pNFS related talk
> in CMU's Systems Design and Implementation seminar and we are all
> invited.
>
> Our schedule ends at 4pm, allowing for people to catch a 6pm flight to
> the west coast. For those that can stay a little longer, CMU's
> storage group, the Parallel Data Lab (www.pdl.cmu.edu), will be
> offering a poster session from 4-6pm. Thanks to Greg Ganger and the
> PDL team for doing another poster session after three solid days of
> review 9/27-9/29.
>
> Attendees I know about include: David Black (EMC), Andy Adamson
> (U.Mich), Dean Hildebrand (U.Mich), Craig Everhardt (IBM), David Ford
> (NetApp), Yuichi Yagawa (Hitachi), Charles Fan (Rainfinity), John
> Howard (Sun), Tushar Tambay (Veritas), Garth Gibson (Panasas/CMU),
> Brent Welch (Panasas).
>
> See you there!
>
> garth
>
>
> On Sep 15, 2004, at 5:14 PM, Garth Gibson wrote:
>> CMU has people who know how to help organize things, so please look
>> here http://www.pdl.cmu.edu/pNFS/ for the details.
>>
>> This will be the last non-IETF meeting, and we are requested to put
>> materials for this meeting into the IETF as internet drafts so they
>> are
>> visible to all. Future pNFS conversations, after Sept 30, should
>> happen on the IETF mailing list and face-to-face meetings should be
>> requests for items on the NFSv4 meeting agendas.
>>
>> garth




Attachment (not stored)
CMU-map.pdf
Type: application/pdf

Attachment (not stored)
CarnegieMuseum-roadmap.gif
Type: image/gif

Attachment (not stored)
wean-map.pdf
Type: application/pdf

From bwelch@panasas.com Thu Sep 30 20:11:27 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 39833 invoked from network); 1 Oct 2004 03:11:25 -0000
Received: from unknown (66.218.66.172)
by m17.grp.scd.yahoo.com with QMQP; 1 Oct 2004 03:11:25 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.206)
by mta4.grp.scd.yahoo.com with SMTP; 1 Oct 2004 03:11:25 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i91387O18638;
Thu, 30 Sep 2004 20:08:07 -0700
Message-Id: <200410010308.i91387O18638@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
To: pnfs-reqs@yahoogroups.com
Cc: welch@panasas.com
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-ID: <18636.1096600086.1@panasas.com>
Date: Thu, 30 Sep 2004 20:08:06 -0700
X-eGroups-Remote-IP: 63.80.58.206
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Notes from Sept 30
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

(This will be one of the last postings before we complete the move
to the nfsv4 mailing list. Just wanted to capture these notes.)

These were the agenda items for today's pnfs ops discussion.
GETDEVINFO / GETDEVLIST
Group Open
Client Failures
Attribute Handling
Layout life cycle

We started with a general review for newcomers. This led to a bit
of discussion about how the client is structured. We identified
three layers on the client side: the high level "generic" layers that
deal with vnode ops and other interfaces to the host OS. The
middle layer that understands layout information. Its job is to
convert I/O requests into operations on the underlying devices
based on the layout information. The lowest level is the device
driver layer that just does raw I/O.
We coined the term "layout class" to describe families of layouts for file,
object and block access. Within each layout class there are different
possible layouts that describe how the data is organized onto
one or more storage devices. I'd like to propose the term
"layout driver" as the code module that understands a class of
layouts. (At the meeting we used the term "storage access module")

With this view, it is the middle layer (hmm, the "pnfs layer?") that
understands layouts and issues I/O to drivers. And, it is this layer
that brings us to the GETDEVICEINFO and GETDEVICELIST ops. We had
this in the early drafts, and then tried to drop it because it was
particular to this layout-class-specific layer of code. However,
essentialy all layouts have the problem of needing a fairly compact
way to refer to devices from layouts, and therefore they need a way
to map from these short device names to more complete information
required to find the device. This topic was discussed at the last
meeting, where we re-introduced GETDEVICEINFO.

At this meeting,
David Black gave us a tutorial on the world of SCSI resource discovery.
The net result is that the SCSI layer queries all available devices
and reads volume label information off the attached devices as an
absolute way to determine what devices are accessible and which
ports happen to be useful by the current host for that device.
Different hosts may use different physical addresses for the same
device, and the same device may appear at a different physical
address over time. These are the consequences of multi-path networks
and failover. That's just how it is. In this world is it somewhat
natural for the server to compare notes with its clients about the
devices that are used by the filesystem. This motivates the
GETDEVICELIST operation that the client would use to read out
the entire device table for the filesystem. Input parameters
are clientID and fsid, and the result is an array of mappings
from short device names to a storage type "information" specific
to that type.

We talked about the sequence of operations that the client takes when
it first encounters a filesystem (e.g., "mount" time)
1) Determine if the filesystem can do pNFS
2) Determine what kind of storage devices and storage protocols are involved
3) Determine if the storage devices are in fact accessible

Step (1) and (2) can be acheived by doing a GETATTR when it first
encounters a new fsid. This means it will do this when it crosses
internal mount points. The attribute would be, e.g., LAYOUT_CLASSES
and it would return an array of layout class values. These would
select from available layout drives to implement that middle "pnfs"
layer, or "layout driver" layer. Next, the client can use the new
GETDEVICELIST (if it so chooses) to enumerate the devices associated
with that filesystem. It might determine at that point that it does
not have sufficient connectivity to access storage directly. It
could fail the "mount", or it could simply make a note to itself
that it isn't worth trying out the pNFS operations on that fsid.

Last meeting Dave Noveck argued against using an fsid as an input
parameter to GETDEVICEINFO and GETDEVICELIST. I'm not sure I can
replay his argument, and it seems convenient to be able to do so
from the client's perspective.

Note also the GETDEVICELIST may not be needed by all layout drivers.
The other strategy is to wait until a new short device name
(i.e., small integer) appears in a layout, and then do a GETDEVICEINFO
op to learn more about it.

The next topic was Group Open. The motivating example is to have
1000 clients that all want to open the same file. One desired
paradigm by the HPC programming community is to be able to do
a single OPEN/LAYOUTGET operation on a "master" node, and then have
it give out the resulting layout and stateids to its 100's of slave
nodes, which then access storage directly. The previous solutions
proposed by the group include a kind of share_key on the LAYOUTGET.
However, that solution still implies that all 100's of slave nodes
do their own LAYOUTGET. Is it possible to avoid that? How does
it affect the protocol? Can we let the clients "cheat", er,
cooperate in this way? How much protection should the filesystem
provide against different MPI applications that might accidentally
share the same file? Mostly we had questions.

One fringe theory was that the MPI application could somehow get
a new clientid (of the SETCLIENTID kind) that represented the collection
of processors involved in the MPI application. This is compatible
with the typical batch processing MPI environments where N nodes
are allocated, and then a job is run against those nodes. The
master node uses this clientid to get the layout-related stateids.
It gives out the layout information to its slaves, and the underlying
filesystem allows use of the layout information by the nodes
represented by that clientID. There is somewhat tenuous connection
between the set of processors, the clientID, the layout and associated
layout stateID, and the class-specific information in the layout that
is passed to the storage device so it can ensure that only those
processors access storage.

The next topic was client failure recovery. The net result is that
we can claim leverage off the existing NFSv4 features to deal with
crashed clients. This includes that layouts have associated layout
stateids that "expire" along with the rest of the client state.
The main new issue introduced by pNFS is that the client may have to
do a lot of I/O in response to a layout recall. The client may
need to remember to send null ops to the server during this period
if it were to risk not doing anything within the lease time.
Of course, the client should only reply with its LAYOUTRETURN
after it knows its I/O has completed.

"All existing NFS security mechanisms apply to these ops". Need to
remember this standard text in our RFC.

We talked about attribute handling semantics. Mostly we agreed not
to state much. For example, a LAYOUTCOMMIT implies that the modify
time changed, but we don't need to mandate precisely when that takes
effect on the server. Ditto for access time and ctime.

The final item was layout lifetimes. The client is free to cache the
layout information for as long as the stateid is valid. The layout
is associated with a file handle, so the layout can be applied to
many different OPEN / CLOSE sessions on a file. (Even open/close
at the vop level.)
--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From dnoveck@netapp.com Mon Oct 04 12:12:09 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 65401 invoked from network); 4 Oct 2004 19:12:08 -0000
Received: from unknown (66.218.66.172)
by m21.grp.scd.yahoo.com with QMQP; 4 Oct 2004 19:12:08 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta4.grp.scd.yahoo.com with SMTP; 4 Oct 2004 19:12:08 -0000
Received: from frejya.corp.netapp.com (frejya [10.57.157.119])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i94JBuFC012762;
Mon, 4 Oct 2004 12:11:56 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by frejya.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i94JBtt8025156;
Mon, 4 Oct 2004 12:11:55 -0700 (PDT)
Received: from violet.hq.netapp.com ([10.56.10.190]) by svlexc01.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.6713);
Mon, 4 Oct 2004 12:11:55 -0700
Received: from exnane01.hq.netapp.com ([10.97.0.61]) by violet.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.2966);
Mon, 4 Oct 2004 12:11:55 -0700
X-MimeOLE: Produced By Microsoft Exchange V6.5.7226.0
Content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Date: Mon, 4 Oct 2004 15:11:54 -0400
Message-ID: <C98692FD98048C41885E0B0FACD9DFB803BCD2@exnane01.hq.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] Notes from Sept 30
Thread-Index: AcSnZFmLvbCBJy+cQKCGAdryGu41qwAO2PTQ
To: <pnfs-reqs@yahoogroups.com>
Cc: <welch@panasas.com>
X-OriginalArrivalTime: 04 Oct 2004 19:11:55.0581 (UTC) FILETIME=[037776D0:01C4AA46]
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

> Last meeting Dave Noveck argued against using an fsid as an input
> parameter to GETDEVICEINFO and GETDEVICELIST. I'm not sure I can
> replay his argument, and it seems convenient to be able to do so
> from the client's perspective.

I don't think this is a big deal. I guess I would argue that it is
more in keeping with the spirit of the protocol to get the fsid from
the current fh. Any file handle within the target fs would do.
Essentially you would be specifying the filesystem which could also
be identified by using the fsid. If you have an fsid, you got it
from an fh within the fs (by doing a GETATTR) so that whenever the
client has an fsid, it must have an fh within it and it can use the
typical v4 style of specification by doing a PUTFH followed by the
GETDEVICEINFO or GETDEVICELIST.

-----Original Message-----
From: Brent Welch [mailto:bwelch@panasas.com]
Sent: Thursday, September 30, 2004 11:08 PM
To: pnfs-reqs@yahoogroups.com
Cc: welch@panasas.com
Subject: [pnfs-reqs] Notes from Sept 30


(This will be one of the last postings before we complete the move
to the nfsv4 mailing list. Just wanted to capture these notes.)

These were the agenda items for today's pnfs ops discussion.
GETDEVINFO / GETDEVLIST
Group Open
Client Failures
Attribute Handling
Layout life cycle

We started with a general review for newcomers. This led to a bit
of discussion about how the client is structured. We identified
three layers on the client side: the high level "generic" layers that
deal with vnode ops and other interfaces to the host OS. The
middle layer that understands layout information. Its job is to
convert I/O requests into operations on the underlying devices
based on the layout information. The lowest level is the device
driver layer that just does raw I/O.
We coined the term "layout class" to describe families of layouts for
file,
object and block access. Within each layout class there are different
possible layouts that describe how the data is organized onto
one or more storage devices. I'd like to propose the term
"layout driver" as the code module that understands a class of
layouts. (At the meeting we used the term "storage access module")

With this view, it is the middle layer (hmm, the "pnfs layer?") that
understands layouts and issues I/O to drivers. And, it is this layer
that brings us to the GETDEVICEINFO and GETDEVICELIST ops. We had
this in the early drafts, and then tried to drop it because it was
particular to this layout-class-specific layer of code. However,
essentialy all layouts have the problem of needing a fairly compact
way to refer to devices from layouts, and therefore they need a way
to map from these short device names to more complete information
required to find the device. This topic was discussed at the last
meeting, where we re-introduced GETDEVICEINFO.

At this meeting,
David Black gave us a tutorial on the world of SCSI resource discovery.
The net result is that the SCSI layer queries all available devices
and reads volume label information off the attached devices as an
absolute way to determine what devices are accessible and which
ports happen to be useful by the current host for that device.
Different hosts may use different physical addresses for the same
device, and the same device may appear at a different physical
address over time. These are the consequences of multi-path networks
and failover. That's just how it is. In this world is it somewhat
natural for the server to compare notes with its clients about the
devices that are used by the filesystem. This motivates the
GETDEVICELIST operation that the client would use to read out
the entire device table for the filesystem. Input parameters
are clientID and fsid, and the result is an array of mappings
from short device names to a storage type "information" specific
to that type.

We talked about the sequence of operations that the client takes when
it first encounters a filesystem (e.g., "mount" time)
1) Determine if the filesystem can do pNFS
2) Determine what kind of storage devices and storage protocols are
involved
3) Determine if the storage devices are in fact accessible

Step (1) and (2) can be acheived by doing a GETATTR when it first
encounters a new fsid. This means it will do this when it crosses
internal mount points. The attribute would be, e.g., LAYOUT_CLASSES
and it would return an array of layout class values. These would
select from available layout drives to implement that middle "pnfs"
layer, or "layout driver" layer. Next, the client can use the new
GETDEVICELIST (if it so chooses) to enumerate the devices associated
with that filesystem. It might determine at that point that it does
not have sufficient connectivity to access storage directly. It
could fail the "mount", or it could simply make a note to itself
that it isn't worth trying out the pNFS operations on that fsid.

Last meeting Dave Noveck argued against using an fsid as an input
parameter to GETDEVICEINFO and GETDEVICELIST. I'm not sure I can
replay his argument, and it seems convenient to be able to do so
from the client's perspective.

Note also the GETDEVICELIST may not be needed by all layout drivers.
The other strategy is to wait until a new short device name
(i.e., small integer) appears in a layout, and then do a GETDEVICEINFO
op to learn more about it.

The next topic was Group Open. The motivating example is to have
1000 clients that all want to open the same file. One desired
paradigm by the HPC programming community is to be able to do
a single OPEN/LAYOUTGET operation on a "master" node, and then have
it give out the resulting layout and stateids to its 100's of slave
nodes, which then access storage directly. The previous solutions
proposed by the group include a kind of share_key on the LAYOUTGET.
However, that solution still implies that all 100's of slave nodes
do their own LAYOUTGET. Is it possible to avoid that? How does
it affect the protocol? Can we let the clients "cheat", er,
cooperate in this way? How much protection should the filesystem
provide against different MPI applications that might accidentally
share the same file? Mostly we had questions.

One fringe theory was that the MPI application could somehow get
a new clientid (of the SETCLIENTID kind) that represented the collection
of processors involved in the MPI application. This is compatible
with the typical batch processing MPI environments where N nodes
are allocated, and then a job is run against those nodes. The
master node uses this clientid to get the layout-related stateids.
It gives out the layout information to its slaves, and the underlying
filesystem allows use of the layout information by the nodes
represented by that clientID. There is somewhat tenuous connection
between the set of processors, the clientID, the layout and associated
layout stateID, and the class-specific information in the layout that
is passed to the storage device so it can ensure that only those
processors access storage.

The next topic was client failure recovery. The net result is that
we can claim leverage off the existing NFSv4 features to deal with
crashed clients. This includes that layouts have associated layout
stateids that "expire" along with the rest of the client state.
The main new issue introduced by pNFS is that the client may have to
do a lot of I/O in response to a layout recall. The client may
need to remember to send null ops to the server during this period
if it were to risk not doing anything within the lease time.
Of course, the client should only reply with its LAYOUTRETURN
after it knows its I/O has completed.

"All existing NFS security mechanisms apply to these ops". Need to
remember this standard text in our RFC.

We talked about attribute handling semantics. Mostly we agreed not
to state much. For example, a LAYOUTCOMMIT implies that the modify
time changed, but we don't need to mandate precisely when that takes
effect on the server. Ditto for access time and ctime.

The final item was layout lifetimes. The client is free to cache the
layout information for as long as the stateid is valid. The layout
is associated with a file handle, so the layout can be applied to
many different OPEN / CLOSE sessions on a file. (Even open/close
at the vop level.)
--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com




Yahoo! Groups Links

From black_david@emc.com Tue Oct 05 21:28:08 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 48482 invoked from network); 6 Oct 2004 04:28:07 -0000
Received: from unknown (66.218.66.167)
by m25.grp.scd.yahoo.com with QMQP; 6 Oct 2004 04:28:07 -0000
Received: from unknown (HELO mailhub.lss.emc.com) (168.159.2.31)
by mta6.grp.scd.yahoo.com with SMTP; 6 Oct 2004 04:28:06 -0000
Received: from mxic2.corp.emc.com (mxic2.corp.emc.com [128.221.12.9])
by mailhub.lss.emc.com (Switch-3.1.6/Switch-3.1.6) with ESMTP id i964S1iT022977
for <pnfs-reqs@yahoogroups.com>; Wed, 6 Oct 2004 00:28:02 -0400 (EDT)
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <S0DYHTPY>; Wed, 6 Oct 2004 00:27:54 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA092F8885@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com
Date: Wed, 6 Oct 2004 00:27:46 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-PMX-Version: 4.6.1.107272, Antispam-Core: 4.6.1.106808, Antispam-Data: 2004.10.5.4
X-PerlMx-Spam: Gauge=, SPAM=0%, Report='EMC_FROM_0 -0, __TLG_EMC_ENVFROM_0 0, __IMS_MSGID 0, __HAS_MSGID 0, __SANE_MSGID 0, NO_REAL_NAME 0, __TO_MALFORMED_2 0, __MIME_VERSION 0, __ANY_IMS_MUA 0, __IMS_MUA 0, __HAS_X_MAILER 0, __CT_TEXT_PLAIN 0, __CT 0, __C230066_P5 0, __MIME_TEXT_ONLY 0, EMC_BODY_1 -5'
X-eGroups-Remote-IP: 168.159.2.31
From: black_david@emc.com
Subject: fsid usage
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

I took a closer look at how HighRoad's FMP protocol uses fsid,
and an fsid is allowed, but not required as an input argument to
its analog to GETDEVICELIST (FMP doesn't have a GETDEVICEINFO
call). FMP's analog of GETDEVICELIST is called FMP_Mount, and
it has a number of ways to identify the filesystem - the
NFS-relevant ones are:

1) Absolute path on the fileserver from "root" on the
fileserver
2) An NFS handle to any file in the filesystem.
3) An fsid
4) The NFS mount point of the filesystem (string).

Number 1) is perhaps not the best idea, as it creates client
dependencies on the internal configuration of the fileserver,
so let me set it aside. The remaining three are useful in
different situations at the client:

2) Roughly as Dave Noveck outlines - an NFS filehandle is a
good starting point if the client knows that the handle's
in a filesystem for which the client doesn't have device
info.
3) If the client doesn't know that the handle is in a filesystem
for which the client lacks device info, the fsid is
useful. To avoid any ambiguity about which filesystem
any device shorthand refers to, FMP always returns the
fsid with the layout info. If a client gets back an fsid
it hasn't seen before, it's convenient to use that fsid
to go get the device info, since the client indexes device
info by fsid.
4) This is useful for a client that scans the mount structure
of the mounted NFS filesystems at startup to gather
relevant device info in advance. The mount string is
part of the mount operation and is convenient for getting
device info at mount time. Notice that in this case, the
client does not need to open any files to gather device info.

This suggests that both the fsid and the mount point string
are useful ways to get device info in addition to an NFS file
handle.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

> -----Original Message-----
> From: Noveck, Dave [mailto:dnoveck@netapp.com]
> Sent: Monday, October 04, 2004 3:12 PM
> To: pnfs-reqs@yahoogroups.com
> Cc: welch@panasas.com
> Subject: RE: [pnfs-reqs] Notes from Sept 30
>
>
>
> > Last meeting Dave Noveck argued against using an fsid as an input
> > parameter to GETDEVICEINFO and GETDEVICELIST. I'm not sure I can
> > replay his argument, and it seems convenient to be able to do so
> > from the client's perspective.
>
> I don't think this is a big deal. I guess I would argue that it is
> more in keeping with the spirit of the protocol to get the fsid from
> the current fh. Any file handle within the target fs would do.
> Essentially you would be specifying the filesystem which could also
> be identified by using the fsid. If you have an fsid, you got it
> from an fh within the fs (by doing a GETATTR) so that whenever the
> client has an fsid, it must have an fh within it and it can use the
> typical v4 style of specification by doing a PUTFH followed by the
> GETDEVICEINFO or GETDEVICELIST.

From trond.myklebust@fys.uio.no Wed Oct 06 10:55:27 2004
Return-Path: <trond.myklebust@fys.uio.no>
X-Sender: trond.myklebust@fys.uio.no
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 62563 invoked from network); 6 Oct 2004 17:55:26 -0000
Received: from unknown (66.218.66.218)
by m21.grp.scd.yahoo.com with QMQP; 6 Oct 2004 17:55:26 -0000
Received: from unknown (HELO pat.uio.no) (129.240.130.16)
by mta3.grp.scd.yahoo.com with SMTP; 6 Oct 2004 17:55:25 -0000
Received: from mail-mx6.uio.no ([129.240.10.47])
by pat.uio.no with esmtp (Exim 4.34)
id 1CFFyo-0000ai-QU
for pnfs-reqs@yahoogroups.com; Wed, 06 Oct 2004 19:53:35 +0200
Received: from 213.80-202-70.nextgentel.com ([80.202.70.213] helo=[192.168.1.102])
by smtp.uio.no with asmtp (SSLv3:RC4-MD5:128)
(Exim 4.34)
id 1CFFym-0002Ne-Sq
for pnfs-reqs@yahoogroups.com; Wed, 06 Oct 2004 19:53:33 +0200
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <B459CE1AFFC52D4688B2A5B842CA35EA092F8885@corpmx14.us.dg.com>
References: <B459CE1AFFC52D4688B2A5B842CA35EA092F8885@corpmx14.us.dg.com>
Content-Type: text/plain; charset=iso-8859-1
Message-Id: <1097085203.5245.49.camel@lade.trondhjem.org>
Mime-Version: 1.0
X-Mailer: Ximian Evolution 1.4.6
Date: Wed, 06 Oct 2004 19:53:23 +0200
Content-Transfer-Encoding: quoted-printable
X-MailScanner-Information: This message has been scanned for viruses/spam. Contact postmaster@uio.no if you have questions about this scanning
X-UiO-MailScanner: No virus found
X-UiO-Spam-info: not spam, SpamAssassin (score=0, required 12)
X-eGroups-Remote-IP: 129.240.130.16
From: Trond Myklebust <trond.myklebust@fys.uio.no>
Subject: Re: [pnfs-reqs] fsid usage
X-Yahoo-Group-Post: member; u=194000208
X-Yahoo-Profile: trondmy

ADVERTISEMENT
P� on , 06/10/2004 klokka 06:27, skreiv black_david@emc.com:
> I took a closer look at how HighRoad's FMP protocol uses fsid,
> and an fsid is allowed, but not required as an input argument to
> its analog to GETDEVICELIST (FMP doesn't have a GETDEVICEINFO
> call). FMP's analog of GETDEVICELIST is called FMP_Mount, and
> it has a number of ways to identify the filesystem - the
> NFS-relevant ones are:
>
> 1) Absolute path on the fileserver from "root" on the
> fileserver
> 2) An NFS handle to any file in the filesystem.
> 3) An fsid
> 4) The NFS mount point of the filesystem (string).

There are uniqueness guarantees on filehandles that have previously not
existed for fsids. Are you planning on introducing such guarantees into
NFS?
I'm basically referring to the fact that many NFS servers currently may
change the value of the fsid upon reboot or sometimes just even on
remount.

Cheers,
Trond


From dnoveck@netapp.com Wed Oct 06 12:31:43 2004
Return-Path: <Dave.Noveck@netapp.com>
X-Sender: Dave.Noveck@netapp.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 16616 invoked from network); 6 Oct 2004 19:31:42 -0000
Received: from unknown (66.218.66.218)
by m25.grp.scd.yahoo.com with QMQP; 6 Oct 2004 19:31:42 -0000
Received: from unknown (HELO mx01.netapp.com) (198.95.226.53)
by mta3.grp.scd.yahoo.com with SMTP; 6 Oct 2004 19:31:42 -0000
Received: from hawk.corp.netapp.com (hawk [10.57.156.122])
by mx01.netapp.com (8.12.10/8.12.10/NTAP-1.4) with ESMTP id i96JVbFC021446
for <pnfs-reqs@yahoogroups.com>; Wed, 6 Oct 2004 12:31:37 -0700 (PDT)
Received: from svlexc01.hq.netapp.com (svlexc01.corp.netapp.com [10.57.156.135])
by hawk.corp.netapp.com (8.12.9/8.12.9/NTAP-1.5) with ESMTP id i96JVUf1008457
for <pnfs-reqs@yahoogroups.com>; Wed, 6 Oct 2004 12:31:37 -0700 (PDT)
Received: from lavender.hq.netapp.com ([10.56.11.75]) by svlexc01.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.6713);
Wed, 6 Oct 2004 12:31:30 -0700
Received: from exnane01.hq.netapp.com ([10.97.0.61]) by lavender.hq.netapp.com with Microsoft SMTPSVC(5.0.2195.6713);
Wed, 6 Oct 2004 12:31:30 -0700
X-MimeOLE: Produced By Microsoft Exchange V6.5.7226.0
Content-class: urn:content-classes:message
MIME-Version: 1.0
Content-Type: text/plain;
charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Date: Wed, 6 Oct 2004 15:31:29 -0400
Message-ID: <C98692FD98048C41885E0B0FACD9DFB803BCEF@exnane01.hq.netapp.com>
X-MS-Has-Attach:
X-MS-TNEF-Correlator:
Thread-Topic: [pnfs-reqs] fsid usage
Thread-Index: AcSrXOQowi52+0OKS7u29a7qCXlyUQAeW/GA
To: <pnfs-reqs@yahoogroups.com>
X-OriginalArrivalTime: 06 Oct 2004 19:31:30.0550 (UTC) FILETIME=[14A10160:01C4ABDB]
X-eGroups-Remote-IP: 198.95.226.53
X-eGroups-From: "Noveck, Dave" <Dave.Noveck@netapp.com>
From: "Noveck, Dave" <dnoveck@netapp.com>
Subject: RE: [pnfs-reqs] fsid usage
X-Yahoo-Group-Post: member; u=44152831
X-Yahoo-Profile: davidnoveck

ADVERTISEMENT

> 3) If the client doesn't know that the handle is in a filesystem
> for which the client lacks device info, the fsid is
> useful. To avoid any ambiguity about which filesystem
> any device shorthand refers to, FMP always returns the
> fsid with the layout info. If a client gets back an fsid
> it hasn't seen before, it's convenient to use that fsid
> to go get the device info, since the client indexes device
> info by fsid.

In v4, how can a client have an fsid, and not have a file
handle for an object within the associated fs? How did he
get the fsid if not via a GETATTR of an fh specifying the
FSID attribute?

> 4) This is useful for a client that scans the mount structure
> of the mounted NFS filesystems at startup to gather
> relevant device info in advance. The mount string is
> part of the mount operation and is convenient for getting
> device info at mount time. Notice that in this case, the
> client does not need to open any files to gather device info.

In v4, and you have a string, you can do PUTFH, and then a
series of LOOKUP's and then you have the file handle you need
via a GETFH or you can add the GETDEVICELIST/GETDEVINFO and use
current fh directly.

So I can't see the need for these ops to use anything but the
current fh to indicate what fs they are for.

From trond.myklebust@fys.uio.no Wed Oct 06 12:50:05 2004
Return-Path: <trond.myklebust@fys.uio.no>
X-Sender: trond.myklebust@fys.uio.no
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 14297 invoked from network); 6 Oct 2004 19:50:00 -0000
Received: from unknown (66.218.66.217)
by m5.grp.scd.yahoo.com with QMQP; 6 Oct 2004 19:50:00 -0000
Received: from unknown (HELO pat.uio.no) (129.240.130.16)
by mta2.grp.scd.yahoo.com with SMTP; 6 Oct 2004 19:50:00 -0000
Received: from mail-mx6.uio.no ([129.240.10.47])
by pat.uio.no with esmtp (Exim 4.34)
id 1CFHnH-0004aw-7f
for pnfs-reqs@yahoogroups.com; Wed, 06 Oct 2004 21:49:47 +0200
Received: from 213.80-202-70.nextgentel.com ([80.202.70.213] helo=[192.168.1.103])
by smtp.uio.no with asmtp (SSLv3:RC4-MD5:128)
(Exim 4.34)
id 1CFHnB-0007lB-MS
for pnfs-reqs@yahoogroups.com; Wed, 06 Oct 2004 21:49:42 +0200
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <C98692FD98048C41885E0B0FACD9DFB803BCEF@exnane01.hq.netapp.com>
References: <C98692FD98048C41885E0B0FACD9DFB803BCEF@exnane01.hq.netapp.com>
Content-Type: text/plain; charset=iso-8859-1
Message-Id: <1097092168.30728.18.camel@lade.trondhjem.org>
Mime-Version: 1.0
X-Mailer: Ximian Evolution 1.4.6
Date: Wed, 06 Oct 2004 21:49:28 +0200
Content-Transfer-Encoding: quoted-printable
X-MailScanner-Information: This message has been scanned for viruses/spam. Contact postmaster@uio.no if you have questions about this scanning
X-UiO-MailScanner: No virus found
X-UiO-Spam-info: not spam, SpamAssassin (score=0, required 12)
X-eGroups-Remote-IP: 129.240.130.16
From: Trond Myklebust <trond.myklebust@fys.uio.no>
Subject: RE: [pnfs-reqs] fsid usage
X-Yahoo-Group-Post: member; u=194000208
X-Yahoo-Profile: trondmy

P� on , 06/10/2004 klokka 21:31, skreiv Noveck, Dave:
> > 4) This is useful for a client that scans the mount structure
> > of the mounted NFS filesystems at startup to gather
> > relevant device info in advance. The mount string is
> > part of the mount operation and is convenient for getting
> > device info at mount time. Notice that in this case, the
> > client does not need to open any files to gather device info.
>
> In v4, and you have a string, you can do PUTFH, and then a
> series of LOOKUP's and then you have the file handle you need
> via a GETFH or you can add the GETDEVICELIST/GETDEVINFO and use
> current fh directly.

Note that what Dave means here is that the "current" property of
filehandles allows you to do the LOOKUPs and GETDEVICELIST/GETDEVICEINFO
in a single compound.

This is more rational than having to first get the fsid in one compound,
in order to be able to copy it into the argument for a second
GETDEVICELIST/.. compound. You could of course define a "current fsid"
in order to gain the same benefits, but that seems like overkill, given
the number of operations that want to be able to use it.

Cheers,
Trond

From bwelch@panasas.com Wed Oct 06 15:21:42 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 94885 invoked from network); 6 Oct 2004 22:21:39 -0000
Received: from unknown (66.218.66.172)
by m19.grp.scd.yahoo.com with QMQP; 6 Oct 2004 22:21:39 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.206)
by mta4.grp.scd.yahoo.com with SMTP; 6 Oct 2004 22:21:39 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i96MLdB13988
for <pnfs-reqs@yahoogroups.com>; Wed, 6 Oct 2004 15:21:39 -0700
Message-Id: <200410062221.i96MLdB13988@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.7.1 10/4/2004 with nmh-1.0.4
To: pnfs-reqs@yahoogroups.com
In-reply-to: <1097092168.30728.18.camel@lade.trondhjem.org>
References: <C98692FD98048C41885E0B0FACD9DFB803BCEF@exnane01.hq.netapp.com>
<1097092168.30728.18.camel@lade.trondhjem.org>
Comments: In-reply-to Trond Myklebust <trond.myklebust@fys.uio.no>
message dated "Wed, 06 Oct 2004 21:49:28 +0200."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Wed, 06 Oct 2004 15:21:38 -0700
X-eGroups-Remote-IP: 63.80.58.206
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] fsid usage
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

The current ops draft, under review, uses the current
filehandle as an implicit argument in GETDEVICEINFO/LIST

>>>Trond Myklebust said:
>
> on , 06/10/2004 klokka 21:31, skreiv Noveck, Dave:
> > > 4) This is useful for a client that scans the mount structure
> > > of the mounted NFS filesystems at startup to gather
> > > relevant device info in advance. The mount string is
> > > part of the mount operation and is convenient for getting
> > > device info at mount time. Notice that in this case, the
> > > client does not need to open any files to gather device info.
> >
> > In v4, and you have a string, you can do PUTFH, and then a
> > series of LOOKUP's and then you have the file handle you need
> > via a GETFH or you can add the GETDEVICELIST/GETDEVINFO and use
> > current fh directly.
>
> Note that what Dave means here is that the "current" property of
> filehandles allows you to do the LOOKUPs and
> GETDEVICELIST/GETDEVICEINFO
> in a single compound.
>
> This is more rational than having to first get the fsid in one
> compound,
> in order to be able to copy it into the argument for a second
> GETDEVICELIST/.. compound. You could of course define a "current fsid"
> in order to gain the same benefits, but that seems like overkill, given
> the number of operations that want to be able to use it.

--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com

From black_david@emc.com Wed Oct 06 18:20:34 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 58682 invoked from network); 7 Oct 2004 01:20:33 -0000
Received: from unknown (66.218.66.172)
by m25.grp.scd.yahoo.com with QMQP; 7 Oct 2004 01:20:33 -0000
Received: from unknown (HELO mailhub.lss.emc.com) (168.159.2.31)
by mta4.grp.scd.yahoo.com with SMTP; 7 Oct 2004 01:20:33 -0000
Received: from MAHO3MSX2.corp.emc.com (maho3msx2.corp.emc.com [128.221.11.32])
by mailhub.lss.emc.com (Switch-3.1.6/Switch-3.1.6) with ESMTP id i971KUJE024933
for <pnfs-reqs@yahoogroups.com>; Wed, 6 Oct 2004 21:20:31 -0400 (EDT)
Received: by maho3msx2.isus.emc.com with Internet Mail Service (5.5.2653.19)
id <PNMCWJNH>; Wed, 6 Oct 2004 21:20:30 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA07E5C9C5@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com
Date: Wed, 6 Oct 2004 21:20:30 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-PMX-Version: 4.6.1.107272, Antispam-Core: 4.6.1.106808, Antispam-Data: 2004.10.6.3
X-PerlMx-Spam: Gauge=, SPAM=0%, Report='EMC_FROM_0 -0, __TLG_EMC_ENVFROM_0 0, __IMS_MSGID 0, __HAS_MSGID 0, __SANE_MSGID 0, NO_REAL_NAME 0, __TO_MALFORMED_2 0, __MIME_VERSION 0, __ANY_IMS_MUA 0, __IMS_MUA 0, __HAS_X_MAILER 0, __CTYPE_CHARSET_QUOTED 0, __CT_TEXT_PLAIN 0, __CT 0, __CTE 0, __C230066_P5 0, __MIME_TEXT_ONLY 0, EMC_BODY_1 -5'
X-eGroups-Remote-IP: 168.159.2.31
From: black_david@emc.com
Subject: RE: [pnfs-reqs] fsid usage
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

> P� on , 06/10/2004 klokka 06:27, skreiv black_david@emc.com:
> > I took a closer look at how HighRoad's FMP protocol uses fsid,
> > and an fsid is allowed, but not required as an input argument to
> > its analog to GETDEVICELIST (FMP doesn't have a GETDEVICEINFO
> > call). FMP's analog of GETDEVICELIST is called FMP_Mount, and
> > it has a number of ways to identify the filesystem - the
> > NFS-relevant ones are:
> >
> > 1) Absolute path on the fileserver from "root" on the
> > fileserver
> > 2) An NFS handle to any file in the filesystem.
> > 3) An fsid
> > 4) The NFS mount point of the filesystem (string).
>
> There are uniqueness guarantees on filehandles that have previously not
> existed for fsids. Are you planning on introducing such guarantees into
> NFS?

No.

> I'm basically referring to the fact that many NFS servers currently may
> change the value of the fsid upon reboot or sometimes just even on
> remount.

For reboot there are existing state revalidation protocols (stateid)
that can cope with what's going on; if the fsid changes, best bet is
to discard any saved device info and refetch. Across unmount/remount,
the client needs to discard the device info and fetch it anew on remount.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From black_david@emc.com Wed Oct 06 18:23:59 2004
Return-Path: <Black_David@emc.com>
X-Sender: Black_David@emc.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 37049 invoked from network); 7 Oct 2004 01:23:59 -0000
Received: from unknown (66.218.66.166)
by m8.grp.scd.yahoo.com with QMQP; 7 Oct 2004 01:23:59 -0000
Received: from unknown (HELO mailhub.lss.emc.com) (168.159.2.31)
by mta5.grp.scd.yahoo.com with SMTP; 7 Oct 2004 01:23:58 -0000
Received: from mxic2.corp.emc.com (mxic2.corp.emc.com [128.221.12.9])
by mailhub.lss.emc.com (Switch-3.1.6/Switch-3.1.6) with ESMTP id i971NuAC019861
for <pnfs-reqs@yahoogroups.com>; Wed, 6 Oct 2004 21:23:56 -0400 (EDT)
Received: by mxic2.corp.emc.com with Internet Mail Service (5.5.2653.19)
id <S0DYJDT4>; Wed, 6 Oct 2004 21:23:56 -0400
Message-ID: <B459CE1AFFC52D4688B2A5B842CA35EA07E5C9C6@corpmx14.us.dg.com>
To: pnfs-reqs@yahoogroups.com
Date: Wed, 6 Oct 2004 21:23:55 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain
X-PMX-Version: 4.6.1.107272, Antispam-Core: 4.6.1.106808, Antispam-Data: 2004.10.6.3
X-PerlMx-Spam: Gauge=, SPAM=0%, Report='EMC_FROM_0 -0, __TLG_EMC_ENVFROM_0 0, __IMS_MSGID 0, __HAS_MSGID 0, __SANE_MSGID 0, NO_REAL_NAME 0, __TO_MALFORMED_2 0, __MIME_VERSION 0, __ANY_IMS_MUA 0, __IMS_MUA 0, __HAS_X_MAILER 0, __CT_TEXT_PLAIN 0, __CT 0, __C230066_P5 0, __MIME_TEXT_ONLY 0, EMC_BODY_1 -5'
X-eGroups-Remote-IP: 168.159.2.31
From: black_david@emc.com
Subject: RE: [pnfs-reqs] fsid usage
X-Yahoo-Group-Post: member; u=82420288
X-Yahoo-Profile: dlb237

> > 3) If the client doesn't know that the handle is in a filesystem
> > for which the client lacks device info, the fsid is
> > useful. To avoid any ambiguity about which filesystem
> > any device shorthand refers to, FMP always returns the
> > fsid with the layout info. If a client gets back an fsid
> > it hasn't seen before, it's convenient to use that fsid
> > to go get the device info, since the client indexes device
> > info by fsid.
>
> In v4, how can a client have an fsid, and not have a file
> handle for an object within the associated fs? How did he
> get the fsid if not via a GETATTR of an fh specifying the
> FSID attribute?
>
> > 4) This is useful for a client that scans the mount structure
> > of the mounted NFS filesystems at startup to gather
> > relevant device info in advance. The mount string is
> > part of the mount operation and is convenient for getting
> > device info at mount time. Notice that in this case, the
> > client does not need to open any files to gather device info.
>
> In v4, and you have a string, you can do PUTFH, and then a
> series of LOOKUP's and then you have the file handle you need
> via a GETFH or you can add the GETDEVICELIST/GETDEVINFO and use
> current fh directly.
>
> So I can't see the need for these ops to use anything but the
> current fh to indicate what fs they are for.

That's workable - both of these are more in the way of convenience
for the client than absolute functional requirements. Note that
case 4) involves a bunch of extra ops for the client to get a usable
fh. For 3), the answer to how the client might have an fsid without
GETATTR is that FMP returns the fsid with any layout granted to the
client, but that's roughly equivalent to a compounded GETATTR for
the fsid.

Thanks,
--David
----------------------------------------------------
David L. Black, Senior Technologist
EMC Corporation, 176 South St., Hopkinton, MA 01748
+1 (508) 293-7953 FAX: +1 (508) 293-7786
black_david@emc.com Mobile: +1 (978) 394-7754
----------------------------------------------------

From julian_satran@il.ibm.com Sat Oct 09 09:50:11 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 42133 invoked from network); 9 Oct 2004 16:50:09 -0000
Received: from unknown (66.218.66.216)
by m16.grp.scd.yahoo.com with QMQP; 9 Oct 2004 16:50:09 -0000
Received: from unknown (HELO mtagate1.de.ibm.com) (195.212.29.150)
by mta1.grp.scd.yahoo.com with SMTP; 9 Oct 2004 16:50:08 -0000
Received: from d12nrmr1507.megacenter.de.ibm.com (d12nrmr1507.megacenter.de.ibm.com [9.149.167.1])
by mtagate1.de.ibm.com (8.12.10/8.12.10) with ESMTP id i99GnlfQ160708;
Sat, 9 Oct 2004 16:49:47 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1507.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i99GnkN9215744;
Sat, 9 Oct 2004 18:49:46 +0200
In-Reply-To: <200410010308.i91387O18638@medlicott.panasas.com>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com, welch@panasas.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Build V70_M2_07222004 Beta 2NP July 22, 2004
Message-ID: <OF9FCFDF4B.F1C05291-ONC2256F28.005B338F-C2256F28.005C71E2@il.ibm.com>
Date: Sat, 9 Oct 2004 18:49:44 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.5.1| March 5, 2004) at
09/10/2004 18:49:46,
Serialize complete at 09/10/2004 18:49:46
Content-Type: multipart/alternative; boundary="=_alternative 005BE2A6C2256F28_="
X-eGroups-Remote-IP: 195.212.29.150
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

Brent Welch <bwelch@panasas.com> wrote on 01/10/2004 05:08:06:

> One fringe theory was that the MPI application could somehow get
> a new clientid (of the SETCLIENTID kind) that represented the collection
> of processors involved in the MPI application.  This is compatible
> with the typical batch processing MPI environments where N nodes
> are allocated, and then a job is run against those nodes.  The
> master node uses this clientid to get the layout-related stateids.
> It gives out the layout information to its slaves, and the underlying
> filesystem allows use of the layout information by the nodes
> represented by that clientID.  There is somewhat tenuous connection
> between the set of processors, the clientID, the layout and associated
> layout stateID, and the class-specific information in the layout that
> is passed to the storage device so it can ensure that only those
> processors access storage.
>

I am not sure that the implied assumption that the LAYOUT information is handed "securely" to some other entity that the one authorized to access the file is acceptable. CLIENT is a machine while file access is authorized to a user.

On a more positive note if a user can be associated with a (limited) group of machines - distributing layout information to them can be acceptable (and less tenuous than creating a "superclient").

Julo

From bwelch@panasas.com Mon Oct 11 09:54:02 2004
Return-Path: <welch@panasas.com>
X-Sender: welch@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 17864 invoked from network); 11 Oct 2004 16:54:00 -0000
Received: from unknown (66.218.66.166)
by m8.grp.scd.yahoo.com with QMQP; 11 Oct 2004 16:54:00 -0000
Received: from unknown (HELO medlicott.panasas.com) (63.80.58.206)
by mta5.grp.scd.yahoo.com with SMTP; 11 Oct 2004 16:54:00 -0000
Received: from panasas.com (welch@localhost)
by medlicott.panasas.com (8.11.6/8.11.6) with ESMTP id i9BGrvE25253;
Mon, 11 Oct 2004 09:53:57 -0700
Message-Id: <200410111653.i9BGrvE25253@medlicott.panasas.com>
X-Authentication-Warning: medlicott.panasas.com: welch owned process doing -bs
X-Mailer: exmh version 2.7.1 10/4/2004 with nmh-1.0.4
To: Julian Satran <Julian_Satran@il.ibm.com>
Cc: pnfs-reqs@yahoogroups.com
In-reply-to: <OF9FCFDF4B.F1C05291-ONC2256F28.005B338F-C2256F28.005C71E2@il.ibm.com>
References: <OF9FCFDF4B.F1C05291-ONC2256F28.005B338F-C2256F28.005C71E2@il.ibm.com>
Comments: In-reply-to Julian Satran <Julian_Satran@il.ibm.com>
message dated "Sat, 09 Oct 2004 18:49:44 +0200."
X-URL: http://www.panasas.com/
X-Face: "HxE|?EnC9fVMV8f70H83&{fgLE.|FZ^$>@Q(yb#N,Eh~N]e&]=>
r5~UnRml1:4EglY{9B+
:'wJq$@c_C!l8@<$t,{YUr4K,QJGHSvS~U]H`<+L*x?eGzSk>XH\W:AK\j?@?c1o<k;j'Ei/UL)!*0
ILwSR)J\bc)gjz!rrGQ2#i*f:M:ydhK}jp4dWQW?;0{,#iWrCV$4~%e/3)$1/D
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Date: Mon, 11 Oct 2004 09:53:57 -0700
X-eGroups-Remote-IP: 63.80.58.206
X-eGroups-From: Brent Welch <welch@panasas.com>
From: Brent Welch <bwelch@panasas.com>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=169551413
X-Yahoo-Profile: brent_welch_1960

Doesn't the NFS client have this kind of problem already, regardless
of the pNFS extension? A host operating system that acts for many
users has to keep things sorted out under the hood so that if user
A gets access to a file, that user B doesn't use the wrong credentials
to get access to something they ought not to.

>>>Julian Satran said:

> Brent Welch <bwelch@panasas.com> wrote on 01/10/2004 05:08:06:
>
> > One fringe theory was that the MPI application could somehow get
> > a new clientid (of the SETCLIENTID kind) that represented the
> collection
> > of processors involved in the MPI application. This is compatible
> > with the typical batch processing MPI environments where N nodes
> > are allocated, and then a job is run against those nodes. The
> > master node uses this clientid to get the layout-related stateids.
> > It gives out the layout information to its slaves, and the underlying
> > filesystem allows use of the layout information by the nodes
> > represented by that clientID. There is somewhat tenuous connection
> > between the set of processors, the clientID, the layout and associated
> > layout stateID, and the class-specific information in the layout that
> > is passed to the storage device so it can ensure that only those
> > processors access storage.
> >
>
> I am not sure that the implied assumption that the LAYOUT information is
> handed "securely" to some other entity that the one authorized to access
> the file is acceptable. CLIENT is a machine while file access is
> authorized to a user.
>
> On a more positive note if a user can be associated with a (limited)
> group of machines - distributing layout information to them can be
> acceptable (and less tenuous than creating a "superclient").


--
Brent Welch
Software Architect, Panasas Inc
Delivering the premier storage system for scalable Linux clusters

www.panasas.com
welch@panasas.com


From julian_satran@il.ibm.com Mon Oct 11 20:04:59 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 54538 invoked from network); 12 Oct 2004 03:04:57 -0000
Received: from unknown (66.218.66.167)
by m5.grp.scd.yahoo.com with QMQP; 12 Oct 2004 03:04:57 -0000
Received: from unknown (HELO mtagate2.de.ibm.com) (195.212.29.151)
by mta6.grp.scd.yahoo.com with SMTP; 12 Oct 2004 03:04:56 -0000
Received: from d12nrmr1607.megacenter.de.ibm.com (d12nrmr1607.megacenter.de.ibm.com [9.149.167.49])
by mtagate2.de.ibm.com (8.12.10/8.12.10) with ESMTP id i9C34jFW100496;
Tue, 12 Oct 2004 03:04:45 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1607.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i9C34iSt107742;
Tue, 12 Oct 2004 05:04:45 +0200
In-Reply-To: <200410111653.i9BGrvE25253@medlicott.panasas.com>
To: Brent Welch <welch@panasas.com>
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Build V70_M2_07222004 Beta 2NP July 22, 2004
Message-ID: <OFBE87E02F.2DBEF26E-ONC2256F2B.000F2D03-C2256F2B.0010E8ED@il.ibm.com>
Date: Tue, 12 Oct 2004 05:04:41 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.5.1| March 5, 2004) at
12/10/2004 05:04:44,
Serialize complete at 12/10/2004 05:04:44
Content-Type: multipart/alternative; boundary="=_alternative 000F8843C2256F2B_="
X-eGroups-Remote-IP: 195.212.29.151
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

ADVERTISEMENT

With NFS the problem is limited to "trust your own machine OS" at most. You do not place any trust in other clients.
Authentication is per user and machine affiliation is a "local" issue.
Introducing or relying on things like CLIENTID violates this model.

Julo

Brent Welch <welch@panasas.com> wrote on 11/10/2004 18:53:57:

> Doesn't the NFS client have this kind of problem already, regardless
> of the pNFS extension?  A host operating system that acts for many
> users has to keep things sorted out under the hood so that if user
> A gets access to a file, that user B doesn't use the wrong credentials
> to get access to something they ought not to.
>
> >>>Julian Satran said:
>
>  > Brent Welch <bwelch@panasas.com> wrote on 01/10/2004 05:08:06:
>  >
>  > > One fringe theory was that the MPI application could somehow get
>  > > a new clientid (of the SETCLIENTID kind) that represented the
>  > collection
>  > > of processors involved in the MPI application.  This is compatible
>  > > with the typical batch processing MPI environments where N nodes
>  > > are allocated, and then a job is run against those nodes.  The
>  > > master node uses this clientid to get the layout-related stateids.
>  > > It gives out the layout information to its slaves, and the underlying
>  > > filesystem allows use of the layout information by the nodes
>  > > represented by that clientID.  There is somewhat tenuous connection
>  > > between the set of processors, the clientID, the layout and associated
>  > > layout stateID, and the class-specific information in the layout that
>  > > is passed to the storage device so it can ensure that only those
>  > > processors access storage.
>  > >
>  >
>  > I am not sure that the implied assumption that the LAYOUT information is
>  > handed "securely" to some other entity that the one authorized to access
>  > the file is acceptable. CLIENT is a machine while file access is
>  > authorized to a user.
>  >
>  > On a more positive note if a user can be associated with a (limited)
>  > group of machines - distributing layout information to them can be
>  > acceptable (and less tenuous than creating a "superclient").
>
>
> --
> Brent Welch
> Software Architect, Panasas Inc
> Delivering the premier storage system for scalable Linux clusters
>
> www.panasas.com
> welch@panasas.com
>
> 

From bhalevy@panasas.com Tue Oct 12 05:22:30 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 13559 invoked from network); 12 Oct 2004 12:22:29 -0000
Received: from unknown (66.218.66.172)
by m15.grp.scd.yahoo.com with QMQP; 12 Oct 2004 12:22:29 -0000
Received: from unknown (HELO barrule.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 12 Oct 2004 12:22:29 -0000
Received: by barrule.panasas.com with Internet Mail Service (5.5.2653.19)
id <4S0FLTBH>; Tue, 12 Oct 2004 08:22:27 -0400
Message-ID: <D72776FC4B13B64E9232562572AF292BF151FB@barrule.panasas.com>
To: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>,
"Welch, Brent" <welch@panasas.com>
Date: Tue, 12 Oct 2004 08:22:19 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: multipart/alternative;
boundary="----_=_NextPart_001_01C4B054.C5D31393"
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

Well, the server can't really tell who's on the other side of the line
as long as it conforms to the NFS protocol.  The client or whatever
is there can do whatever it wishes with the information it gets.
The human customer though can use the right hardware and software
that will provide them with the level of security they need
so that unauthorized entities can't use this information.
 
The problem in pnfs arises from the fact that the layout
and security token (aka capabilities) given to the client
are used to access the storage servers directly,
thus not allowing the server who dispatched them
to enforce the security on the data path.
For example, in NFSv4 a client host can
be delegated with the right to serve
opens, locks, and I/O operation locally
without involving the server, yet the server is
always able to intercept any access to the
file when an requests get to the server, This is useful
in various circumectences, e.g. an employee
left the company and it's credentials are revoked.
 
In the pnfs world the server must communicate with
the storage servers in order to invalidate or revoke
a capability it has previously dispatched in order
to achieve the same affect as enforcing security
in-band..
 
I understand that this is hard to achieve in SANs
since the host systems as a whole must be fenced-
off to enforce security, otherwise their OS is
trusted to enforce security into parts of the
storage space (data blocks) they are allowed
to access (SCSI targets, LUNs, etc.).
 
Benny
 
 -----Original Message-----
From: Julian Satran [mailto:julian_satran@il.ibm.com]
Sent: Tuesday, October 12, 2004 5:05 AM
To: Brent Welch
Cc: pnfs-reqs@yahoogroups.com
Subject: Re: [pnfs-reqs] Notes from Sept 30


    With NFS the problem is limited to "trust your own machine OS" at most. You do not place any trust in other clients.
    Authentication is per user and machine affiliation is a "local" issue.
    Introducing or relying on things like CLIENTID violates this model.

    Julo

    Brent Welch <welch@panasas.com> wrote on 11/10/2004 18:53:57:

    > Doesn't the NFS client have this kind of problem already, regardless
    > of the pNFS extension?  A host operating system that acts for many
    > users has to keep things sorted out under the hood so that if user
    > A gets access to a file, that user B doesn't use the wrong credentials
    > to get access to something they ought not to.
    >
    > >>>Julian Satran said:
    >
    >  > Brent Welch <bwelch@panasas.com> wrote on 01/10/2004 05:08:06:
    >  >
    >  > > One fringe theory was that the MPI application could somehow get
    >  > > a new clientid (of the SETCLIENTID kind) that represented the
    >  > collection
    >  > > of processors involved in the MPI application.  This is compatible
    >  > > with the typical batch processing MPI environments where N nodes
    >  > > are allocated, and then a job is run against those nodes.  The
    >  > > master node uses this clientid to get the layout-related stateids.
    >  > > It gives out the layout information to its slaves, and the underlying
    >  > > filesystem allows use of the layout information by the nodes
    >  > > represented by that clientID.  There is somewhat tenuous connection
    >  > > between the set of processors, the clientID, the layout and associated
    >  > > layout stateID, and the class-specific information in the layout that
    >  > > is passed to the storage device so it can ensure that only those
    >  > > processors access storage.
    >  > >
    >  >
    >  > I am not sure that the implied assumption that the LAYOUT information is
    >  > handed "securely" to some other entity that the one authorized to access
    >  > the file is acceptable. CLIENT is a machine while file access is
    >  > authorized to a user.
    >  >
    >  > On a more positive note if a user can be associated with a (limited)
    >  > group of machines - distributing layout information to them can be
    >  > acceptable (and less tenuous than creating a "superclient").
    >
    >
    > --
    > Brent Welch
    > Software Architect, Panasas Inc
    > Delivering the premier storage system for scalable Linux clusters
    >
    > www.panasas.com
    > welch@panasas.com
    >
    > 

From trond.myklebust@fys.uio.no Tue Oct 12 05:48:16 2004
Return-Path: <trond.myklebust@fys.uio.no>
X-Sender: trond.myklebust@fys.uio.no
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 18406 invoked from network); 12 Oct 2004 12:48:15 -0000
Received: from unknown (66.218.66.172)
by m24.grp.scd.yahoo.com with QMQP; 12 Oct 2004 12:48:15 -0000
Received: from unknown (HELO pat.uio.no) (129.240.130.16)
by mta4.grp.scd.yahoo.com with SMTP; 12 Oct 2004 12:48:15 -0000
Received: from mail-mx6.uio.no ([129.240.10.47])
by pat.uio.no with esmtp (Exim 4.34)
id 1CHIEq-0006Hm-K8
for pnfs-reqs@yahoogroups.com; Tue, 12 Oct 2004 10:42:33 +0200
Received: from 213.80-202-70.nextgentel.com ([80.202.70.213] helo=[192.168.1.103])
by smtp.uio.no with asmtp (SSLv3:RC4-MD5:128)
(Exim 4.34)
id 1CHGiQ-0002xu-Gp
for pnfs-reqs@yahoogroups.com; Tue, 12 Oct 2004 09:04:58 +0200
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <OFBE87E02F.2DBEF26E-ONC2256F2B.000F2D03-C2256F2B.0010E8ED@il.ibm.com>
References:
<OFBE87E02F.2DBEF26E-ONC2256F2B.000F2D03-C2256F2B.0010E8ED@il.ibm.com>
Content-Type: text/plain; charset=ISO-8859-1
Message-Id: <1097564692.5432.118.camel@lade.trondhjem.org>
Mime-Version: 1.0
X-Mailer: Ximian Evolution 1.4.6
Date: Tue, 12 Oct 2004 09:04:52 +0200
Content-Transfer-Encoding: quoted-printable
X-MailScanner-Information: This message has been scanned for viruses/spam. Contact postmaster@uio.no if you have questions about this scanning
X-UiO-MailScanner: No virus found
X-UiO-Spam-info: not spam, SpamAssassin (score=0, required 12)
X-eGroups-Remote-IP: 129.240.130.16
From: Trond Myklebust <trond.myklebust@fys.uio.no>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=194000208
X-Yahoo-Profile: trondmy

P� ty , 12/10/2004 klokka 05:04, skreiv Julian Satran:
> With NFS the problem is limited to "trust your own machine OS" at
> most. You do not place any trust in other clients.
> Authentication is per user and machine affiliation is a "local" issue.
> Introducing or relying on things like CLIENTID violates this model.

Why? CLIENTID is not a form of authentication even in the single client
case. In fact it carries NO security information whatsoever: it both
can, and usually is shared between processes that are using different
RPCSEC_GSS contexts on any given client.
The issue of determining whether or not that client is in fact a
clustered environment with more than one node goes way beyond the
current NFSv4 security models, and afaics will require significant
changes to those models (you'd need strong per-node security in addition
to the existing per-user).

Cheers,
Trond

From julian_satran@il.ibm.com Tue Oct 12 06:46:55 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 41760 invoked from network); 12 Oct 2004 13:46:53 -0000
Received: from unknown (66.218.66.167)
by m22.grp.scd.yahoo.com with QMQP; 12 Oct 2004 13:46:53 -0000
Received: from unknown (HELO mtagate2.de.ibm.com) (195.212.29.151)
by mta6.grp.scd.yahoo.com with SMTP; 12 Oct 2004 13:46:53 -0000
Received: from d12nrmr1507.megacenter.de.ibm.com (d12nrmr1507.megacenter.de.ibm.com [9.149.167.1])
by mtagate2.de.ibm.com (8.12.10/8.12.10) with ESMTP id i9CDkpFW068544
for <pnfs-reqs@yahoogroups.com>; Tue, 12 Oct 2004 13:46:52 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1507.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i9CDkpXV208866
for <pnfs-reqs@yahoogroups.com>; Tue, 12 Oct 2004 15:46:51 +0200
In-Reply-To: <1097564692.5432.118.camel@lade.trondhjem.org>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Build V70_M2_07222004 Beta 2NP July 22, 2004
Message-ID: <OF83270AF9.9AEEADC6-ONC2256F2B.0048837B-C2256F2B.004BB2CF@il.ibm.com>
Date: Tue, 12 Oct 2004 15:46:48 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.5.1| March 5, 2004) at
12/10/2004 15:46:50,
Serialize complete at 12/10/2004 15:46:50
Content-Type: multipart/alternative; boundary="=_alternative 00496A67C2256F2B_="
X-eGroups-Remote-IP: 195.212.29.151
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

As far as I understand the current NFS model every RPC is authenticated (if the system is set to do so) and authentication is per user
and not per client. It is true that the client OS can forge user identities but that is acceptable (OS can access memory too) as it is limited to a client.

Across clients - that is not acceptable.

As for the issue Benny raised - I am aware that access to storage can't enforce credentials but as long as clients can be limited to specific volumes
we are within the limits of doable.

Allowing CLIENTID based authentication is not acceptable as client machines are not trusted and device map may end-up on the wrong machine and screw-up even the weak enforcement that the storage can do.

We should somewhat strengthen the LUN masking mechanism (this can be done within the block SCSI model) and not weaken it inadvertently.

Julo


Trond Myklebust <trond.myklebust@fys.uio.no>

12/10/04 09:04
Please respond to
pnfs-reqs@yahoogroups.com

	
To
	pnfs-reqs@yahoogroups.com
cc
	
Subject
	Re: [pnfs-reqs] Notes from Sept 30

	





P� ty , 12/10/2004 klokka 05:04, skreiv Julian Satran:
> With NFS the problem is limited to "trust your own machine OS" at
> most. You do not place any trust in other clients.
> Authentication is per user and machine affiliation is a "local" issue.
> Introducing or relying on things like CLIENTID violates this model.

Why? CLIENTID is not a form of authentication even in the single client
case. In fact it carries NO security information whatsoever: it both
can, and usually is shared between processes that are using different
RPCSEC_GSS contexts on any given client.
The issue of determining whether or not that client is in fact a
clustered environment with more than one node goes way beyond the
current NFSv4 security models, and afaics will require significant
changes to those models (you'd need strong per-node security in addition
to the existing per-user).

Cheers,
 Trond



------------------------ Yahoo! Groups Sponsor --------------------~-->
$9.95 domain names from Yahoo!. Register anything.
http://us.click.yahoo.com/J8kdrA/y20IAA/yQLSAA/W6uqlB/TM
--------------------------------------------------------------------~->


Yahoo! Groups Links

<*> To visit your group on the web, go to:
   http://groups.yahoo.com/group/pnfs-reqs/

<*> To unsubscribe from this group, send an email to:
   pnfs-reqs-unsubscribe@yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
   http://docs.yahoo.com/info/terms/

From trond.myklebust@fys.uio.no Tue Oct 12 08:13:47 2004
Return-Path: <trond.myklebust@fys.uio.no>
X-Sender: trond.myklebust@fys.uio.no
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 81149 invoked from network); 12 Oct 2004 15:13:45 -0000
Received: from unknown (66.218.66.166)
by m18.grp.scd.yahoo.com with QMQP; 12 Oct 2004 15:13:45 -0000
Received: from unknown (HELO pat.uio.no) (129.240.130.16)
by mta5.grp.scd.yahoo.com with SMTP; 12 Oct 2004 15:13:45 -0000
Received: from mail-mx6.uio.no ([129.240.10.47])
by pat.uio.no with esmtp (Exim 4.34)
id 1CHOLQ-0003Hj-G2
for pnfs-reqs@yahoogroups.com; Tue, 12 Oct 2004 17:13:44 +0200
Received: from 184.80-202-71.nextgentel.com ([80.202.71.184] helo=[192.168.1.101])
by smtp.uio.no with asmtp (SSLv3:RC4-MD5:128)
(Exim 4.34)
id 1CHOLN-00023K-IJ
for pnfs-reqs@yahoogroups.com; Tue, 12 Oct 2004 17:13:41 +0200
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <OF83270AF9.9AEEADC6-ONC2256F2B.0048837B-C2256F2B.004BB2CF@il.ibm.com>
References:
<OF83270AF9.9AEEADC6-ONC2256F2B.0048837B-C2256F2B.004BB2CF@il.ibm.com>
Content-Type: text/plain; charset=iso-8859-1
Message-Id: <1097594015.5432.258.camel@lade.trondhjem.org>
Mime-Version: 1.0
X-Mailer: Ximian Evolution 1.4.6
Date: Tue, 12 Oct 2004 17:13:35 +0200
Content-Transfer-Encoding: quoted-printable
X-MailScanner-Information: This message has been scanned for viruses/spam. Contact postmaster@uio.no if you have questions about this scanning
X-UiO-MailScanner: No virus found
X-UiO-Spam-info: not spam, SpamAssassin (score=0, required 12)
X-eGroups-Remote-IP: 129.240.130.16
From: Trond Myklebust <trond.myklebust@fys.uio.no>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=194000208
X-Yahoo-Profile: trondmy

P� ty , 12/10/2004 klokka 15:46, skreiv Julian Satran:
> As far as I understand the current NFS model every RPC is
> authenticated (if the system is set to do so) and authentication is
> per user
> and not per client. It is true that the client OS can forge user
> identities but that is acceptable (OS can access memory too) as it is
> limited to a client.
>
> Across clients - that is not acceptable.

You are missing the point:

How do you define "across clients" in the above scenario?

AFAIK there is NOTHING in the RFC3530 which allows the server to assert
that "this information comes from client 1", whereas "that information
comes from client 2". There is NOTHING in those specs that defines some
physical boundaries beyond which certain kinds of information cannot be
shared.
All that the server may do, is assert that "this information has been
authenticated using user 1's RPCSEC_GSS context", or "this information
has been authenticated using user 2's RPCSEC_GSS context". As long as
that user is authenticated, then we trust the information. If the users
have chosen to share state together, then it is their choice, and it is
their responsability to ensure safe sharing of state information.

IOW: it is currently entirely up to the client to define its own
architecture. If it wants to define its own architecture as running
across an entire 10000 node cluster, then the NFS server does not have
any means to prevent that.


> Allowing CLIENTID based authentication is not acceptable as client
> machines are not trusted and device map may end-up on the wrong
> machine and screw-up even the weak enforcement that the storage can
> do.

Don't confuse authentication and authorization.

You can have a authorization policies that allow several authenticated
users to share the same device map.

For instance, delegations work because you authenticate on a per user
basis, but the authorization to use the delegation is granted to all
users that have access to that file (within the scope of the clientid).

Cheers,
Trond

From julian_satran@il.ibm.com Tue Oct 12 09:55:16 2004
Return-Path: <Julian_Satran@il.ibm.com>
X-Sender: Julian_Satran@il.ibm.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 44308 invoked from network); 12 Oct 2004 16:55:14 -0000
Received: from unknown (66.218.66.166)
by m25.grp.scd.yahoo.com with QMQP; 12 Oct 2004 16:55:14 -0000
Received: from unknown (HELO mtagate1.de.ibm.com) (195.212.29.150)
by mta5.grp.scd.yahoo.com with SMTP; 12 Oct 2004 16:55:13 -0000
Received: from d12nrmr1607.megacenter.de.ibm.com (d12nrmr1607.megacenter.de.ibm.com [9.149.167.49])
by mtagate1.de.ibm.com (8.12.10/8.12.10) with ESMTP id i9CGtDfQ126512
for <pnfs-reqs@yahoogroups.com>; Tue, 12 Oct 2004 16:55:13 GMT
Received: from d12ml102.megacenter.de.ibm.com (d12av02.megacenter.de.ibm.com [9.149.165.228])
by d12nrmr1607.megacenter.de.ibm.com (8.12.10/NCO/VER6.6) with ESMTP id i9CGtCXY103812
for <pnfs-reqs@yahoogroups.com>; Tue, 12 Oct 2004 18:55:12 +0200
In-Reply-To: <1097594015.5432.258.camel@lade.trondhjem.org>
To: pnfs-reqs@yahoogroups.com
Cc: pnfs-reqs@yahoogroups.com
MIME-Version: 1.0
X-Mailer: Lotus Notes Build V70_M2_07222004 Beta 2NP July 22, 2004
Message-ID: <OF71A9006A.189EA8C1-ONC2256F2B.005931AE-C2256F2B.005CF1B4@il.ibm.com>
Date: Tue, 12 Oct 2004 18:55:11 +0200
X-MIMETrack: Serialize by Router on D12ML102/12/M/IBM(Release 6.5.1| March 5, 2004) at
12/10/2004 18:55:12,
Serialize complete at 12/10/2004 18:55:12
Content-Type: multipart/alternative; boundary="=_alternative 0059B09CC2256F2B_="
X-eGroups-Remote-IP: 195.212.29.150
From: Julian Satran <julian_satran@il.ibm.com>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=64714603
X-Yahoo-Profile: julian_satran

ADVERTISEMENT
click here


Trond Myklebust <trond.myklebust@fys.uio.no> wrote on 12/10/2004 17:13:35:

>
> P� ty , 12/10/2004 klokka 15:46, skreiv Julian Satran:
> > As far as I understand the current NFS model every RPC is
> > authenticated (if the system is set to do so) and authentication is
> > per user
> > and not per client. It is true that the client OS can forge user
> > identities but that is acceptable (OS can access memory too) as it is
> > limited to a client.
> >
> > Across clients - that is not acceptable.
>
> You are missing the point:
>
>    How do you define "across clients" in the above scenario?
>

I don't think I am missing the point. Handing over authenticators is "legal" in any case.
The point is than an authenticator has to be handed over. And the authenticator is "secret"
in any of the protocols. Guessing it (it is ussualy large) is usually considered unfeasible.

The ClientID is not secret.

> AFAIK there is NOTHING in the RFC3530 which allows the server to assert
> that "this information comes from client 1", whereas "that information
> comes from client 2". There is NOTHING in those specs that defines some
> physical boundaries beyond which certain kinds of information cannot be
> shared.
> All that the server may do, is assert that "this information has been
> authenticated using user 1's RPCSEC_GSS context", or "this information
> has been authenticated using user 2's RPCSEC_GSS context". As long as
> that user is authenticated, then we trust the information. If the users
> have chosen to share state together, then it is their choice, and it is
> their responsability to ensure safe sharing of state information.
>
> IOW: it is currently entirely up to the client to define its own
> architecture. If it wants to define its own architecture as running
> across an entire 10000 node cluster, then the NFS server does not have
> any means to prevent that.
>
>
> > Allowing CLIENTID based authentication is not acceptable as client
> > machines are not trusted and device map may end-up on the wrong
> > machine and screw-up even the weak enforcement that the storage can
> > do.
>
> Don't confuse authentication and authorization.

I can hardly stand patronising remarks.

>
> You can have a authorization policies that allow several authenticated
> users to share the same device map.
>
> For instance, delegations work because you authenticate on a per user
> basis, but the authorization to use the delegation is granted to all
> users that have access to that file (within the scope of the clientid).
>
> Cheers,
>   Trond
>
>
>
> ------------------------ Yahoo! Groups Sponsor --------------------~-->
> $9.95 domain names from Yahoo!. Register anything.
> http://us.click.yahoo.com/J8kdrA/y20IAA/yQLSAA/W6uqlB/TM
> --------------------------------------------------------------------~->
>
>  
> Yahoo! Groups Links
>
> <*> To visit your group on the web, go to:
>     http://groups.yahoo.com/group/pnfs-reqs/
>
> <*> To unsubscribe from this group, send an email to:
>     pnfs-reqs-unsubscribe@yahoogroups.com
>
> <*> Your use of Yahoo! Groups is subject to:
>     http://docs.yahoo.com/info/terms/
>  
>
>
> 

From trond.myklebust@fys.uio.no Tue Oct 12 12:46:24 2004
Return-Path: <trond.myklebust@fys.uio.no>
X-Sender: trond.myklebust@fys.uio.no
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 27756 invoked from network); 12 Oct 2004 19:46:23 -0000
Received: from unknown (66.218.66.216)
by m24.grp.scd.yahoo.com with QMQP; 12 Oct 2004 19:46:23 -0000
Received: from unknown (HELO pat.uio.no) (129.240.130.16)
by mta1.grp.scd.yahoo.com with SMTP; 12 Oct 2004 19:46:22 -0000
Received: from mail-mx6.uio.no ([129.240.10.47])
by pat.uio.no with esmtp (Exim 4.34)
id 1CHSaQ-0004G9-89
for pnfs-reqs@yahoogroups.com; Tue, 12 Oct 2004 21:45:30 +0200
Received: from 184.80-202-71.nextgentel.com ([80.202.71.184] helo=[192.168.1.101])
by smtp.uio.no with asmtp (SSLv3:RC4-MD5:128)
(Exim 4.34)
id 1CHSaN-0001TG-6L
for pnfs-reqs@yahoogroups.com; Tue, 12 Oct 2004 21:45:27 +0200
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <OF71A9006A.189EA8C1-ONC2256F2B.005931AE-C2256F2B.005CF1B4@il.ibm.com>
References:
<OF71A9006A.189EA8C1-ONC2256F2B.005931AE-C2256F2B.005CF1B4@il.ibm.com>
Content-Type: text/plain; charset=iso-8859-1
Message-Id: <1097610321.5432.439.camel@lade.trondhjem.org>
Mime-Version: 1.0
X-Mailer: Ximian Evolution 1.4.6
Date: Tue, 12 Oct 2004 21:45:21 +0200
Content-Transfer-Encoding: quoted-printable
X-MailScanner-Information: This message has been scanned for viruses/spam. Contact postmaster@uio.no if you have questions about this scanning
X-UiO-MailScanner: No virus found
X-UiO-Spam-info: not spam, SpamAssassin (score=0, required 12)
X-eGroups-Remote-IP: 129.240.130.16
From: Trond Myklebust <trond.myklebust@fys.uio.no>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=194000208
X-Yahoo-Profile: trondmy

ADVERTISEMENT
P� ty , 12/10/2004 klokka 18:55, skreiv Julian Satran:
>
> I don't think I am missing the point. Handing over authenticators is
> "legal" in any case.
> The point is than an authenticator has to be handed over. And the
> authenticator is "secret"
> in any of the protocols. Guessing it (it is ussualy large) is usually
> considered unfeasible.

> The ClientID is not secret.

No, but it could be made to provide basic machine authentication,
because the call to SETCLIENTID_CONFIRM does two things:

1) Causes the server to return a new clientid that is used as the
basis for the state.

2) The server looks at the RPC authentication, and derives a
principal, and an RPC security flavour, (and if the flavour is
RPCSEC_GSS - also a mechanism and a service name) and associates that
information to that clientid.

IOW it establishes a de-facto "machine credential" associated to that
particular clientid. If you were truly desperate you could use this to
authenticate clients (we are BTW already desperate enough to allow it
for RENEW and SETCLIENTID as you can see in the RENEW description on
page 201 of RFC3530).

---------

That said, I'm still failing to see why all the above is necessary.

IOW: Why do we need special server support with new SETCLIENTIDs, new
authentication schemes against the server etc in order to deal with the
MPI case?

AFAICS, the only purpose of introducing the MPI in the first place is to
_hide_ the details of the client's internal architecture from the
server, and to represent all N nodes as if they were 1 node trunked
across several transport mechanisms (or possibly even sharing a single
transport?). If it is not capable of doing this, then why even bother
looking at it?

With the current NFSv4, the only problem should be the lack of decent
support for trunking, and that is supposed to be solved by means of the
SESSION extensions in NFSv4.1.
Is pNFS adding something new that will break this?

Cheers,
Trond

From bhalevy@panasas.com Tue Oct 12 16:12:21 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 44188 invoked from network); 12 Oct 2004 23:12:19 -0000
Received: from unknown (66.218.66.172)
by m17.grp.scd.yahoo.com with QMQP; 12 Oct 2004 23:12:19 -0000
Received: from unknown (HELO barrule.panasas.com) (65.194.124.178)
by mta4.grp.scd.yahoo.com with SMTP; 12 Oct 2004 23:12:19 -0000
Received: by barrule.panasas.com with Internet Mail Service (5.5.2653.19)
id <4S0FL48P>; Tue, 12 Oct 2004 19:12:18 -0400
Message-ID: <D72776FC4B13B64E9232562572AF292BF15203@barrule.panasas.com>
To: "'trond.myklebust@fys.uio.no'" <trond.myklebust@fys.uio.no>,
"'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Tue, 12 Oct 2004 19:12:08 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

ADVERTISEMENT
Trond, I completely agree with you.
My approach was that to keep things simple
within the present NFSv4 spec, you need to model
the cluster of clients as a single NFSv4 client
sharing the same client id and NFSv4 state space.
The different instances can share secrets amongst
themselves and the server shouldn't care less.

With the new sessions model I think that channels
from multiple client hosts can be associated
with a single session in a way that's probably
indistinguishable from a multi-homed monolithic
client opening channels over many nic ports.

> Is pNFS adding something new that will break this?

pNFS shouldn't add anything new at the NFSv4 level
with respect to modeling the client. Security on the
client/ storage data path might be affected as you want many
client hosts to access storage using a copy of information
given to one of them (that performed the GETLAYOUT call).
If the security system is based on the client's host
address (e.g. MAC address) you're in trouble unless
all the clients can virtualize it or if you plan your
security tokens to support multi-path.

Benny

> -----Original Message-----
> From: Trond Myklebust [mailto:trond.myklebust@fys.uio.no]
> Sent: Tuesday, October 12, 2004 9:45 PM
> To: pnfs-reqs@yahoogroups.com
> Subject: Re: [pnfs-reqs] Notes from Sept 30
>
>
>
> P� ty , 12/10/2004 klokka 18:55, skreiv Julian Satran:
> >
> > I don't think I am missing the point. Handing over authenticators is
> > "legal" in any case.
> > The point is than an authenticator has to be handed over. And the
> > authenticator is "secret"
> > in any of the protocols. Guessing it (it is ussualy large)
> is usually
> > considered unfeasible.
>
> > The ClientID is not secret.
>
> No, but it could be made to provide basic machine authentication,
> because the call to SETCLIENTID_CONFIRM does two things:
>
> 1) Causes the server to return a new clientid that is used as the
> basis for the state.
>
> 2) The server looks at the RPC authentication, and derives a
> principal, and an RPC security flavour, (and if the flavour is
> RPCSEC_GSS - also a mechanism and a service name) and associates that
> information to that clientid.
>
> IOW it establishes a de-facto "machine credential" associated to that
> particular clientid. If you were truly desperate you could use this to
> authenticate clients (we are BTW already desperate enough to allow it
> for RENEW and SETCLIENTID as you can see in the RENEW description on
> page 201 of RFC3530).
>
> ---------
>
> That said, I'm still failing to see why all the above is necessary.
>
> IOW: Why do we need special server support with new SETCLIENTIDs, new
> authentication schemes against the server etc in order to
> deal with the
> MPI case?
>
> AFAICS, the only purpose of introducing the MPI in the first
> place is to
> _hide_ the details of the client's internal architecture from the
> server, and to represent all N nodes as if they were 1 node trunked
> across several transport mechanisms (or possibly even sharing a single
> transport?). If it is not capable of doing this, then why even bother
> looking at it?
>
> With the current NFSv4, the only problem should be the lack of decent
> support for trunking, and that is supposed to be solved by
> means of the
> SESSION extensions in NFSv4.1.
> Is pNFS adding something new that will break this?
>
> Cheers,
> Trond
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> --------------------~-->
> Make a clean sweep of pop-up ads. Yahoo! Companion Toolbar.
> Now with Pop-Up Blocker. Get it for free!
> http://us.click.yahoo.com/L5YrjA/eSIIAA/yQLSAA/W6uqlB/TM
> --------------------------------------------------------------
> ------~->
>
>
> Yahoo! Groups Links
>
>
>
>
>
>
> 

From trond.myklebust@fys.uio.no Tue Oct 12 17:04:22 2004
Return-Path: <trond.myklebust@fys.uio.no>
X-Sender: trond.myklebust@fys.uio.no
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 94851 invoked from network); 13 Oct 2004 00:04:19 -0000
Received: from unknown (66.218.66.217)
by m1.grp.scd.yahoo.com with QMQP; 13 Oct 2004 00:04:19 -0000
Received: from unknown (HELO pat.uio.no) (129.240.130.16)
by mta2.grp.scd.yahoo.com with SMTP; 13 Oct 2004 00:04:18 -0000
Received: from mail-mx6.uio.no ([129.240.10.47])
by pat.uio.no with esmtp (Exim 4.34)
id 1CHWcg-0005jJ-7N
for pnfs-reqs@yahoogroups.com; Wed, 13 Oct 2004 02:04:06 +0200
Received: from 184.80-202-71.nextgentel.com ([80.202.71.184] helo=[192.168.1.101])
by smtp.uio.no with asmtp (SSLv3:RC4-MD5:128)
(Exim 4.34)
id 1CHWcc-0003zG-TG
for pnfs-reqs@yahoogroups.com; Wed, 13 Oct 2004 02:04:02 +0200
To: pnfs-reqs@yahoogroups.com
In-Reply-To: <D72776FC4B13B64E9232562572AF292BF15203@barrule.panasas.com>
References: <D72776FC4B13B64E9232562572AF292BF15203@barrule.panasas.com>
Content-Type: text/plain; charset=iso-8859-1
Message-Id: <1097625836.30356.33.camel@lade.trondhjem.org>
Mime-Version: 1.0
X-Mailer: Ximian Evolution 1.4.6
Date: Wed, 13 Oct 2004 02:03:56 +0200
Content-Transfer-Encoding: quoted-printable
X-MailScanner-Information: This message has been scanned for viruses/spam. Contact postmaster@uio.no if you have questions about this scanning
X-UiO-MailScanner: No virus found
X-UiO-Spam-info: not spam, SpamAssassin (score=0, required 12)
X-eGroups-Remote-IP: 129.240.130.16
From: Trond Myklebust <trond.myklebust@fys.uio.no>
Subject: RE: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=194000208
X-Yahoo-Profile: trondmy

P� on , 13/10/2004 klokka 01:12, skreiv Halevy, Benny:
> My approach was that to keep things simple
> within the present NFSv4 spec, you need to model
> the cluster of clients as a single NFSv4 client
> sharing the same client id and NFSv4 state space.
> The different instances can share secrets amongst
> themselves and the server shouldn't care less.

Exactly. That is in practice what we are doing with single noded clients
too. The authentication is done at the per-user level, but the processes
are free to share data and state provided that they pay the necessary
lip service to ACCESS/OPEN/....

> > Is pNFS adding something new that will break this?
>
> pNFS shouldn't add anything new at the NFSv4 level
> with respect to modeling the client. Security on the
> client/ storage data path might be affected as you want many
> client hosts to access storage using a copy of information
> given to one of them (that performed the GETLAYOUT call).
> If the security system is based on the client's host
> address (e.g. MAC address) you're in trouble unless
> all the clients can virtualize it or if you plan your
> security tokens to support multi-path.

So you are saying that in order to support clustering, there should be
no need to mandate any special security mechanisms beyond the fact that
it should not rely on any features that are local as far as the cluster
is concerned (global cluster features OTOH should be OK)?

Cheers,
Trond

From bhalevy@panasas.com Wed Oct 13 13:58:26 2004
Return-Path: <bhalevy@panasas.com>
X-Sender: bhalevy@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 30537 invoked from network); 13 Oct 2004 20:58:24 -0000
Received: from unknown (66.218.66.166)
by m21.grp.scd.yahoo.com with QMQP; 13 Oct 2004 20:58:24 -0000
Received: from unknown (HELO barrule.panasas.com) (65.194.124.178)
by mta5.grp.scd.yahoo.com with SMTP; 13 Oct 2004 20:58:24 -0000
Received: by barrule.panasas.com with Internet Mail Service (5.5.2653.19)
id <4S0FLX9Q>; Wed, 13 Oct 2004 16:57:37 -0400
Message-ID: <D72776FC4B13B64E9232562572AF292BF15207@barrule.panasas.com>
To: "'trond.myklebust@fys.uio.no'" <trond.myklebust@fys.uio.no>
Cc: "'pnfs-reqs@yahoogroups.com'" <pnfs-reqs@yahoogroups.com>
Date: Wed, 13 Oct 2004 16:57:37 -0400
MIME-Version: 1.0
X-Mailer: Internet Mail Service (5.5.2653.19)
Content-Type: text/plain;
charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable
X-eGroups-Remote-IP: 65.194.124.178
From: "Halevy, Benny" <bhalevy@panasas.com>
Subject: RE: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=169276676
X-Yahoo-Profile: benny_halevy

> So you are saying that in order to support clustering, there should be
> no need to mandate any special security mechanisms beyond the
> fact that
> it should not rely on any features that are local as far as
> the cluster
> is concerned (global cluster features OTOH should be OK)?

correct.

my assumptions regarding the client cluster are:
- each node is an independent NFSv4 client and
needs to open the shared file, or

- all nodes share the same NFSv4 session and therefore
are considered a single NFSv4 client. One node opens
the shared file and broadcasts/multicasts the stateid
and layout to other nodes.

my assumptions regarding storage-level security in pnfs are:
- security for file storage protocol is based on RPC
authentication & filesystem security. All instances
of the application need access to the shared file's
components. Volatile filehandles may be used as
capabilities as well (see object storage below).

- security for object storage protocol is based on
server-generated capabilities. These are not
client-host specific and can be used by any
holder of the capability (until expired/revoked).

- security for block protocol is based on SAN
configuration. All hosts should be allowed access
to the logical devices holding the data.

Benny

> -----Original Message-----
> From: Trond Myklebust [mailto:trond.myklebust@fys.uio.no]
> Sent: Wednesday, October 13, 2004 2:04 AM
> To: pnfs-reqs@yahoogroups.com
> Subject: RE: [pnfs-reqs] Notes from Sept 30
>
>
>
> P� on , 13/10/2004 klokka 01:12, skreiv Halevy, Benny:
> > My approach was that to keep things simple
> > within the present NFSv4 spec, you need to model
> > the cluster of clients as a single NFSv4 client
> > sharing the same client id and NFSv4 state space.
> > The different instances can share secrets amongst
> > themselves and the server shouldn't care less.
>
> Exactly. That is in practice what we are doing with single
> noded clients
> too. The authentication is done at the per-user level, but
> the processes
> are free to share data and state provided that they pay the necessary
> lip service to ACCESS/OPEN/....
>
> > > Is pNFS adding something new that will break this?
> >
> > pNFS shouldn't add anything new at the NFSv4 level
> > with respect to modeling the client. Security on the
> > client/ storage data path might be affected as you want many
> > client hosts to access storage using a copy of information
> > given to one of them (that performed the GETLAYOUT call).
> > If the security system is based on the client's host
> > address (e.g. MAC address) you're in trouble unless
> > all the clients can virtualize it or if you plan your
> > security tokens to support multi-path.
>
> So you are saying that in order to support clustering, there should be
> no need to mandate any special security mechanisms beyond the
> fact that
> it should not rely on any features that are local as far as
> the cluster
> is concerned (global cluster features OTOH should be OK)?
>
> Cheers,
> Trond
>
>
>
> ------------------------ Yahoo! Groups Sponsor
> --------------------~-->
> $9.95 domain names from Yahoo!. Register anything.
> http://us.click.yahoo.com/J8kdrA/y20IAA/yQLSAA/W6uqlB/TM
> --------------------------------------------------------------
> ------~->
>
>
> Yahoo! Groups Links
>
>
>
>
>
>
> 

From andros@citi.umich.edu Wed Oct 13 14:13:56 2004
Return-Path: <andros@citi.umich.edu>
X-Sender: andros@citi.umich.edu
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 46688 invoked from network); 13 Oct 2004 21:13:55 -0000
Received: from unknown (66.218.66.166)
by m22.grp.scd.yahoo.com with QMQP; 13 Oct 2004 21:13:55 -0000
Received: from unknown (HELO citi.umich.edu) (141.211.133.111)
by mta5.grp.scd.yahoo.com with SMTP; 13 Oct 2004 21:13:55 -0000
Received: from citi.umich.edu (citi.umich.edu [141.211.133.111])
by citi.umich.edu (Postfix) with ESMTP
id 6AFA01BB10; Wed, 13 Oct 2004 17:13:54 -0400 (EDT)
X-Mailer: exmh version 2.5 07/13/2001 with version: MH 6.8.3 #74[UCI]
To: pnfs-reqs@yahoogroups.com
Cc: "'trond.myklebust@fys.uio.no'" <trond.myklebust@fys.uio.no>,
andros@citi.umich.edu
In-reply-to: Your message of "Wed, 13 Oct 2004 16:57:37 EDT."
<D72776FC4B13B64E9232562572AF292BF15207@barrule.panasas.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: quoted-printable
Date: Wed, 13 Oct 2004 17:13:54 -0400
Message-Id: <20041013211354.6AFA01BB10@citi.umich.edu>
X-eGroups-Remote-IP: 141.211.133.111
From: "William A.(Andy) Adamson" <andros@citi.umich.edu>
Subject: Re: [pnfs-reqs] Notes from Sept 30
X-Yahoo-Group-Post: member; u=169434965

>
> > So you are saying that in order to support clustering, there should be
> > no need to mandate any special security mechanisms beyond the
> > fact that
> > it should not rely on any features that are local as far as
> > the cluster
> > is concerned (global cluster features OTOH should be OK)?
>
> correct.
>
> my assumptions regarding the client cluster are:
> - each node is an independent NFSv4 client and
> needs to open the shared file, or
>
> - all nodes share the same NFSv4 session and therefore
> are considered a single NFSv4 client. One node opens
> the shared file and broadcasts/multicasts the stateid
> and layout to other nodes.

i agree. the client to client communication and the potential user->kernel
interface to seed the pNFS client with the common state are not part of the
pNFS protocol.
>
> my assumptions regarding storage-level security in pnfs are:
> - security for file storage protocol is based on RPC
> authentication & filesystem security. All instances
> of the application need access to the shared file's
> components. Volatile filehandles may be used as
> capabilities as well (see object storage below).
>
> - security for object storage protocol is based on
> server-generated capabilities. These are not
> client-host specific and can be used by any
> holder of the capability (until expired/revoked).

perhaps a user's capabilities set needs to be mapped to gss_context by the
pNFS server so that when the gss_context expires, the capabilities could be
revoked

>
> - security for block protocol is based on SAN
> configuration. All hosts should be allowed access
> to the logical devices holding the data.


both OSD and block could rely on transport security (IPSEC) to gain security
properties. there is a draft(?) by mike eisler which describes a new gss
security mechanism that allows an established gss_context to use an existing
ipsec channel.

-->Andy

>
> Benny
>
> > -----Original Message-----
> > From: Trond Myklebust [mailto:trond.myklebust@fys.uio.no]
> > Sent: Wednesday, October 13, 2004 2:04 AM
> > To: pnfs-reqs@yahoogroups.com
> > Subject: RE: [pnfs-reqs] Notes from Sept 30
> >
> >
> >
> > P� on , 13/10/2004 klokka 01:12, skreiv Halevy, Benny:
> > > My approach was that to keep things simple
> > > within the present NFSv4 spec, you need to model
> > > the cluster of clients as a single NFSv4 client
> > > sharing the same client id and NFSv4 state space.
> > > The different instances can share secrets amongst
> > > themselves and the server shouldn't care less.
> >
> > Exactly. That is in practice what we are doing with single
> > noded clients
> > too. The authentication is done at the per-user level, but
> > the processes
> > are free to share data and state provided that they pay the necessary
> > lip service to ACCESS/OPEN/....
> >
> > > > Is pNFS adding something new that will break this?
> > >
> > > pNFS shouldn't add anything new at the NFSv4 level
> > > with respect to modeling the client. Security on the
> > > client/ storage data path might be affected as you want many
> > > client hosts to access storage using a copy of information
> > > given to one of them (that performed the GETLAYOUT call).
> > > If the security system is based on the client's host
> > > address (e.g. MAC address) you're in trouble unless
> > > all the clients can virtualize it or if you plan your
> > > security tokens to support multi-path.
> >
> > So you are saying that in order to support clustering, there should be
> > no need to mandate any special security mechanisms beyond the
> > fact that
> > it should not rely on any features that are local as far as
> > the cluster
> > is concerned (global cluster features OTOH should be OK)?
> >
> > Cheers,
> > Trond
> >
> >
> >
> > ------------------------ Yahoo! Groups Sponsor
> > --------------------~-->
> > $9.95 domain names from Yahoo!. Register anything.
> > http://us.click.yahoo.com/J8kdrA/y20IAA/yQLSAA/W6uqlB/TM
> > --------------------------------------------------------------
> > ------~->
> >
> >
> > Yahoo! Groups Links
> >
> >
> >
> >
> >
> >
> >
>
>
>
>
> Yahoo! Groups Links
>
>
>
>
>
>
>
> 

From garth@panasas.com Thu Oct 21 18:54:39 2004
Return-Path: <garth@panasas.com>
X-Sender: garth@panasas.com
X-Apparently-To: pnfs-reqs@yahoogroups.com
Received: (qmail 89004 invoked from network); 22 Oct 2004 01:54:38 -0000
Received: from unknown (66.218.66.218)
by m23.grp.scd.yahoo.com with QMQP; 22 Oct 2004 01:54:38 -0000
Received: from unknown (HELO PIKES.panasas.com) (65.194.124.178)
by mta3.grp.scd.yahoo.com with SMTP; 22 Oct 2004 01:54:38 -0000
Received: from [127.0.0.1] ([172.17.19.3]) by PIKES.panasas.com with SMTP (Microsoft Exchange Internet Mail Service Version 5.5.2653.13)
id 4S02HCYR; Thu, 21 Oct 2004 21:54:36 -0400
Mime-Version: 1.0 (Apple Message framework v619)
Content-Transfer-Encoding: 7bit
Message-Id: <517BAE19-23CD-11D9-A1F0-000A95A94F04@panasas.com>
Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
To: pnfs-reqs@yahoogroups.com
Date: Thu, 21 Oct 2004 21:54:32 -0400
X-Mailer: Apple Mail (2.619)
X-eGroups-Remote-IP: 65.194.124.178
From: Garth Gibson <garth@panasas.com>
Subject: Time to shutdown this mailing list and move to IETF NFSv4
X-Yahoo-Group-Post: member; u=169457820
X-Yahoo-Profile: garth_a_gibson

pNFS Yahoo'rs,

I'm pleased to point you to the two new pNFS internet Drafts:

draft-gibson-pnfs-reqs-00.txt
http://www1.ietf.org/mail-archive/web/i-d-announce/current/
msg02567.html

draft-welch-pnfs-ops-00.txt
http://www1.ietf.org/mail-archive/web/i-d-announce/current/
msg02494.html

These are just first drafts of course, with many changes in their
future, and I hope you will participate in those changes. But in so
much as the IETF NFSv4 working group has invited us to move this
discussion into their ranks, we will no longer be working on pNFS
through these Yahoo groups mailing lists. From now on the email you
would have sent to the Yahoo mailing list should be sent to the IETF
nfsv4 mailing list, nfsv4@ietf.org. That means joining the list if you
are not already a member; instructions are available at
https://www1.ietf.org/mailman/listinfo/nfsv4

Welcome to the next step in pNFS.

garth

Assoc. Professor
Computer Science and Electrical and Computer Engineering Depts
Carnegie Mellon University
5000 Forbes Ave., Pittsburgh PA 15213-3891
garth.gibson@cs.cmu.edu, www.cs.cmu.edu/~garth

CTO, Panasas Inc
1501 Reedsdale St., Suite 400, Pittsburgh PA 15233
garth.gibson@panasas.com, www.panasas.com, 412-323-3500