RE: iSCSI: more on StatRN

To: "Prasenjit Sarkar/Almaden/IBM" <psarkar@almaden.ibm.com>
Subject: RE: iSCSI: more on StatRN
From: "Douglas Otis" <dotis@sanlight.net>
Date: Fri, 20 Oct 2000 15:44:10 -0700
Cc: "Randall R. Stewart" <randall@stewart.chicago.il.us>, <ips@ece.cmu.edu>
Content-Transfer-Encoding: 7bit
Content-Type: text/plain;charset="iso-8859-1"
Importance: Normal
In-Reply-To: <OF5A1997F5.91B78793-ON8825697E.007BED78@LocalDomain>
Sender: owner-ips@ece.cmu.edu
Prasenjit,

I was not trying to make a case for StatRN other than to indicate a possible
use for faster recovery.  By implementing a more deterministic means of
detecting a connection failure, many of the SCSI timeouts could be avoided
should reconnection be relatively quick.  Much of this has to do with the
overhead at making a connection in the first place of course.  Although you
indicate SCSI layer error handling is not a problem, it depends on the
application.  I prefer SCTP means of recovery to avoid much of this
transport on a transport on a transport.

Doug

> Doug:
>
> Let me rephrase the question so that you can understand: for most
> commands, the SCSI error handling mechanism will kick in before
> the iSCSI stat rn mechanism. For the remianing few, there are other
> mechanisms which make stat_rn redundant.
>
> Please make a case for stat_rn,
>
> Thanks,
> Prasenjit
>
>
>    Prasenjit Sarkar
>    Research Staff Member
>    IBM Almaden Research
>    San Jose
>
>
> "Douglas Otis" <dotis@sanlight.net>@ece.cmu.edu on 10/20/2000 11:25:26 AM
>
> Sent by:  owner-ips@ece.cmu.edu
>
>
> To:   Prasenjit Sarkar/Almaden/IBM@IBMUS
> cc:   "Randall R. Stewart" <randall@stewart.chicago.il.us>,
>       <ips@ece.cmu.edu>
> Subject:  RE: iSCSI: more on StatRN
>
>
>
> Prasenjit,
>
> I was not suggesting a change to the iSCSI by adding a means of detecting
> failure within a deterministic fashion.  It would be simpler not
> to involve
> the SCSI layer in order to invoke a response. The use of StatRN is a less
> deterministic means of detecting a connection failure using large
> differences between StatRN and ExpStatRN.  Once a recovery mechanism is in
> place, a starting point for a replay of status would use a reported
> ExpStatRN, so I expect.  How long does the target wait for an initiator to
> discover a problem and allow reconnection?  At this involves a substantial
> amount of resources, these timeouts should be defined.  The OS should not
> be
> depended upon to indicate a TCP failure.
>
> Doug
>
> As the only cause for long delays in
>      responses can be failed connections and received responses free-up
>      resources, we felt that score boarding responses at the initiator
>      could be accomplished by simple bitmaps and there is no need to
>      flow-control responses. Status acknowledgment is done by the ini-
>      tiator through ExpStatRN (Expected Status RN) and large difference
>      between StatRN and ExpStatRN indicates a failed connection.
> >
> > I'm aware tape timeouts could exceed 3 minutes or an hour, but tape
> > commands
> > are highly causal and there are existing SCSI mechanisms to deal with
> tape
> > error recovery.  Also, there are ways in SCSI to report intermediate
> > status, so you
> > really dont need a hearbeat mechanism.
> >
> > I'm really trying to understand the motivation for stat_rn,
> >
> > Prasenjit
> >
> >
> >    Prasenjit Sarkar
> >    Research Staff Member
> >    IBM Almaden Research
> >    San Jose
> >
> >
> > "Douglas Otis" <dotis@sanlight.net>@ece.cmu.edu on 10/20/2000
> 09:36:55 AM
> >
> > Sent by:  owner-ips@ece.cmu.edu
> >
> >
> > To:   Prasenjit Sarkar/Almaden/IBM@IBMUS, "Randall R. Stewart"
> >       <randall@stewart.chicago.il.us>
> > cc:   <ips@ece.cmu.edu>
> > Subject:  RE: iSCSI: more on StatRN
> >
> >
> >
> > Prasenjit,
> >
> > The timeouts for streaming devices do range beyond 3 minutes.  There are
> > also system parameters that affect the rate TCP time out as well.
> >  In other
> > words, do not expect either to timeout before the other.  With timeouts
> in
> > the 10 minute range, a heart-beat would be desired in the range of tens
> of
> > seconds if no other communications.  This should allow a
> reasonably quick
> > response to a network failure after several successive failed responses.
> > In
> > iSCSI speak, it could be an iSCSI version of Echo (ping).  SCTP has
> > Heartbeat detection.
> >
> > Doug
> >
> > > The ballpark figure for SCSI varies but by 3 minutes you can be rest
> > > assured that SCSI will give up on a command, and will have probably
> > > issued a lun/target reset.
> > >
> > > I've other arguments against the stat_rn mechanism, but I'll wait till
> > > this is resolved,
> > >
> > > Prasenjit
> > >
> > >    Prasenjit Sarkar
> > >    Research Staff Member
> > >    IBM Almaden Research
> > >    San Jose
> > >
> > >
> > > "Randall R. Stewart" <randall@stewart.chicago.il.us>@ece.cmu.edu on
> > > 10/20/2000 06:01:55 AM
> > >
> > > Sent by:  owner-ips@ece.cmu.edu
> > >
> > >
> > > To:   Prasenjit Sarkar/Almaden/IBM@IBMUS
> > > cc:   ips@ece.cmu.edu
> > > Subject:  Re: iSCSI: more on StatRN
> > >
> > >
> > >
> > > Prasenjit:
> > >
> > > Being a transportish geek I don't know what the "failure" time is
> > > on SCSI... can you give a ball-park figure?
> > >
> > > Another thought on this issue, is if SCSI retransmits, when
> > > it times out (I think it does??), this just adds more
> > > to the queue of things in TCP that are attempting to be sent.
> > >
> > > On the TCP failure side, in most cases that I have seen
> > > a TCP connection fail, I have always seen it around 3 minutes
> > > or more before the failure was report...
> > >
> > > R
> > >
> > > Prasenjit Sarkar/Almaden/IBM wrote:
> > > >
> > > > If the time TCP takes to give up on a connection is more than the
> time
> > > SCSI
> > > > takes
> > > > to give up on a command, the stat_rn mechanism would not be useful.
> > > >
> > > > While I know the values for certain operating systems, I would like
> to
> > > hear
> > > > from
> > > > people who can assert confidently that the TCP fail
> connection time <
> > > SCSI
> > > > command failure time.
> > > >
> > > > Prasenjit
> > > >
> > > >    Prasenjit Sarkar
> > > >    Research Staff Member
> > > >    IBM Almaden Research
> > > >    San Jose
> > > >
> > > > "Mallikarjun C." <cbm@rose.hp.com>@ece.cmu.edu on
> 10/19/2000 07:40:16
> > PM
> > > >
> > > > Please respond to cbm@rose.hp.com
> > > >
> > > > Sent by:  owner-ips@ece.cmu.edu
> > > >
> > > > To:   ips@ece.cmu.edu
> > > > cc:
> > > > Subject:  Re: iSCSI: Question on StatRN usage
> > > >
> > > > Julian,
> > > >
> > > > Thanks for the clarifications, I am pleased to understand that
> > > > there's no overloading of any reference #s - the usage of new
> > > > term "DataRN" in your new draft makes it a lot clearer.
> > > >
> > > > Some comments.
> > > >
> > > > >Mallikarjun and Prasanjit,
> > > > >
> > > > >Sorry for the confusion.
> > > > >
> > > > >The text is confusing and I have corrected it the new text. StatRN
> is
> > > > >mandatory (it is the only way we have to ACK status and is
> > not related
> > > to
> > > > >ordering).
> > > >
> > > > Eventhough StatRN itself may not be used by an initiator
> for ordering
> > > > (unless it
> > > > wants to order completions, for whatever reason), StatRNs are
> > > themseleves
> > > > are in a monotonically increasing order.  It is helpful to
> state this
> > > > explicitly.
> > > >
> > > > >
> > > > >As for the data the intent was to use StatRN to just number
> > > data packets
> > > > >for a given command (start with whatever you want) and have
> > them acked
> > > > with
> > > > >a NOP with the same task tag (this is important for input data
> > > for which
> > > > we
> > > > >have no other way of acking them). Those numbers are not related to
> > the
> > > > >Status numbers. No ordering or recovery is required up to command
> > > restart.
> > > > >I assume that numbers will not wrap unless a target sends more
> blocks
> > > than
> > > > >bytes (and it can!) but even then
> > > > >no harm is done.
> > > > >At recovery the restarted command will be followed by a
> NOP with the
> > > same
> > > > >initiator tag indicating what is the
> > > > >the block expected. The initiator does not have to do any
> > scoreboardong
> >
> > > > >only keep the counters.
> > > > >The target can free early resources and iSCSI can recover eve long
> > > reads.
> > > > >For writes evidently R2T does the job but it means that
> > write data can
> > > be
> > > > >recovered only with R2T.
> > > >
> > > > This implies that in case an iSCSI implementation is counting the #
> of
> > > > bytes transferred in/out during a task, it shall not assume
> > an error if
> > > > the count is the less than expected transfer size - if the retry bit
> > > > was set (This is especially true for writes, where the initiator
> > doesn't
> > > > know from which point target starts issuing R2Ts).  I would suggest
> > > adding
> > > > this comment as well to enable better interoperability.
> > > >
> > > > >Should we overload on CmdRN/ExpCmdRN to shorten recovery? I don't
> see
> > a
> > > > >need.
> > > >
> > > > NO, I don't see either.  My concern was that overloading these RNs
> for
> > > > data would become a scalability bottleneck, when a session
> > > spans mulitple
> > > > NICs.  I am glad that it's not what was intended.
> > > >
> > > > Comments on your next email:
> > > >    >The NOP message PDUs are not associated with a task, are meant
> for
> > > >    >immediate delivery, and their only purpose is synchronizing the
> > > > ordering
> > > >    >registers of the target and initiator.
> > > >
> > > > I would like to point out that NOP PDUs are indeed associated with a
> > > task!
> > > > They are associated with a task whose read data they are
> > ack'ing (given
> > > > that the DataRN is only task-unique).  Also, I would like to point
> out
> > > > that the current definition of NOP payload does not have
> > Initiator Task
> > > Tag
> > > > - it needs to be added.
> > > >
> > > > Thanks.
> > > > --
> > > > Mallikarjun
> > > > M/S 5601
> > > > Networked Storage Architecture
> > > > HP Storage Organization
> > > > Hewlett-Packard, Roseville.
> > > > cbm@rose.hp.com
> > > >
> > > > phone: (916) 785-5621
> > > > fax:   (916) 785-2875
> > >
> > > --
> > > Randall R. Stewart
> > > randall@stewart.chicago.il.us or rrs@cisco.com
> > > 815-342-5222 (cell) 815-477-2127 (work)
> > >
> > >
> > >
> >
> >
> >
> >
>
>
>
>
References:
- RE: iSCSI: more on StatRN
  - From: "Prasenjit Sarkar/Almaden/IBM" <psarkar@almaden.ibm.com>
Prev by Date: RE: iSCSI: more on StatRN
Next by Date: Re: iSCSI: more on StatRN
Prev by thread: RE: iSCSI: more on StatRN
Next by thread: Re: iSCSI: more on StatRN
Index(es):
- Date
- Thread
Home
Last updated: Tue Sep 04 01:06:36 2001
6315 messages in chronological order