SORT BY:

LIST ORDER
THREAD
AUTHOR
SUBJECT


SEARCH

IPS HOME


    [Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

    RE: iSCSI: more on StatRN



    Prasenjit,
    
    I was not trying to make a case for StatRN other than to indicate a possible
    use for faster recovery.  By implementing a more deterministic means of
    detecting a connection failure, many of the SCSI timeouts could be avoided
    should reconnection be relatively quick.  Much of this has to do with the
    overhead at making a connection in the first place of course.  Although you
    indicate SCSI layer error handling is not a problem, it depends on the
    application.  I prefer SCTP means of recovery to avoid much of this
    transport on a transport on a transport.
    
    Doug
    
    > Doug:
    >
    > Let me rephrase the question so that you can understand: for most
    > commands, the SCSI error handling mechanism will kick in before
    > the iSCSI stat rn mechanism. For the remianing few, there are other
    > mechanisms which make stat_rn redundant.
    >
    > Please make a case for stat_rn,
    >
    > Thanks,
    > Prasenjit
    >
    >
    >    Prasenjit Sarkar
    >    Research Staff Member
    >    IBM Almaden Research
    >    San Jose
    >
    >
    > "Douglas Otis" <dotis@sanlight.net>@ece.cmu.edu on 10/20/2000 11:25:26 AM
    >
    > Sent by:  owner-ips@ece.cmu.edu
    >
    >
    > To:   Prasenjit Sarkar/Almaden/IBM@IBMUS
    > cc:   "Randall R. Stewart" <randall@stewart.chicago.il.us>,
    >       <ips@ece.cmu.edu>
    > Subject:  RE: iSCSI: more on StatRN
    >
    >
    >
    > Prasenjit,
    >
    > I was not suggesting a change to the iSCSI by adding a means of detecting
    > failure within a deterministic fashion.  It would be simpler not
    > to involve
    > the SCSI layer in order to invoke a response. The use of StatRN is a less
    > deterministic means of detecting a connection failure using large
    > differences between StatRN and ExpStatRN.  Once a recovery mechanism is in
    > place, a starting point for a replay of status would use a reported
    > ExpStatRN, so I expect.  How long does the target wait for an initiator to
    > discover a problem and allow reconnection?  At this involves a substantial
    > amount of resources, these timeouts should be defined.  The OS should not
    > be
    > depended upon to indicate a TCP failure.
    >
    > Doug
    >
    > As the only cause for long delays in
    >      responses can be failed connections and received responses free-up
    >      resources, we felt that score boarding responses at the initiator
    >      could be accomplished by simple bitmaps and there is no need to
    >      flow-control responses. Status acknowledgment is done by the ini-
    >      tiator through ExpStatRN (Expected Status RN) and large difference
    >      between StatRN and ExpStatRN indicates a failed connection.
    > >
    > > I'm aware tape timeouts could exceed 3 minutes or an hour, but tape
    > > commands
    > > are highly causal and there are existing SCSI mechanisms to deal with
    > tape
    > > error recovery.  Also, there are ways in SCSI to report intermediate
    > > status, so you
    > > really dont need a hearbeat mechanism.
    > >
    > > I'm really trying to understand the motivation for stat_rn,
    > >
    > > Prasenjit
    > >
    > >
    > >    Prasenjit Sarkar
    > >    Research Staff Member
    > >    IBM Almaden Research
    > >    San Jose
    > >
    > >
    > > "Douglas Otis" <dotis@sanlight.net>@ece.cmu.edu on 10/20/2000
    > 09:36:55 AM
    > >
    > > Sent by:  owner-ips@ece.cmu.edu
    > >
    > >
    > > To:   Prasenjit Sarkar/Almaden/IBM@IBMUS, "Randall R. Stewart"
    > >       <randall@stewart.chicago.il.us>
    > > cc:   <ips@ece.cmu.edu>
    > > Subject:  RE: iSCSI: more on StatRN
    > >
    > >
    > >
    > > Prasenjit,
    > >
    > > The timeouts for streaming devices do range beyond 3 minutes.  There are
    > > also system parameters that affect the rate TCP time out as well.
    > >  In other
    > > words, do not expect either to timeout before the other.  With timeouts
    > in
    > > the 10 minute range, a heart-beat would be desired in the range of tens
    > of
    > > seconds if no other communications.  This should allow a
    > reasonably quick
    > > response to a network failure after several successive failed responses.
    > > In
    > > iSCSI speak, it could be an iSCSI version of Echo (ping).  SCTP has
    > > Heartbeat detection.
    > >
    > > Doug
    > >
    > > > The ballpark figure for SCSI varies but by 3 minutes you can be rest
    > > > assured that SCSI will give up on a command, and will have probably
    > > > issued a lun/target reset.
    > > >
    > > > I've other arguments against the stat_rn mechanism, but I'll wait till
    > > > this is resolved,
    > > >
    > > > Prasenjit
    > > >
    > > >    Prasenjit Sarkar
    > > >    Research Staff Member
    > > >    IBM Almaden Research
    > > >    San Jose
    > > >
    > > >
    > > > "Randall R. Stewart" <randall@stewart.chicago.il.us>@ece.cmu.edu on
    > > > 10/20/2000 06:01:55 AM
    > > >
    > > > Sent by:  owner-ips@ece.cmu.edu
    > > >
    > > >
    > > > To:   Prasenjit Sarkar/Almaden/IBM@IBMUS
    > > > cc:   ips@ece.cmu.edu
    > > > Subject:  Re: iSCSI: more on StatRN
    > > >
    > > >
    > > >
    > > > Prasenjit:
    > > >
    > > > Being a transportish geek I don't know what the "failure" time is
    > > > on SCSI... can you give a ball-park figure?
    > > >
    > > > Another thought on this issue, is if SCSI retransmits, when
    > > > it times out (I think it does??), this just adds more
    > > > to the queue of things in TCP that are attempting to be sent.
    > > >
    > > > On the TCP failure side, in most cases that I have seen
    > > > a TCP connection fail, I have always seen it around 3 minutes
    > > > or more before the failure was report...
    > > >
    > > > R
    > > >
    > > > Prasenjit Sarkar/Almaden/IBM wrote:
    > > > >
    > > > > If the time TCP takes to give up on a connection is more than the
    > time
    > > > SCSI
    > > > > takes
    > > > > to give up on a command, the stat_rn mechanism would not be useful.
    > > > >
    > > > > While I know the values for certain operating systems, I would like
    > to
    > > > hear
    > > > > from
    > > > > people who can assert confidently that the TCP fail
    > connection time <
    > > > SCSI
    > > > > command failure time.
    > > > >
    > > > > Prasenjit
    > > > >
    > > > >    Prasenjit Sarkar
    > > > >    Research Staff Member
    > > > >    IBM Almaden Research
    > > > >    San Jose
    > > > >
    > > > > "Mallikarjun C." <cbm@rose.hp.com>@ece.cmu.edu on
    > 10/19/2000 07:40:16
    > > PM
    > > > >
    > > > > Please respond to cbm@rose.hp.com
    > > > >
    > > > > Sent by:  owner-ips@ece.cmu.edu
    > > > >
    > > > > To:   ips@ece.cmu.edu
    > > > > cc:
    > > > > Subject:  Re: iSCSI: Question on StatRN usage
    > > > >
    > > > > Julian,
    > > > >
    > > > > Thanks for the clarifications, I am pleased to understand that
    > > > > there's no overloading of any reference #s - the usage of new
    > > > > term "DataRN" in your new draft makes it a lot clearer.
    > > > >
    > > > > Some comments.
    > > > >
    > > > > >Mallikarjun and Prasanjit,
    > > > > >
    > > > > >Sorry for the confusion.
    > > > > >
    > > > > >The text is confusing and I have corrected it the new text. StatRN
    > is
    > > > > >mandatory (it is the only way we have to ACK status and is
    > > not related
    > > > to
    > > > > >ordering).
    > > > >
    > > > > Eventhough StatRN itself may not be used by an initiator
    > for ordering
    > > > > (unless it
    > > > > wants to order completions, for whatever reason), StatRNs are
    > > > themseleves
    > > > > are in a monotonically increasing order.  It is helpful to
    > state this
    > > > > explicitly.
    > > > >
    > > > > >
    > > > > >As for the data the intent was to use StatRN to just number
    > > > data packets
    > > > > >for a given command (start with whatever you want) and have
    > > them acked
    > > > > with
    > > > > >a NOP with the same task tag (this is important for input data
    > > > for which
    > > > > we
    > > > > >have no other way of acking them). Those numbers are not related to
    > > the
    > > > > >Status numbers. No ordering or recovery is required up to command
    > > > restart.
    > > > > >I assume that numbers will not wrap unless a target sends more
    > blocks
    > > > than
    > > > > >bytes (and it can!) but even then
    > > > > >no harm is done.
    > > > > >At recovery the restarted command will be followed by a
    > NOP with the
    > > > same
    > > > > >initiator tag indicating what is the
    > > > > >the block expected. The initiator does not have to do any
    > > scoreboardong
    > >
    > > > > >only keep the counters.
    > > > > >The target can free early resources and iSCSI can recover eve long
    > > > reads.
    > > > > >For writes evidently R2T does the job but it means that
    > > write data can
    > > > be
    > > > > >recovered only with R2T.
    > > > >
    > > > > This implies that in case an iSCSI implementation is counting the #
    > of
    > > > > bytes transferred in/out during a task, it shall not assume
    > > an error if
    > > > > the count is the less than expected transfer size - if the retry bit
    > > > > was set (This is especially true for writes, where the initiator
    > > doesn't
    > > > > know from which point target starts issuing R2Ts).  I would suggest
    > > > adding
    > > > > this comment as well to enable better interoperability.
    > > > >
    > > > > >Should we overload on CmdRN/ExpCmdRN to shorten recovery? I don't
    > see
    > > a
    > > > > >need.
    > > > >
    > > > > NO, I don't see either.  My concern was that overloading these RNs
    > for
    > > > > data would become a scalability bottleneck, when a session
    > > > spans mulitple
    > > > > NICs.  I am glad that it's not what was intended.
    > > > >
    > > > > Comments on your next email:
    > > > >    >The NOP message PDUs are not associated with a task, are meant
    > for
    > > > >    >immediate delivery, and their only purpose is synchronizing the
    > > > > ordering
    > > > >    >registers of the target and initiator.
    > > > >
    > > > > I would like to point out that NOP PDUs are indeed associated with a
    > > > task!
    > > > > They are associated with a task whose read data they are
    > > ack'ing (given
    > > > > that the DataRN is only task-unique).  Also, I would like to point
    > out
    > > > > that the current definition of NOP payload does not have
    > > Initiator Task
    > > > Tag
    > > > > - it needs to be added.
    > > > >
    > > > > Thanks.
    > > > > --
    > > > > Mallikarjun
    > > > > M/S 5601
    > > > > Networked Storage Architecture
    > > > > HP Storage Organization
    > > > > Hewlett-Packard, Roseville.
    > > > > cbm@rose.hp.com
    > > > >
    > > > > phone: (916) 785-5621
    > > > > fax:   (916) 785-2875
    > > >
    > > > --
    > > > Randall R. Stewart
    > > > randall@stewart.chicago.il.us or rrs@cisco.com
    > > > 815-342-5222 (cell) 815-477-2127 (work)
    > > >
    > > >
    > > >
    > >
    > >
    > >
    > >
    >
    >
    >
    >
    
    


Home

Last updated: Tue Sep 04 01:06:36 2001
6315 messages in chronological order