SORT BY:

LIST ORDER
THREAD
AUTHOR
SUBJECT


SEARCH

IPS HOME


    [Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

    RE: iSCSI: DLB-T.30 (using SNACK w/ PDU size changes)




    I pointed out that for any of the issues raised I will say whatever I have to say on the list of issues.
    For the issue that you debate  happily I proposed a simple solution - and the text is on my site since Sunday or Monday.

    And BTW this was one of the countable few real issues on the list  - and would not categorize even this as major.

    Julo


    Black_David@emc.com
    Sent by: owner-ips@ece.cmu.edu

    07/10/2002 10:56 AM
    Please respond to Black_David

           
            To:        cbm@rose.hp.com, ips@ece.cmu.edu
            cc:        
            Subject:        RE: iSCSI: DLB-T.30 (using SNACK w/ PDU size changes)

           


    > You're right that there's a duplicate StatSN issue with the rev14 text.
    >
    > I believe that can be addressed by mandating that the initiators must
    > discard the first status PDU always for the task in question, and then
    issue
    > a follow-on status SNACK.  It may occasionally lead to discarding a good
    > status (with the right ExpDataSN that reflects the re-segmented DataSN
    > count), but that would anyway be recovered with the explicit follow-on
    status
    > SNACK.
    >
    > I don't believe we need to make BegRun=0 for these cases, nor do I believe
    > that we need a new type of data SNACK.

    Mallikarjun,

    I don't think your proposal works very well.  One problem is that
    the initiator may not know how many data PDUs it should have received
    (and hence know whether it's missing some and needs to issue a Data SNACK)
    until it processes the status PDU (e.g., suppose the last Data-In PDU
    doesn't show up).  This appears to require receiving the status
    PDU, processing it, and retroactively dropping it when discovering
    that not all the data has arrived.  That doesn't strike me as a
    good design (e.g., it prohibits an acknowledge via ExpStatSN before
    the ExpDataSN processing).

    There's also at least one weird corner case in here - suppose
    the initiator issues the Data SNACK and then changes
    MaxRecvDataSegmentLength before all the data from the SNACK has
    shown up.  Now the initiator would have to retroactively drop a status
    that it already acknowledged via ExpStatSN.  This is bad, and leads
    to one of several poor outcomes:

    - Target fails to retransmit the response because the reused
                    StatSN is less than ExpStatSN.  The initiator can't recover
                    because it can't issue a status SNACK for StatSN < ExpStatSN.
    - Target ignores ExpStatSN and sends the response anyway.  If
                    the response gets corrupted and has to be dropped, the
                    initiator again can't issue a status SNACK to recover.
    - Initiator sends a new value of ExpStatSN to the target that is
                    less than the target's current ExpStatSN.  That's a
                    serious protocol error.

    This is still an ugly corner case for a resegmenting Data SNACK
    because the size change results in the original Data SNACK stopping
    its transmission, and the initiator has to figure out that it needs
    to send a resegmenting Data SNACK - not unreasonable, as the initiator
    did change MaxRecvDataSegmentLength.  The alternative of the
    initiator not realizing the consequences of the size change and
    hence having the extra response show up and complete the wrong
    task courtesy of a reused Initiator Task Tag seems far worse.

    I'm also reminded of John Hufferd's warning that features inserted
    late in a design tend to be the most error prone.  This one (Data
    SNACK in the face of MaxRecvDataSegmentLength change) seems to have
    survived two attempts to fix it, so isolating it to its own separate
    type of Data SNACK is making more and more sense, lest it survive
    more attempts to fix it ... that way a failed fix won't
    break ordinary Data SNACKs.  The fact that there are potentially
    subtle problems here was also behind my suggestion that the only
    acceptable resegmenting Data SNACK should be "Send Everything" -
    this should be a relatively rare case, and hence a brute force
    simple robust design is in order, even if it's inefficient.

    Thanks,
    --David


    > -----Original Message-----
    > From: Mallikarjun C. [mailto:cbm@rose.hp.com]
    > Sent: Tuesday, July 09, 2002 6:49 PM
    > To: ips@ece.cmu.edu
    > Subject: iSCSI: DLB-T.30 (using SNACK w/ PDU size changes)
    >
    >

    > David,
    >
    > You're right that there's a duplicate StatSN issue with the rev14 text.
    >
    > I believe that can be addressed by mandating that the initiators must
    > discard the first status PDU always for the task in question, and then
    issue
    > a follow-on status SNACK.  It may occasionally lead to discarding a good
    > status (with the right ExpDataSN that reflects the re-segmented DataSN
    > count), but that would anyway be recovered with the explicit follow-on
    status
    > SNACK.
    >
    > I don't believe we need to make BegRun=0 for these cases, nor do I believe
    > that we need a new type of data SNACK.  
    >
    > Mallikarjun
    >
    > > [T.30] 9.16   SNACK Request
    > >
    > >    If the initiator MaxRecvDataSegmenTLength changed Data-In PDUs
    > >    requested with RunLength 0 (meaning all PDUs after this number) may
    > >    be different from the ones originally sent, in order to reflect
    > >    changes in MaxRecvDataSegmentLength. Their DataSN starts with the
    > >    requested number and is increased by 1 for each resent Data-In PDU.
    > >    If DataSN numbers change and a SCSI-Reponse PDU was sent reflecting
    > >    the DataSN before retransmission it MUST be resent to reflect the new

    > >    numbers.
    > >
    > > This was discussed on the list, but there are still some problems here:
    > > (1) If the MaxRecvDataSegmentLength has changed, the only valid Data
    > > SNACK is BegRun=0, RunLength=0 (i.e., resend everything).  Attempts
    > > to be more clever than this are an invitation to miscount Data-In
    > > PDUs and cause problems in the initiator.  Targets MUST reject
    > > all other Data SNACK requests in this situation.
    > > (2) The new SCSI-Response PDU needs a new StatSN to avoid the initiator
    > > discarding it as a duplicate.  Section 2.2.2.2 is silent on duplicate
    > > detection for StatSN, but discarding duplicates would be a reasonable
    > > thing for an initiator to do.
    > > (3) The initiator needs some way to know that a new response is coming,
    > > and specifically whether to expect one or two responses.  If it
    > > only expects one and two show up, the initiator could reuse the
    > > Task Tag once all the data arrives causing a race in which the
    > > new response could incorrectly complete an unrelated command
    > > (unlikely, but potentially nasty).
    > > This suggests calling out the <BegRun=0, RunLength=0> Data SNACK as
    having
    > > special behavior:
    > > - It may resegment Data-In PDUs to deal with MaxRecvDataSegmentLength.
    > > All other Data SNACK requests MUST NOT resegment.
    > > - It *always* generates a new SCSI Response due to the possibility
    > > of resegmentation.
    > > That's not a great solution, because if one ever sets <BegRun=0,
    > > RunLength=0> in a Data SNACK, the resulting behavior change is dramatic
    > > and unexpected.
    > > This leads to the final proposal:
    > > - Specify a new SNACK type code (3) for Resegmenting Data SNACK.  SNACK
    > > Data-In resegmentation is allowed only when this is used.  If
    > > resegmentation would be necessary for a Data SNACK (type 1),
    > > that SNACK MUST be rejected.
    > > - Both BegRun and RunLength MUST be zero for a Resegmenting Data
    > > SNACK, and (unlike reserved fields) these MUST be checked by
    > > the receiver (target).
    > > - A new SCSI Response is always generated as a result of a Resegmenting
    > > Data-In SNACK, and it has its own StatSN number to deal with the
    > > fact that the number of Data-In PDUs may have changed, causing
    > > a change to the ExpDataSN value.  This new response also needs
    > > to be marked to distinguish it from a response that may have
    > > been generated earlier (so the initiator knows to wait for the
    > > new response) - using a bit in the flags field for this seems
    > > wrong, so specifying a new Response code value (0x02 - see 9.4.3)
    > > seems like a reasonable way to accomplish this.
    > > - Data SNACK (type 1) now has consistent behavior - it MUST NOT
    resegment
    > > and MUST NOT generate a new SCSI response, ever.
    > > This approach also has the potentially useful property of making it easy
    > > to yank out the Resegmenting Data SNACK wart if we ever put restrictions
    > > on the interaction of MaxRecvDatasegmentLength and Data SNACKs (yes,
    that's
    > > a hint ... this has gotten messy enough that forbidding Data SNACKs when
    > > MaxRecvDataSegmentLength has changed needs to be considered as a
    possible
    > > alternative).
    > >
    > > This issue also affects some text in 9.16.3.
    >
    >  
    >




Home

Last updated: Wed Jul 10 15:18:54 2002
11248 messages in chronological order