possible parsing problem with OSI-DS 23 (Distinguished Names)

Allan Cargille <Allan.Cargille@cs.wisc.edu> Wed, 03 February 1993 17:23 UTC

Received: from ietf.nri.reston.va.us by IETF.CNRI.Reston.VA.US id aa07594; 3 Feb 93 12:23 EST
Received: from CNRI.RESTON.VA.US by IETF.CNRI.Reston.VA.US id aa07589; 3 Feb 93 12:23 EST
Received: from haig.cs.ucl.ac.uk by CNRI.Reston.VA.US id aa15068; 3 Feb 93 12:23 EST
X400-Received: by mta haig.cs.ucl.ac.uk in /PRMD=uk.ac/ADMD=gold 400/C=gb/; Relayed; Wed, 3 Feb 1993 17:04:47 +0000
Date: Wed, 3 Feb 1993 17:04:47 +0000
X400-Originator: osi-ds-request@cs.ucl.ac.uk
X400-Recipients: non-disclosure:;
X400-MTS-Identifier: [/PRMD=uk.ac/ADMD=gold 400/C=gb/; haig.cs.uc.281:03.01.93.17.04.47]
Priority: Non-Urgent
DL-Expansion-History: osi-ds@cs.ucl.ac.uk ; Wed, 3 Feb 1993 17:04:47 +0000;
Sender: ietf-archive-request@IETF.CNRI.Reston.VA.US
From: Allan Cargille <Allan.Cargille@cs.wisc.edu>
Message-ID: <930203105649*/G=Allan/S=Cargille/OU=cs/O=uw-madison/PRMD=xnren/C=us/@MHS>
To: s.kille@isode.com
Cc: osi-ds@cs.ucl.ac.uk, "Allan C." <Allan.Cargille@cs.wisc.edu>
Subject: possible parsing problem with OSI-DS 23 (Distinguished Names)

Hi Steve,

I think that the syntax presented in "A String Representation of
Distinguished Names" (OSI-DS 23 (v5) and draft-ietf-osids-distnames-05.txt)
needs to be modified slightly.  The question was raised in my mind when
I was considering how to parse such strings.

I think that the problem has to do with double quotation marks ("),
when this character occurs in a field value.  Obviously this would
only occur in bizarre cases.  There is nothing to distinguish whether
the double quotes are *part* of the value, or if they *terminate* the
value.  For example, I believe the following is ambiguous:

    (a) CN=joe, OU="ba"d"

or even worse,

    (b) CN=joe, OU="ba"d",OU="va"lue"

I believe that (b) could be legally parsed two ways, as two
attributes:

    CN=joe
    OU=ba"d",OU="val"ue
       ^^^^^^^^^^^^^^^^  where this is *one* string value

or as three attributes:

    CN=joe
    OU=ba"d
    OU=va"lue

If this is a problem, I thought of several solutions:

    1.  require that a double-quote mark in a value must always
	be included in <pair> notation (as \"\).  This would transform
	example (a) above to

	    CN=joe, OU="ba\"\d"

	I prefer this solution.  I don't think it would change the
	grammar much.

    2.  when a double-quote character is used in a value, require that
	two such characters be used.  This would transform example (a)
	above to

	    CN=joe, OU="ba""d"

Well, I just wanted to raise the question, and let the experts worry
about the solution.

Cheers,

allan
--
Allan Cargille                  Computer Sciences Department
Internet X.400 Project          University of Wisconsin-Madison
Voice +1 (608) 262-5084         1210 West Dayton Street
Fax   +1 (608) 262-9777         Madison, WI   53706   USA
Internet: cargille@cs.wisc.edu          
X.400:    S=Cargille; OU=cs; O=UW-Madison; PRMD=xnren; ADMD= ; C=us