[MarkLogic Dev General] Interpretation of URI in document creation

Schouten, Edgar J. (RB-NL) Edgar.Schouten at reedbusiness.nl
Mon Mar 10 23:28:12 PST 2008


Hi Colin,
 
Good question. I've inserted a variety of documents with an URI based on some tag in the document (the <title>-tag, I think) and I actually ended up having carriage-return characters in some URI's.
 
Regards
EdgarS

________________________________

Van: general-bounces at developer.marklogic.com [mailto:general-bounces at developer.marklogic.com] Namens Gerety, Colin
Verzonden: maandag 10 maart 2008 18:45
Aan: General Mark Logic Developer Discussion
Onderwerp: [MarkLogic Dev General] Interpretation of URI in document creation



  I'm a little unclear about what is allowed as a document URI in Mark Logic.

 

  The XQuery/XSchema/XLink specifications indicate to me that the URI passed to XQuery functions like fn:doc and xdmp:document-insert should be a URI as defined in RFC2396 including % escaping of non-ascii characters.   This does not appear to be what Mark Logic is doing.

 

   Mark Logic seems to interpret the URI string without any escaping so that % sequences are just taken as characters (not interpreted as escapes) and that non-ascii characters, like ü are legal in the URI.   Experimentation indicates that even some of the excluded characters from RFC 2396 ("<" | ">" | "#" | "%" | <">

 ) are allowed.  In fact, the only disallowed character I have found is "#".  

 

   Is this correct and, if so, does that mean that a Mark Logic URI can contain any (printable?) Unicode character except the "#"?

 

Colin Gerety
Flatirons Solutions Corporation
(970) 980-5372 (cell)
Colin.Gerety at FlatironsSolutions.com

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://xqzone.marklogic.com/pipermail/general/attachments/20080311/7cde201a/attachment.html


More information about the General mailing list