[MarkLogic Dev General] Interpretation of URI in document creation
Gerety, Colin
Colin.Gerety at flatironssolutions.com
Mon Mar 10 09:45:12 PST 2008
I'm a little unclear about what is allowed as a document URI in Mark Logic.
The XQuery/XSchema/XLink specifications indicate to me that the URI passed to XQuery functions like fn:doc and xdmp:document-insert should be a URI as defined in RFC2396 including % escaping of non-ascii characters. This does not appear to be what Mark Logic is doing.
Mark Logic seems to interpret the URI string without any escaping so that % sequences are just taken as characters (not interpreted as escapes) and that non-ascii characters, like ü are legal in the URI. Experimentation indicates that even some of the excluded characters from RFC 2396 ("<" | ">" | "#" | "%" | <">
) are allowed. In fact, the only disallowed character I have found is "#".
Is this correct and, if so, does that mean that a Mark Logic URI can contain any (printable?) Unicode character except the "#"?
Colin Gerety
Flatirons Solutions Corporation
(970) 980-5372 (cell)
Colin.Gerety at FlatironsSolutions.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://xqzone.marklogic.com/pipermail/general/attachments/20080310/198c2781/attachment.html
More information about the General
mailing list