[MarkLogic Dev General] Diacritic search
Gnana Arasan
gnanaarasanm at gmail.com
Tue Sep 30 12:08:09 PDT 2008
Hi,
We are inserting the xml(UTF-8) conent using
session.insertContent(uri,inputstream,options).by default option encoding is
UTF-8.(ML version 3.5-2).For example person name josé is stored.In cq using
doc(uri) the content seems to be JosÃ(c) .
cts:element-value-match(xs:QName("first-name")
,"jos*","diacritic-sensitive")
for above xqy Iam getting two results like JosÃ(c) ,José instead of José.
cts:element-value-match(xs:QName("first-name")
,"josé","diacritic-sensitive")
result -> josé
but cts:search(collection("myColl")//person,cts:element-value-query(
xs:QName("fdr:first-name"),cts:element-value-match(xs:QName("first-name")
,"José","diacritic-sensitive")))
result -> empty
cts:search(collection("myColl")//person,cts:element-value-query(
xs:QName("fdr:first-name"),cts:element-value-match(xs:QName("first-name")
,"JosÃ(c)","diacritic-sensitive")))
result -> person node
Iam i missing ot colloation concept any where. In admin conf
element-range-index first-name is added with default collation
http://marklogic.com/collation/
will this diacritic store in diffrent collation if that what will be the
collation.
My aim is to store the diacritic like josé and retive it as josé.
Thanks In Advace,
Gnana Arasan.M
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://xqzone.marklogic.com/pipermail/general/attachments/20081001/85a3fc0f/attachment.html
More information about the General
mailing list