[MarkLogic Dev General] UTF -8 Encoding Exception
Geert Josten
geert.josten at dayon.nl
Mon Jun 25 23:26:46 PDT 2012
Hi Abhishek,
Did you try xdmp:unquote with repair-full option? There are also some
format option that might interest you.
http://community.marklogic.com/pubs/5.0/apidocs/Ext-5.html#xdmp:unquote
Kind regards,
Geert
*Van:* general-bounces at developer.marklogic.com [mailto:
general-bounces at developer.marklogic.com] *Namens *Abhishek53 S
*Verzonden:* maandag 25 juni 2012 16:01
*Aan:* MarkLogic Developer Discussion
*Onderwerp:* Re: [MarkLogic Dev General] UTF -8 Encoding Exception
Hi Geert,
Thanks for prompt reply! Is there any way to convert Non UTF 8 encoded file
to UTF -8 encoded through some different API? The downloaded text file has
invalid XML characters like  which needs to be pre-processed before
updating this to a XML file.
Thanks
Abhishek Srivastav
Systems Engineer
Tata Consultancy Services
Cell:- +91-9883389968
Mailto: abhishek53.s at tcs.com
Website: http://www.tcs.com
____________________________________________
Experience certainty. IT Services
Business Solutions
Outsourcing
____________________________________________
From:
Geert Josten <geert.josten at dayon.nl>
To:
MarkLogic Developer Discussion <general at developer.marklogic.com>
Date:
06/25/2012 06:41 PM
Subject:
Re: [MarkLogic Dev General] UTF -8 Encoding Exception
Sent by:
general-bounces at developer.marklogic.com
------------------------------
Hi Abhishek,
The encoding option is not to specify a target encoding for conversion, but
to specify the encoding of the file you try to download. So, you should
figure out which encoding file-location.txt itself has, and just specify
that..
Kind regards,
Geert
*Van:* general-bounces at developer.marklogic.com [mailto:
general-bounces at developer.marklogic.com] *Namens *Abhishek53 S*
Verzonden:* maandag 25 juni 2012 14:51*
Aan:* MarkLogic Developer Discussion*
Onderwerp:* [MarkLogic Dev General] UTF -8 Encoding Exception
Hi Folks,
I am having issue in downloading non UTF 8 encoded text file from file
server. I am using http-get method to download text files and then updating
the text inside XML documents.
How to convert non UTF 8 to UTF 8 encoded?
Sample Code
xdmp:http-get("file-location.txt",
<options xmlns="xdmp:document-get">
<encoding>utf-8</encoding>
</options>
)
Exception: XDMP-DOCUTF8SEQ: -- document is not UTF-8 encoded
Please let me know your suggestion
Thanks
Abhishek Srivastav
Systems Engineer
Tata Consultancy Services
Cell:- +91-9883389968
Mailto: abhishek53.s at tcs.com
Website: http://www.tcs.com
____________________________________________
Experience certainty. IT Services
Business Solutions
Outsourcing
____________________________________________
=====-----=====-----=====
Notice: The information contained in this e-mail
message and/or attachments to it may contain
confidential or privileged information. If you are
not the intended recipient, any dissemination, use,
review, distribution, printing or copying of the
information contained in this e-mail message
and/or attachments to it are strictly prohibited. If
you have received this communication in error,
please notify us by reply e-mail or telephone and
immediately and permanently delete the message
and any attachments. Thank you
_______________________________________________
General mailing list
General at developer.marklogic.com
http://community.marklogic.com/mailman/listinfo/general
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://community.marklogic.com/pipermail/general/attachments/20120626/46201f52/attachment-0001.html
More information about the General
mailing list