[Corona] extractContent

Clark Richey Clark.Richey at marklogic.com
Mon Dec 12 12:55:32 PST 2011


This isn't Corona its the ISYS filters that are transforming the binary content. This won't cause an error as the elements aren't typed per se. However, if you wanted to apply a dateTime range index then yes, you might need to normalize the values. 

Sent from my iPhone

On Dec 12, 2011, at 15:42, "Scott Conroy" <conroys at avalonconsult.com> wrote:

> Can someone verify that the extractContent capability for binary files
> is a bit hit-or-miss when it comes to element naming and type?
> 
> For example, for PDF's, I get:
> 
> <corona:modDate>2011/09/20 00:03:43Z</corona:modDate>
> 
> For a Word file, I get:
> 
> <corona:lastSavedDate>2011-11-28T18:25:00Z</corona:lastSavedDate>
> 
> I believe the first of those two example will error with an invalid
> cast as xs:date, though I didn't check.
> 
> I imagine the transform issue is because of the underlying library
> rather than Corona itself.
> _______________________________________________
> Corona mailing list
> Corona at developer.marklogic.com
> http://developer.marklogic.com/mailman/listinfo/corona


More information about the Corona mailing list