[MarkLogic Dev General] Data profiling on large datasets

Alex Jouravlev alexj at businessabstraction.com
Sat Mar 28 00:05:17 PDT 2015


Hi everybody,

I an trying to list all top-level element types using

> fn:distinct-values(/*[name()])


The database has about 400,000 documents, but only a dozen of top-level
element types
The Query Console returns
[1.0-ml] XDMP-EXPNTREECACHEFULL:
fn:distinct-values(fn:collection()//*[fn:name(.)]) -- Expanded tree cache
full on host hp5

I am running it on a Win8 laptop with 8Gb of RAM and 16Gb of paging space,
with plenty of free disk space. Already expanded tree cash to 8Gb - more
than the data I have.

What am I missing?

Alex Jouravlev
Director, Business Abstraction Pty Ltd
Phone:       +61-(2)-8003-4830
Mobile:     +61-4-0408-3258
Web: http://www.businessabstraction.com
LinkedIn: http://au.linkedin.com/in/alexjouravlev/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://developer.marklogic.com/pipermail/general/attachments/20150328/75e473da/attachment.html 


More information about the General mailing list