[MarkLogic Dev General] Query weights
Darin McBeath
ddmcbeath at yahoo.com
Tue May 8 05:02:27 PDT 2007
You might want to try cts:element-word-query ...
depending on what indexes you have enabled, TF and IDF
will be based only on the QName (and specified
phrase-through descendants).
For cts:element-query, TF and IDF are based on the
entire document (fragment).
At least that is my understanding.
Darin.
--- Peter Hickman <peter.hickman at semantico.com> wrote:
> Michael Blakeley wrote:
> > I think the interesting point for your question is
> that scores are
> > calculated based on inverse document frequency
> (IDF) as well as term
> > frequency (TF). If that doesn't suit your
> application, you can choose
> > an alternative scoring technique: try score-logtf,
> or score-simple, as
> > options to cts:search() -
> >
>
http://developer.marklogic.com/pubs/3.1/apidocs/SearchBuiltins.html#search
>
> > has more information.
> >
>
> Sorry for the lateness of the reply (stuff cropping
> up at home, then a
> bank holiday, etc etc etc). I have tried
> "score-logtfidf",
> "score-logtf", and even "score-simple". And although
> they change the
> scores they do not seem to change the ordering. The
> problem as I see it
> is that a weighting that applies to documents that
> matched on dc:title
> seems to be applied to documents that do not match
> the dc:title. Given,
>
>
cts:element-query(xs:QName("dc:title"),cts:word-query("bach",(),16)),
>
cts:element-query(xs:QName("opp:body"),cts:word-query("bach"))
>
> in a cts:or-query should boost the score of
> documents that match "bach"
> in the dc:title element and not boost the score for
> documents that do
> not. However the examples show that the score of
> documents that do not
> have "bach" in the dc:title element are being
> boosted along with those
> that do. This is confusing and makes me feel like I
> have no idea as to
> what is going on. My understanding is that results
> 11 and 12 should not
> have been boosted, but they were. I need to know why
> if I am to make use
> of this facility.
>
> --
> Peter Hickman.
>
> Semantico, Lees House, 21-23 Dyke Road, Brighton BN1
> 3FE
> t: 01273 722222
> f: 01273 723232
> e: peter.hickman at semantico.com
> w: www.semantico.com
>
> _______________________________________________
> General mailing list
> General at developer.marklogic.com
> http://xqzone.com/mailman/listinfo/general
>
More information about the General
mailing list