[MarkLogic Dev General] RE: General Digest, Vol 47, Issue 12
Christopher Welch
Christopher.Welch at marklogic.com
Fri May 16 08:51:17 PDT 2008
A general approach to this problem is to write an XQuery to build a new dictionary XML file using the words in the word lexicon, and then point your config file to your custom dictionary file.
Cheers!
Chris
-----Original Message-----
From: general-bounces at developer.marklogic.com [mailto:general-bounces at developer.marklogic.com] On Behalf Of Mariano Grau Calín
Sent: Friday, May 16, 2008 3:42 AM
To: General Mark Logic Developer Discussion
Subject: RE: [MarkLogic Dev General] RE: General Digest, Vol 47, Issue 12
Hello Chris:
I have enabled word lexicon in my database and load a dictionary based in it.
The problem is there is a lot of terms are not really spanish words.
I only want that marklogic suggest words when my search text contain words that not belong to spanish dictionary.
Thanks,
Mariano
-----Mensaje original-----
De: general-bounces at developer.marklogic.com [mailto:general-bounces at developer.marklogic.com] En nombre de Christopher Welch
Enviado el: jueves, 15 de mayo de 2008 17:46
Para: General Mark Logic Developer Discussion
Asunto: RE: [MarkLogic Dev General] RE: General Digest, Vol 47, Issue 12
Hi Mariano,
lib-search is a base library that you can customize to suit your needs so although there is no built-in functionality for word lexicons within lib-search, let's discuss how you would like to use word lexicons in your application.
Can you provide a bit more information about how you would like to use word lexicons? Are you building type-ahead search, or analyzing words in a set of search results? Maybe we can help walk you through.
Cheers!
Chris
-----Original Message-----
From: general-bounces at developer.marklogic.com [mailto:general-bounces at developer.marklogic.com] On Behalf Of Mariano Grau Calín
Sent: Thursday, May 15, 2008 8:33 AM
To: General Mark Logic Developer Discussion
Subject: RE: [MarkLogic Dev General] RE: General Digest, Vol 47, Issue 12
Hello,
How can i use the Word lexicon of my database in lib-search?
I have configured config section in lib-search-custom.xqy with uri of sample dictionary and work well. Is there a built-in uri for word lexicon of my database?
Thanks,
Mariano Grau
-----Mensaje original-----
De: general-bounces at developer.marklogic.com [mailto:general-bounces at developer.marklogic.com] En nombre de Kelly Stirman
Enviado el: miércoles, 14 de mayo de 2008 21:19
Para: general at developer.marklogic.com
Asunto: [MarkLogic Dev General] RE: General Digest, Vol 47, Issue 12
The spell correction functionality in MarkLogic employs the Double
Metaphone algorithm:
http://en.wikipedia.org/wiki/Double_Metaphone
This is a more modern and more sophisticated approach to phonetic
matches than soundex.
You can load one of the sample dictionaries on the developer site, your
own, or use the word lexicon of your database to generate a list of
terms that exist across your documents.
Kelly
-----Original Message-----
From: general-bounces at developer.marklogic.com
[mailto:general-bounces at developer.marklogic.com] On Behalf Of
general-request at developer.marklogic.com
Sent: Wednesday, May 14, 2008 3:00 PM
To: general at developer.marklogic.com
Subject: General Digest, Vol 47, Issue 12
Send General mailing list submissions to
general at developer.marklogic.com
To subscribe or unsubscribe via the World Wide Web, visit
http://xqzone.com/mailman/listinfo/general
or, via email, send a message with subject or body 'help' to
general-request at developer.marklogic.com
You can reach the person managing the list at
general-owner at developer.marklogic.com
When replying, please edit your Subject line so it is more specific
than "Re: Contents of General digest..."
Today's Topics:
1. what is marklogic (Vikash Ranjan)
2. Fuzzy and/or phonetic searching (Steve Mallen)
----------------------------------------------------------------------
Message: 1
Date: Wed, 14 May 2008 15:05:28 +0530
From: "Vikash Ranjan" <ranjanvikash07 at gmail.com>
Subject: [MarkLogic Dev General] what is marklogic
To: general at developer.marklogic.com
Message-ID:
<65a592e0805140235g772b48fck3a58f39135cba67d at mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"
Hi, I am new to this Marklogic what to know some more information
regarding
Marklogic. Please anyone let me know.
Thanks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL:
http://xqzone.marklogic.com/pipermail/general/attachments/20080514/5e038
076/attachment-0001.html
------------------------------
Message: 2
Date: Wed, 14 May 2008 15:53:49 +0100
From: Steve Mallen <Steve.Mallen at semantico.com>
Subject: [MarkLogic Dev General] Fuzzy and/or phonetic searching
To: general at developer.marklogic.com
Message-ID: <482AFCFD.7050804 at semantico.com>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Hi folks,
I've been looking through the developer docs to try to find out if I can
do fuzzy searching or any type of phonetic searching in XQuery with Mark
Logic.
Does anyone know if there any functions to determine similarities and
distance between strings - e.g. soundex, levenstein, metaphone?
Specifically, I'd like to be able to do lucene-style fuzzy searches
based on levenstein distance (for example, in Lucene, a search for
"roam~" will find words like "foam" and "roams"). The spellcheck module
looks like it does something similar, but I'm not sure what the
implementation is based on? How does it find words from a dictionary
that are spelt similarly to the search term? Is there any developer
control over this?
I'd also like to be able to do phonetic searches, so that, for example,
a search for "fiziks" would match "physics" since they are phonetically
similar. A few relational databases support "soundex" searches, and
SOLR supports the use of various phonetic transcription algorithms. I
guess that I could create an index of phonetic transcriptions during
content load, and do lookups based on that, but it would be good if
there was something I could use 'out-of-the-box'.
Could anyone shed any light on this?
Many thanks,
-Steve
------------------------------
_______________________________________________
General mailing list
General at developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
End of General Digest, Vol 47, Issue 12
***************************************
_______________________________________________
General mailing list
General at developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
_______________________________________________
General mailing list
General at developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
_______________________________________________
General mailing list
General at developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
_______________________________________________
General mailing list
General at developer.marklogic.com
http://xqzone.com/mailman/listinfo/general
More information about the General
mailing list