[MarkLogic Dev General] cpf question
Mike Sokolov
sokolov at ifactory.com
Mon Jul 21 07:46:00 PDT 2008
I have set up a very simple content processing pipeline; basically there
is a single action to be taken whenever any document is updated or
created. I've gotten this to work well enough when starting with an
empty database. However now I am in a state where I can't seem to get
my actions to be triggered by updating the already-existing documents.
I have two questions:
1) If you load all your documents, and then install cpf, load your
actions and set up a pipeline, will this trigger processing? Is there
some way to get documents to be processed at that point?
2) What does "TaskServer: state is already successor so not updating for
<uri>" mean? It sounds as if TaskServer is saying: hey I processed this
document already, so (even though you just updated it), I'm not going to
process it again. How can I set things up so updates work?
OK, third question: what is the recommended "best practice" for updating
all the documents, in order to get the maximum loading throughput. This
is in a development system, so I can handle some downtime. Do you wipe
the database first and then reload?
I am attaching a copy of my simple pipeline in case it has some relevance...
-Mike
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pipe.xml
Type: text/xml
Size: 2039 bytes
Desc: not available
Url : http://xqzone.marklogic.com/pipermail/general/attachments/20080721/37109916/pipe.xml
More information about the General
mailing list