[MarkLogic Dev General] cpf question

Mike Sokolov sokolov at ifactory.com
Mon Jul 21 07:46:00 PDT 2008


I have set up a very simple content processing pipeline; basically there 
is a single action to be taken whenever any document is updated or 
created.  I've gotten this to work well enough when starting with an 
empty database.  However now I am in a state where I can't seem to get 
my actions to be triggered by updating the already-existing documents.  
I have two questions:

1) If you load all your documents, and then install cpf, load your 
actions and set up a pipeline, will this trigger processing?  Is there 
some way to get documents to be processed at that point?

2) What does "TaskServer: state is already successor so not updating for 
<uri>" mean?  It sounds as if TaskServer is saying: hey I processed this 
document already, so (even though you just updated it), I'm not going to 
process it again.  How can I set things up so updates work?

OK, third question: what is the recommended "best practice" for updating 
all the documents, in order to get the maximum loading throughput.  This 
is in a development system, so I can handle some downtime. Do you wipe 
the database first and then reload?

I am attaching a copy of my simple pipeline in case it has some relevance...

-Mike
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pipe.xml
Type: text/xml
Size: 2039 bytes
Desc: not available
Url : http://xqzone.marklogic.com/pipermail/general/attachments/20080721/37109916/pipe.xml


More information about the General mailing list