[MarkLogic Dev General] cpf question

Mike Sokolov sokolov at ifactory.com
Mon Jul 21 08:51:30 PDT 2008


Thanks for your quick response, Mary.

In the course of digging a bit further, I found I had neglected to 
attach my pipeline to the domain.  I think things are moving along OK now.

-Mike


Mary Holstege wrote:
> On Mon, 21 Jul 2008 07:46:00 -0700, Mike Sokolov 
> <sokolov at ifactory.com> wrote:
>
>> I have set up a very simple content processing pipeline; basically there
>> is a single action to be taken whenever any document is updated or
>> created.  I've gotten this to work well enough when starting with an
>> empty database.  However now I am in a state where I can't seem to get
>> my actions to be triggered by updating the already-existing documents.
>> I have two questions:
>>
>> 1) If you load all your documents, and then install cpf, load your
>> actions and set up a pipeline, will this trigger processing?  Is there
>> some way to get documents to be processed at that point?
>
> CPF responds to events, so to reprocess existing content, you need
> to touch each of those documents in order to get CPF to notice them.
>
>> 2) What does "TaskServer: state is already successor so not updating for
>> <uri>" mean?  It sounds as if TaskServer is saying: hey I processed this
>> document already, so (even though you just updated it), I'm not going to
>> process it again.  How can I set things up so updates work?
>
> This is benign.  It was put in to trace cases where multiple actions
> were interfering with each other, but it turns out the way it is testing
> for that case was flawed and it gets reported more than it should.
>
>> OK, third question: what is the recommended "best practice" for updating
>> all the documents, in order to get the maximum loading throughput.  This
>> is in a development system, so I can handle some downtime. Do you wipe
>> the database first and then reload?
>
> That is maybe the simplest, but all you need to do to get CPF to pay 
> attention
> is to touch each document; if they have multiple fragments, that could be
> cheaper than a full reload.
>
>> I am attaching a copy of my simple pipeline in case it has some 
>> relevance...
>
> What other pipelines do you have attached to the domain?  You will
> need the status handling pipeline.
>
> //Mary
> _______________________________________________
> General mailing list
> General at developer.marklogic.com
> http://xqzone.com/mailman/listinfo/general


More information about the General mailing list