CORB is a Java tool designed for bulk content-reprocessing of documents stored in MarkLogic. In a nutshell, CORB works off a list of documents in a database and performs operations against those documents. CORB operations can include generating a report across all documents, manipulating the individual documents or a combination thereof. CORB stands for Content Reprocessing in Bulk and is a multi-threaded workhorse tool at your disposal.

CORB was originally developed by Michael Blakeley and submitted to the Open Source Community (OSC). CORB 1.0 provides basic functionality for selecting documents and using multiple threads to apply an XQuery against them. In 2014, CORB2 was released by Bhagat Bandlamudi to the OSC. CORB2 extends CORB by adding considerable new functionality.

Visit the Repository

Learn More

CoRB Video Tutorial

Learn the basics of CoRB (Content Reprocessing in Bulk). Also, learn the steps to implement for using CoRB, and see a demonstration of using CoRB.
Configuration & Requirements

Find out the required libraries and configuration to run CORB on MarkLogic. Also learn how to write a custom URI module with some sample invocations.

GitHub Repositories

Explore the GitHub repositories, projects that have benefited from the work of the MarkLogic developer community. We encourage contributions.

This website uses cookies.

By continuing to use this website you are giving consent to cookies being used in accordance with the MarkLogic Privacy Statement.