Products: MarkLogic Content Pump (MLCP)

MarkLogic Content Pump (MLCP) is an open-source, Java-based command-line tool. MLCP provides the fastest way to import, export, and copy data to or from MarkLogic databases. It is designed for integration and automation in existing workflows and scripts.

Downloads

Release 11.3.1 binaries zip package — 55 MB (SHA1)
Release 11.3.1 source zip package — 2.4 MB (SHA1)

MARKLOGIC CONTENT PUMP ON GITHUB ›

Major Features

Bulk load local files
Split and load large, aggregate XML files or delimited text
Bulk load triples or quads from RDF files
Archive and restore database contents across environments
Copy subsets of data between databases
Load documents from HDFS, including Hadoop sequence files

Getting Started

Get started with the MLCP online training course provided by MarkLogic University. You can also review over the MLCP User Guide for reference.

Requirements

To get started moving data with MLCP, download and unpack the binaries. For those interested in hacking or look at the internals, you can also download the Apache 2.0 licensed source.

Installation

To create your first import script, make sure you have an XDBC server attached to your database (running on port 8006, for example, below). From the command line, run the following, substituting your particulars:

mlcp.sh import 
 -host localhost 
 -port 8006 
 -username user 
 -password ******* 
 -input_file_path /data/latest

To export a subset of that same database into a platform-independent archive:

mlcp.sh export 
 -host localhost 
 -port 8006 
 -username user 
 -password ******* 
 -output_type archive 
 -output_file_path /out/collections 
 -compress true

To import all triples from an N-Triples formatted file named example.nt:

mlcp.sh import 
 -host localhost 
 -port 8006 
 -username user 
 -password ******* 
 -input_file_path example.nt 
 -mode local 
 -input_file_type RDF

This is a small sampling of the rich set of capabilities MLCP provides. For much more information, sample code, and examples, explore its documentation.

Other Versions

Notice about log4j: MarkLogic Content Pump (MLCP) v10.0.8.2 and v9.0.13.7 contain the fix for the runtime log4j vulnerability.

MarkLogic 10

Release 10.0.11.1 binaries zip package — 50 MB (SHA1)
Release 10.0.11.1 source zip package — 2.4 MB (SHA1)

Maven

Dependencies

<dependency>
  <groupId>com.marklogic</groupId>
  <artifactId>mlcp</artifactId>
  <version>10.0.11.1</version>
</dependency>

MarkLogic 9

Release 9.0.13.8 binaries zip package — 36 MB (SHA1)
Release 9.0.13.8 source zip package — 1.4 MB (SHA1)

Maven

Dependencies

<dependency>
  <groupId>com.marklogic</groupId>
  <artifactId>mlcp</artifactId>
  <version>9.0.13.8</version>
</dependency>

MarkLogic 8

Release 8.0.9 binaries zip package — 29 MB (SHA1)
Release 8.0.9 source zip package — 430 KB (SHA1)
Release 8.0.9 binaries zip package for use with MapR — 31 MB (SHA1)

Maven

Dependencies

<dependency>
  <groupId>com.marklogic</groupId>
  <artifactId>mlcp</artifactId>
  <version>8.0.9</version>
</dependency>

Product

MarkLogic Content Pump

Downloads

Major Features

Getting Started

Requirements

Installation

Other Versions

MarkLogic 10

Maven

Dependencies

MarkLogic 9

Maven

Dependencies

MarkLogic 8

Maven

Dependencies

Related Resources

Stay on top of everything Marklogic.

This website uses cookies.