MarkLogic Connector for Hadoop 1.1-3

com.marklogic.mapreduce
Class PropertyOutputFormat

java.lang.Object
  extended by org.apache.hadoop.mapreduce.OutputFormat<KEYOUT,VALUEOUT>
      extended by com.marklogic.mapreduce.MarkLogicOutputFormat<DocumentURI,MarkLogicNode>
          extended by com.marklogic.mapreduce.PropertyOutputFormat
All Implemented Interfaces:
MarkLogicConstants, org.apache.hadoop.conf.Configurable

public class PropertyOutputFormat
extends MarkLogicOutputFormat<DocumentURI,MarkLogicNode>

MarkLogicOutputFormat for Document Property.

Use this class to store MapReduce results as properties on documents in a MarkLogic database. This class expects output key-value pairs where the key is a DocumentURI and the value is a MarkLogicNode describing the property to be added to the document at the key URI.

Control whether the inserted property replaces or adds to existing document properties by setting the configuration property output.property.optype. By default, any existing properties are replaced with the new one.

By default, properties are only created by documents that exist in the database. Set the configuration property output.property.alwayscreate to true to create properties even if the target document does not exist.

See Also:
PropertyOpType

Field Summary
static org.apache.commons.logging.Log LOG
           
 
Fields inherited from class com.marklogic.mapreduce.MarkLogicOutputFormat
conf
 
Fields inherited from interface com.marklogic.mapreduce.MarkLogicConstants
ADVANCED_MODE, BASIC_MODE, BATCH_SIZE, BIND_SPLIT_RANGE, CONTENT_TYPE, DEFAULT_BATCH_SIZE, DEFAULT_CONTENT_TYPE, DEFAULT_MAX_SPLIT_SIZE, DEFAULT_OUTPUT_CONTENT_ENCODING, DEFAULT_OUTPUT_XML_REPAIR_LEVEL, DEFAULT_PROPERTY_OPERATION_TYPE, DOCUMENT_SELECTOR, INDENTED, INPUT_DATABASE_NAME, INPUT_HOST, INPUT_KEY_CLASS, INPUT_LEXICON_FUNCTION_CLASS, INPUT_MODE, INPUT_PASSWORD, INPUT_PORT, INPUT_QUERY, INPUT_SSL_OPTIONS_CLASS, INPUT_USE_SSL, INPUT_USERNAME, INPUT_VALUE_CLASS, MAX_SPLIT_SIZE, MR_NAMESPACE, NODE_OPERATION_TYPE, OUTPUT_CLEAN_DIR, OUTPUT_COLLECTION, OUTPUT_CONTENT_ENCODING, OUTPUT_CONTENT_LANGUAGE, OUTPUT_CONTENT_NAMESPACE, OUTPUT_DIRECTORY, OUTPUT_FAST_LOAD, OUTPUT_FOREST_HOST, OUTPUT_HOST, OUTPUT_KEY_TYPE, OUTPUT_KEY_VARNAME, OUTPUT_NAMESPACE, OUTPUT_PASSWORD, OUTPUT_PERMISSION, OUTPUT_PORT, OUTPUT_PROPERTY_ALWAYS_CREATE, OUTPUT_QUALITY, OUTPUT_QUERY, OUTPUT_SSL_OPTIONS_CLASS, OUTPUT_STREAMING, OUTPUT_TOLERATE_ERRORS, OUTPUT_USE_SSL, OUTPUT_USERNAME, OUTPUT_VALUE_TYPE, OUTPUT_VALUE_VARNAME, OUTPUT_XML_REPAIR_LEVEL, PATH_NAMESPACE, PROPERTY_OPERATION_TYPE, RECORD_TO_FRAGMENT_RATIO, SPLIT_END_VARNAME, SPLIT_QUERY, SPLIT_START_VARNAME, SUBDOCUMENT_EXPRESSION, TXN_SIZE
 
Constructor Summary
PropertyOutputFormat()
           
 
Method Summary
 void checkOutputSpecs(org.apache.hadoop.conf.Configuration conf, com.marklogic.xcc.ContentSource cs)
           
 org.apache.hadoop.mapreduce.RecordWriter<DocumentURI,MarkLogicNode> getRecordWriter(org.apache.hadoop.mapreduce.TaskAttemptContext context)
           
 
Methods inherited from class com.marklogic.mapreduce.MarkLogicOutputFormat
checkOutputSpecs, getConf, getForestHostMap, getOutputCommitter, queryForestHostMap, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

LOG

public static final org.apache.commons.logging.Log LOG
Constructor Detail

PropertyOutputFormat

public PropertyOutputFormat()
Method Detail

getRecordWriter

public org.apache.hadoop.mapreduce.RecordWriter<DocumentURI,MarkLogicNode> getRecordWriter(org.apache.hadoop.mapreduce.TaskAttemptContext context)
                                                                                    throws IOException,
                                                                                           InterruptedException
Specified by:
getRecordWriter in class org.apache.hadoop.mapreduce.OutputFormat<DocumentURI,MarkLogicNode>
Throws:
IOException
InterruptedException

checkOutputSpecs

public void checkOutputSpecs(org.apache.hadoop.conf.Configuration conf,
                             com.marklogic.xcc.ContentSource cs)
                      throws IOException
Specified by:
checkOutputSpecs in class MarkLogicOutputFormat<DocumentURI,MarkLogicNode>
Throws:
IOException

MarkLogic Connector for Hadoop 1.1-3

Copyright © 2013 MarkLogic Corporation. All Rights Reserved.

Complete online documentation for MarkLogic Server, XQuery and related components may be found at developer.marklogic.com