com.marklogic.mapreduce
Class NodeOutputFormat
java.lang.Object
org.apache.hadoop.mapreduce.OutputFormat<KEYOUT,VALUEOUT>
com.marklogic.mapreduce.MarkLogicOutputFormat<NodePath,MarkLogicNode>
com.marklogic.mapreduce.NodeOutputFormat
- All Implemented Interfaces:
- MarkLogicConstants, org.apache.hadoop.conf.Configurable
public class NodeOutputFormat
- extends MarkLogicOutputFormat<NodePath,MarkLogicNode>
MarkLogicOutputFormat for Node.
Use this class to store MapReduce results as XML nodes in a MarkLogic
database. Use this class with output key-value pairs where the key is
NodePath and the value is a MarkLogicNode to be
inserted into the database relative to the node path in the key.
Where the node is inserted relative to the path is controlled by the
output.node.optype configuration
property. You must set this property when using NodeOutputFormat.
- See Also:
NodeOpType,
MarkLogicConstants.NODE_OPERATION_TYPE
|
Field Summary |
static org.apache.commons.logging.Log |
LOG
|
| Fields inherited from interface com.marklogic.mapreduce.MarkLogicConstants |
ADVANCED_MODE, BASIC_MODE, BATCH_SIZE, BIND_SPLIT_RANGE, CONTENT_TYPE, DEFAULT_BATCH_SIZE, DEFAULT_CONTENT_TYPE, DEFAULT_MAX_SPLIT_SIZE, DEFAULT_OUTPUT_CONTENT_ENCODING, DEFAULT_OUTPUT_XML_REPAIR_LEVEL, DEFAULT_PROPERTY_OPERATION_TYPE, DOCUMENT_SELECTOR, INDENTED, INPUT_DATABASE_NAME, INPUT_HOST, INPUT_KEY_CLASS, INPUT_LEXICON_FUNCTION_CLASS, INPUT_MODE, INPUT_PASSWORD, INPUT_PORT, INPUT_QUERY, INPUT_SSL_OPTIONS_CLASS, INPUT_USE_SSL, INPUT_USERNAME, INPUT_VALUE_CLASS, MAX_SPLIT_SIZE, MR_NAMESPACE, NODE_OPERATION_TYPE, OUTPUT_CLEAN_DIR, OUTPUT_COLLECTION, OUTPUT_CONTENT_ENCODING, OUTPUT_CONTENT_LANGUAGE, OUTPUT_CONTENT_NAMESPACE, OUTPUT_DIRECTORY, OUTPUT_FAST_LOAD, OUTPUT_FOREST_HOST, OUTPUT_HOST, OUTPUT_KEY_TYPE, OUTPUT_KEY_VARNAME, OUTPUT_NAMESPACE, OUTPUT_PASSWORD, OUTPUT_PERMISSION, OUTPUT_PORT, OUTPUT_PROPERTY_ALWAYS_CREATE, OUTPUT_QUALITY, OUTPUT_QUERY, OUTPUT_SSL_OPTIONS_CLASS, OUTPUT_STREAMING, OUTPUT_TOLERATE_ERRORS, OUTPUT_USE_SSL, OUTPUT_USERNAME, OUTPUT_VALUE_TYPE, OUTPUT_VALUE_VARNAME, OUTPUT_XML_REPAIR_LEVEL, PATH_NAMESPACE, PROPERTY_OPERATION_TYPE, RECORD_TO_FRAGMENT_RATIO, SPLIT_END_VARNAME, SPLIT_QUERY, SPLIT_START_VARNAME, SUBDOCUMENT_EXPRESSION, TXN_SIZE |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
LOG
public static final org.apache.commons.logging.Log LOG
NodeOutputFormat
public NodeOutputFormat()
getRecordWriter
public org.apache.hadoop.mapreduce.RecordWriter<NodePath,MarkLogicNode> getRecordWriter(org.apache.hadoop.mapreduce.TaskAttemptContext context)
throws IOException,
InterruptedException
- Specified by:
getRecordWriter in class org.apache.hadoop.mapreduce.OutputFormat<NodePath,MarkLogicNode>
- Throws:
IOException
InterruptedException
checkOutputSpecs
public void checkOutputSpecs(org.apache.hadoop.conf.Configuration conf,
com.marklogic.xcc.ContentSource cs)
throws IOException
- Specified by:
checkOutputSpecs in class MarkLogicOutputFormat<NodePath,MarkLogicNode>
- Throws:
IOException
Copyright © 2013 MarkLogic Corporation. All Rights Reserved.
Complete online documentation for MarkLogic Server,
XQuery and related components may be found at
developer.marklogic.com