com.marklogic.mapreduce
Class NodeInputFormat
java.lang.Object
org.apache.hadoop.mapreduce.InputFormat<KEYIN,VALUEIN>
com.marklogic.mapreduce.MarkLogicInputFormat<NodePath,MarkLogicNode>
com.marklogic.mapreduce.NodeInputFormat
- All Implemented Interfaces:
- MarkLogicConstants
public class NodeInputFormat
- extends MarkLogicInputFormat<NodePath,MarkLogicNode>
MarkLogicInputFormat for Node.
Use this class when using XML nodes in a MarkLogic database as input
to a MapReduce job. This format produces key-value pairs where the
key is NodePath to the MarkLogicNode value.
- See Also:
LinkCountInDoc
| Fields inherited from interface com.marklogic.mapreduce.MarkLogicConstants |
ADVANCED_MODE, BASIC_MODE, BATCH_SIZE, BIND_SPLIT_RANGE, CONTENT_TYPE, DEFAULT_BATCH_SIZE, DEFAULT_CONTENT_TYPE, DEFAULT_MAX_SPLIT_SIZE, DEFAULT_OUTPUT_CONTENT_ENCODING, DEFAULT_OUTPUT_XML_REPAIR_LEVEL, DEFAULT_PROPERTY_OPERATION_TYPE, DOCUMENT_SELECTOR, INDENTED, INPUT_DATABASE_NAME, INPUT_HOST, INPUT_KEY_CLASS, INPUT_LEXICON_FUNCTION_CLASS, INPUT_MODE, INPUT_PASSWORD, INPUT_PORT, INPUT_QUERY, INPUT_SSL_OPTIONS_CLASS, INPUT_USE_SSL, INPUT_USERNAME, INPUT_VALUE_CLASS, MAX_SPLIT_SIZE, MR_NAMESPACE, NODE_OPERATION_TYPE, OUTPUT_CLEAN_DIR, OUTPUT_COLLECTION, OUTPUT_CONTENT_ENCODING, OUTPUT_CONTENT_LANGUAGE, OUTPUT_CONTENT_NAMESPACE, OUTPUT_DIRECTORY, OUTPUT_FAST_LOAD, OUTPUT_FOREST_HOST, OUTPUT_HOST, OUTPUT_KEY_TYPE, OUTPUT_KEY_VARNAME, OUTPUT_NAMESPACE, OUTPUT_PASSWORD, OUTPUT_PERMISSION, OUTPUT_PORT, OUTPUT_PROPERTY_ALWAYS_CREATE, OUTPUT_QUALITY, OUTPUT_QUERY, OUTPUT_SSL_OPTIONS_CLASS, OUTPUT_STREAMING, OUTPUT_TOLERATE_ERRORS, OUTPUT_USE_SSL, OUTPUT_USERNAME, OUTPUT_VALUE_TYPE, OUTPUT_VALUE_VARNAME, OUTPUT_XML_REPAIR_LEVEL, PATH_NAMESPACE, PROPERTY_OPERATION_TYPE, RECORD_TO_FRAGMENT_RATIO, SPLIT_END_VARNAME, SPLIT_QUERY, SPLIT_START_VARNAME, SUBDOCUMENT_EXPRESSION, TXN_SIZE |
|
Method Summary |
org.apache.hadoop.mapreduce.RecordReader<NodePath,MarkLogicNode> |
createRecordReader(org.apache.hadoop.mapreduce.InputSplit arg0,
org.apache.hadoop.mapreduce.TaskAttemptContext context)
|
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
NodeInputFormat
public NodeInputFormat()
createRecordReader
public org.apache.hadoop.mapreduce.RecordReader<NodePath,MarkLogicNode> createRecordReader(org.apache.hadoop.mapreduce.InputSplit arg0,
org.apache.hadoop.mapreduce.TaskAttemptContext context)
throws IOException,
InterruptedException
- Specified by:
createRecordReader in class org.apache.hadoop.mapreduce.InputFormat<NodePath,MarkLogicNode>
- Throws:
IOException
InterruptedException
Copyright © 2013 MarkLogic Corporation. All Rights Reserved.
Complete online documentation for MarkLogic Server,
XQuery and related components may be found at
developer.marklogic.com