The MarkLogic connector for Apache Spark is an Apache Spark 3 connector that supports reading data from and writing data to MarkLogic. Within any Spark 3 environment, the connector enables users to easily query for data in MarkLogic, manipulate it using widely-known Spark operations, and then write results back to MarkLogic or disseminate them to another system. Data can also be easily imported into MarkLogic by first reading it from any data source that Spark supports and then writing it to MarkLogic.
Reading Data:
Writing Data:
Reprocess Data
To learn more about the project and get started visit the MarkLogic Spark documentation.
By continuing to use this website you are giving consent to cookies being used in accordance with the MarkLogic Privacy Statement.