The MarkLogic Connector for Spark 2 makes it fast and easy to implement Spark jobs for ingesting and exporting data from a MarkLogic Data Hub.
Apache Spark is an in-memory, distributed data processing engine for analytical applications, including machine learning, SQL, streaming, and graph. As a unified analytical tool, it is primarily used by developers like data engineers and data scientists to build scalable data pipelines that span diverse data sources like Object Stores, RDBMS, HDFS, NoSQL etc.
Step through the written tutorial for Spark connector to get started ingesting and exporting.
By continuing to use this website you are giving consent to cookies being used in accordance with the MarkLogic Privacy Statement.