You are here

Apache Spark Python Library

Apache Spark is a fast cluster computing system supporting interactive queries with SQL, machine learning, and graph computation all handled through the Spark API. The Apache Spark Python Library enables developers to quickly write programs in Python that access a unified engine in order to process large amounts of data. Supported by the Apache Software Foundation, the Python library comes well documented.