Apache Spark is a fast cluster computing system supporting interactive queries with SQL, machine learning, and graph computation all handled through the Spark API. The Apache Spark Python Library enables developers to quickly write programs in Python that access a unified engine in order to process large amounts of data. Supported by the Apache Software Foundation, the Python library comes well documented.
As part of a concerted effort to make the SAP HANA in-memory computing platform more appealing to developers, SAP this week announced a more modular approach to exposing SAP HANA services in the cloud. SAP is also moving to open an application store for SAP HANA applications offered by both SAP and third-party developers.