Apache Spark is a fast cluster computing system supporting interactive queries with SQL, machine learning, and graph computation all handled through the Spark API. The Apache Spark Python Library enables developers to quickly write programs in Python that access a unified engine in order to process large amounts of data. Supported by the Apache Software Foundation, the Python library comes well documented.
Arguably, Salesforce.com brought the software-as-a-service (SaaS) concept mainstream. Today, if software isn't available as a service, it's considered old school. But software -- as a service or not -- is just a container. What makes software valuable has always been what it does to data. Now, in the same spirit of service-oriented architectures and SaaS, a new concept is emerging, Data-as-a-Service (DaaS).