Amazon has announced the general availability of AWS Lake Formation. Lake Formation is a managed service that eases developers' ability to build, secure and manage data lakes. Amazon achieves its goal through automation. Functions automated through Lake Formation include collecting, cleaning, cataloging, and securing data for analytics.
"Our customers tell us that Amazon S3 is the ideal place to house their data lakes, which is why AWS hosts more data lakes than anyone else - with tens of thousands and growing every day," Raju Gulabani, AWS Vice President of Databases, Analytics, and Machine Learning, commented in a press release. "That's why we built AWS Lake Formation, so customers can spend more time learning from their data and innovating, rather than wrestling that data into functioning data lakes."
Data lakes remove silos within an organization's various datasets of interest. Once the silos are removed, an organization can better understand and analyze its data in a single, central location. But, getting data into a single location is only one step in the process. This data must be readied for analysis which is why Amazon built the Lake Formation product.
Customers can collect and bring their data into a single data lake from various, disparate resources. Lake Formation has tools and templates for which users can leverage to collect such data. Once collected, the data is automatically classified and prepared as needed to achieve the goal of the data lake. Customers can utilize a number of tools to analyze the collected data (e.g. Amazon Redshift, Amazon Athena, AWS Glue, Amazon QuickSight, etc.).