The data lakehouse is the next generation of the data warehouse and data lake, designed to meet today’s complex and ever-changing analytics, machine learning, and data science requirements. This book covers the essential topics prior to building the full methodology for the data lakehouse.
Learn about the features and architecture of the data lakehouse, along with its powerful analytical infrastructure. Appreciate how the universal common connector blends structured, textual, analog, and IoT data. Maintain the lakehouse for future generations through Data Lakehouse Housekeeping and Data Future-proofing.
Incorporate data catalogs, data lineage tools, and open source software into your architecture to ensure your data scientists, analysts, and end users live happily ever after. Deep dive into one specific implementation of a data lakehouse: the Databricks Lakehouse Platform.