- Добавил: literator
- Дата: 27-11-2023, 17:37
- Комментариев: 0
Название: Delta Lake: Up and Running: Modern Data Lakehouse Architectures with Delta Lake (Final)
Автор: Bennie Haelen, Dan Davis
Издательство: O’Reilly Media, Inc.
Год: 2024
Страниц: 267
Язык: английский
Формат: pdf (true), epub (true)
Размер: 12.3 MB
With the surge in Big Data and AI, organizations can rapidly create data products. However, the effectiveness of their analytics and Machine Learning models depends on the data's quality. Delta Lake's open source format offers a robust lakehouse framework over platforms like Amazon S3, ADLS, and GCS. This practical book shows data engineers, data scientists, and data analysts how to get Delta Lake and its features up and running. The ultimate goal of building data pipelines and applications is to gain insights from data. You'll understand how your storage solution choice determines the robustness and performance of the data pipeline, from raw data to insights. The code examples in the book range from snippets that can be used in a PySpark shell to those designed to be run with a complete end-to-end notebook. In this book, all code snippets will be in Python, SQL, and, where necessary, shell commands.
Автор: Bennie Haelen, Dan Davis
Издательство: O’Reilly Media, Inc.
Год: 2024
Страниц: 267
Язык: английский
Формат: pdf (true), epub (true)
Размер: 12.3 MB
With the surge in Big Data and AI, organizations can rapidly create data products. However, the effectiveness of their analytics and Machine Learning models depends on the data's quality. Delta Lake's open source format offers a robust lakehouse framework over platforms like Amazon S3, ADLS, and GCS. This practical book shows data engineers, data scientists, and data analysts how to get Delta Lake and its features up and running. The ultimate goal of building data pipelines and applications is to gain insights from data. You'll understand how your storage solution choice determines the robustness and performance of the data pipeline, from raw data to insights. The code examples in the book range from snippets that can be used in a PySpark shell to those designed to be run with a complete end-to-end notebook. In this book, all code snippets will be in Python, SQL, and, where necessary, shell commands.