Fundamentals of Data Engineering

Fundamentals of Data Engineering
内容简介:

Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available in the framework of the data engineering lifecycle.

Authors Joe Reis and Matt Housley walk you through the data engineering lifecycle and show you how to stitch together a variety of cloud technologies to serve the needs of downstream data consumers. You'll understand how to apply the concepts of data generation, ingestion, orchestration, transformation, storage, governance, and deployment that are critical in any data environment regardless of the underlying technology.

This book will help you:

Assess data engineering problems using an end-to-end data framework of best practices

Cut through marketing hype when choosing data technologies, architecture, and processes

Use the data engineering lifecycle to design and build a robust architecture

Incorporate data governance and security across the data engineering lifecycle


Joe Reis is a business-minded data nerd who’s worked in the data industry for 20 years, with responsibilities ranging from statistical modeling, forecasting, machine learning, data engineering, data architecture, and almost everything else in between. Joe is the CEO and Co-Founder of Ternary Data, a data engineering and architecture consulting firm based in Salt Lake City, Utah...

作者简介:
下载地址:
下载Fundamentals of Data Engineering
标签:
文章链接:https://www.dushupai.com/book-content-28022.html(转载时请注明本文出处及文章链接)
读书评论: 更多
  • 三七李
    08-08
    读了大半年,获得了一个总体概念,蛮好的,知道了很多不知道的东西,大数据的软件咋这么多…
  • 流光
    03-18
    前后部分对调一下可能会读得更顺畅。还是提供了一些能面试的时候bb的东西
  • 2024
    07-02
    early release 版本写得挺像通讯约稿的/"The data engineer we discuss in this book can be described more precisely as a data lifecycle engineer."/工业界以数据为对象的生产实践中,数据科学和数据工程的分野。/Data Mesh, Serving Data for Analytics, Machine Learning, and Reverse ETL/中间有几章可以当作checklist
猜你喜欢: