Data pipelines are the foundation for success in data analytics. Moving data from numerous diverse sources and transforming it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today's modern data stack.
You'll learn common considerations and key decision points when implementing pipelines, such as batch versus streaming data ingestion and build versus buy. This book addresses the most common decisions made by data professionals and discusses foundational concepts that apply to open source frameworks, commercial products, and homegrown solutions.
You'll learn:
What a data pipeline is and how it works
How data is moved and processed on modern data infrastructure, including cloud platforms
Common tools and products used by data engineers to build pipelines
How pipelines support analytics and reporting needs
Considerations for pipeline maintenance, testing, and alerting
Product details
Publisher : O'Reilly Media; 1st edition (March 16, 2021)
Language : English
Paperback : 274 pages
ISBN-10 : 1492087831
ISBN-13 : 978-1492087830
Item Weight : 2.31 pounds
Dimensions : 4 x 0.75 x 7 inches
Best Sellers Rank: #67,984 in Books (See Top 100 in Books)
#7 in Data Warehousing (Books)
#13 in Data Modeling & Design (Books)
#21 in Data Processing
Customer Reviews: 4.5
361 ratings
When you purchase through links on our site, we may earn an affiliate commission at no cost to you.