Become a Readings Member to make your shopping experience even easier. Sign in or sign up for free!

Become a Readings Member. Sign in or sign up for free!

Hello Readings Member! Go to the member centre to view your orders, change your details, or view your lists, or sign out.

Hello Readings Member! Go to the member centre or sign out.

Automating Data Quality Monitoring at Scale
Paperback

Automating Data Quality Monitoring at Scale

$152.99
Sign in or become a Readings Member to add this title to your wishlist.

The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your organization relies on contains only high-quality records.Most data engineers, data analysts, and data scientists genuinely care about data quality, but they often don't have the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately.This book will help you:Learn why data quality is a business imperativeUnderstand and assess unsupervised learning models for detecting data issuesImplement notifications that reduce alert fatigue and let you triage and resolve issues quicklyIntegrate automated data quality monitoring with data catalogs, orchestration layers, and BI and ML systemsUnderstand the limits of automated data quality monitoring and how to overcome themLearn how to deploy and manage your monitoring solution at scaleMaintain automated data quality monitoring for the long term

Read More
In Shop
Out of stock
Shipping & Delivery

$9.00 standard shipping within Australia
FREE standard shipping within Australia for orders over $100.00
Express & International shipping calculated at checkout

MORE INFO
Format
Paperback
Publisher
O'Reilly Media
Country
United States
Date
30 January 2024
Pages
170
ISBN
9781098145934

The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your organization relies on contains only high-quality records.Most data engineers, data analysts, and data scientists genuinely care about data quality, but they often don't have the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately.This book will help you:Learn why data quality is a business imperativeUnderstand and assess unsupervised learning models for detecting data issuesImplement notifications that reduce alert fatigue and let you triage and resolve issues quicklyIntegrate automated data quality monitoring with data catalogs, orchestration layers, and BI and ML systemsUnderstand the limits of automated data quality monitoring and how to overcome themLearn how to deploy and manage your monitoring solution at scaleMaintain automated data quality monitoring for the long term

Read More
Format
Paperback
Publisher
O'Reilly Media
Country
United States
Date
30 January 2024
Pages
170
ISBN
9781098145934