Best Sellers in Books
Discover the most popular and best selling products in Books based on sales

Disclosure: I get commissions for purchases made through links in this website
Databases & Big Data - Data Science on the Google Cloud Platform: Implementing End-to-End Real-Time Data Pipelines: From Ingest to Machine Learning

Description

Book Synopsis: Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build using Google Cloud Platform (GCP). This hands-on guide shows data engineers and data scientists how to implement an end-to-end data pipeline with cloud native tools on GCP.

Throughout this updated second edition, you'll work through a sample business decision by employing a variety of data science approaches. Follow along by building a data pipeline in your own project on GCP, and discover how to solve data science problems in a transformative and more collaborative way.

You'll learn how to:

  • Employ best practices in building highly scalable data and ML pipelines on Google Cloud
  • Automate and schedule data ingest using Cloud Run
  • Create and populate a dashboard in Data Studio
  • Build a real-time analytics pipeline using Pub/Sub, Dataflow, and BigQuery
  • Conduct interactive data exploration with BigQuery
  • Create a Bayesian model with Spark on Cloud Dataproc
  • Forecast time series and do anomaly detection with BigQuery ML
  • Aggregate within time windows with Dataflow
  • Train explainable machine learning models with Vertex AI
  • Operationalize ML with Vertex AI Pipelines

Read more

Details

Are you ready to take your data science skills to the next level? Look no further than the "Data Science on the Google Cloud Platform" book. With this updated second edition, you'll discover how easy it is to implement end-to-end real-time data pipelines using Google Cloud Platform (GCP). Whether you're a data engineer or a data scientist, this hands-on guide will equip you with the tools and knowledge needed to solve real-world problems with sophisticated statistical and machine learning methods.

One of the standout features of this book is its focus on using cloud native tools on GCP. By following along with a sample business decision, you'll learn how to build your own data pipeline and automate data ingest using Cloud Run. With the help of Data Studio, you'll be able to create interactive dashboards to visualize your data. Additionally, you'll explore a wide range of data science approaches, including real-time analytics using Pub/Sub, Dataflow, and BigQuery, and Bayesian modeling with Spark on Cloud Dataproc.

Imagine being able to forecast time series and detect anomalies with BigQuery ML, or train explainable machine learning models with Vertex AI. This book covers it all. What sets it apart is its emphasis on collaboration and transforming the way you approach data science problems. Say goodbye to siloed work and hello to a more integrated and productive workflow.

Ready to revolutionize your data science projects? Get your hands on the "Data Science on the Google Cloud Platform" book now and start building highly scalable data and ML pipelines with GCP. Take the first step towards unlocking the power of cloud-native tools and transforming the way you solve data science problems.

Click here to get your copy now.

Disclosure: I get commissions for purchases made through links in this website