Best Sellers in Books
Discover the most popular and best selling products in Books based on sales

Disclosure: I get commissions for purchases made through links in this website
Databases & Big Data - Effective Data Science Infrastructure: How to make data scientists productive

Description

Book Synopsis: Simplify data science infrastructure to give data scientists an efficient path from prototype to production.In Effective Data Science Infrastructure you will learn how to: Design data science infrastructure that boosts productivity Handle compute and orchestration in the cloud Deploy machine learning to production Monitor and manage performance and results Combine cloud-based tools into a cohesive data science environment Develop reproducible data science projects using Metaflow, Conda, and Docker Architect complex applications for multiple teams and large datasets Customize and grow data science infrastructure

Effective Data Science Infrastructure: How to make data scientists more productive is a hands-on guide to assembling infrastructure for data science and machine learning applications. It reveals the processes used at Netflix and other data-driven companies to manage their cutting-edge data infrastructure. In it, you'll master scalable techniques for data storage, computation, experiment tracking, and orchestration that are relevant to companies of all shapes and sizes. You'll learn how you can make data scientists more productive with your existing cloud infrastructure, a stack of open source software, and idiomatic Python. The author is donating proceeds from this book to charities that support women and underrepresented groups in data science.

Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.

About the technology

Growing data science projects from prototype to production requires reliable infrastructure. Using the powerful new techniques and tooling in this book, you can stand up an infrastructure stack that will scale with any organization, from startups to the largest enterprises.

About the book

Effective Data Science Infrastructure teaches you to build data pipelines and project workflows that will supercharge data scientists and their projects. Based on state-of-the-art tools and concepts that power data operations of Netflix, this book introduces a customizable cloud-based approach to model development and MLOps that you can easily adapt to your company's specific needs. As you roll out these practical processes, your teams will produce better and faster results when applying data science and machine learning to a wide array of business problems.

What's inside

  • Handle compute and orchestration in the cloud
  • Combine cloud-based tools into a cohesive data science environment
  • Develop reproducible data science projects using Metaflow, AWS, and the Python data ecosystem
  • Architect complex applications that require large datasets and models, and a team of data scientists

About the reader

For infrastructure engineers and engineering-minded data scientists who are familiar with Python.

About the author

At Netflix, Ville Tuulos designed and built Metaflow, a full-stack framework for data science. Currently, he is the CEO of a startup focusing on data science infrastructure.

Table of Contents

  1. Introducing data science infrastructure
  2. The toolchain of data science
  3. Introducing Metaflow
  4. Scaling with the compute layer
  5. Practicing scalability and performance
  6. Going to production
  7. Processing data
  8. Using and operating models
  9. Machine learning with the full stack

Read more

Details

Boost Data Scientist Productivity with Effective Data Science Infrastructure

Want to make your data scientists more productive? Look no further than Effective Data Science Infrastructure. This comprehensive guide reveals the secrets used by industry-leading companies like Netflix to manage their cutting-edge data infrastructure. From designing scalable data pipelines to monitoring performance, this book provides the tools and techniques you need to streamline your data science projects. With your existing cloud infrastructure, open-source software, and idiomatic Python, you can empower your team to achieve faster and better results. Don't miss out on this opportunity to supercharge your data science efforts.

Learn more

Effortlessly Scale Your Data Science Projects

Are you struggling to handle the complex infrastructure requirements of your data science projects? Effective Data Science Infrastructure is here to help. With this book, you'll discover how to develop reproducible data science projects using Metaflow, Conda, and Docker. Whether you're dealing with large datasets, deploying machine learning to production, or managing multiple teams, this hands-on guide has got you covered. By implementing the techniques outlined in this book, you can build a customizable cloud-based data science environment that scales with your organization's needs. Don't let infrastructure limitations hold back your data science potential.

Unlock the power of scalable infrastructure today

Support Women and Underrepresented Groups in Data Science

Not only will you benefit from the wealth of knowledge in Effective Data Science Infrastructure, but you'll also be supporting a great cause. The author of this book is donating proceeds to charities that promote diversity and inclusion in data science. By purchasing this book, you're making a difference in the lives of women and underrepresented groups in the tech industry. Get involved and enhance your data science skills while contributing to a more inclusive future.

Join us in making a positive impact

Disclosure: I get commissions for purchases made through links in this website