BIBLIO is the largest independent book marketplace in the world, with over 100 million books.

Skip to content

Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python

Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python

Click to view full size.

Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python Paperback - 2020

by Peter Bruce, Andrew Bruce, Peter Gedeck

Add to wish list
  • New
  • Paperback
New

Description

Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this popular guide adds comprehensive examples in Python, provides practical guidance on applying statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what's important and what's not.

Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you're familiar with the R or Python programming languages and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.

With this book, you'll learn:

Why exploratory data analysis is a key preliminary step in data science

How random sampling can reduce bias and yield a higher-quality dataset, even with big data

How the principles of experimental design yield definitive answers to questions

How to use regression to estimate outcomes and detect anomalies

Key classification techniques for predicting which categories a record belongs to

Statistical machine learning methods that "learn" from data

Unsupervised learning methods for extracting meaning from unlabeled data

About the Author

Peter Bruce is the Founder and Chief Academic Officer of the Institute for Statistics Education at Statistics.com, which offers about 80 courses in statistics and analytics, roughly half of which are aimed at data scientists. He has authored or co-authored several books in statistics and analytics, and he earned his Bachelor's degree at Princeton, and Masters degrees at Harvard and the University of Maryland.

^

Andrew Bruce, Principal Research Scientist at Amazon, has over 30 years of experience in statistics and data science in academia, government and business. The co-author of Applied Wavelet Analysis with S-PLUS, he earned his bachelor's degree at Princeton, and PhD in statistics at the University of Washington

^

Peter Gedeck, Senior Data Scientist at Collaborative Drug Discovery, specializes in the development of machine learning algorithms to predict biological and physicochemical properties of drug candidates. Co-author of Data Mining for Business Analytics, he earned PhD's in Chemistry from the University of Erlangen-Nürnberg in Germany and Mathematics from Fernuniversität Hagen, Germany.

Ask the seller a question Add to wish list
On sale A$44.67 (was A$49.63 )
Free Delivery to USA
Standard delivery: 7 to 15 days
More delivery options
Ships from Anaira Enterprises (Andaman and Nicobar Islands, India)

On sale

More books like this are on offer from Anaira Enterprises at 10% off.

Details

About Anaira Enterprises Andaman and Nicobar Islands, India

Biblio member since 2023

Brand new books at best whole-sale price.

Terms of Sale: 30 day return guarantee, with full refund including original shipping costs for up to 30 days after delivery if an item arrives misdescribed or damaged.

Browse books from Anaira Enterprises

Reader reviews for Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python

From the publisher

Statistical methods are a key part of data science, yet few data scientists have formal statistical training. Courses and books on basic statistics rarely cover the topic from a data science perspective. The second edition of this popular guide adds comprehensive examples in Python, provides practical guidance on applying statistical methods to data science, tells you how to avoid their misuse, and gives you advice on what's important and what's not.

Many data science resources incorporate statistical methods but lack a deeper statistical perspective. If you're familiar with the R or Python programming languages and have some exposure to statistics, this quick reference bridges the gap in an accessible, readable format.

With this book, you'll learn:

  • Why exploratory data analysis is a key preliminary step in data science
  • How random sampling can reduce bias and yield a higher-quality dataset, even with big data
  • How the principles of experimental design yield definitive answers to questions
  • How to use regression to estimate outcomes and detect anomalies
  • Key classification techniques for predicting which categories a record belongs to
  • Statistical machine learning methods that "learn" from data
  • Unsupervised learning methods for extracting meaning from unlabeled data

About the author

Peter Bruce is the Founder and Chief Academic Officer of the Institute for Statistics Education at Statistics.com, which offers about 80 courses in statistics and analytics, roughly half of which are aimed at data scientists. He has authored or co-authored several books in statistics and analytics, and he earned his Bachelor's degree at Princeton, and Masters degrees at Harvard and the University of Maryland.

Andrew Bruce, Principal Research Scientist at Amazon, has over 30 years of experience in statistics and data science in academia, government and business. The co-author of Applied Wavelet Analysis with S-PLUS, he earned his bachelor's degree at Princeton, and PhD in statistics at the University of Washington

Peter Gedeck, Senior Data Scientist at Collaborative Drug Discovery, specializes in the development of machine learning algorithms to predict biological and physicochemical properties of drug candidates. Co-author of Data Mining for Business Analytics, he earned PhD's in Chemistry from the University of Erlangen-Nrnberg in Germany and Mathematics from Fernuniversitt Hagen, Germany

tracking-