Reddit Books

The most-mentioned books on r/datascience

Reddit is a social media and news aggregation website that ranks content based on a voting system. People worldwide post content (usually links, but also original content), and other users can “upvote” or “downvote” posts, pushing the most interesting content to the top. 

It’s a place where you can find groups of like-minded people. Reddit calls these groups subreddits, and they cover different topics, including niche interests, politics, hobbies, and thousands of other topics people want to talk about.

Since its launch in 2005, the site has become one of the most popular social media sites with millions of monthly active users. 

We've processed billions of comments to find the books most mentioned on Reddit. 

 

Table of Contents

 

The most-mentioned books on r/datascience

Data Science for Business35 users
Python for Data Analysis15 users
An Introduction to Statistical Learning15 users
Data Science from Scratch15 users
Elements of Statistical Learning Data Mining, Inference, and Predic...13 users
Data Smart13 users
Hands-On Machine Learning with Scikit-Learn and TensorFlow12 users
Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow11 users
Clean Code10 users
Storytelling with Data10 users
Trustworthy online controlled experiments9 users
Designing Data-Intensive Applications9 users
Weapons of Math Destruction8 users
The Visual Display of Quantitative Information8 users
Applied Predictive Modeling8 users
Statistical Rethinking8 users
Practical Statistics for Data Scientists7 users
Ace the Data Science Interview6 users
The Data Warehouse Toolkit6 users
Python Data Science Handbook6 users
The Signal and the Noise5 users
Cracking the Coding Interview5 users
Data Scientists at Work5 users
Discovering Statistics Using R5 users
Lean Analytics5 users
Doing Data Science5 users
Statistics For Data Scientists5 users
Applied Linear Statistical Models4 users
Doing Bayesian Data Analysis4 users
Pattern Recognition and Machine Learning4 users
How to Lie with Statistics4 users
Naked Statistics4 users
Programming Collective Intelligence4 users
SQL in 10 Minutes, Sams Teach Yourself4 users
Interactive Data Visualization for the Web4 users
Python Machine Learning4 users
Everybody Lies3 users
Data Mining3 users
Music In The United States3 users
Probabilistic Graphical Models3 users
Database Design for Mere Mortals3 users
All of Statistics3 users
The Book of Why3 users
Data Analysis Using SQL and Excel3 users
The Data Science Handbook3 users
Web Scraping with Python3 users
Fluent Python3 users
Machine Learning with Python Cookbook3 users
Extending The Linear Model With R3 users
Practical Data Science with R3 users
Deep Learning with Python3 users
Build a Career in Data Science3 users
R for Marketing Research and Analytics3 users

 

Our methodology

Book mentions are found by checking every reddit comment for links to Amazon, Goodreads, Google Books, and O'Rielly Media. Non-link mentions are too tricky to parse in an unbiased way: books with short, simple titles like The Road get massively under- or over-counted.

When a user mentions the same book multiple times in the same subreddit, we only count the top-voted comment. This curbs users shilling a particular book, and gives a more representative feel for an entire subreddit.

 

Who are we

Find accountability partners, and study online courses & books with other learners. Moocable helps you find your next course/book/problem set, and lets you find study partners.

Junaid Khan

Junaid Khan

Junaid Khan is the founder of Moocable - the platform to help learner find their next MOOC, and study partners. A passionate learner, he struggled with self-learning.