Reddit Books

The most-mentioned books on r/dataengineering

Reddit is a social media and news aggregation website that ranks content based on a voting system. People worldwide post content (usually links, but also original content), and other users can “upvote” or “downvote” posts, pushing the most interesting content to the top. 

It’s a place where you can find groups of like-minded people. Reddit calls these groups subreddits, and they cover different topics, including niche interests, politics, hobbies, and thousands of other topics people want to talk about.

Since its launch in 2005, the site has become one of the most popular social media sites with millions of monthly active users. 

We've processed billions of comments to find the books most mentioned on Reddit. 

Table of Contents

 

The most-mentioned books on r/dataengineering

Designing Data-Intensive Applications35 users
The Data Warehouse Toolkit28 users
Fundamentals of Data Engineering12 users
Data Pipelines Pocket Reference10 users
Agile Data Warehouse Design8 users
Spark7 users
Building a Scalable Data Warehouse with Data Vault 2. 05 users
Star Schema The Complete Reference4 users
Data governance3 users
T-SQL Querying3 users
The data warehouse ETL toolkit3 users
Database Internals3 users
Learning Spark3 users
T-SQL Fundamentals3 users
Big data3 users
Data Engineering with AWS3 users

Our methodology

Book mentions are found by checking every reddit comment for links to Amazon, Goodreads, Google Books, and O'Rielly Media. Non-link mentions are too tricky to parse in an unbiased way: books with short, simple titles like The Road get massively under- or over-counted.

When a user mentions the same book multiple times in the same subreddit, we only count the top-voted comment. This curbs users shilling a particular book, and gives a more representative feel for an entire subreddit.

 

Who are we

Find accountability partners, and study online courses & books with other learners. Moocable helps you find your next course/book/problem set, and lets you find study partners.

Junaid Khan

Junaid Khan

Junaid Khan is the founder of Moocable - the platform to help learner find their next MOOC, and study partners. A passionate learner, he struggled with self-learning.