When we discuss with our customers and our community of data analysts we always come up with a common list if go-to resources that everybody uses. Some days ago we wrote a short but awesome list of newsletters about Big Data and Data Science that you should be aware of.
In this post, I gathered some of the best blogs and websites that will prove useful for every data analyst. I tried to include blogs/sources which are up to date and not look “dead”. Additions are welcome too ☺
So let’s get started (in alphabetical order)…
Cross Validated is part of the Stack Exchange network. It is a Q&A site about statistics, machine learning, data analysis, data mining and data visualization.
Links: Cross Validated
Data Science Central
Data Science Central (DSC) is a thriving community of data scientists and data / big data experts and practitioners. It contains a large number of posts, questions, data sets, training material and more.
Links: Data Science Central
A blog by Curt Monash about data management, BI, and analytic technologies. There is a lot of material for stuff like: Amazon and its cloud like Amazon Redshift,Cassandra,Kafka and Confluent or PostgreSQL.
Facebook Data Science Blog
Facebook Data Science Blog is actually a Facebook page. But it is a goldmine as it is the official page of the data scientist teams working at Facebook.
Links: Facebook Data Science Blog
Open – New York Times
This is an interesting blog. It is about code written by New York Times development team. They cover everything from their internal projects and products, along with data analysis, Machine Learning, and data science.
Links: Open – New York Times
O’Reilly Data Radar
O’Reilly is the one stop shop in anything from software engineering to data. In their blog, you can find a huge amount of information about data and big data along with many events, opinions and offers. O’Reilly Data Radar (with a new website) is a super resource to follow.
Links: O’Reilly Data Radar
There are many topics related to Data Analysis and Data Science topics in Quora. There is an active community answering questions and having discussions on various Data Science topics.
R-Bloggers is a content aggregator from feeds of blogs writing about R. If you are an R fan then you already know R-bloggers, if not it is great to follow it to stay up to date.
Reddit has some great subreddits about Machine Learning, Data Science and Data analysis.
Simply Statistics is a blog run by three biostatistics professors (Rafa Irizarry, Roger Peng, and Jeff Leek). They write about statistics (obviously) data analysis and more. You may also find posts like: What is software engineering for data science? or The relativity of raw data
Links: Simply Statistics
Statistical Modeling, Causal Inference, and Social Science
In this blog, you are going to find A LOT of practical examples for data analysis, statistics, and modeling. It is updated often as there are six people involved and sponsored by 10+ organizations like Columbia University, National Institute of Health or Sloan Foundation.