Uncategorized
-
More notebooks for Think Stats
More notebooks for Think Stats As I mentioned in the previous post, I am getting ready to teach Data Science in…
Read More » -
Why hierarchical models are awesome, tricky, and Bayesian
Thomas originally posted this article here at http://twiecki.github.io Hierarchical models are underappreciated. Hierarchies exist in many data sets and modeling them appropriately…
Read More » -
Third batch of notebooks for Think Stats
As I mentioned in the previous post and the one before that, I am getting ready to teach Data Science in the spring,…
Read More » -
Descriptive Analysis of MLST Data for MRSA
During one of my summers, I had the opportunity to conduct some research on the prevalence of methicillin-resistant Staphylococcus aureus (MRSA) in…
Read More » -
Last batch of notebooks for Think Stats
Getting ready to teach Data Science in the spring, I am going back through Think Stats and updating the Jupyter notebooks. Each chapter…
Read More » -
2017 Data Science in Review, Topic Modeling
This blogpost is about topic modeling using data from this blog, zambiatek.com. From this, combined with the most visited articles of…
Read More » -
Mine Like Amazon with Market Basket Analysis
Pattern mining is an incredibly simple but powerful technique for discovering cooccurrences in large datasets. The most common approach to…
Read More » -
Building a Microservice for Twitter Real-Time Data Collection and Sentiment Analysis.
First of all, I would like to point out that the skill of building MVP and microservices for a data…
Read More » -
Tips for Linear Regression Diagnostics
I like to call linear regression the data scientist’s “workhorse.” It may not be sexy, but it’s a tried and…
Read More » -
Introducing PyMC Labs: Saving the World with Bayesian Modeling
After I left Quantopian in 2020, something interesting happened: various companies contacted me inquiring about consulting to help them with…
Read More »