Uncategorized
-
Emily Webber of AWS on Pretraining Large Language Models
As newer fields emerge within data science and the research is still hard to grasp, sometimes it’s best to talk…
Read More » -
More notebooks for Think Stats
More notebooks for Think Stats As I mentioned in the previous post, I am getting ready to teach Data Science in…
Read More » -
Why hierarchical models are awesome, tricky, and Bayesian
Thomas originally posted this article here at http://twiecki.github.io Hierarchical models are underappreciated. Hierarchies exist in many data sets and modeling them appropriately…
Read More » -
Third batch of notebooks for Think Stats
As I mentioned in the previous post and the one before that, I am getting ready to teach Data Science in the spring,…
Read More » -
Descriptive Analysis of MLST Data for MRSA
During one of my summers, I had the opportunity to conduct some research on the prevalence of methicillin-resistant Staphylococcus aureus (MRSA) in…
Read More » -
Last batch of notebooks for Think Stats
Getting ready to teach Data Science in the spring, I am going back through Think Stats and updating the Jupyter notebooks. Each chapter…
Read More » -
2017 Data Science in Review, Topic Modeling
This blogpost is about topic modeling using data from this blog, zambiatek.com. From this, combined with the most visited articles of…
Read More » -
Mine Like Amazon with Market Basket Analysis
Pattern mining is an incredibly simple but powerful technique for discovering cooccurrences in large datasets. The most common approach to…
Read More » -
Building a Microservice for Twitter Real-Time Data Collection and Sentiment Analysis.
First of all, I would like to point out that the skill of building MVP and microservices for a data…
Read More » -
Tips for Linear Regression Diagnostics
I like to call linear regression the data scientist’s “workhorse.” It may not be sexy, but it’s a tried and…
Read More »









