99 Lessons on Data Analysis from Placing Top 5 in 5 Kaggle Analytics Challenges

I share how I won prizes in Kaggle Analytics challenges / competitions and list all my tips on exploratory data analysis and data visualizations.

All About Pandas Groupby Explained with 25 Examples

The groupby is one of the most frequently used Pandas functions in data analysis. It is used for grouping the data points (i.e. rows) based on the distinct values in the given column or columns. We…

Data Analyst’s guide in handling flooding data ad-hoc requests

As a Data Analyst, especially in an organization in which the team is not data-savvy yet, we often need to deal with numerous ad-hoc data requests from various stakeholders.

3 Underappreciated Skills to Make You a Next-Level Python Programmer

Top Python Developer Skills You Must Have in 2022. Become a successful Python developer and boost your career with these coveted skills many engineers lack.

Simple random sampling: is it actually simple?

No matter how hard you may try to forget your STAT101 course, you’ll likely tend to default to simple random sampling (SRS) as your knee jerk approach. It was, after all, an assumption you were told…

Named Entity Recognition with Deep Learning (BERT) — The Essential Guide

Named Entity Recognition with Deep Learning (BERT) — The Essential Guide. From data preparation to model training — and how to tag your own sentences!.

5 Biases & Fallacies Data Scientists Should Beware of (and How to Avoid Them)

One of the hardest things about working with data is dealing with the fallacies and biases that plague both the data itself as well as how we interpret the data. Because of the hundreds of biases and…

The Joy of A/B Testing, Part II: Advanced Topics

A/B testing is one of the most critical steps in Machine Learning production: we only want to roll out a new ML model if it can be proven to be better in production. In Part I of this series we…

A Step-By-Step Guide To Summarizing Audio Files in Python

As the name suggests, summarization is the process of generating a concise summary of a given piece of information. This information can appear as text, audio, video, pictures, etc. In other words…

5 important talks you might have missed at JuliaCon 2022

JuliaCon 2022 wrapped up on Saturday July 30th with the annual virtual hackathon. As one of the conference organizers, the live conference days during JuliaCon are usually filled putting out fires…