Random Forest or XGBoost? It is Time to Explore LCE

A new machine learning method that further enhances the prediction performance of the state-of-the-art Random Forest and XGBoost. LCE combines their strengths and adopts a complementary diversification approach to obtain a better generalizing predictor.

towardsdatascience.com 15 hours ago

Controlling for “X”?

An intuitive approach into what it really means to control for covariates in a linear regression model via the Frisch-Waugh-Lovell theorem.

towardsdatascience.com 15 hours ago

Another Look at Exploratory Data Analysis with Excel’s Ideas Feature

TLDR: If you are not a data scientist, you too can perform exploratory data analysis. And any data scientists who previously found the distribution of Excel’s automated exploratory data analysis…

towardsdatascience.com 16 hours ago

Complete Detailed Tutorial on Linear Regression in Python for Beginners

Linear regression is the most basic type of machine learning. It is based on the simple straight-line formula we have all learned in middle school. Though there are a lot of other more…

medium.com_towards-ai 18 hours ago

Want to Succeed Doing ML in the Real World?

Read on to learn a couple of tricks that will make your life easier and increase your chances of success doing ML in the real world

3 Rookie Mistakes to Avoid with Python Lists

Of all the coding languages, Python is probably the most beginner-friendly. It’s intuitive, well-documented, and easy to learn. However, all languages have quirks, which can be difficult to spot…

BigQuery Remote Functions, Cloud Functions 2.0, and Plus Codes Revisited

Years ago, I went to a conference hosted by Google and discovered Plus Codes. In a nutshell, it’s an addressing system that generates an alphanumeric code to represent locations on the world. It’s…

Performing Data Validation at Scale with Soda Core

An in-depth look into the past and present of data validation, and how you can leverage today's tools to ensure data quality at scale.

AWS SageMaker X HuggingFace X AWS QuickSight

This is a solution that demonstrates how to train and deploy a pre-trained Huggingface model on AWS SageMaker and publish an AWS QuickSight Dashboard that visualizes the model performance over the…

Learning Gadfly by Creating Beautiful Seaborn Plots in Julia

One of the libraries I used a lot for drawing attractive and informative statistical graphics in Python was Seaborn. One of my favourite packages for data visualisation in Julia is Gadfly. It is…