How to perform anomaly detection with the LOF Algorithm

The LOF algorithm is an interesting algorithm for anomaly detection that borrows concepts from the K-nearest neighbors algorithm.

Automated Detection of Data Quality Issues

This article is the second in a series about cleaning data using Large Language Models (LLMs), with a focus on identifying errors in tabular data sets. The sketch outlines the methodology we’ll…

Process Pandas DataFrames with a Large Language Model

Nowadays, it is easy to use different large language models (LLMs) via the web interface or the public API. But can we seamlessly integrate LLM into the data analysis process and use the model…

Step by Step Basics: Code Autodocumentation

A Data Scientist holds many responsibilities when working on a project, and one that is usually left until the last minute is documentation. Perhaps you’re diligent with writing docstrings for…

How to Ensure AI Models Reflect the Richness of Human Diversity

Insights from bridging data science and cultural understanding. “How to Ensure AI Models Reflect the Richness of Human Diversity” is published by John Loewen, PhD in Towards AI.

Four Data Engineering Projects That Look Great on your CV

In this story, I would like to speak about data engineering career paths and data projects that look great on any CV. If you are an aspiring data practitioner not only willing to learn new tools and…

Customizing RStudio Container with Docker Compose

In this post, we will review the steps of setting up a Docker-Compose workflow to launch an RStudio Server inside a container. We will introduce the Docker Compose settings process and discuss when…

Evolving Chess Puzzles

Evolutionary Algorithms (EAs) are a subset of AI that solve problems using methods inspired by biological evolution. From optimizing neural networks to resource scheduling, they have a stunning range…

Are Language Models Benchmark Savants or Real-World Problem Solvers?

In the realm of education, the best exams are those that challenge students to apply what they’ve learned in new and unpredictable ways, moving beyond memorizing facts to demonstrate true…

Top Important Computer Vision Papers for the Week from 11/03 to 17/03

Every week, several top-tier academic conferences and journals showcased innovative research in computer vision, presenting exciting breakthroughs in various subfields such as image recognition…