Start-Up Data Mesh Blueprint: 3 Steps for Becoming a Data-Driven Start-Up through the Data Mesh

Start-Up Data Mesh Blueprint: 3 Steps for Becoming a Data-Driven Start-Up through the Data Mesh. A dead-simple 3-step blueprint for start-ups to get data-driven right through the data mesh.

Getting Started with BigQuery SQL User Defined Functions (SQL UDFs)

Getting Started with BigQuery SQL User Defined Functions (SQL UDFs). A powerful and fundamental building block which enables custom extension of the core BigQuery platform functionality.

Exploring Alternatives to Python’s Built-in Data Structures

Python‘s built-in data structures serve as the backbone for many operations. That being said, users that leverage these data structures soon have to come to terms with their limitations.

Image Color Extraction with Python in 4 Steps

Colors help get people’s attention at first sight. They are helpful in communicating messages, for example, using red and blue colors to express high and low temperatures or green and red colors to…

SourceCodeAI — how to handle Train-Inference mismatch

Source code AI has many unique features which differentiate it from the more general NLP applications (like the common practice to heavily process the input prior to feeding it to the model). One of…

First, We Must Discover. Then, We Can Explore.

Back in the slammin’ 70s, John Tukey published Exploratory Data Analysis, through which he championed the idea of playing around with our datasets before jumping into hypothesis testing. Tukey argued…

Creating autocomplete search forms

User-friendly search masks are almost indispensable in today’s flood of data. In this article we will see how to develop a responsive search for translation purposes using Tkinter and compare it with…

5 Ways to Use Histograms with Machine Learning Algorithms

Feature engineering is the process of using domain knowledge to create features that make machine learning algorithms work better. This is a crucial part of applied machine learning and is often the…

High scalable fast search with Haystack

In a previous post we created a simple “string match search” to find German/English translations. In a figurative sense, we can also call this translation table a kind of FAQ. FAQ search is said to…

Stat Stories: Variable Transformation to Generate New Distributions

Often we are required to generate a new distribution or density function with closed forms from a given distribution. Programmatically, it might be easy, but sometimes looking beyond just numbers is…