Common data, such as the well-known iris or penguin dataset, used for analysis are pretty simple since they have only a few categorical variables. By the way, real-world data can be more complex and…
This is the sixth part in our series of posts on the topic of analyzing and optimizing PyTorch models using PyTorch Profiler and TensorBoard. In this post we will tackle one of the more complicated…
Data-driven organizations often run hundreds or thousands of experiments at any given time, but what is the net impact of all of these experiments? A naive approach is to sum the difference-in-means…
I have encountered many situations in the past where I wanted to get Pub/Sub messages into a BigQuery table, but I never managed to find a particularly simple way of doing this. You could set up a…
In the field of signal processing, source separation describes the task of breaking down an audio signal into multiple source audio signals. This concept is not only relevant for music, but also for…
In today’s data-driven world, organizations often face challenges with diverse and inconsistent data sources. Entity resolution, also called record linkage or deduplication, helps identify and merge…
Various tools can be used to debug Python code, from the simplest print() function, via static but more advanced icecream and its sibling ycecream, to the various interactive debuggers that IDEs…
I’ve been thinking about the “Human vs. AI” debate. Not because I’ve been taking an active part in it. Still, I want to share with you how I take advantage of the AI tools. And I’ll show you how I…
Ever since the emergence of ChatGPT, I’ve been thinking how ChatGPT would influence the world of optimization and Operations Research (OR). There has been news about ChatGPT passing high school and…
I have often seen beautiful population maps circulating online; however, I usually got stuck at some technical parts, like visualizing other map segments than shown in the tutorial or turning the…