How to Work Remotely and Not Feel Isolated

It’s not a secret that many vacancies in IT & data are remote. On one hand, this means not wasting time in traffic jams on the way to work and back home. But on the other hand, for many employees the…

How to Build a Generative AI Tool for Information Extraction from Receipts

Paper receipts come in all sorts of styles and formats and represent an interesting target for automated information extraction. They also provide a wealth of itemized costs that, if aggregated into…

Linear Regressions for Causal Conclusions

I suppose most of us have heard the statement “correlation doesn’t imply causation” multiple times. It often becomes a problem for analysts since we frequently can see only correlations but still…

Design an Easy-to-Use Deep Learning Framework

Deep learning frameworks are extremely transitory. If you compare the deep learning frameworks people use today with what it was eight years ago, you will find the landscape is completely different…

Overwriting in Python: Tricky. Dangerous. Powerful

I’ve overwritten plenty of various objects in Python. If you’ve spent your share of hours coding in Python, you have, too. This is because, at least in Python, overwriting objects lies at the core of…

ORPO: Preference Optimization without the Supervised Fine-tuning (SFT) Step

There are now many methods to align large language models (LLMs) with human preferences. Reinforcement learning with human feedback (RLHF) was one of the first and brought us ChatGPT, but RLHF is…

Statistics for Data Science 101 Series — An Overview

Following the Data Analytics 101 Series, the Statistics 101 Series will delve into a burst of articles that explain the usage and importance of statistics in Data Science. I will try my best to…

Google’s CodeGemma: I am not Impressed

CodeGEMMA represents a significant advancement in the realm of code generation and completion, stemming from Google’s broader Gemma model family. As a fine-tuned version of the Gemma-7b model…

The Controversy of AI Greeting Cards

Today I am taking a slight detour from my usual data analysis, to indulge one of my favorite hobbies — crafting personalized greeting cards, now with a significant boost from Artificial Intelligence…

Mixture of Experts

What you know about Mixture of Experts is wrong. We are not using this technique because each model is an expert on a specific topic. In fact, each of these so-called experts is not an individual…