Soft Skills You Need For A Career in Data in 2024

In the rapidly expanding technological world, data-related careers have strongly emerged in the market over the past few years, and the demand for a professional with skills to understand and…

The Math Behind “The Curse of Dimensionality”

In the realm of machine learning, handling high-dimensional vectors is not just common; it’s essential. This is illustrated by the architecture of popular models like Transformers. For instance, BERT…

Tiny Llama — a Performance Review and Discussion

TinyLlama is an open-source project that trains a small language model of around 1.1B parameters. The project aims to have a language model capable of performing tasks a full LLM like Llama 2 can…

Label Studio Customized Backend for Semiautomatic Image Segmentation Labeling

· Introduction · Overview ∘ Goal ∘ Why semiautomatic? ∘ Entering Label Studio ∘ 1 frontend + 2 backends · Implementation (Local) ∘ 1. Install git and docker & download backend code ∘ 2. Set up…

The Infamous Attention Mechanism in the Transformer Architecture

It all started with a problem. How do you play around with sequential data?! People had the architecture to work with regression and classification problems, but sequential data was very different…

How to build an Air-gapped LLM-based AI Chatbot in Containers Step-by-Step

As AI tools become increasingly popular, they play an important role in boosting our productivity in everyday tasks. In this post, I will show you how to build an Air-gapped LLM-based AI Chatbot in Containers Step-by-Step.

A Comprehensive Comparison Between OPen-Source Vector Databases

Semantic search and retrieval-augmented generation (RAG) are revolutionizing the way we interact online. However, the backbone enabling these groundbreaking advancements is often overlooked: vector…

Introduction to Multi-Stage Image Build for Python

I never paid too much attention to the size of my images until I started deploying my code into Github Actions using containers. The math here is simple: the bigger the size of your container, the…

How I Landed a Spotify Internship: My Guide to Get Into Top Tech

So I’m something of an internship specialist myself. My colleagues even used to joke about the fact that I was becoming a senior intern. So if you’re a student dreaming to join Spotify, this article…

How Machine Learning Detects Phishing Attacks

Machine learning has special applications in phishing detection that can protect Internet users from harm. Especially since the COVID-19 epidemic, we have relied on the Internet for so many of our…