A Comprehensive Comparison Between OPen-Source Vector Databases

Semantic search and retrieval-augmented generation (RAG) are revolutionizing the way we interact online. However, the backbone enabling these groundbreaking advancements is often overlooked: vector…

Introduction to Multi-Stage Image Build for Python

I never paid too much attention to the size of my images until I started deploying my code into Github Actions using containers. The math here is simple: the bigger the size of your container, the…

How I Landed a Spotify Internship: My Guide to Get Into Top Tech

So I’m something of an internship specialist myself. My colleagues even used to joke about the fact that I was becoming a senior intern. So if you’re a student dreaming to join Spotify, this article…

How Machine Learning Detects Phishing Attacks

Machine learning has special applications in phishing detection that can protect Internet users from harm. Especially since the COVID-19 epidemic, we have relied on the Internet for so many of our…

Pandas vs Polars? Bid Adieu to Pandas and Switch To Polars!

Dealing with tabular data? Pandas is the first Python library you’ll come across when dealing with tabular data preprocessing. It’s one of the most popular libraries used in Data Science. Pandas is…

Supercharging Prompt Engineering via Symbolic Program Search

It’s no secret that much of the success of LLMs still depends on our ability to prompt them with the right instructions and examples. As newer generation LLMs become more and more powerful, prompts…

Demystifying the Receiver Operating Characteristic (ROC) Curve

One of the most common and widely applicable themes in Artificial Intelligence is Supervised Learning, in particular the Classification task. For instance, if we have a mixture of cat and dog images…

Some Technical Notes About Llama 3

Since the debut of the original version, Llama has become one of the foundational blocks of the open source generative AI space. I prefer to use the term “open models,” given that these releases are…

Llama 3 Matches GPT-4 Performance with Less Parameters

Llama 3 uses 75 times this amount, aiming to create a strong yet compact model for simpler use and inference. Meta found that Llama 3 didn’t learn as well as expected, even with lots of data. This…

Deep Dive into Self-Attention by Hand✍︎

As of today, the world has been swept over by the power of transformers. Not the likes of ‘Robtimus Prime’ but the ones that constitute neural networks. And that power is because of the concept of…