Get Underlined Text from Any PDF with Python

PDF data extraction can be a real headache, and it gets even trickier when you’re trying to snag underlined text — believe it or not, there aren’t any go-to solutions or libraries that handle this…

Best Resources to Learn & Understand Evaluating LLMs

Large language models (LLMs) are gaining increasing popularity in both academia and industry, owing to their unprecedented performance in various applications. As LLMs continue to play a vital role…

Modeling Slowly Changing Dimensions

What are Slowly Changing Dimensions and how to implement them in Data Warehouse. Difference between SCD Types 0, 1, 2, 3 and 4 and how they affect pipelines

Demystifying Shiny Modules by Transforming a Bigfoot Sightings App Modular

When I discovered Shiny years ago, I got immediately hooked. Shiny is an R package to build interactive web applications that can run R code in the backend. I was fascinated by the ability it…

Courage to Learn ML: Tackling Vanishing and Exploding Gradients (Part 2)

Welcome back to a new chapter of “Courage to Learn ML.” For those new to this series, this series aims to make these complex topics accessible and engaging, much like a casual conversation between a…

Cyclical Encoding: An Alternative to One-Hot Encoding for Time Series Features

Cyclical Encoding: An Alternative to One-Hot Encoding for Time Series Features. Cyclical encoding provides your model with the same information using significantly less features.

Analyzing MRI Scans With AI (Tensorflow) Is Easier Than You Think

A few weeks ago, I had an MRI scan. That’s when it occurred to me to wonder how complicated it would be to evaluate MRI images with the help of AI. I had always thought that this was a complex…

Turn Llama 3 into an Embedding Model with LLM2Vec

The embedding model is a critical component of retrieval-augmented generation (RAG) for large language models (LLMs). They encode the knowledge base and the query written by the user.

On Handling Precalculated Hierarchical Data in Power BI

While hierarchies are a familiar concept with data, some sources deliver their data in an unusual format. But what happens when we get pre-aggregated values?

SaaS-based Engineering Tool Onboarding with AI Assistance

A Prototype solution to explore improving engineering tool onboarding experience by leveraging LLM and multi-agents with LangChain and LangGraph